Претрага
1103 items
-
Fourth Summer Datathon on Linguistic Linked Open Data
Tijana Radović, Ranka Stanković (2023)The 4th Summer Datathon on Linguistic Linked Open Data (SD-LLOD-22) was held in Spain, in Cersedilla near Madrid, in May 2022, and organized by the COST Action NexusLinguarum. The school gathered interested researchers, academics, students who wanted to acquire and/or expand their knowledge in the field of linguistic linked data science. During the school, a spectrum of topics from the field of linked data was presented, from various ontologies, through document integration, annotation and natural language text processing tools ...Tijana Radović, Ranka Stanković. "Fourth Summer Datathon on Linguistic Linked Open Data" in Infotheca, Faculty of Philology, University of Belgrade (2023). https://doi.org/10.18485/infotheca.2023.23.1.6
-
Bridging Computational Lexicography and Corpus Linguistics: A Query Extension for OntoLex-FrAC
OntoLex, dominantni standard zajednice za mašinski čitljive leksičke resurse u kontekstu RDF-a, Linked Data i tehnologija Semantičkog veba, trenutno se proširuje sa posebnim modulom za Frekvencije, Primere i Informacije zasnovane na Korpusu (OntoLex-FrAC). Predlažemo novi komponent za OntoLex-FrAC, koji se bavi inkorporacijom korpusnih upita za (a) povezivanje rečnika sa korpusnim mašinama, (b) omogućavanje RDF baziranih web servisa da dinamički razmenjuju korpusne upite i podatke odgovora, i (c) korišćenje konvencionalnih upitačkih jezika za formalizaciju unutrašnje strukture kolokacija, skica reči i ...standardizacija, digitalna leksikografija, OntoLex, upiti korpusa, povezani podaci, Lingvistički povezani otvoreni podaciChristian Chiarcos, Ranka Stanković, Maxim Ionov, Gilles Sérasset. "Bridging Computational Lexicography and Corpus Linguistics: A Query Extension for OntoLex-FrAC" in Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Turin, 20-25 May 2024, LREC (2024)
-
From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)
In this paper we present the wikification of the ELTeC (European Literary Text Collection), developed within the COST Action ``Distant Reading for European Literary History'' (CA16204). ELTeC is a multilingual corpus of novels written in the time period 1840—1920, built to apply distant reading methods and tools to explore the European literary history. We present the pipeline that led to the production of the linked dataset, the novels’ metadata retrieval and named entity recognition, transformation, mapping and Wikidata population, ...Milica Ikonić Nešić, Ranka Stanković, Christof Schöch and Mihailo Škorić. "From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)" in Proceedings of The 8th Workshop on Linked Data in Linguistics within the 13th Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
-
Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data
Овај рад описује студију случаја о генерисању повезаних података креираних на основу обечежених текстуалних корпуса коришћењем формата размене података у обради природних језика (NIF). Као основа за ово истраживање послужио је подскуп корпуса ELTeC, који се састоји од 900 романа из периода 1840-1920 за 9 европских језика. Верзија романа са коментарима, у такозваном TEI level-2 формату, трансформисана је у NIF, формат заснован на RDF/OWL који има за циљ постизање интероперабилности између алата за обраду природних језика, језичких ресурса и ...Ranka Stanković, Christian Chiarcos, Miloš Utvić, Olivera Kitanović. "Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data" in LDK 2023 – 4th Conference on Language, Data and Knowledge, 12-15 September in Vienna, Austria, Lisabon : NOVA FCSH - CLUNL (2023). https://doi.org/10.34619/srmk-injj
-
OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian
Ovaj rad predstavlja novi jezički resurs za pretraživanje i istraživanje verbalnih aspektnih parova u BCS (bosanskom, hrvatskom i srpskom), kreiran korišćenjem principa Lingvističkih Povezanih Otvorenih Podataka (LLOD). Pošto ne postoji resurs koji bi pomogao učenicima bosanskog, hrvatskog i srpskog kao stranih jezika da prepoznaju aspekt glagola ili njegove parove, kreirali smo novi resurs koji će korisnicima pružiti informacije o aspektu, kao i link ka aspektnim parovima glagola. Ovaj resurs takođe sadrži spoljne linkove ka monolingvalnim rečnicima, Wordnetu i BabelNetu. ...Ranka Stanković, Maxim Ionov, Medina Bajtarević, Lorena Ninčević. "OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024)
-
Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection
Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić (2022)In this paper we present the Serbian part of the ELTeC multilingual corpus of novels written in the time period 1840-1920. The corpus is being built in order to test various distant reading methods and tools with the aim of re-thinking the European literary history. We present the various steps that led to the production of the Serbian sub-collection: the novel selection and retrieval, text preparation, structural annotation, POS-tagging, lemmatization and named entity recognition. The Serbian sub-collection was published ...Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić. "Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection" in Proceedings of the Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
-
SrpELTeC: A Serbian Literary Corpus for Distant Reading
U članku je predstavljen SrpELTeC, korpus razvijen u okviru akcije COST Distant Reading for European Literary History (CA16204). Svi romani u SrpELTeC-u su odabrani, pripremljeni i obeleženi korišćenjem zajedničkih principa uspostavljenih za sve jezičke zbirke u Evropskoj zbirci književnog teksta (ELTeC). Navedeni su izazovi i rešenja u pripremi SrpELTeC od nule. Svi romani su ručno kodirani u TEI sa bogatim metapodacima i strukturnim napomenama. Automatska anotacija je uključivala POS-označavanje, lematizaciju i imenovane entitete, oslanjajući se na resurse za obradu ...digital humanities, Serbian literature, text corpora, distant reading , linked data, named entity recognition, text analyticsRanka Stanković, Cvetana Krstev, Duško Vitas. "SrpELTeC: A Serbian Literary Corpus for Distant Reading" in Primerjalna književnost, Research Centre of the Slovenian Academy of Sciences and Arts (2024). https://doi.org/10.3986/pkn.v47.i2.03
-
Football terminology: compilation and transformation into OntoLex-Lemon resource
У овом раду представља се пројекат који је у развоју, креирање првог дигиталног фудбалског речника на српском језику, као и да демонстрација примене модела OntoLex и љегових модула. OntoLex-FrAC модул укључује информације о учесталости и примерима употребе екстрахованих из корпуса. У овом случају, креиран је корпус за специфичан домен под називом СрФудКо, који садржи чланке вести о фудбалу на српском језику. Вишечлани термини аутоматски су екстраховани из српског корпуса, а затим ручно евалуирани и класификовани као спортски или ...Jelena Lazarević, Ranka Stanković, Mihailo Škorić, Biljana Rujević. "Football terminology: compilation and transformation into OntoLex-Lemon resource" in LDK 2023 – 4th Conference on Language, Data and Knowledge, 12-15 September in Vienna, Austria, Lisabon : NOVA FCSH - CLUNL (2023). https://doi.org/10.34619/srmk-injj
-
Infotheca (Q25460443) in Wikidata
Ranka Stanković, Lazar Davidović (2021)Vikipodaci su baza znanja Zadužbine Vikimedija koja predstavlja zajednički izvor različitih vrsta podataka koje koriste ne samo drugi Vikipedijini projekti, već sve više i brojne aplikacije semantičkog veba. U ovom radu ćemo prezentovati primer integracije Vikipodataka sa digitalnim bibliotekama i eksternim sistemima, kao i mogućnost ubrzanja pripreme i unosa podataka na primeru radova iz časopisa za digitalnu humanistiku Infoteka.... given in the parenthesis. An item is, thus, linked to a unique identifier (QID), the identifier is, in turn, linked to the item’s corresponding title and description, so as to remove any ambiguity. An identifier of a data item (QID) can, in addition to being linked to a title and a description, have a number ...
... continued activity where special attention will be paid to linked open data in the domain of linguistics – LLOD and its application. We must certainly be aware of the problems and limitations related to Wikidata and other kinds of linked open data, so as to be able to look into the ways of overcoming or ...
... preparation and entry using the articles published in In- fotheca, Journal for Digital Humanities as an example. KEYWORDS: Semantic Web, Open Linked Data, Wikidata, Infotheca, journal metadata. PAPER SUBMITTED: 24 June 2021 PAPER ACCEPTED: 16 July 2021 Ranka Stanković ranka.stankovic@rgf.bg.ac.rs University ...Ranka Stanković, Lazar Davidović. "Infotheca (Q25460443) in Wikidata" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.5
-
Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking
U radu se prikazuju rezultati istraživanja vezanih za pripremu paralelnih korpusa, fokusirajući se na transformaciju u RDF grafove koristeći NLP Interchange Format (NIF) za lingvističku anotaciju. Pružamo pregled paralelnog korpusa koji je korišćen u ovom studijskom slučaju, kao i proces označavanja delova govora, lematizacije i prepoznavanja imenovanih entiteta (NER). Zatim opisujemo povezivanje imenovanih entiteta (NEL), konverziju podataka u RDF, i uključivanje NIF anotacija. Proizvedene NIF datoteke su evaluirane kroz istraživanje triplestore-a korišćenjem SPARQL upita. Na kraju, razmatra se povezivanje Linked ...paralelni korpusi, povezivanje imenovanih entiteta, prepoznavanje imenovanih entiteta, NER, NEL, povezani podaci, NIF, VikipodaciRanka Stanković, Milica Ikonić Nešić, Olja Perisic, Mihailo Škorić, Olivera Kitanović. "Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024)
-
EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School
Prva škola za obuku polaznika koju je organizovala COST akcija NexusLinguarum održana je od 8. do 12. februara 2021. godine sa ciljem da studenti, istraživači i stručnjaci nauče osnove lingvističke nauke o podacima. Tokom obuke polaznici su se upoznali sa širokim spektrom tema: od semantičkog veba, RDF -a i ontologija, do modeliranja i pretraživanja jezičkih podataka pomoću najsavremenijih ontoloških modela i alata. Škola je održana u okviru serije letnjih škola EUROLAN-a i organizovalo ju je virtuelno (onlajn) nekoliko instituta; ...nauka o lingvističkim podacima, povezani podaci u lingvistici, jezički podaci, EUROLAN, NexusLinguarum, COST akcija, škola za obuku... validation; (Cimiano et al. 2020) – Linguistic linked data; (Chiarcos et al. 2013) – Lemon-OntoLex7 (McCrae et al. 2017; Declerck, Tiberius, and Wandl- Vogt 2017; Stanković et al. 2018) – Linguistic linked data generation; (Cimiano et al. 2020) – Corpora and linked data; (Chiarcos 2012) – Linguistic annotations; ...
... (Hellmann et al. 2013) – Tools and applications of linguistic linked data. (Declerck et al. 2020) The first day started with an opening session and a brief introduction to Linguistic Linked Data (LLD), followed by an introduction to Linked Data and RDF dedicated sessions. The second day covered topics related ...
... M. et al., eurolan 2021: . . . Linked Data. . . , pp. 113–120 and semantically interoperable linguistic data is required. Training schools are one of the means for reaching this goal, and therefore the NexusLin- guarum core team organized the Introduction to Linked Data for Linguistics online training ...Milan Dojchinovski, Julia Bosque Gil, Jorge Gracia, Ranka Stanković. "EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.7
-
A Data Driven Approach for Raw Material Terminology
Olivera Kitanović, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić, Ivan Babić, Ljiljana Kolonja (2021)The research presented in this paper aims at creating a bilingual (sr-en), easily searchable, hypertext, born-digital, corpus-based terminological database of raw material terminology for dictionary production. The approach is based on linking dictionaries related to the raw material domain, both digitally born and printed, into a lexicon structure, aligning terminology from different dictionaries as much as possible. This paper presents the main features of this approach, data used for compilation of the terminological database, the procedure by which it has ...sirovine, rudarstvo, terminologija, rečnik, terminološka aplikacija, mobilna aplikacija, digitizacija, leksički podaci, korpusi, otvoreni povezani podaci... terminology system that includes data, application and user interface layers covering different data and software technologies. The automation of data publishing in the form of linked data, as one of the core pillars of the Semantic Web or the Web of Data, provides links between data sets that are Appl. Sci ...
... between lexical data sets and with other LOD resources [46]. In our research we were also aiming at compatibility with the Linked Data approach, using its set of design principles for sharing machine-readable interlinked data on the Web. This vision of globally accessible and linked data on the internet ...
... tion; digitization; lexical data; corpus data; linguistic linked open data 1. Introduction During the last decade, lexicography entered a new era due both to rapid development of advanced computational methods and availability of previously unseen abundance of language data in different modalities. These ...Olivera Kitanović, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić, Ivan Babić, Ljiljana Kolonja. "A Data Driven Approach for Raw Material Terminology" in Applied Sciences, MDPI AG (2021). https://doi.org/10.3390/app11072892
-
Глаголи у кухињи и за столом
Цветана Крстев, Биљана Лазић (2015)У раду је приказано истраживање лексике на српском језику кулинарског домена које се заснива на коришћењу доменског корпуса, електронских лексичких ресурса, пре свега WordNet-а и морфолошких речника, и локалних граматика. Приказане су доменске специфичности ових ресурса, како се користе, и међусобно употпуњују. Посебно је приказано како се коришћењем доменског корпуса могу екстраховати глаголи специфични за кулинарски домен и описати начини њиховог коришћења. Дат је попис глагола са основним подацима који је добијен применом представљених метода.аутоматска обрада, коначни трансдуктори, електронски речници, семантичке мреже, локалне граматике, кулинарство... обогаћивање да пите обојити обојити, обојен да растопљеном посном чоколадом, јаје обрати обран да павлака, јогурт, млеко одбацити одбацивање да воде одвојити одвојен да млеко одлежати одлежан да печеница одмарати одмарање да теста одмрзнути одмрзнут да риба одсећи одсечен, осећи да опраним тиквицама ...
... распадање да рибе располовити располовлјен да црни лук распоредити распоредити да по сложеним пишкотама растањити растањен да тесто растворити растворен да квасац растопити растопљен да чоколада, маслац, бутер растресати растресање да теста расхладити расхлађен да кафа рафинисати рафинисан да уље, ...
... различитих лема уз неке карактеристичне примере дати су у Табели 4. На овај начин екстраховано је 376 различитих глагола с изведеним трпним придевима и глаголским именицама који су сви уз основне добијене податке наведени у Додатку овог рада. Утврђено је да 35 од ових глагола није описано у Речнику Матице ...Цветана Крстев, Биљана Лазић. "Глаголи у кухињи и за столом" in Научни састанак слависта у Вукове дане - Српски језик и његови ресурси: теорија, опис и преимене, Вол. 44/3, Београд : Међународни славистички центар (2015)
-
WebGIS Cadastre of Abandoned Mines in Autonomous Province of Vojvodina
Ranka Stanković, Nikola Vulović, Nikola Lilić, Ivan Obradović, Radule Tošović, Milica Pešić-Georgiadis (2015)... to work with data. Administrators have the possibility to manage all the data, while editors have the option of editing data related to pits and commercial organizations, but not data related to user accounts and data dictionary. 3.3 Web application for textual (attribute) data Authorized ...
... spatial data needed for different aspects of the study of the terrain of abandoned mines and/or their remediation. These data include: basic data, hydrogeology, biology, geology and mining as well as specific information related to the terrain of abandoned mines [7]. Spatial data in the a ...
... solution is compliant with recommendations related to spatial data infrastructure (SDI), as a data infrastructure implementing a framework of geographic data, metadata, users and tools that are interactively connected in order to use spatial data in an efficient and flexible way [13] The WebGIS ...Ranka Stanković, Nikola Vulović, Nikola Lilić, Ivan Obradović, Radule Tošović, Milica Pešić-Georgiadis. "WebGIS Cadastre of Abandoned Mines in Autonomous Province of Vojvodina" in Proceedings of the 5th International Symposium Mining And Environmental Protection,June 10-13,2015, Vrdnik, Serbia, Belgrade : Faculty of Mining and Geology (2015)
-
Comparative analysis of correlation coefficients in mineralogical and geophysical data from the mine tailing site “Rudnik” (Serbia)
Vesna Cvetkov, Filip Arnaut, Dragana Životić. "Comparative analysis of correlation coefficients in mineralogical and geophysical data from the mine tailing site “Rudnik” (Serbia)" in 5th Congress Geologists of the Republic of North Macedonia, Ohrid, 28-29. 10. 2024, Македонско геолошко друштво (2024)
-
A WebGIS Decision Support System for Management of Abandoned Mines
Ranka Stanković, Nikola Vulović, Nikola Lilić, Ivan Obradović, Radule Tošović, Milica Pešić-Georgiadis (2016)... who manage the meta data and system preferences data, and editors, who have the option of editing data related to mine sites and companies, but not data related to user accounts and data dictionary. 2.2. Web Application for Non-Spatial (Attribute) Data For the non-spatial data, authorized users can ...
... multimedia data panel for the same mine. Figure 5. The panels showing basic information and multimedia data fora mine. 2.3. Reporting and Data Analysis Within the reporting system the following reports are available: ‚ Reports giving data on individual abandoned mines (general data, properties ...
... namely data and information, but also regarding the quality of structural relations among data, as well as quality of user interface featuring easy and simple search and display od structured data. Development of the software solution was compliant with recommendations related to spatial data infr ...Ranka Stanković, Nikola Vulović, Nikola Lilić, Ivan Obradović, Radule Tošović, Milica Pešić-Georgiadis. "A WebGIS Decision Support System for Management of Abandoned Mines" in Energies 7 no. 9 (2016): 567. https://doi.org/10.3390/en9070567
-
Advantages and Disadvantages of a Parallel and Zigzag Method of Acquisition in Walking Mode in Magnetometric Archeological Research
магнетометријска испитивања, цик-цак и паралелна аквизиција у ходајућем моду, линеарне аномалије, археологија... processing of magnetic data. For the testing polygon, the fi eld above the previously identifi ed archaeological anomaly in the archaeological site “Kremenite Njive” (Barajevo, Republic of Serbia) was used. An investigative polygon with dimen- sions of 25 x 25 m was used and the data was acquired using ...
... completion of the acquisition, the data was prepared for software processing. In the fi rst three cases, the data which had been acquired outside of the polygons was clipped from the da- tabase, and also the data whose value was outside a range of ± 70 nT/m. The data was arranged in a regular grid using ...
... of processing, data outside the test polygon was clipped along with data whose gra- dient fell outside a range of ± 70 nT/m. The data was gridded using the same interpolation method and displayed as a map of the vertical gradient of TMI (Figure 5.). Figure 5 In cases where data was sampled in a ...Mirko Petković, Vesna Cvetkov, Branislav Sretenović. "Advantages and Disadvantages of a Parallel and Zigzag Method of Acquisition in Walking Mode in Magnetometric Archeological Research" in Arheologija i prirodne nauke (2014)
-
Improvement of geodatabase queries within GeolISS
Ranka Stanković (2008)... introduces the approach to GeolISS development, whose main goal is the integration of existing geologic archives, data from published maps on different scales, newly acquired field data, Intranet and Internet publishing of geologic information. The Faculty of Mining and Geology of Belgrade University ...
... raster and the vector data model to represent reality. Raster data sets record a value for each point in the area of interest, which may require more storage space than the representation in vector format that stores only the Ranka Stanković 66 necessary data. Vector data can be displayed as vector ...
... vector data is usually much smaller than the space required for raster data. Another advantage of vector data is that they can be easily updated and maintained. In GeolISS vectorization of geologic maps is chosen as the approach to digitization of geological structures, namely geospatial data in general ...Ranka Stanković. "Improvement of geodatabase queries within GeolISS" in Review of the National Center for Digitization, Beograd : Faculty of Mathematics, Belgrade (2008)
-
Srbija u OneGeology Europe
Геолошки завод Србије као носилац Пројекта ОneGeologyEurope заједно са Рударско геолошким факултетом и Министарством за природне ресурсе, рударство и просторно планирање су се укључили у међународни Пројекат OneGeology Europe у мају 2013. године у већ поодмаклој фази израде Пројекта. До краја 2013. године испунили су завршене активности које треба да доведу до пуноправног укључења у Пројекат чиме је Република Србија нашла своје место на Геолошкој карти Европе 1:1М. Геолошка карта Србије 1:1М представља компилациону односно поједностављену верзију ОГК 1:500 ...... geological maps within 1G-E. Schematic harmonization ensures that all data sets have a coherent data structure, to enable web services to read and search the data in the right way. Syntax and data language, as well as systems for data management were implemented in accordance with the OpenGIS (OGS) ...
... geological maps within 1G-E. Schematic harmonization ensures that all data sets have a coherent data structure, to enable web services to read and search the data in the right way. Syntax and data language, as well as systems for data management were implemented in accordance with the OpenGIS (OGS) standards ...
... implement the required interchange standards, for their data to be interoperable. This can be achieved by using GML (Geography Mark-up Language) based data. GML based data (including GeoSciML) can be used in many different ways. For example, basic data can either be rendered into a map that can be i ...Danka Blagojević, Ranka Stanković, Petar Stejić, Velizar Nikolić. "Srbija u OneGeology Europe" in Zapisnici Srpskog geološkog društva za 2013. godinu, Beograd : Srpsko geološko društvo (2014)
-
From post-disaster landslides inventory to open landslides data
Biljana Abolmasov, Miloš Marjanović, Uroš Đurić, Jelka Krušić. "From post-disaster landslides inventory to open landslides data" in Proceedings of 3rd European Regional Conference of IAEG/ Athens/ Greece/ 6-10 October 2021, International Association for Engineering Geology and the Environment (2021)