From DELA Based Dictionary to Leximirka Lexical Database
In this paper, we will present an approach in transforming Serbian language Morphological dictionaries from a DELA text format to a lexical database dubbed Leximirka. Considering the benefits of storing data within a database when compared to storing them in textual documents, we will outline some of the functionality that the database has made possible. We will also show how hand-made rules that use category labels lexical entries are marked with can be used to link lexical entries.
... ACCEPTED: 28 December 2019 Biljana Lazić biljana.lazic@rgf.bg.ac.rs Mihailo Škorić mihailo.skoric@rgf.bg.ac.rs University of Belgrade Faculty of Mining and Geology Belgrade, Serbia 1 Introduction Prof. Dr. Dusko Vitas and Prof. Dr. Cvetana Krstev started working on the development of Serbian ...
Stanković, Ranka, Cvetana Krstev, Biljana Lazić and Mihailo Škorić. "Electronic Dictionaries – from File System to lemon Based Lexical Database". In Proceedings of the 11th International Conference on Language Resources and Evaluation - W23 6th Workshop on Linked Data in Linguistics
Preparation of Multimedia Document “YU Rock Scene”
This study will present the preparation process of a multimedia document entitled YU ROCK SCENE in which participants were senior students of undergraduate studies of the Department of Library and Information Science at the University of Belgrade Faculty of Philology during the academic year 2014/2015, as a part of the subject Multimedia Documents. This study gives an overview of the historical development of rock and roll in the territory of the former Yugoslavia, rock scene in Yugoslav republics,
... Arsenijević, Milica Ninković and Milena Obradović), rest of Serbia (Aleksandra Kojić), Croatia (Jovana Došenović, Maja Ivančić and Marko Petrović), Bosnia and Herzegovina (Violeta Kolaković, Milica Perǐsić and Petar Popović), Slove- nia (Mihailo Škorić), Macedonia and Montenegro (Aleksandar ...
... 7424@gmail.com Mihailo Škorić miccersoft@gmail.com University of Belgrade, Faculty of Philology 1 Historical Development of Rock and Roll in the Territory of Former Yugoslavia Rock and roll in the territory of former Yugoslavia is rooted in the 1920s when new instruments, such as saxophone and guitar ...Milena Obradović, Aleksandra Arsenijević, Mihailo Škorić. "Preparation of Multimedia Document “YU Rock Scene”" in Infotheca - Journal for Digital Humanities, Faculty of Philology, University of Belgrade (2017). https://doi.org/10.18485/infotheca.2016.16.1_2.6
Употреба веб платформе Омека за дигиталне библиотеке из домена рударства
У овом раду биће представљена Омека, веб платформа за приказивање дигиталних колекциjа и систем за управљање њиховим садржаjем. Њену примену у области техничких наука, а конкретно у области рударства, приказаћемо на примеру дигиталне библиотеке ROmeka@RGF. За Омеку смо се определили првенствено због чињенице да jе jедноставна за коришћење, има обимну пратећу документациjу и не захтева уско специфичне информатичке вештине што jе чини приступачном за већину корисника, а нарочито за рударске инжењере, коjима jе ова дигитална библиотека првенствено намењена. Документа
... Александра Томашевић aleksandra.tomasevic@rgf.bg.ac.rs Биљана Лазић biljana.lazic@rgf.bg.ac.rs Далибор Воркапић dalibor.vorkapic@rgf.bg.ac.rs Михаило Шкорић mihailo.skoric@rgf.bg.ac.rs Љиљана Колоња ljiljana.kolonja@rgf.bg.ac.rs Универзитет у Београду Рударско-геолошки факултет 1. Увод За потребе ...
... Tomašević and Bojan Zlatić. “Ter- minological and Lexical Resources Used to Provide Open Multilingual Educational Resources”. Belgrade, Serbia, 2016. http://www.baektel. eu/documents/conferences/eLearning_2016_BL_DS_AT_BZ.pdf Stanković, Ranka, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac and Miloš ...Александра Томашевић, Биљана Лазић, Далибор Воркапић, Михаило Шкорић, Љиљана Колоња. "Употреба веб платформе Омека за дигиталне библиотеке из домена рударства" in Инфотека, Филолошки факултет, Универзитет у Београду; Универзитетска библиотека „Светозар Марковић“; Заједница библиотека универзитета у Србији (2017)
Towards Automatic Definition Extraction for Serbian
U radu su prikazani preliminarni rezultati automatske ekstrakcije kandidata za definicije rečnika iz nestrukturiranih tekstova na srpskom jeziku u cilju ubrzanja razvoja rečnika. Definicije u rečniku Srpske akademije nauka i umetnosti (SANU) korišćene su za modelovanje različitih tipova definicija (opisnih, gramatičkih, referentnih i sinonimskih) koje imaju različite sintaksičke i leksičke karakteristike. Korpus istraživanja sastoji se od 61.213 definicija imenica, koje su analizirane korišćenjem morfoloških e-rečnika i lokalnih gramatika implementiranih kao pretvarači konačnih stanja u paketu za obradu korpusa otvorenog
... Stanković, R., Šandrih, B., Krstev, C., Utvić M. & Škorić M. (2020). Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian. In: Proceedings of the 12th International Conference on Language Resources and Evaluation, LREC eds. Nicoletta Calzolari et al., ...
... Mining and Geology archives faculty publications available in open access, as well as the employees' publications. - The Repository is available at: www.dr.rgf.bg.ac.rs Towards Automatic Definition Extraction for Serbian Stanković Ranka1, Krstev Cvetana1, Stijović Rada2, Gočanin Mirjana2, Škorić Mihailo1 ...Ranka Stanković, Cvetana Krstev, Rada Stijović, Mirjana Gočanin, Mihailo Škorić. "Towards Automatic Definition Extraction for Serbian" in Proceedings of the XIX EURALEX Congress of the European Assocition for Lexicography: Lexicography for Inclusion (Volume 2). 7-9 September (virtual), Democritus University of Thrace (2021)
Језички модели, шта је то?
Михаило Шкорић (2023)Михаило Шкорић. "Језички модели, шта је то?" in Језик данас, Нови Сад : Матица српска (2023)
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian
The training of new tagger models for Serbian is primarily motivated by the enhancement of the existing tagset with the grammatical category of a gender. The harmonization of resources that were manually annotated within different projects over a long period of time was an important task, enabled by the development of tools that support partial automation. The supporting tools take into account different taggers and tagsets. This paper focuses on TreeTagger and spaCy taggers, and the annotation schema alignment
... Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian | Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić | Proceedings of the 12th Language Resources and Evaluation Conference, May Year: 2020, Marseille, France | 2020 | | http://dr.rgf.bg.ac.rs/s/repo ...
... (ELRA), licensed under CC-BY-NC 3954 Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić {Faculty of Mining and Geology, Faculty of Philology} University of Belgrade ...Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić. "Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian" in Proceedings of the 12th Language Resources and Evaluation Conference, May Year: 2020, Marseille, France, European Language Resources Association (2020)
Classification of Terms on a Positive-Negative Feelings Polarity Scale Based on Emoticons
The goal of this paper is to draw attention to the possibility of using emoticon-riddled text on the web in language-neutral sentiment analysis. It introduces several innovations in the existing framework of research and tests their effectiveness. It also presents a software tool especially made for that purpose, explains how it builds a database with sentimental value of terms and offers the user manual. Finally, it presents a software tool that tests the new database and gives some examples
... results. KEYWORDS: data mining, information extraction, emotions, text on the web. PAPER SUBMITTED: 24 January 2017 PAPER ACCEPTED: 25 March 2017 Mihailo Škorić miks@tesla.rcub.bg.ac.rs University of Belgrade 1 Introduction When creating natural language understanding software, there are two widely ...
... will be searched for and the values that they stand for. Determiners and values can be set one by one, or this step can be skipped. In case this step is skipped, the software will automatically load the default determiners and their values (Table 1). All default determiners and their values listed in ...Mihailo Škorić. "Classification of Terms on a Positive-Negative Feelings Polarity Scale Based on Emoticons" in Infotheca, Faculty of Philology, University of Belgrade (2017). https://doi.org/10.18485/infotheca.2017.17.1.4
Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model
Ova studija predstavlja analizu sentimenta srpskih starih romana iz perioda 1840-1920, koristeći veliki jezički model (LLM) Mistral za tehniku učenja sa zasnovani na takozvanim "zero" i "few-shot" pokušajima. Glavni pristup uvodi inovacije osmišljavanjem istraživačkih upita (promptova) uključuju tekst sa uputstvom za klasifikaciju bez primera i na osnovu nekoliko primera, omogućavajući jezičkom modelu da klasifikuje osećanja u pozitivne, negativne ili objektivne kategorije. Ova metodologija ima za cilj da pojednostavi analizu osećanja ograničavanjem odgovora, čime se povećava preciznost ...Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković, Biljana Rujević. "Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model" in In Proceedings of the Sixth International Conference on Computational Linguistics in Bulgaria (CLIB 2024), BAS (2024)
Electronic Dictionaries - from File System to lemon Based Lexical Database
In this paper we discuss some well-known morphological descriptions used in various projects and applications (most notably MULTEXT-East and Unitex) and illustrate the encountered problems on Serbian. We have spotted four groups of problems: the lack of a value for an existing category, the lack of a category, the interdependence of values and categories lacking some description, and the lack of a support for some types of categories. At the same time, various descriptions often describe exactly the same
... rs Electronic Dictionaries – from File System to lemon Based Lexical Database Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić {Faculty of Mining and Geology, Faculty of Philology } University of Belgrade {Djušina 7, Studentski trg 3} Belgrade, Serbia {ranka.stankovic, biljana ...
... variants (for instance, istorija and historija ‘history’), full forms and their abbreviation (e.g. kilogram and kg), derivationally re- lated lexical entries (e.g. istorija and istorijski ‘relating to the study of history’), and different pronunciations (Eka- vian dete and Ijekavian dijete ‘child’). ...Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić. "Electronic Dictionaries - from File System to lemon Based Lexical Database" in Proceedings of the 11th International Conference on Language Resources and Evaluation - W23 6th Workshop on Linked Data in Linguistics : Towards Linguistic Data Science (LDL-2018), LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
The Many Faces of SrpKor
Акроним СрпКор означава фамилију електронских корпуса савременог српског језика чија је изградња почела крајем седамдесетих година прошлога века, а која је постала шире видљива заинтересованој истраживачкој заједници објављивањем његове прве верзије на вебу 2002. године. У овом дугом периоду, посебно пре појаве корисних текстуелних ресурса на вебу, развој корпуса се састојао у прикупљању и обради грађе као и у развоју метода обраде корпуса. Наиме, електронски корпус није само колекција текстова у дигиталном облику (како се то, на пример, наводи
Duško Vitas, Ranka Stanković, Cvetana Krstev. "The Many Faces of SrpKor" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024, University of Belgrade - Faculty of Philology (2024.)
A Data Driven Approach for Raw Material Terminology
The research presented in this paper aims at creating a bilingual (sr-en), easily searchable, hypertext, born-digital, corpus-based terminological database of raw material terminology for dictionary production. The approach is based on linking dictionaries related to the raw material domain, both digitally born and printed, into a lexicon structure, aligning terminology from different dictionaries as much as possible. This paper presents the main features of this approach, data used for compilation of the terminological database, the procedure by which it has
Olivera Kitanović, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić, Ivan Babić, Ljiljana Kolonja (2021)
... Data Driven Approach for Raw Material Terminology Olivera Kitanović 1,*,† , Ranka Stanković 1,† , Aleksandra Tomašević 1,† , Mihailo Škorić 1,† , Ivan Babić 2,† and Ljiljana Kolonja 1,† ���������� ������� Citation: Kitanović, O; Stanković, R.; Tomašević, A.; Škorić, M.; Babić, I.; Kolonja ...
... Article A Data Driven Approach for Raw Material Terminology Olivera Kitanovié /*+®, Ranka Stankovié 1+, Aleksandra Tomagevié 1+, Mihailo Skorié 1*©®, Ivan Babié 2+ and Ljiljana Kolonja 1+ check for updates Citation: Kitanovié, O; Stankovié, R.; Tomadevicé, A.; Skorié, M.; Babié, L; Kolonja, L. A ...Olivera Kitanović, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić, Ivan Babić, Ljiljana Kolonja. "A Data Driven Approach for Raw Material Terminology" in Applied Sciences, MDPI AG (2021). https://doi.org/10.3390/app11072892
Social-Emo.Sr: Emotional Multi-Label Categorization of Conversational Messages from Social Networks X and Reddit
U digitalnom okruženju južnoslovenskih jezika, analiza emocija u tekstovima na društvenim mrežama postaje sve važnija za razumevanje javnog mnjenja, kreiranje personalizovanog sadržaja i analizu međusobnih interakcija korisnika. U okviru ovog rada predstavljamo detaljnu metodologiju i rezultate označavanja korpusa na srpskom jeziku prema Plutčikovom modelu kategorizacije, koji prepoznaje osam osnovnih emocionalnih kategorija, kao što su radost, tuga, bes, strah, poverenje, gađenje, iščekivanje i iznenađenje. Cilj istraživanja je da se analizira emocionalni sadržaj tekstova preuzetih sa društvenih mreža X (nekada Twitter)
Milena Šošić, Ranka Stanković, Jelena Graovac. "Social-Emo.Sr: Emotional Multi-Label Categorization of Conversational Messages from Social Networks X and Reddit" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024., University of Belgrade - Faculty of Philology (2024)
OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian
Ovaj rad predstavlja novi jezički resurs za pretraživanje i istraživanje verbalnih aspektnih parova u BCS (bosanskom, hrvatskom i srpskom), kreiran korišćenjem principa Lingvističkih Povezanih Otvorenih Podataka (LLOD). Pošto ne postoji resurs koji bi pomogao učenicima bosanskog, hrvatskog i srpskog kao stranih jezika da prepoznaju aspekt glagola ili njegove parove, kreirali smo novi resurs koji će korisnicima pružiti informacije o aspektu, kao i link ka aspektnim parovima glagola. Ovaj resurs takođe sadrži spoljne linkove ka monolingvalnim rečnicima, Wordnetu i BabelNetu.
Ranka Stanković, Maxim Ionov, Medina Bajtarević, Lorena Ninčević. "OntoLex Publication Made Easy: A Dataset of Verbal Aspectual Pairs for Bosnian, Croatian and Serbian" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024)
EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School
Prva škola za obuku polaznika koju je organizovala COST akcija NexusLinguarum održana je od 8. do 12. februara 2021. godine sa ciljem da studenti, istraživači i stručnjaci nauče osnove lingvističke nauke o podacima. Tokom obuke polaznici su se upoznali sa širokim spektrom tema: od semantičkog veba, RDF -a i ontologija, do modeliranja i pretraživanja jezičkih podataka pomoću najsavremenijih ontoloških modela i alata. Škola je održana u okviru serije letnjih škola EUROLAN-a i organizovalo ju je virtuelno (onlajn) nekoliko instituta;
Milan Dojchinovski, Julia Bosque Gil, Jorge Gracia, Ranka Stanković. "EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School" in Infotheca, Faculty of Philology, University of Belgrade (2021).
... linguistics and natural language processing (NLP). The goal of this 15th EUROLAN School was to bring together scholars, teachers and students of linguistics, NLP and information technology to discuss the principles and best practices for repre- senting, publishing and linking linguistic data and the issues ...
... on February 8-12, 2021 and was aimed at students, academics, and practition- ers wishing to learn the basics of Linguistic Data Science. During the training school, the participants were introduced to a wide range of topics: from Semantic Web, RDF and on- tologies, to modeling and querying linguis- tic ...Milan Dojchinovski, Julia Bosque Gil, Jorge Gracia, Ranka Stanković. "EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.7
New paleoecological perspectives on Late Pleistocene Neanderthals in northern Balkans: the rodent assemblages from Smolućka cave (Serbia)
During the Late Pleistocene, the Balkans came to be an important region with many isolated areas, enabling fauna, alongside Neanderthals, to thrive in the area. This work is focused on paleoenvironmental and paleoclimatic changes that occurred in the northern Balkan Peninsula with a special focus on fossil record from Smolućka cave aging from MIS 5 to MIS 3. Based on available data, an attempt has been made to establish a synthetic chronological context for the faunal assemblages recovered from
Mihailo Jovanović, Katarina Bogićević, Draženko Nenadić, Jordi Agustí,·Christian Sánchez Bandera, Juan Manuel López García, Hugues Alexandre Blain. "New paleoecological perspectives on Late Pleistocene Neanderthals in northern Balkans: the rodent assemblages from Smolućka cave (Serbia)" in Archaeological and Anthropological Sciences (2022)
Ocena kvaliteta i mogućnost korišćenja podzemnih voda za piće i navodnjavanje u slivu reke Ralje
Analiza mogućnosti korišćenja podzemnih voda za potrebe vodosnabdevanje stanovništva kao i za navodnjavanje vršena je na prostoru sliva reke Ralje. Istražni prostor veličine oko 280 km2 je obuhvatio veći deo sliva reke Ralje koji se administrativno nalaze na području grada Beograda. Podzemne vode su glavni izvor za vodosnabdevanje i navodnjavanje u ovom području. Seoska naselja nemaju urađenu komunalnu infrastrukturu pa predstavljaju značajan faktor za degradaciju kvaliteta podzemnih voda na ovom području. U periodu 2012-2014 godine prikupljeno je 100 uzoraka
Sunčica Ninković, Nebojša Atanacković, Sava Magazinović, Jakov Andrijašević, Mihailo Šević. "Ocena kvaliteta i mogućnost korišćenja podzemnih voda za piće i navodnjavanje u slivu reke Ralje" in XV Srpski simpozijum o hidrogeologiji sa međunarodnim učešćem, Kopaonik, 14-17.septembar 2016. godine, Univerzitet u Beogradu- Rudarsko-geološki fakultet (2016)
The analysis of the geothermal energy capacity for power generation in Serbia
Jana Stojković, Goran Marinković, Petar Papić, Mihailo Milivojević, Maja Todorović, Marina Ćuk (2013)... POWER GENERATION IN SERBIA by Jana S. STOJKOVI] a*, Goran H. MARINKOVI] b, Petar J. PAPI] a, Mihailo G. MILIVOJEVI] a, Maja M. TODOROVI] a, and Marina D. ]UK a a University of Belgrade, Faculty of Mining and Geology, Belgrade, Serbia b Geological Survey of Serbia, Belgrade, Serbia Original scientific ...
... Petar Papić, Mihailo Milivojević, Maja Todorović, Marina Ćuk Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] The analysis of the geothermal energy capacity for power generation in Serbia | Jana Stojković, Goran Marinković, Petar Papić, Mihailo Milivojević, ...
... research and uti- lization of this geothermal resource. Geothermometers are based on the tempera- ture relation of some chemical reactions or the solubility of some minerals. Re- searchers mostly use silicon-based (quartz, chalcedony, amorphous silica) and cation-based (Na-K, Na-K-Ca, Na-K-Mg, and so forth) ...Jana Stojković, Goran Marinković, Petar Papić, Mihailo Milivojević, Maja Todorović, Marina Ćuk. "The analysis of the geothermal energy capacity for power generation in Serbia" in Thermal Science, National Library of Serbia (2013). https://doi.org/10.2298/TSCI120215033S
Hidrogeotermalni resursi kao faktor razvoja Srbije
Milenić Dejan, Milivojević Mihailo, Krunić Olivera, Vranješ Ana. "Hidrogeotermalni resursi kao faktor razvoja Srbije" in Srpska akademija nauka i umetnosti-Odbor za selo, Lukovska banja, Srbija (2014)
Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities
Овај рад представља активности на развоју корпуса ELEXIS-sr, српском додатку вишејезичном анотираном корпусу ELEXIS-а, који се састоји од семантичких анотација и репозиторија значења речи. ELEXIS је паралелни вишејезични анотирани корпус на десет европских језика, који може да се користи као вишејезички репер за евалуацију европских језика са мање и средње развијеним ресурсима. Фокус овог рада је на вишечланим изразима и именованим ентитетима, њиховом препознавању у скупу реченица ELEXIS-sr и поређењу са анотацијама на другим језицима. Разматрају се први кораци
Cvetana Krstev, Ranka Stanković, Aleksandra Marković, Teodora Mihajlov. "Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities" in Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024, Turin, May 25, 2024, ELRA and ICCL (2024)
Vebran Web Services for Corpus Query Expansion
U ovom radu se govori o razvoju veb usluga Vebran i njihovoj primeni u poboljšanju pretraživanja korpusa. Veb-servisi Vebran koriste se za konsultovanje spoljnih leksičkih izvora za srpski jezik (uglavnom elektronski morfološki rečnici i srpski Vordnet) i proširivanje korisničkih upita radi dobijanja relevantnijih rezultata iz srpskih korpusa.
Ranka Stanković, Miloš Utvić (2020)
... Language Resources and Evaluation (LREC),(Istanbul, Turkey, 2012, 1710–1717 Stanković, Ranka. “Modeli ekspanzije upita nad tekstuelnim resursima”. Phdthesis, Univerzitet u Beogradu, Matematički fakultet, Beograd, 2009 Stanković, Ranka, Cvetana Krstev, Biljana Lazić and Mihailo Škorić. “Elec- tronic ...
... instanced with a part of the text or with any characters; 3) Use regular expressions and graphs of automata and transducers for searching and extraction; and 4) Build cascades of rules. Corpora SrpKor2013 (cf. 2.1) and RudKor (cf. 2.2) can be searched by OCWB, while a search of RudKor is also available through ...Ranka Stanković, Miloš Utvić. "Vebran Web Services for Corpus Query Expansion" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.5