Претрага
93 items
-
Multiword Expressions between the Corpus and the Lexicon: Universality, Idiosyncrasy and the Lexicon-Corpus Interface
Verginica Barbu Mititelu, Voula Giouli, Kilian Evang, Daniel Zeman, Petya Osenova, Carole Tiberius, Simon Krek, Stella Markantonatou, Ivelina Stoyanova, Ranka Stankovic, Christian Chiarcos (2024)Predstavljamo trenutne aktivnosti na definisanju interfejsa leksikona i korpusa koji će služiti kao referenca u prikazu polileksemskih jedinica - višečlanih izraza - (različitih tipova - imenskih, glagolskih, itd.) u specijalizovanim leksikonima i povezivanju ovih unosa sa njihovim pojavljivanjima u korpusima. Konačni cilj je korišćenje ovakvih resursa za automatsko identifikovanje višečlanih izraza u tekstu. Uključivanje nekoliko prirodnih jezika ima za cilj univerzalnost rešenja koje nije usredsređeno na određeni jezik, kao i prilagođavanje idiosinkrazijama. Raspravljaju se izazovi u leksikografskom opisu višerečnih ...Verginica Barbu Mititelu, Voula Giouli, Kilian Evang, Daniel Zeman, Petya Osenova, Carole Tiberius, Simon Krek, Stella Markantonatou, Ivelina Stoyanova, Ranka Stankovic, Christian Chiarcos. "Multiword Expressions between the Corpus and the Lexicon: Universality, Idiosyncrasy and the Lexicon-Corpus Interface" in Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024, Turin, May 25, 2024, ELRA and ICCL (2024)
-
Sentiment Analysis of Serbian Old Novels
In this paper we present first study of Sentiment Analysis (SA) of Serbian novels from the 1840-1920 period. The preparation of sentiment lexicon was based on three existing lexicons: NRC, AFFIN and Bing with additional extensive corrections. The first phase of dataset refinement included filtering the word that are not found in Serbian morphological dictionary and in second automatic POS tagging and lemma were manually corrected. The polarity lexicon was extracted and transformed into ontolex-lemon and published as initial ...Ranka Stanković, Miloš Košprdić, Milica Ikonić Nešić, Tijana Radović. "Sentiment Analysis of Serbian Old Novels" in Proceedings of the 2nd Workshop on Sentiment Analysis and Linguistic Linked Data, June 2022, Marseille, France, European Language Resources Association (2022)
-
A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian
Uvredljivi govor na društvenim medijima, uključujući psovke, pogrdni govor i govor mržnje, dostigao je nivo pandemije. Sistem koji bi bio u stanju da detektuje takve tekstove mogao bi da pomogne da internet i društveni mediji postanu bolji virtuelni prostor sa više poštovanja. Istraživanja i komercijalna primena u ovoj oblasti do sada su bili fokusirani uglavnom na engleski jezik. Ovaj rad predstavlja rad na izgradnji AbCoSER-a, prvog korpusa uvredljivog govora na srpskom jeziku. Korpus se sastoji od 6.436 ručno označenih ...Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih. "A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian" in 3rd Conference on Language, Data and Knowledge (LDK 2021), MDPI AG (2021). https://doi.org/10.4230/OASIcs.LDK.2021.13
-
Novi koncept izrade Osnovne hidrogeološke karte Srbije
Igor Jemcov, Zoran Stevanović, Vladimir Živanović, Saša Milanović, Dušan Polomčić, Veselin Dragišić (2022)Osnovna hidrogeološka karta (OHGK) predstavlja bazični dokument u hidrogeologiji, a ima za cilj sagledavanje osnovnih tipova izdani što da omogućava sagledavanje podzemnih vodnih resursa na području obuhvaćenom kartom. Primena postojećeg Uputstva za izradu Osnovne hidrogeološke karte SFRJ 1:100.000 (iz 1984, odnosno 1988. godine), vezana je za brojne poteškoće, što je uslovilo da je u proteklom periodu od 30 godina bilo je više inicijativa za formiranjem novog Uputstva. Sagledavajući postojeću situaciju uz činjenice o savremenim trendovima razvoja hidrogeoloških karata u ...Igor Jemcov, Zoran Stevanović, Vladimir Živanović, Saša Milanović, Dušan Polomčić, Veselin Dragišić. "Novi koncept izrade Osnovne hidrogeološke karte Srbije" in Zbornik radova XVI srpskog Simpozijum o hidrogeologiji sa međunarodnim učešćem, Univerzitet u Beograd, Rudarsko-geološki fakultet (2022)
-
Development and Evaluation of Three Named Entity Recognition Systems for Serbian - The Case of Personal Names
In this paper we present a rule- and lexicon-based system for the recognition of Named Entities (NE) in Serbian news paper texts that was used to prepare a gold standard annotated with personal names. It was further used to prepare training sets for four different levels of annota tion, which were further used to train two Named Entity Recognition (NER) sys tems: Stanford and spaCy. All obtained models, together with a rule- and lexicon based system were evaluated on ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Development and Evaluation of Three Named Entity Recognition Systems for Serbian - The Case of Personal Names" in Proceedings - Natural Language Processing in a Deep Learning World, Incoma Ltd., Shoumen, Bulgaria (2019). https://doi.org/10.26615/978-954-452-056-4_122
-
Multi-word Expressions for Abusive Speech Detection in Serbian
Ovaj rad predstavlja istraživanja na usavršavanju i unapređenju srpske verzije rečnika Hurtlex, višejezičnog leksikona uvredljivih reči. Posebnu pažnju posvećujemo dodavanju izraza sa više reči (polileksemskih jedinica) koji se mogu smatrati uvredljivim, jer su takvi leksički zapisi veoma važni za postizanje dobrih rezultata u mnoštvu zadataka otkrivanja uvredljivog jezika. Srpski morfološki rečnici se koriste kao osnova za čišćenje podataka i stvaranje rečnika. Istaknuta je veza sa drugim leksičkim i semantičkim resursima na srpskom jeziku i predviđena je izgradnja sistema za ...Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev. "Multi-word Expressions for Abusive Speech Detection in Serbian" in Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, Association for Computational Linguistics (2020)
-
Terminology Acquisition and Description Using Lexical Resources and Local Grammars
Acquisition of new terminology from specific domains and its adequate description within terminological dictionaries is a complex task, especially for languages that are morphologically complex such as Serbian. In this paper we present an approach to solving this task semi-automatically on basis of lexical resources and local grammars developed for Serbian. Special attention is given to automatic inflectional class prediction for simple adjectives and nouns and the use of syntactic graphs for extraction of Multi-Word Unit (MWU) candidates for ...Cvetana Krstev, Ranka Stanković, Ivan Obradović, Biljana Lazić. "Terminology Acquisition and Description Using Lexical Resources and Local Grammars" in Proceedings of the 11th Conference on Terminology and Artificial Intelligence, Granada, Spain, 2015, Granada : LexiCon (Universidad de Granada) (2015)
-
Using Lexical Resources for Irony and Sarcasm Classification
The paper presents a language dependent model for classification of statements into ironic and non-ironic. The model uses various language resources: morphological dictionaries, sentiment lexicon, lexicon of markers and a WordNet based ontology. This approach uses various features: antonymous pairs obtained using the reasoning rules over the Serbian WordNet ontology (R), antonymous pairs in which one member has positive sentiment polarity (PPR), polarity of positive sentiment words (PSP), ordered sequence of sentiment tags (OSA), Part-of-Speech tags of words (POS) ...Miljana Mladenović, Cvetana Krstev, Jelena Mitrović, Ranka Stanković. "Using Lexical Resources for Irony and Sarcasm Classification" in Proceedings of the 8th Balkan Conference in Informatics (BCI '17), New York, NY, USA, : ACM (2017). https://doi.org/
-
A Description of Morphological Features of Serbian: a Revision using Feature System Declaration
In this paper we discuss some well-known morphological descriptions used in various projects and applications (most notably MULTEXT-East and Unitex) and illustrate the encountered problems on Serbian. We have spotted four groups of problems: the lack of a value for an existing category, the lack of a category, the interdependence of values and categories lacking some description, and the lack of a support for some types of categories. At the same time, various descriptions often describe exactly the same ...Cvetana Krstev, Ranka Stanković, Vitas Duško. "A Description of Morphological Features of Serbian: a Revision using Feature System Declaration" in Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2010, Valetta, Malta : European Language Resources Association (2010)
-
Strateško odlučivanje pri izboru novih rotornih bagera za površinske kopove lignita: primer rudarskog basena Kolubara
U ovom radu je dat primer izbora najpovoljnijih postojećih rotornih bagera u uslovima površinskih kopova RB Kolubara, zbog budućeg odlučivanja o optimalnom izboru bagera za rad na otkrivci, uglju i međuslojnoj jalovini. Korišćeni su parametri performansi specifičnog iskorišćenja mase i snage bagera, vremensko i kapacitetno iskorišćenje bagera, ali i specifični operativni troškovi rada rotornog bagera. Znajući da je ovo neka vrsta višekriterijumskog problema, iskorišćena je metoda TOPSIS za optimalno i tačno rangiranje rotornih bagera. Za analizu je korišćen period ...Predrag Jovančić, Stevan Đenadić, Goran Todorović, Dragan Novaković, Filip Miletić. "Strateško odlučivanje pri izboru novih rotornih bagera za površinske kopove lignita: primer rudarskog basena Kolubara" in XI simpozijum sa međunarodnim učešćem "Rudarstvo 2020", Vrnjačka Banja, Srbija, 8-11. septembar, Institut za tehnologiju nuklearnih i drugih mineralnih sirovina (Beograd), Privredna komora Srbije (Beograd) (2020)
-
The Dictionary of the Serbian Academy: from the Text to the Lexical Database
In this paper we discuss the project of digitization of the Dictionary of the Serbo-Croatian Standard and Vernacular Language. Scanning and character recognition were a particular challenge, since various non-standard character set encoding was used in the course of the almost 60-year long production of the dictionary. The first aim of the project was to formalize the micro-structure of the dictionary articles in order to parse the digitized text of and transform it into structured data stored in relational lexical database. This approach ...Ranka Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo. "The Dictionary of the Serbian Academy: from the Text to the Lexical Database" in Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts, Ljubljana : Ljubljana University Press, Faculty of Arts (2018)
-
Spatial and temporal variability of precipitation in Serbia for the period 1961-2020
Boško Milanović, Phillip Schuster, Milan Radovanović, Vesna Ristić Vakanjac, Shristoph Schneider (2017)У овом раду анализиране су месечне, сезонске и годишње суме падавина у Србији за период 1961–2010. Географска ширина, дужина и надморска висина 421 падавинске станице и карактеристике терена у њиховом блиском окружењу (нагиб и аспект терена у радијусу од 10 км око станице) коришћене су за развој регресионог модела на основу којег је прорачуната просторна расподела падавина. Приказан је просторни распоред годишњих, јунских (максималне вредности за скоро све станице) и фебруарских (минималне вредности за скоро све станице) падавина. Годишње ...Boško Milanović, Phillip Schuster, Milan Radovanović, Vesna Ristić Vakanjac, Shristoph Schneider. "Spatial and temporal variability of precipitation in Serbia for the period 1961-2020" in Theoretical and Applied Climatology (2017). https://doi.org/10.1007/s0070-017-2118-5
-
Fuzzy expert analysis of the severity of mining machinery failure
Mining machinery failure is almost an everyday occurrence. Usually the failures bare certain consequences, which require additional financial costs to repair and restore the system to its operational state. The consequences are viewed through negative and damaging effects a failure has on the machine, health and safety of the employees, work environment, and on the environment. The removal of the consequences of the failure requires additional financial investment, which has a negative impact on the company’s business. In order ...Dejan V. Petrović, Miloš Tanasijević, Saša Stojadinović, Jelena Ivaz, Pavle Stojković. "Fuzzy expert analysis of the severity of mining machinery failure" in Applied Soft Computing, Elsevier BV (2020). https://doi.org/10.1016/j.asoc.2020.106459
-
The state and perspective of the natural gas sector in Serbia
The strategy of long-term energy development of Serbia identifies an increase of share of natural gas in final energy consumption as one of the main aims. Serbia has signed a strategic agreement with the Russian Federation on cooperation in the oil and gas sector, within which the project South Stream pipeline is planned to be realized. In addition, the Republic of Serbia has signed the Treaty that establishes the Energy Community and accepted the obligation to implement the Energy ...Dejan Ivezić, Marija Živković, Dušan Danilović, Aleksandar Madžarević, Miloš Tanasijević. "The state and perspective of the natural gas sector in Serbia" in Energy Sources, Part B: Economics, Planning and Policy, Taylor & Francis Group, LLC (2016). https://doi.org/http://dx.doi.org/10.1080/15567249.2013.858796
-
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources
In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615
-
A Data Driven Approach for Raw Material Terminology
Olivera Kitanović, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić, Ivan Babić, Ljiljana Kolonja (2021)The research presented in this paper aims at creating a bilingual (sr-en), easily searchable, hypertext, born-digital, corpus-based terminological database of raw material terminology for dictionary production. The approach is based on linking dictionaries related to the raw material domain, both digitally born and printed, into a lexicon structure, aligning terminology from different dictionaries as much as possible. This paper presents the main features of this approach, data used for compilation of the terminological database, the procedure by which it has ...sirovine, rudarstvo, terminologija, rečnik, terminološka aplikacija, mobilna aplikacija, digitizacija, leksički podaci, korpusi, otvoreni povezani podaciOlivera Kitanović, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić, Ivan Babić, Ljiljana Kolonja. "A Data Driven Approach for Raw Material Terminology" in Applied Sciences, MDPI AG (2021). https://doi.org/10.3390/app11072892
-
Data from the Digital Repository of the Faculty of Mining and Geology in eScience (eNauka)
Biljana Rujević, Mihailo Škorić (2024)The paper describes linking the Digital Repository of the University of Belgrade, Faculty of Mining and Geology, with the eScience system in terms of transferring metadata about the results of researchers' scientific work. The steps taken to ensure a smooth harvesting of metadata are outlined. Additionally, a presentation of additional improvements to the OAI system is provided, aiming to contribute to the automatic linking of authors with their results in the eScience system.Biljana Rujević, Mihailo Škorić. "Data from the Digital Repository of the Faculty of Mining and Geology in eScience (eNauka)" in Infotheca, Faculty of Philology, University of Belgrade (2024). https://doi.org/10.18485/infotheca.2023.23.2.4
-
Security of Supply as a Major Part of the Energy Security Puzzle
Sigurnost snadbevanja prirodnim gasom Republike Srbije se kroz poslednje dve decenije tretira kao hitno, strateško, političko i bezbednosno pitanje. U sektoru prirodnog gasa, Republika Srbija je veoma zavisna od gasa koji uvozi iz Rusije. Indikatori sigurnosti snadbevanja predstavljaju jedan od osnovnih elemenata za određivanje energetske bezbednosti i snažne alate za usmeravanje energetskog sektora ka održivom razvoju. Metodolaška analiza prikazana u radu je bila koncentrisana na pokazatelje sigurnosti snadbevanja u oblasti energetske bezbednosti koji se odnose na sektor prirodnog gasa ...energetska bezbednost, sigurnost snabdevanja, energetski indikator, dostupnost energije, diversifikacija izvora i pravacaAleksandar Madžarević, Miroslav Crnogorac. "Security of Supply as a Major Part of the Energy Security Puzzle" in Energija, ekonomija, ekologija, University Library in Kragujevac (2022). https://doi.org/10.46793/EEE22-4.28M
-
Combining Heterogeneous Lexical Resources
Cvetana Krstev, Duško Vitas, Ranka Stanković, Ivan Obradović, Gordana Pavlović-Lažetić. "Combining Heterogeneous Lexical Resources" in Proceedings of the Fourth Interantional Conference on Language Resources and Evaluation, Lisabon, Portugal , May 2004, vol. 4, ELRA - European Language Resources Association (2004)
-
Football terminology: compilation and transformation into OntoLex-Lemon resource
У овом раду представља се пројекат који је у развоју, креирање првог дигиталног фудбалског речника на српском језику, као и да демонстрација примене модела OntoLex и љегових модула. OntoLex-FrAC модул укључује информације о учесталости и примерима употребе екстрахованих из корпуса. У овом случају, креиран је корпус за специфичан домен под називом СрФудКо, који садржи чланке вести о фудбалу на српском језику. Вишечлани термини аутоматски су екстраховани из српског корпуса, а затим ручно евалуирани и класификовани као спортски или ...Jelena Lazarević, Ranka Stanković, Mihailo Škorić, Biljana Rujević. "Football terminology: compilation and transformation into OntoLex-Lemon resource" in LDK 2023 – 4th Conference on Language, Data and Knowledge, 12-15 September in Vienna, Austria, Lisabon : NOVA FCSH - CLUNL (2023). https://doi.org/10.34619/srmk-injj