Претрага
92 items
-
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian
The training of new tagger models for Serbian is primarily motivated by the enhancement of the existing tagset with the grammatical category of a gender. The harmonization of resources that were manually annotated within different projects over a long period of time was an important task, enabled by the development of tools that support partial automation. The supporting tools take into account different taggers and tagsets. This paper focuses on TreeTagger and spaCy taggers, and the annotation schema alignment ...Ranka Stanković, Branislava Šandrih, Cvetana Krstev, Miloš Utvić, Mihailo Škorić. "Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian" in Proceedings of the 12th Language Resources and Evaluation Conference, May Year: 2020, Marseille, France, European Language Resources Association (2020)
-
Parallel Bidirectionally Pretrained Taggers as Feature Generators
In a setting where multiple automatic annotation approaches coexist and advance separately but none completely solve a specific problem, the key might be in their combination and integration. This paper outlines a scalable architecture for Part-of-Speech tagging using multiple standalone annotation systems as feature generators for a stacked classifier. It also explores automatic resource expansion via dataset augmentation and bidirectional training in order to increase the number of taggers and to maximize the impact of the composite system, which ...Ranka Stanković, Mihailo Škorić, Branislava Šandrih Todorović. "Parallel Bidirectionally Pretrained Taggers as Feature Generators" in Applied Sciences, MDPI AG (2022). https://doi.org/10.3390/app12105028
-
The Effects of Multi-Word Tagging on Text Disambiguation
Utvić Miloš, Obradović Ivan, Krstev Cvetana, Vitas Duško. "The Effects of Multi-Word Tagging on Text Disambiguation" in Proceedings of the 29th International Conference on Lexis and Grammar, LGC 2010, September 2010, Belgrade, Serbia, D. Vitas and C. Krstev (eds.), Belgrade:Faculty of Mathematics, University of Belgrade (2010): 333-342
-
Нове технологије за оживљавање старих текстова
удаљено читање, књижевни корпус, обрада српског језика, анотација врстом речи, лематизација, именовани ентитетиЦветана Крстев, Ранка Станковић, Бранислава Шандрих Тодоровић, Милица Иконић Нешић. "Нове технологије за оживљавање старих текстова" in Зборник радова Међународне научне конференције Дигитална хуманистика и словенско културно наслеђе II, Београд, 28-29 јуни 2021., Београд : Савез славистичких друштава Србије (2023)
-
Part of Speech Tagging for Serbian language using Natural Language Toolkit
Ranka Stanković, Boro Milovanović (2020)Dok se razvijaju složeni algoritmi za NLP (obrada prirodnog jezika), osnovni zadaci kao što je označavanje ostaju veoma važni i još uvek izazovni. NLTK (Natural Language Toolkit) je moćna Python biblioteka za razvoj programa zasnovanih na NLP-u. Pokušavamo da iskoristimo ovu biblioteku za kreiranje PoS (vrsta reči) oznake za savremeni srpski jezik. Jedanaest različitih modela je kreirano korišćenjem NLTK API-ja za označavanje. Najbolji modeli se transformišu sa Brill tagerom da bi se poboljšala tačnost. Obučili smo modele na označenom ...Ranka Stanković, Boro Milovanović. "Part of Speech Tagging for Serbian language using Natural Language Toolkit" in 7th International Conference on Electrical, Electronic and Computing Engineering IcETRAN 2020, Academic Mind, Belgrade (2020)
-
Annotation of the Serbian ELTeC Collection
Ovaj rad predstavlja takozvano izdanje nivoa 2 kolekcije tekstova SrpELTeC razvijene u okviru aktivnosti Radne grupe 2 – Metode i alati COST akcije CA 16204 (Distant Reading for European Literary History) i njene specifikacije šeme. Izdanje nivoa 2 je nastavak izdanja nivoa 1, koje se koristi kao ulaz za morfosintaksičke i NER anotacije romana. Srpska obrada nivoa-2 je navedena kroz potrebne korake, uključujući metode i alate koji se koriste u tom procesu. Neki statistički podaci iz srpske kolekcije nivoa ...udaljeno čitanje, literarni korpus, tagiranje, prepoznavanje imenovanih entiteta, lematizacija, ELTeCRanka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Mihailo Škorić. "Annotation of the Serbian ELTeC Collection" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.2.3
-
Multi-word Expressions for Abusive Speech Detection in Serbian
Ovaj rad predstavlja istraživanja na usavršavanju i unapređenju srpske verzije rečnika Hurtlex, višejezičnog leksikona uvredljivih reči. Posebnu pažnju posvećujemo dodavanju izraza sa više reči (polileksemskih jedinica) koji se mogu smatrati uvredljivim, jer su takvi leksički zapisi veoma važni za postizanje dobrih rezultata u mnoštvu zadataka otkrivanja uvredljivog jezika. Srpski morfološki rečnici se koriste kao osnova za čišćenje podataka i stvaranje rečnika. Istaknuta je veza sa drugim leksičkim i semantičkim resursima na srpskom jeziku i predviđena je izgradnja sistema za ...Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev. "Multi-word Expressions for Abusive Speech Detection in Serbian" in Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, Association for Computational Linguistics (2020)
-
A Model for Determining the Dependability of Continuous Subsystems in Coal Mines Using the Fuzzy Logic Approach
Nikola Stanić, Miljan Gomilanović, Petar Marković, Daniel Krzanović, Aleksandar Doderović, Saša Stepanović (2024)This study presents a unique model for assessing the dependability of continuous parts of combined systems in open-pit mining through the application of fuzzy logic. Continuous sub-systems as part of the combined system of coal exploitation in surface mines have the basic function of ensuring safe operation, high capacity with high reliability, and low costs. These subsystems are usually part of the thermal power plant’s coal supply system and ensure stable fuel supply. The model integrates various independent partial ...fuzzy logic, max-min composition, continuous part of combined system (CCS), open pit, mining, dependabilityNikola Stanić, Miljan Gomilanović, Petar Marković, Daniel Krzanović, Aleksandar Doderović, Saša Stepanović. "A Model for Determining the Dependability of Continuous Subsystems in Coal Mines Using the Fuzzy Logic Approach" in Applied Sciences, Basel, August 2024, MDPI (2024). https://doi.org/https://doi.org/10.3390/app14177947
-
A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian
Uvredljivi govor na društvenim medijima, uključujući psovke, pogrdni govor i govor mržnje, dostigao je nivo pandemije. Sistem koji bi bio u stanju da detektuje takve tekstove mogao bi da pomogne da internet i društveni mediji postanu bolji virtuelni prostor sa više poštovanja. Istraživanja i komercijalna primena u ovoj oblasti do sada su bili fokusirani uglavnom na engleski jezik. Ovaj rad predstavlja rad na izgradnji AbCoSER-a, prvog korpusa uvredljivog govora na srpskom jeziku. Korpus se sastoji od 6.436 ručno označenih ...Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih. "A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian" in 3rd Conference on Language, Data and Knowledge (LDK 2021), MDPI AG (2021). https://doi.org/10.4230/OASIcs.LDK.2021.13
-
Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection
Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić (2022)In this paper we present the Serbian part of the ELTeC multilingual corpus of novels written in the time period 1840-1920. The corpus is being built in order to test various distant reading methods and tools with the aim of re-thinking the European literary history. We present the various steps that led to the production of the Serbian sub-collection: the novel selection and retrieval, text preparation, structural annotation, POS-tagging, lemmatization and named entity recognition. The Serbian sub-collection was published ...Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić. "Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection" in Proceedings of the Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
-
BERT Downstream Task Analysis: Named Entity Recognition in Serbian
This paper compares different architectures and techniques for preparing named entity recognition (NER) models for the Serbian language via integrating BERT with spaCy. Models were trained to recognize seven different named entity types (persons, locations, organisations, professions, events, demonyms, and artworks), and are trained on the dataset containing Serbian novels published between 1840 and 1920, publicly available newspaper articles and sentences generated from the Wikidata knowledge base and Leximirka lexical database. We explore various configurations and several training pipelines ...Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković. "BERT Downstream Task Analysis: Named Entity Recognition in Serbian" in Lecture Notes in Networks and Systems, Springer Nature Switzerland (2024). https://doi.org/10.1007/978-3-031-71419-1_29
-
Sentiment Analysis of Serbian Old Novels
In this paper we present first study of Sentiment Analysis (SA) of Serbian novels from the 1840-1920 period. The preparation of sentiment lexicon was based on three existing lexicons: NRC, AFFIN and Bing with additional extensive corrections. The first phase of dataset refinement included filtering the word that are not found in Serbian morphological dictionary and in second automatic POS tagging and lemma were manually corrected. The polarity lexicon was extracted and transformed into ontolex-lemon and published as initial ...Ranka Stanković, Miloš Košprdić, Milica Ikonić Nešić, Tijana Radović. "Sentiment Analysis of Serbian Old Novels" in Proceedings of the 2nd Workshop on Sentiment Analysis and Linguistic Linked Data, June 2022, Marseille, France, European Language Resources Association (2022)
-
Transformer-Based Composite Language Models for Text Evaluation and Classification
Parallel natural language processing systems were previously successfully tested on the tasks of part-of-speech tagging and authorship attribution through mini-language modeling, for which they achieved significantly better results than independent methods in the cases of seven European languages. The aim of this paper is to present the advantages of using composite language models in the processing and evaluation of texts written in arbitrary highly inflective and morphology-rich natural language, particularly Serbian. A perplexity-based dataset, the main asset for the ...Mihailo Škorić, Miloš Utvić, Ranka Stanković. "Transformer-Based Composite Language Models for Text Evaluation and Classification" in Mathematics, MDPI AG (2023). https://doi.org/10.3390/math11224660
-
Spatial and temporal variability of precipitation in Serbia for the period 1961-2020
Boško Milanović, Phillip Schuster, Milan Radovanović, Vesna Ristić Vakanjac, Shristoph Schneider (2017)У овом раду анализиране су месечне, сезонске и годишње суме падавина у Србији за период 1961–2010. Географска ширина, дужина и надморска висина 421 падавинске станице и карактеристике терена у њиховом блиском окружењу (нагиб и аспект терена у радијусу од 10 км око станице) коришћене су за развој регресионог модела на основу којег је прорачуната просторна расподела падавина. Приказан је просторни распоред годишњих, јунских (максималне вредности за скоро све станице) и фебруарских (минималне вредности за скоро све станице) падавина. Годишње ...Boško Milanović, Phillip Schuster, Milan Radovanović, Vesna Ristić Vakanjac, Shristoph Schneider. "Spatial and temporal variability of precipitation in Serbia for the period 1961-2020" in Theoretical and Applied Climatology (2017). https://doi.org/10.1007/s0070-017-2118-5
-
Fuzzy expert analysis of the severity of mining machinery failure
Mining machinery failure is almost an everyday occurrence. Usually the failures bare certain consequences, which require additional financial costs to repair and restore the system to its operational state. The consequences are viewed through negative and damaging effects a failure has on the machine, health and safety of the employees, work environment, and on the environment. The removal of the consequences of the failure requires additional financial investment, which has a negative impact on the company’s business. In order ...Dejan V. Petrović, Miloš Tanasijević, Saša Stojadinović, Jelena Ivaz, Pavle Stojković. "Fuzzy expert analysis of the severity of mining machinery failure" in Applied Soft Computing, Elsevier BV (2020). https://doi.org/10.1016/j.asoc.2020.106459
-
The state and perspective of the natural gas sector in Serbia
The strategy of long-term energy development of Serbia identifies an increase of share of natural gas in final energy consumption as one of the main aims. Serbia has signed a strategic agreement with the Russian Federation on cooperation in the oil and gas sector, within which the project South Stream pipeline is planned to be realized. In addition, the Republic of Serbia has signed the Treaty that establishes the Energy Community and accepted the obligation to implement the Energy ...Dejan Ivezić, Marija Živković, Dušan Danilović, Aleksandar Madžarević, Miloš Tanasijević. "The state and perspective of the natural gas sector in Serbia" in Energy Sources, Part B: Economics, Planning and Policy, Taylor & Francis Group, LLC (2016). https://doi.org/http://dx.doi.org/10.1080/15567249.2013.858796
-
Data from the Digital Repository of the Faculty of Mining and Geology in eScience (eNauka)
Biljana Rujević, Mihailo Škorić (2024)The paper describes linking the Digital Repository of the University of Belgrade, Faculty of Mining and Geology, with the eScience system in terms of transferring metadata about the results of researchers' scientific work. The steps taken to ensure a smooth harvesting of metadata are outlined. Additionally, a presentation of additional improvements to the OAI system is provided, aiming to contribute to the automatic linking of authors with their results in the eScience system.Biljana Rujević, Mihailo Škorić. "Data from the Digital Repository of the Faculty of Mining and Geology in eScience (eNauka)" in Infotheca, Faculty of Philology, University of Belgrade (2024). https://doi.org/10.18485/infotheca.2023.23.2.4
-
Security of Supply as a Major Part of the Energy Security Puzzle
Sigurnost snadbevanja prirodnim gasom Republike Srbije se kroz poslednje dve decenije tretira kao hitno, strateško, političko i bezbednosno pitanje. U sektoru prirodnog gasa, Republika Srbija je veoma zavisna od gasa koji uvozi iz Rusije. Indikatori sigurnosti snadbevanja predstavljaju jedan od osnovnih elemenata za određivanje energetske bezbednosti i snažne alate za usmeravanje energetskog sektora ka održivom razvoju. Metodolaška analiza prikazana u radu je bila koncentrisana na pokazatelje sigurnosti snadbevanja u oblasti energetske bezbednosti koji se odnose na sektor prirodnog gasa ...energetska bezbednost, sigurnost snabdevanja, energetski indikator, dostupnost energije, diversifikacija izvora i pravacaAleksandar Madžarević, Miroslav Crnogorac. "Security of Supply as a Major Part of the Energy Security Puzzle" in Energija, ekonomija, ekologija, University Library in Kragujevac (2022). https://doi.org/10.46793/EEE22-4.28M
-
Combining Heterogeneous Lexical Resources
Cvetana Krstev, Duško Vitas, Ranka Stanković, Ivan Obradović, Gordana Pavlović-Lažetić. "Combining Heterogeneous Lexical Resources" in Proceedings of the Fourth Interantional Conference on Language Resources and Evaluation, Lisabon, Portugal , May 2004, vol. 4, ELRA - European Language Resources Association (2004)
-
Споменица 1991. – 2015. година: 135 година геологије и 70 година рударства на Универзитету у Београду
главни и одговорни уредник Душан Поломчић. Споменица 1991. – 2015. година: 135 година геологије и 70 година рударства на Универзитету у Београду, Београд : Универзитет у Београду, Рударско-геолошки факултет, 2016