Претрага ⚒ Радови ⚒ Др РГФ - Репозиторијум РГФ

Претрага

Per page

Sort by

216 items

Веб-алат за управљање грађом Речника САНУ и анотација листића

Рада Стијовић, Ранка Станковић, Михаило Шкорић (2020)

Грађа на основу које се израђује Речник српскохрватског књижевног и народног језика САНУ, а која садржи материјал из преко 4.500 писаних извора и 300 рукописних збирки речи са подручја народних говора штокавског наречја, забележена је на око 5.000.000 листића. Богат лексички материјал, који обухвата књижевни и народни језик у протекла два века и на основу кога треба да се напише још најмање 15 томова Речника, пружа могућност и за разноврсна лингвистичка и ванлингвистичка истраживања. Из тог разлога се приступило ...

лексикографска грађа, листићи, лексикографски алат, дигитализација, анотација

Рада Стијовић, Ранка Станковић, Михаило Шкорић. "Веб-алат за управљање грађом Речника САНУ и анотација листића" in Rasprave Instituta za hrvatski jezik i jezikoslovlje, Institute of Croatian Language and Linguistics (2020). https://doi.org/10.31724/rihjj.46.2.32
CliRtheRoads: An Integrated Approach to Landslide Risk Management on Roads in Serbia

Biljana Abolmasov, Ranka Stanković, Miloš Marjanović, Nikola Vulović, Uroš Đurić (2023)

baza podataka o klizištima, mobilna aplikacija, upravljanje putevima, klimatske promene

Biljana Abolmasov, Ranka Stanković, Miloš Marjanović, Nikola Vulović, Uroš Đurić . "CliRtheRoads: An Integrated Approach to Landslide Risk Management on Roads in Serbia" in Progress in Landslide Research and Technology, Springer Cham (2023). https://doi.org/https://doi.org/10.1007/978-3-031-44296-4_23
BERT Downstream Task Analysis: Named Entity Recognition in Serbian

Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković (2024)

This paper compares different architectures and techniques for preparing named entity recognition (NER) models for the Serbian language via integrating BERT with spaCy. Models were trained to recognize seven different named entity types (persons, locations, organisations, professions, events, demonyms, and artworks), and are trained on the dataset containing Serbian novels published between 1840 and 1920, publicly available newspaper articles and sentences generated from the Wikidata knowledge base and Leximirka lexical database. We explore various configurations and several training pipelines ...

Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković. "BERT Downstream Task Analysis: Named Entity Recognition in Serbian" in Lecture Notes in Networks and Systems, Springer Nature Switzerland (2024). https://doi.org/10.1007/978-3-031-71419-1_29
Application of pulsed flash thermography method for specific defect estimation in aluminum

Tomić Ljubiša D, Jovanović Dalibor B, Karkalić Radovan M, Damnjanović Vesna, Kovačević Branko V, Filipović Dalibor D, Radaković Sonja S. (2015)

Tomić Ljubiša D, Jovanović Dalibor B, Karkalić Radovan M, Damnjanović Vesna, Kovačević Branko V, Filipović Dalibor D, Radaković Sonja S.. "Application of pulsed flash thermography method for specific defect estimation in aluminum" in Thermal Science 19 no. 5, Belgrade:Vinca Institute of Nuclear Sciences (2015): 1845-1854
Diversity and biodeteriorative potential of fungal dwellers on ancient stone stela

Savković Željko, Unković Nikola, Stupar Miloš, Franković Maja, Jovanović Milena, Erić Suzana, Šarić Kristina, Stanković Slaviša, Dimkić Ivica, Vukojević Jelena, Ljaljević Grbić Milica (2016)

Savković Željko, Unković Nikola, Stupar Miloš, Franković Maja, Jovanović Milena, Erić Suzana, Šarić Kristina, Stanković Slaviša, Dimkić Ivica, Vukojević Jelena, Ljaljević Grbić Milica. "Diversity and biodeteriorative potential of fungal dwellers on ancient stone stela" in International Biodeterioration & Biodegradation no. 115, Amsterdam, Netherlands :Elsevier (2016): 212-223. https://doi.org/http://dx.doi.org/10.1016/j.ib
An Approach to Efficient Processing of Multi-Word Units

Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas (2013)

Efficient processing of Multi-Word Units in the course of development of morphological MWU dictionaries is not easy to achieve, especially when languages with complex morphological structures are concerned, such as Serbian. Manual development of this type of dictionaries is a tedious and extremely slow process. To alleviate this problem we turned to our multipurpose software tool, dubbed LeXimir, in the production of lemmas for e-dictionaries of multi-word units. In addition to that, we developed a procedure aimed at making ...

Natural Language Processing, Grammatical Category, Lexical Representation, MWU, multi-word unit

Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas. "An Approach to Efficient Processing of Multi-Word Units" in Computational Linguistics - Applications, Studies in Computational Intelligence 458 no. 458, Berlin Heidelberg : Springer-Verlag (2013): 109-129. https://doi.org/10.1007/978-3-642-34399-5_6
Indexing of textual databases based on lexical resources: A case study for Serbian

Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović (2015)

In this paper we describe an approach to improvement of information retrieval results for large textual databases by pre-indexing documents using bag-of-words and Named Entity Recognition. The approach was applied on a database of geological projects financed by the Republic of Serbia in the last half century. Each document within this database is described by metadata, consisting of several fields such as title, domain, keywords, abstract, geographical location and the like. A bag of words was produced from these ...

Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović. "Indexing of textual databases based on lexical resources: A case study for Serbian" in Semantic Keyword-based Search on Structured Data Sources : First COST Action IC1302 International KEYSTONE Conference, IKC 2015, Coimbra, Portugal, September 8-9, 2015. Revised Selected Papers, Springer (2015). https://doi.org/10.1007/978-3-319-27932-9_15
A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals

Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac, Miloš Utvić (2012)

This paper outlines the main features of Bibliša, a tool that offers various possibilities of enhancing queries submitted to large collections of TMX documents generated from aligned parallel articles residing in multilingual digital libraries of e-journals. The queries initiated by a simple or multiword keyword, in Serbian or English, can be expanded by Bibliša, both semantically and morphologically, using different supporting monolingual and multilingual resources, such as wordnets and electronic dictionaries. The tool operates within a complex system composed ...

multilingual digital libraries, query expansion, TMX

Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac, Miloš Utvić. "A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals" in Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, May 2012, Istanbul, Turkey, Istanbul, Turkey : European Language Resources Association (2012)
Electronic Dictionaries - from File System to lemon Based Lexical Database

Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić (2018)

In this paper we discuss some well-known morphological descriptions used in various projects and applications (most notably MULTEXT-East and Unitex) and illustrate the encountered problems on Serbian. We have spotted four groups of problems: the lack of a value for an existing category, the lack of a category, the interdependence of values and categories lacking some description, and the lack of a support for some types of categories. At the same time, various descriptions often describe exactly the same ...

Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić. "Electronic Dictionaries - from File System to lemon Based Lexical Database" in Proceedings of the 11th International Conference on Language Resources and Evaluation - W23 6th Workshop on Linked Data in Linguistics : Towards Linguistic Data Science (LDL-2018), LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
Using English Baits to Catch Serbian Multi-Word Terminology

Cvetana Krstev, Branislava Šandrih, Ranka Stanković (2018)

In this paper we present the first results in bilingual terminology extraction. The hypothesis of our approach is that if for a source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach relies on several resources and tools: aligned domain texts, domain terminology for a source language, a terminology extractor for a target language, and a ...

aligned texts, word alignment, terminology extraction, electronic dictionaries, morphological inﬂection

Cvetana Krstev, Branislava Šandrih, Ranka Stanković. "Using English Baits to Catch Serbian Multi-Word Terminology" in Proceedings of the 11th International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
Развој геолошког информационог система Републике Србије

Бранислав Благојевић, Бранислав Тривић, Ненад Бањац, Ранка Станковић, Велизар Николић (2005)

Геолошки информациони систем Србије (ГеолИСС) је пројектован, првенствено, са намером ефикасног дигиталног архивирања геолошких и њима сродних података. У овом раду је приказана структура базе података као основа за развој геолошки конципираног ГИС-а. Нови, објектно орјентисани (О-О) начин моделирања омогућио је дефинисање самосталних типова објеката, хијерархијски повезаних кроз тополошке и друге релације, чиме је обезбеђена њихова медјусобна интеракција. Објектно оријентисано моделирање извршено је коришћењем унифицираног језика моделирања (UML) и CASE алата, кроз концептуални и логички ниво. Физички модел ће ...

ГеолИСС, геолошки подаци, управљање базама података, концептуални модел, логички модел, ГИС

Бранислав Благојевић, Бранислав Тривић, Ненад Бањац, Ранка Станковић, Велизар Николић. "Развој геолошког информационог система Републике Србије" in 14. конгрес геолога Србије и Црне Горе са међународним учешћем, Нови Сад, 18-20. октобар 2005, Cpпско геолошко друштво и Caвeз геолошких друштава Србије и Црне Горе (2005)
Multi-word Expressions for Abusive Speech Detection in Serbian

Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev (2020)

Ovaj rad predstavlja istraživanja na usavršavanju i unapređenju srpske verzije rečnika Hurtlex, višejezičnog leksikona uvredljivih reči. Posebnu pažnju posvećujemo dodavanju izraza sa više reči (polileksemskih jedinica) koji se mogu smatrati uvredljivim, jer su takvi leksički zapisi veoma važni za postizanje dobrih rezultata u mnoštvu zadataka otkrivanja uvredljivog jezika. Srpski morfološki rečnici se koriste kao osnova za čišćenje podataka i stvaranje rečnika. Istaknuta je veza sa drugim leksičkim i semantičkim resursima na srpskom jeziku i predviđena je izgradnja sistema za ...

uvredljiv govor, govor mržnje, leksički izvori, višejezični leksikon, izrazi sa više reči

Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev. "Multi-word Expressions for Abusive Speech Detection in Serbian" in Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, Association for Computational Linguistics (2020)
A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment

Sina Ahmadi, John P McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette S Pedersen, Thierry Declerck, Tanja Wissik, Andrea Bellandi, Irene Pisani, [...] Ranka Stanković and others (2020)

Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment is carried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships such as broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide range of languages ...

lexical semantic resources, sense alignment, lexicography, language resource

Sina Ahmadi, John P McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette S Pedersen, Thierry Declerck, Tanja Wissik, Andrea Bellandi, Irene Pisani, [...] Ranka Stanković and others . "A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment" in Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), Marseille, European Language Resources Association (ELRA) (2020)
Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++

Branislava Šandrih, Ranka Stanković (2020)

U nauci, industriji i mnogim istraživačkim oblastima, terminologija se brzo razvija. Najčešće, jezik koji je „lingua franca“ za većinu ovih oblasti je engleski. Kao posledica toga, za mnoga polja termini domena su koncipirani na engleskom, a kasnije se prevode na druge jezike. U ovom radu predstavljamo pristup za automatsko izdvajanje dvojezične terminologije za englesko-srpski jezički par koji se oslanja na usaglašeni dvojezični korpus domena, ekstraktor terminologije za ciljni jezik i alat za usklađivanje delova. Ispitujemo performanse metode na domenu ...

ekstrakcija terminologije, validacija terminologije, GIZA++, grafovi, Unitex, klasifikacija teksta

Branislava Šandrih, Ranka Stanković. "Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.6
Annotation of the Serbian ELTeC Collection

Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Mihailo Škorić (2021)

Ovaj rad predstavlja takozvano izdanje nivoa 2 kolekcije tekstova SrpELTeC razvijene u okviru aktivnosti Radne grupe 2 – Metode i alati COST akcije CA 16204 (Distant Reading for European Literary History) i njene specifikacije šeme. Izdanje nivoa 2 je nastavak izdanja nivoa 1, koje se koristi kao ulaz za morfosintaksičke i NER anotacije romana. Srpska obrada nivoa-2 je navedena kroz potrebne korake, uključujući metode i alate koji se koriste u tom procesu. Neki statistički podaci iz srpske kolekcije nivoa ...

udaljeno čitanje, literarni korpus, tagiranje, prepoznavanje imenovanih entiteta, lematizacija, ELTeC

Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Mihailo Škorić. "Annotation of the Serbian ELTeC Collection" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.2.3
Sentiment Analysis of Serbian Old Novels

Ranka Stanković, Miloš Košprdić, Milica Ikonić Nešić, Tijana Radović (2022)

In this paper we present first study of Sentiment Analysis (SA) of Serbian novels from the 1840-1920 period. The preparation of sentiment lexicon was based on three existing lexicons: NRC, AFFIN and Bing with additional extensive corrections. The first phase of dataset refinement included filtering the word that are not found in Serbian morphological dictionary and in second automatic POS tagging and lemma were manually corrected. The polarity lexicon was extracted and transformed into ontolex-lemon and published as initial ...

sentiment lexicon, sentiment analysis, distant-reading, machine learning, old novels

Ranka Stanković, Miloš Košprdić, Milica Ikonić Nešić, Tijana Radović. "Sentiment Analysis of Serbian Old Novels" in Proceedings of the 2nd Workshop on Sentiment Analysis and Linguistic Linked Data, June 2022, Marseille, France, European Language Resources Association (2022)
Transformer-Based Composite Language Models for Text Evaluation and Classification

Mihailo Škorić, Miloš Utvić, Ranka Stanković (2023)

Parallel natural language processing systems were previously successfully tested on the tasks of part-of-speech tagging and authorship attribution through mini-language modeling, for which they achieved significantly better results than independent methods in the cases of seven European languages. The aim of this paper is to present the advantages of using composite language models in the processing and evaluation of texts written in arbitrary highly inflective and morphology-rich natural language, particularly Serbian. A perplexity-based dataset, the main asset for the ...

General Mathematics, Engineering (miscellaneous), Computer Science (miscellaneous)

Mihailo Škorić, Miloš Utvić, Ranka Stanković. "Transformer-Based Composite Language Models for Text Evaluation and Classification" in Mathematics, MDPI AG (2023). https://doi.org/10.3390/math11224660
Diversity and biodeteriorative potential of fungal dwellers on ancient stone stela

Savković Željko, Unković Nikola, Stupar Miloš, Franković Maja, Jovanović Milena, Erić Suzana, Šarić Kristina, Stanković Slaviša, Dimkić Ivica, Vukojević Jelena, Ljaljević Grbić Milica (2016)

Savković Željko, Unković Nikola, Stupar Miloš, Franković Maja, Jovanović Milena, Erić Suzana, Šarić Kristina, Stanković Slaviša, Dimkić Ivica, Vukojević Jelena, Ljaljević Grbić Milica. "Diversity and biodeteriorative potential of fungal dwellers on ancient stone stela" in International Biodeterioration & Biodegradation International Biodeterioration & Biodegradation no. 115, Netherlands Amsterdam:Elsevier (2016): 212-223. https://doi.org/http://dx.doi.org/10.1016/j.ib
Development of Open Educational Resources (OER) for Natural Language Processing

Cvetana Krstev, Biljana Lazić, Ranka Stanković, Giovanni Schiuma, Miladin Kotorčević (2015)

In this paper we present the development of an online course at the edX BAEKTEL platform named “Lexical Recognition in the Natural Language Processing (NLP)”. It is based on the course of the same name for PhD studies at the University of Belgrade, Faculty of Philology. There are not many courses in Computational Linguistics (CL) on OER platforms, and there is none in Serbian either for CL or NLP. We have developed this course in order to improve this ...

E-Learning, Open Educational Resources, Computational Linguistics, Lexical Resources, edX

Cvetana Krstev, Biljana Lazić, Ranka Stanković, Giovanni Schiuma, Miladin Kotorčević. "Development of Open Educational Resources (OER) for Natural Language Processing" in The Sixth International Conference on e-Learning (eLearning-2015), September 2015, Belgrade, Serbia, Belgrade : Belgrade Metropolitan Univesity (2015)
Rule-based Automatic Multi-word Term Extraction and Lemmatization

Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac (2016)

In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...

term extraction, terminology, multi-word units, lemmatization, finite-state transducers

Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)

Претрага

216 items

Веб-алат за управљање грађом Речника САНУ и анотација листића cite

CliRtheRoads: An Integrated Approach to Landslide Risk Management on Roads in Serbia cite

BERT Downstream Task Analysis: Named Entity Recognition in Serbian cite

Application of pulsed flash thermography method for specific defect estimation in aluminum cite

Diversity and biodeteriorative potential of fungal dwellers on ancient stone stela cite

An Approach to Efficient Processing of Multi-Word Units cite

Indexing of textual databases based on lexical resources: A case study for Serbian cite

A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals cite

Electronic Dictionaries - from File System to lemon Based Lexical Database cite

Using English Baits to Catch Serbian Multi-Word Terminology cite

Развој геолошког информационог система Републике Србије cite

Multi-word Expressions for Abusive Speech Detection in Serbian cite

A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment cite

Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++ cite

Annotation of the Serbian ELTeC Collection cite

Sentiment Analysis of Serbian Old Novels cite

Transformer-Based Composite Language Models for Text Evaluation and Classification cite

Diversity and biodeteriorative potential of fungal dwellers on ancient stone stela cite

Development of Open Educational Resources (OER) for Natural Language Processing cite

Rule-based Automatic Multi-word Term Extraction and Lemmatization cite

Веб-алат за управљање грађом Речника САНУ и анотација листића

CliRtheRoads: An Integrated Approach to Landslide Risk Management on Roads in Serbia

BERT Downstream Task Analysis: Named Entity Recognition in Serbian

Application of pulsed flash thermography method for specific defect estimation in aluminum

Diversity and biodeteriorative potential of fungal dwellers on ancient stone stela

An Approach to Efficient Processing of Multi-Word Units

Indexing of textual databases based on lexical resources: A case study for Serbian

A Tool for Enhanced Search of Multilingual Digital Libraries of E-journals

Electronic Dictionaries - from File System to lemon Based Lexical Database

Using English Baits to Catch Serbian Multi-Word Terminology

Развој геолошког информационог система Републике Србије

Multi-word Expressions for Abusive Speech Detection in Serbian

A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment

Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++

Annotation of the Serbian ELTeC Collection

Sentiment Analysis of Serbian Old Novels

Transformer-Based Composite Language Models for Text Evaluation and Classification

Diversity and biodeteriorative potential of fungal dwellers on ancient stone stela

Development of Open Educational Resources (OER) for Natural Language Processing

Rule-based Automatic Multi-word Term Extraction and Lemmatization