Претрага
242 items
-
New Language Models for South Slavic Languages
Mihailo Škorić (2024)Izlaganje će predstaviti izazove i perspektive modelovanja južnoslovenskih jezika, sa posebnim osvrtom opšte jezičke modele građene na arhitekturi transformera (BERT, GPT), na dostupne skupove tekstova za obučavanje tih modela, te kvantitet i kvalitet tih skupova. Izlaganje će ponuditi pregled dostupnih skupova i modela, dok će posebna pažnja biti posvećena najnovijim korpusima tekstova. Prvi korpus, Kišobran, predstavlja krovni veb korpus južnoslovenskih jezika i ujedno trenutno najveći korpus tekstova na našim prostorima koji broji preko osamnaest milijardi reči i uključuje sve ...Mihailo Škorić. "New Language Models for South Slavic Languages" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024, University of Belgrade - Faculty of Philology (2024)
-
Industrija 4.0 - koncept prediktivnog održavanja 4.0 (PdM 4.0) u rudarstvu
Četvrtu industrijsku revoluciju – Industriju 4.0 karakteriše upotreba cyber-fizičkih sistema. Da bi se postigla optimalna strategija održavanja (ali i eksploatacije), neophodno je razviti sisteme koji podržavaju napredne inteligentne sisteme održavanja ili tehnologije pametnog održavanja. Iz toga su proizašli postulati Prediktivnog održavanja 4.0 (PdM 4.0) koji definišu veoma blisku budućnost u oblasti održavanja tehničkih sistema pa i rudarske opreme. PdM 4.0 uključuje iskorišćenje snage veštačke inteligencije za stvaranje stalnog uvida u otkrivanje uzroka i anomalija u radu opreme, koje se ...Predrag Jovančić, Vesna Damnjanović, Dragan Ignjatović, Miloš Tanasijević, Stevan Đenadić. "Industrija 4.0 - koncept prediktivnog održavanja 4.0 (PdM 4.0) u rudarstvu" in IX Međunarodna konferencija UGALJ 2019, Zlatibor, Srbija, 23-26. oktobar 2019., Jugoslovenski komitet za površinsku eksploataciju (2019)
-
Building learning capacity by blending different sources of knowledge
... tools, and e-learning applications in higher education. He was one of the key persons in several scientific projects related to human language technologies and e-learning. Ranka Stanković is associate professor at University of Belgrade, Faculty of Mining and Geology, where she is teaching ...
... tools to be developed. Available literature lacks examples of effective solutions offered to this challenge, especially those using new learning technologies, as the one presented in this article. With the advancement of information technology (IT) a powerful mechanism for blending academic and ...
... original approach to bridging the gap between academic education and knowledge assets needed within enterprises, by means of emerging learning technologies and methods. The initial OER materials in BAEKTEL are published by universities and enterprises of the Western Balkans (WB), mostly in WB ...Ivan Obradović, Ranka Stanković, Olivera Kitanović, Dalibor Vorkapić. "Building learning capacity by blending different sources of knowledge" in International Journal of Learning and Intellectual Capital (2016). https://doi.org/10.1504/IJLIC.2016.075698
-
Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities
Овај рад представља активности на развоју корпуса ELEXIS-sr, српском додатку вишејезичном анотираном корпусу ELEXIS-а, који се састоји од семантичких анотација и репозиторија значења речи. ELEXIS је паралелни вишејезични анотирани корпус на десет европских језика, који може да се користи као вишејезички репер за евалуацију европских језика са мање и средње развијеним ресурсима. Фокус овог рада је на вишечланим изразима и именованим ентитетима, њиховом препознавању у скупу реченица ELEXIS-sr и поређењу са анотацијама на другим језицима. Разматрају се први кораци ...Cvetana Krstev, Ranka Stanković, Aleksandra Marković, Teodora Mihajlov. "Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities" in Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024, Turin, May 25, 2024, ELRA and ICCL (2024)
-
Navigacija budućnosti: poslovni rizici i prilike u rudarskoj industriji
Sektor rudarstva i metalurgije se globalno suočava sa brzo evoluirajućim okruženjem u 2024. godini, koje karakteriše jedinstvena međusobna zavisnost rizika i prilika. Sektor je u središtu globalne energetske tranzicije, uz rastuću potražnju za kritičnim mineralnim sirovinama poput litijuma, nikla i bakra, koji su od suštinskog značaja za tehnologije obnovljive energije i električna vozila. Međutim, ova sve veća odgovornost donosi brojne izazove, uključujući pojačanu pažnju na ekološke, društvene i upravljačke (ESG) faktore, fizičke i tranzicione rizike uzrokovane klimatskim promenama, i ...Petar Marković, Dejan Stevanović, Mirjana Banković, Vuk Lazić. "Navigacija budućnosti: poslovni rizici i prilike u rudarskoj industriji" in XVI međunarodna konferencija OMC 2024, Beograd : Jugoslovenski komitet za površinsku eksploataciju (2024)
-
A business intelligence approach to mine safety management
Ljiljana Kolonja, Ranka Stanković, Ivan Obradović, Olivera Kitanović, Dejan Stevanović, Marija Radojičić (2016)... bg.ac.rs 13th ISCSM 2016 BELGRADE 2 system consists of several software components representing a combination of different information technologies, where interoperability between diverse software components is secured by a system of ontologies developed for mining engineering named RudOnto ...
... manner [13]. The BI system for support of decision making in safety management described in this paper uses a combination of various information technologies, such as OLTP (On Line Transactional Processing), OLAP, WEB, SQL Server Reporting Services, etc. Hence, it is a system for direct analytical ...Ljiljana Kolonja, Ranka Stanković, Ivan Obradović, Olivera Kitanović, Dejan Stevanović, Marija Radojičić . "A business intelligence approach to mine safety management" in 13th International Symposium Continuous Surface Mining, Beograd : Yugoslav Opencast Mining Committee (2016)
-
Energetska bezbednost sektora prirodnog gasa Srbije
U prve dve decenije 21. veka obezbeđivanje sigurnosti snabdevanja prirodnim gasom domaćeg tržišta bio je jedan od prioriteta razvoja energetike Srbije. Istovremeno, aspekt sigurnog snabdevanja bio je neizostavni deo slagalice stvaranja energetske bezbednosti. Izražena uvozna zavisnost je dominantna karakteristika sektora prirodnog gasa, tačnije, Republika Srbija je snažno zavisna od ruskog gasa, sa više od 80% uvezenih količina gasa, a do pre dve godine snabdevala se isključivo kroz jednu interkonekciju. U radu se razmatra aktuelna situacija u sektoru prirodnog ...... madzarevicOrgf.bg.ac.rs (corresponding author only) predrag.jovancicO'rgf.bg.ac.rs miroslav.crnogorac(orgf.bg.ac.rs Conference topic Process technologies MG Energy Security of Serbian Natural Gas Sector Keywords (max. 5 words) Abstract In the first tvo decades of the 21st century, ensuring ...Aleksandar Madžarević, Predrag Jovančić, Miroslav Crnogorac. "Energetska bezbednost sektora prirodnog gasa Srbije" in 36. Međunarodni kongres o procesnoj industriji – Procesing ’23, Šabac, 1. i 2. juna 2023. , Beograd : Savez mašinskih i elektrotehničkih inženjera i tehničara Srbije (SMEITS) Društvo za procesnu tehniku (2023)
-
A comparison between ARIMA, LSTM, ARIMA-LSTM and SSA for cross-border rail freight traffic forecasting: the case of Alpine-Western Balkan Rail Freight Corridor
Miloš Milenković, Miloš Gligorić, Nebojša Bojović, Zoran Gligorić. "A comparison between ARIMA, LSTM, ARIMA-LSTM and SSA for cross-border rail freight traffic forecasting: the case of Alpine-Western Balkan Rail Freight Corridor" in Transportation Planning and Technology, Informa UK Limited (2023). https://doi.org/10.1080/03081060.2023.2245389
-
Towards translation of educational resources using GIZA++
... and Geology, miladin.kotorcevic@rgf.bg.ac.rs Abstract: E-learning courses are becoming progressively popular. Thanks to the Internet and new technologies, education has never been more available to everyone. The main obstacle to studying new subjects is often the language, given the number of different ...
... X-Serbian Bitexts”. In Cristina Vertan and Walther v. Hahn (eds.) Multilingual Processing in Eastern and Southern EU Languages: Low-Resourced Technologies and Translation, pp. 207-227, Cambridge Scholars Publishing,. ISBN (13) 978-1-4438-3878-8, 2012. [21] A. Obuljen, Kvantitativna metoda za poravnanje ...Ivan Obradović, Dalibor Vorkapić, Ranka Stanković, Nikola Vulović, Miladin Kotorčević. "Towards translation of educational resources using GIZA++" in The Seventh International Conference on e-Learning (eLearning-2016), September 2016, Belgrade : Metropolitan Univesity (2016)
-
Part of Speech Tagging for Serbian language using Natural Language Toolkit
Ranka Stanković, Boro Milovanović (2020)Dok se razvijaju složeni algoritmi za NLP (obrada prirodnog jezika), osnovni zadaci kao što je označavanje ostaju veoma važni i još uvek izazovni. NLTK (Natural Language Toolkit) je moćna Python biblioteka za razvoj programa zasnovanih na NLP-u. Pokušavamo da iskoristimo ovu biblioteku za kreiranje PoS (vrsta reči) oznake za savremeni srpski jezik. Jedanaest različitih modela je kreirano korišćenjem NLTK API-ja za označavanje. Najbolji modeli se transformišu sa Brill tagerom da bi se poboljšala tačnost. Obučili smo modele na označenom ...... Serbia (e-mail: ranka.stankovic@rgf.bg.ac.rs). [6]. Later attempts relied on CRF (Conditional Random Fields) [7-8] which is among supported technologies by NLTK and will be used for training one of the taggers. Introducing the dataset and the tagset will be done in the Section 2. The creation ...
... unseen data shows 0.88 precision with Spacy tagger and 0.93 with TreeTagger19 while the best tagger produced in this research achieves 0.87. The technologies in this research are not able to produce us a generalized, multi-purpose, all-around PoS tagger that can be a standard for a Serbian language ...Ranka Stanković, Boro Milovanović. "Part of Speech Tagging for Serbian language using Natural Language Toolkit" in 7th International Conference on Electrical, Electronic and Computing Engineering IcETRAN 2020, Academic Mind, Belgrade (2020)
-
Contrastive Analysis of Syntax Patterns in Comparable Football Corpora in Spanish and Serbian Languages
Jelena Lazarević, Olivera Kitanović (2024.)Cilj rada je istraživanje kolokabilnosti kao načina na koji se leksičke jedinice povezuju sa rečima iz različitih kategorija, formirajući veće jedinice. Istraživanje semantičkih i sintaksičkih principa ovih kombinacija u španskom i srpskom jeziku fudbala izvedeno je na komparabilnim fudbalskim korpusima SrFudKo i EsFudko, razvijenim u okviru doktorske disertacije Jelene Lazarević pod nazivom: Jezičke odlike diskursa novih medija o fudbalu: kontrastivna analiza na korpusu srpskog i španskog jezika. Korpus fudbala SrFudKo, kreiran na osnovu tekstova o fudbalu sa pet srpskih veb-portala: ...Jelena Lazarević, Olivera Kitanović . "Contrastive Analysis of Syntax Patterns in Comparable Football Corpora in Spanish and Serbian Languages" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024, University of Belgrade - Faculty of Philology (2024.)
-
Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking
U radu se prikazuju rezultati istraživanja vezanih za pripremu paralelnih korpusa, fokusirajući se na transformaciju u RDF grafove koristeći NLP Interchange Format (NIF) za lingvističku anotaciju. Pružamo pregled paralelnog korpusa koji je korišćen u ovom studijskom slučaju, kao i proces označavanja delova govora, lematizacije i prepoznavanja imenovanih entiteta (NER). Zatim opisujemo povezivanje imenovanih entiteta (NEL), konverziju podataka u RDF, i uključivanje NIF anotacija. Proizvedene NIF datoteke su evaluirane kroz istraživanje triplestore-a korišćenjem SPARQL upita. Na kraju, razmatra se povezivanje Linked ...paralelni korpusi, povezivanje imenovanih entiteta, prepoznavanje imenovanih entiteta, NER, NEL, povezani podaci, NIF, VikipodaciRanka Stanković, Milica Ikonić Nešić, Olja Perisic, Mihailo Škorić, Olivera Kitanović. "Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024)
-
Keyword-Based Search on Bilingual Digital Libraries
This paper outlines the main features of Biblisha, a tool that offers various possibilities of enhancing queries submitted to large collections of aligned parallel text residing in bilingual digital library. Biblishsa supports keyword queries as an intuitive way of specifying information needs. The keyword queries initiated, in Serbian or English, can be expanded, both semantically, morphologically and in other language, using different supporting monolingual and bilingual resources. Terminological and lexical resources are of various types, such as wordnets, electronic ...Ranka Stanković, Cvetana Krstev, Duško Vitas, Nikola Vulović, Olivera Kitanović. "Keyword-Based Search on Bilingual Digital Libraries" in Semantic Keyword-Based Search on Structured Data Sources - Second COST Action IC1302 International KEYSTONE Conference, IKC 2016, Springer (2017). https://doi.org/10.1007/978-3-319-53640-8_10
-
Social-Emo.Sr: Emotional Multi-Label Categorization of Conversational Messages from Social Networks X and Reddit
U digitalnom okruženju južnoslovenskih jezika, analiza emocija u tekstovima na društvenim mrežama postaje sve važnija za razumevanje javnog mnjenja, kreiranje personalizovanog sadržaja i analizu međusobnih interakcija korisnika. U okviru ovog rada predstavljamo detaljnu metodologiju i rezultate označavanja korpusa na srpskom jeziku prema Plutčikovom modelu kategorizacije, koji prepoznaje osam osnovnih emocionalnih kategorija, kao što su radost, tuga, bes, strah, poverenje, gađenje, iščekivanje i iznenađenje. Cilj istraživanja je da se analizira emocionalni sadržaj tekstova preuzetih sa društvenih mreža X (nekada Twitter) ...Milena Šošić, Ranka Stanković, Jelena Graovac. "Social-Emo.Sr: Emotional Multi-Label Categorization of Conversational Messages from Social Networks X and Reddit" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024., University of Belgrade - Faculty of Philology (2024)
-
Rule-based Automatic Multi-word Term Extraction and Lemmatization
In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...... 248--255. Koeva, S. (2007). Multi-word term extraction for Bulgarian. In Proc. of the Workshop on BSNLP: Information Extraction and Enabling Technologies, pp. 59--66. Krstev, C., Obradović, I., Stanković, R., and Vitas, D. (2013). An Approach to Efficient Processing of Multi-Word Units. In: ...
... Lemmatization of Polish person names. In Proc. of the Workshop on Balto-Slavonic Natural Language Processing: Information Extraction and Enabling Technologies, Stroudsburg: Association for Computational Linguistics, pp. 27--34. Savary, A., Zaborowski, B., Krawczyk-Wieczorek A., and Makowiecki F. (2012) ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)
-
Improving efficiency of thermal power plants through mine coal quality planning and control
Главни циљ контроле квалитета угља у рудницима лигнита је снабдевање термоелектрана угљем чији квалитет мора да се креће унутар одређених квалитативних ограничења. Карактеристике угља могу да утичу на ефикасност, поузданост и расположивост како котла тако и јединица за контролу емисије. У овом раду аутори су презентовали интегрисану симулацију рударског процеса као нови приступ у истраживању променљивости калоричне вредности угља приликом експлоатације комплексног лежишта лигнита. Резултати таквог приступа омогућавају драгоцен увид у перформансе континуалног рударског система у смислу контроле променљивости ...Mirjana Banković, Dejan Stevanović, Milica Pešić, Aleksandra Tomašević, Ljiljana Kolonja. "Improving efficiency of thermal power plants through mine coal quality planning and control" in Thermal Science, Vinča Institute of Nuclear Sciences (2018). https://doi.org/10.2298/TSCI170605209B
-
Bilingual lexical extraction based on word alignment for improving corpus search
Jelena Andonovski, Branislava Šandrih, Olivera Kitanović. "Bilingual lexical extraction based on word alignment for improving corpus search" in The Electronic Library, Emerald (2019). https://doi.org/10.1108/EL-03-2019-0056
-
Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model
Ova studija predstavlja analizu sentimenta srpskih starih romana iz perioda 1840-1920, koristeći veliki jezički model (LLM) Mistral za tehniku učenja sa zasnovani na takozvanim "zero" i "few-shot" pokušajima. Glavni pristup uvodi inovacije osmišljavanjem istraživačkih upita (promptova) uključuju tekst sa uputstvom za klasifikaciju bez primera i na osnovu nekoliko primera, omogućavajući jezičkom modelu da klasifikuje osećanja u pozitivne, negativne ili objektivne kategorije. Ova metodologija ima za cilj da pojednostavi analizu osećanja ograničavanjem odgovora, čime se povećava preciznost ...Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković, Biljana Rujević. "Advancing Sentiment Analysis in Serbian Literature: A Zero and Few-Shot Learning Approach Using the Mistral Model" in In Proceedings of the Sixth International Conference on Computational Linguistics in Bulgaria (CLIB 2024), BAS (2024)
-
Fragmentation Size Prediction of Blasted Material Using a Specialized Software for Drilling and Blasting
Stefan Milanović, Nikola Simić, Lazar Kričak, Milanka Negovanović, Nikola Đokić, Miljan Gomilanović (2024)Design and optimizing the drilling and blasting parameters, should fill the requirements for the capacity, fragmentation size, and technical characteristics of loading and transport equipment, and enable a safe work at the open pit. Besides mentioned, it also achieves the minimal impact on the environment of the open pit and decreases the negative effects on the environment, especially in the blast vibration and flyrock. To obtain the best possible blasting effects and consider all the factors, a specialized software is ...Stefan Milanović, Nikola Simić, Lazar Kričak, Milanka Negovanović, Nikola Đokić, Miljan Gomilanović. "Fragmentation Size Prediction of Blasted Material Using a Specialized Software for Drilling and Blasting" in The 55th International October Conference on Mining and Metallurgy, Mining and Metallurgy Institute Bor (2024). https://doi.org/10.5937/IOC24077M
-
Resource-based WordNet Augmentation and Enrichment
In this paper we present an approach to support production of synsets for SerbianWordNet(SerWN)byadjustingPrincetonWordNet(PWN)synsetsusing several bilingual English-Serbian resources. PWN synset definitions were automatically translated and post-edited, if needed, while candidate literals for Serbian synsets were obtained automatically from a list of translational equivalents compiled form bilingual resources. Preliminary results obtained from a setof1248selectedPWNsynsetsshowthattheproducedSerbiansynsetscontain 4024 literals, out of which 2278 were offered by the system we present in this paper, whereas experts added the remaining 1746. Approximately one half of ...... approach to wordnet enrichment. 1. Introduction Semantic networks, such as wordnets, are among the most important resources in Human Language Technologies. Thus, for example, the Princeton WordNet - PWN (Fellbaum, 1998), has been in use for more than two decades as the standard lexical database for ...
... 9http://eurovoc.europa.eu/ Proceedings of CLIB 2018 107 Office, which moved forward to ontology-based thesaurus management and semantic web technologies compliant to W3C recommendations, as well as latest trends in thesaurus standards. For this research we used the bilingual en-sr version 4.7 in ...Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev. "Resource-based WordNet Augmentation and Enrichment" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018)