Претрага
58 items
-
Knowledge and Rule-Based Diacritic Restoration in Serbian
In this paper we present a procedure for the restoration of diacritics in Serbian texts written using the degraded Latin alphabet. The procedure relies on the comprehensive lexical resources for Serbian: the morphological electronic dictionaries, the Corpus of Contemporary Serbian and local grammars. Dictionaries are used to identify possible candidates for the restoration, while the dataobtainedfromSrpKorandlocalgrammarsassistsinmakingadecisionbetween several candidates in cases of ambiguity. The evaluation results reveal that,dependingonthetext,accuracyrangesfrom95.03%to99.36%,whilethe precision (average 98.93%) is always higher than the recall (average 94.94%).... Restoration in Serbian Cvetana Krstev, Ranka Stanković, Duško Vitas Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Knowledge and Rule-Based Diacritic Restoration in Serbian | Cvetana Krstev, Ranka Stanković, Duško Vitas | Proceedings of the Third International Conference ...Cvetana Krstev, Ranka Stanković, Duško Vitas. "Knowledge and Rule-Based Diacritic Restoration in Serbian" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018): 41-51
-
Megaklizišta u gornjem toku Drine od Foče do Višegrada i njihov uticaj na hidroenergetske objekte
Sunarić Duško, Jevremović Dragutin, Lolin M.. "Megaklizišta u gornjem toku Drine od Foče do Višegrada i njihov uticaj na hidroenergetske objekte" in Tehnički institut, Zbornik radova no. 3, Bijeljina:Arhiv za tehničke nauke Bijeljina (2010): 65-81
-
Zagađivanje reka u Srbiji kliženjem i odronjavanjem
Jevremović Dragutin, Sunarić Duško, Kostić Srđan. "Zagađivanje reka u Srbiji kliženjem i odronjavanjem" in Tehnika - časopis Saveza inženjera i tehničara Srbije, Beograd:Savez tehničara Srbije (2011): 731-736
-
The Nooj System as Module within an Integrated Language Processing Environment
... Processing Environment Ranka Stanković, Duško Vitas, Cvetana Krstev Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] The Nooj System as Module within an Integrated Language Processing Environment | Ranka Stanković, Duško Vitas, Cvetana Krstev | Proceedings of ...
... www.dr.rgf.bg.ac.rs The NooJ system as module within an integrated language processing environment Ranka Stanković, ranka@rgf.bg.ac.yu Duško Vitas, vitas@matf.bg.ac.yu Cvetana Krstev, cvetena@matf.bg.ac.yu 1. Introduction In this paper we describe the main structure and possible ap ...Ranka Stanković, Duško Vitas, Cvetana Krstev. "The Nooj System as Module within an Integrated Language Processing Environment" in Proceedings of the 2007 International Nooj Conference, Cambridge Scholars Publishing (2008)
-
SrpELTeC: A Serbian Literary Corpus for Distant Reading
U članku je predstavljen SrpELTeC, korpus razvijen u okviru akcije COST Distant Reading for European Literary History (CA16204). Svi romani u SrpELTeC-u su odabrani, pripremljeni i obeleženi korišćenjem zajedničkih principa uspostavljenih za sve jezičke zbirke u Evropskoj zbirci književnog teksta (ELTeC). Navedeni su izazovi i rešenja u pripremi SrpELTeC od nule. Svi romani su ručno kodirani u TEI sa bogatim metapodacima i strukturnim napomenama. Automatska anotacija je uključivala POS-označavanje, lematizaciju i imenovane entitete, oslanjajući se na resurse za obradu ...digital humanities, Serbian literature, text corpora, distant reading , linked data, named entity recognition, text analyticsRanka Stanković, Cvetana Krstev, Duško Vitas. "SrpELTeC: A Serbian Literary Corpus for Distant Reading" in Primerjalna književnost, Research Centre of the Slovenian Academy of Sciences and Arts (2024). https://doi.org/10.3986/pkn.v47.i2.03
-
The Many Faces of SrpKor
Акроним СрпКор означава фамилију електронских корпуса савременог српског језика чија је изградња почела крајем седамдесетих година прошлога века, а која је постала шире видљива заинтересованој истраживачкој заједници објављивањем његове прве верзије на вебу 2002. године. У овом дугом периоду, посебно пре појаве корисних текстуелних ресурса на вебу, развој корпуса се састојао у прикупљању и обради грађе као и у развоју метода обраде корпуса. Наиме, електронски корпус није само колекција текстова у дигиталном облику (како се то, на пример, наводи ...Duško Vitas, Ranka Stanković, Cvetana Krstev. "The Many Faces of SrpKor" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024, University of Belgrade - Faculty of Philology (2024.)
-
An Approach to Efficient Processing of Multi-Word Units
Efficient processing of Multi-Word Units in the course of development of morphological MWU dictionaries is not easy to achieve, especially when languages with complex morphological structures are concerned, such as Serbian. Manual development of this type of dictionaries is a tedious and extremely slow process. To alleviate this problem we turned to our multipurpose software tool, dubbed LeXimir, in the production of lemmas for e-dictionaries of multi-word units. In addition to that, we developed a procedure aimed at making ...... Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] An Approach to Efficient Processing of Multi-Word Units | Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas | Computational Linguistics - Applications ...Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas. "An Approach to Efficient Processing of Multi-Word Units" in Computational Linguistics - Applications, Studies in Computational Intelligence 458 no. 458, Berlin Heidelberg : Springer-Verlag (2013): 109-129. https://doi.org/10.1007/978-3-642-34399-5_6
-
Nelinearna analiza prostorne dinamike kretanja klizišta ,,Trbosilje”
Kostić Srđan, Jevremović Dragutin, Sunarić Duško, Vasović Nebojša. "Nelinearna analiza prostorne dinamike kretanja klizišta ,,Trbosilje”" in Зборник радова XIV симпозијума из инжењерске геологије и геотехнике са међународним учешћем, Београд, 27. и 28. септембар, 2012., Beograd:Друштво геолошких инжењера и техничара Србије (2012): 405-416
-
Production of morphological dictionaries of multi-word units using a multipurpose tool
The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...electronic dictionary, Serbian, morphology, inflection, multi-word units, noun phrases, query expansion... Obradović, Cvetana Krstev, Duško Vitas Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Production of morphological dictionaries of multi-word units using a multipurpose tool | Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas | Proceedings of the ...Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas. "Production of morphological dictionaries of multi-word units using a multipurpose tool" in Proceedings of the Computational Linguistics-Applications Conference, October 2011, Jachranka, Poland, Jachranka, Poland : PTI - Polish Information Processing Society (2011)
-
The Effects of Multi-Word Tagging on Text Disambiguation
Utvić Miloš, Obradović Ivan, Krstev Cvetana, Vitas Duško. "The Effects of Multi-Word Tagging on Text Disambiguation" in Proceedings of the 29th International Conference on Lexis and Grammar, LGC 2010, September 2010, Belgrade, Serbia, D. Vitas and C. Krstev (eds.), Belgrade:Faculty of Mathematics, University of Belgrade (2010): 333-342
-
E-Dictionaries and Finite-State Automata for the Recognition of Named Entities
Krstev Cvetana, Vitas Duško, Obradović Ivan, Utvić Miloš. "E-Dictionaries and Finite-State Automata for the Recognition of Named Entities" in Proceedings of the 9th International Workshop on Finite State Methods and Natural Language Processing, FSMNLP 2011, July 2010, Blois, France, A. Maletti and M. Constant (eds.), :Association for Computational Linguistics (2011): 48-56
-
E-Connecting Balkan Languages
In this paper we present a versatile language processing tool that can be successfully used for many Balkan languages. This tool relies for its work on several sophisticated textual and lexical resources that were developed for most of Balkan languages. These resources are based on several de facto standards in natural language processing.... E-Connecting Balkan Languages Cvetana Krstev, Ranka Stanković, Duško Vitas, Svetla Koeva Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] E-Connecting Balkan Languages | Cvetana Krstev, Ranka Stanković, Duško Vitas, Svetla Koeva | Proceedings of the Workshop Workshop ...
... University of Belgrade cvetana@matf.bg.ac.rs Ranka Stanković Faculty of Mining and Geology University of Belgrade ranka@rgf.bg.ac.rs Duško Vitas Faculty of Mathematics University of Belgrade vitas@matf.bg.ac.rs Svetla Koeva Dep. of Computational Linguistics Institute for ...Cvetana Krstev, Ranka Stanković, Duško Vitas, Svetla Koeva. "E-Connecting Balkan Languages" in Proceedings of the Workshop Workshop on Multilingual resources, technologies and evaluation for Central and Eastern European Languages, 17 September 2009, eds. C. Vertan, S. Piperidis, E. Paskaleva and Milena Slavcheva, Borovets, Bulgaria : Association for Computational Linguistics Stroudsburg, PA, USA (2009)
-
The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
In this paper we present how resources and tools developed within the Human Language Technology Group at the University of Belgrade can be used for tuning queries before submitting them to a web search engine. We argue that the selection of words chosen for a query, which are of paramount importance for the quality of results obtained by the query, can be substantially improved by using various lexical resources, such as morphological dictionaries and wordnets. These dictionaries enable semantic ...LR web services, MultiWord Expressions & Collocations, Information Extraction, Information Retrieval... Stanković Ranka, Vitas Duško, Obradović Ivan Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines | Krstev Cvetana, Stanković Ranka, Vitas Duško, Obradović Ivan | LREC ...
... rs The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines Cvetana Krstev 1 , Ranka Stanković2 , Duško Vitas 3 , Ivan Obradović4 1 professor, Faculty of Philology, Belgrade, 2 assistant, Faculty of Mining and Geology, Belgrade 3 professor, Faculty ...Krstev Cvetana, Stanković Ranka, Vitas Duško, Obradović Ivan. "The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines" in LREC 2008: Conference on Language Resources and Evaluation, Marrakesh, Morocco, May 2008, European Language Resources Association (ELRA) (2008)
-
WS4LR - a Worksation for Lexical Resources
... Lexical Resources Cvetana Krstev, Ranka Stanković, Duško Vitas, Ivan Obradović Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] WS4LR - a Worksation for Lexical Resources | Cvetana Krstev, Ranka Stanković, Duško Vitas, Ivan Obradović | Proceedings of the Fifth ...
... publications. - The Repository is available at: www.dr.rgf.bg.ac.rs WS4LR: A Workstation for Lexical Resources Cvetana Krstev 1 , Ranka Stanković2 , Duško Vitas 3 and Ivan Obradović2 1Faculty of Philology, Studentski trg 3, CS-11000 Belgrade, 2Faculty of Mining and Geology, Đušina 7, CS-11000 Belgrade ...Cvetana Krstev, Ranka Stanković, Duško Vitas, Ivan Obradović. "WS4LR - a Worksation for Lexical Resources" in Proceedings of the Fifth Interantional Conference on Language Resources and Evaluation, Genoa, Italy, May 2006, ELRA - European Language Resources Association (2006)
-
A System for Named Entity Recognition Based on Local Grammars
Krstev Cvetana, Obradović Ivan, Utvić Miloš, Vitas Duško. "A System for Named Entity Recognition Based on Local Grammars" in Journal of Logic and Computation 24 no. 2, :Oxford University Press (2014): 473-489. https://doi.org/10.1093/logcom/exs079
-
Споменица 1991. – 2015. година: 135 година геологије и 70 година рударства на Универзитету у Београду
... вентилације рудника као основа за развој експертног система Вент. и тех. заштита Н. Лилић Ј. Пејчиновић, И. Обрадновић, К. Ђиновић 81. Ђукановић Н. Душко 24.06.2002. Утицај техничко - организационих параметара на бризину израде подземних просторија у рудницима угља Србије Израда подзем ...
... геолошким условима на примеру горњег слива Западне Мораве Геологија С. Прохаска, М. Лазић С. Прохаска; М. Лазић; Ђ. Радиновић 126. Ђукановић Душко 28.01.2005. Модел оптимизације техно - економских показатеља при изради подземних просторија у рудницима угља Србије Рударство С. Трајковић ...
... ZEKOVIĆ M., ĐUKANOVIĆ N., 1999: Rudarski materijali. Stalni univerzitetski udžbenik, RGF Beograd. TRIFUNOVIĆ P., TOKALIĆ R., 2004: Tehnologija materijala u rudarstvu-Metode ispitivanja. Stalni pomoćni univerzitetski udžbenik, II izdanje, RGF Beograd. TRIFUNOVIĆ P., TOKALIĆ R., ĐUKANOVIĆ N., 2009.: ...главни и одговорни уредник Душан Поломчић. Споменица 1991. – 2015. година: 135 година геологије и 70 година рударства на Универзитету у Београду, Београд : Универзитет у Београду, Рударско-геолошки факултет, 2016
-
Белешка о дигитализацији речника
У раду ће се анализирати ограничења која проистичу из линеарног процеса традиционалне израде речника на примеру Речника САНУ. Начин да се превазиђу ова ограничења се састоји у формирању електронске лексикографске базе која не представља само пуку дигиталну транскрипцију папирног издања речника. Посебно се указује на чињеницу да текст речника може представљати корпус и приказују се одабрани примери анализе таквог корпуса формираног из текстове 1. и 19. тома Речника САНУ.... до зоре. (стр. 60) Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић60 ЛИТЕРАТУРА Витас 2007: Душко Витас, „О проблему не(пре)познате речи у обради текс- това на српском језику”, зборник матице српске за филологију и линг- вистику, L, 111–120. Витас/Крстев 2015: Душко Витас и Цветана Крстев ...
... Измењено: 2023-10-14 04:07:54 Белешка о дигитализацији речника Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Белешка о дигитализацији речника | Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић | Српски ...
... publications. - The Repository is available at: www.dr.rgf.bg.ac.rs 811.163.41’322.2 004.9:811.163.41’374 https://doi.org/10.18485/msc.2019.48.3.ch3 Душко М. ВИТАС, Цветана Ј. КРСТЕВ, Ранка М. СТАНКОВИЋ Филолошки факултет Универзитета у Београду БЕЛЕШКА О ДИГИТАЛИЗАЦИЈИ РЕЧНИКА У раду ће се анализирати ...Душко М. Витас, Цветана Ј. Крстев, Ранка М. Станковић. "Белешка о дигитализацији речника" in Српски језик и његови ресурси, Међународни славистички центар, Филолошки факултет, Универзитет у Београду (2019). https://doi.org/10.18485/msc.2019.48.3.ch3
-
Keyword-Based Search on Bilingual Digital Libraries
This paper outlines the main features of Biblisha, a tool that offers various possibilities of enhancing queries submitted to large collections of aligned parallel text residing in bilingual digital library. Biblishsa supports keyword queries as an intuitive way of specifying information needs. The keyword queries initiated, in Serbian or English, can be expanded, both semantically, morphologically and in other language, using different supporting monolingual and bilingual resources. Terminological and lexical resources are of various types, such as wordnets, electronic ...Ranka Stanković, Cvetana Krstev, Duško Vitas, Nikola Vulović, Olivera Kitanović. "Keyword-Based Search on Bilingual Digital Libraries" in Semantic Keyword-Based Search on Structured Data Sources - Second COST Action IC1302 International KEYSTONE Conference, IKC 2016, Springer (2017). https://doi.org/10.1007/978-3-319-53640-8_10
-
Distribution of canonical syllable types in Serbian
Obradović Ivan, Obuljen Aljoša, Vitas Duško, Krstev Cvetana, Radulović Vanja. "Distribution of canonical syllable types in Serbian" in Text and Language, Structures · Functions · Interrelations. Quantitative Perspectives, P. Grzybek, E. Kelih, J. Mačutek (eds.), Wien:Praesens Verlag (2010): 145-157
-
Automatic construction of a morphological dictionary of multi-word units
The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...electronic dictionary, Serbian, morphology, inflection, multiwordn units, noun phrases, query expansion... Stanković, Ivan Obradović, Duško Vitas, Miloš Utvić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Automatic construction of a morphological dictionary of multi-word units | Cvetana Krstev, Ranka Stanković, Ivan Obradović, Duško Vitas, Miloš Utvić | Lecture ...Cvetana Krstev, Ranka Stanković, Ivan Obradović, Duško Vitas, Miloš Utvić. "Automatic construction of a morphological dictionary of multi-word units" in Lecture Notes in Computer Science 6233, Advances in Natural Language Processing, Proceedings of the 7thInternational Conference on NLP, IceTAL 2010, Reykjavik, Iceland, August 2010, Springer (2010): 226-237. https://doi.org/10.1007/978-3-642-14770-8_26