Претрага
83 items
-
The Dictionary of the Serbian Academy: from the Text to the Lexical Database
In this paper we discuss the project of digitization of the Dictionary of the Serbo-Croatian Standard and Vernacular Language. Scanning and character recognition were a particular challenge, since various non-standard character set encoding was used in the course of the almost 60-year long production of the dictionary. The first aim of the project was to formalize the micro-structure of the dictionary articles in order to parse the digitized text of and transform it into structured data stored in relational lexical database. This approach ...... Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] The Dictionary of the Serbian Academy: from the Text to the Lexical Database | Ranka Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo | Proceedings ...
... Stanković1, Rada Stijović2, Duško Vitas1, Cvetana Krstev1, Olga Sabo2 1University of Belgrade, 2Institute for Serbian Language, Serbian Academy of Sciences and Arts E-mail: ranka.stankovic@rgf.bg.ac.rs, rada.stijovic@isj.sanu.ac.rs, vitas@matf.bg.ac.rs, cvetana@matf.bg.ac.rs, olga011@yahoo.com Abstract ...
... in recent years were these ideas revitalized, and various possibilities of updating the work on this vocabulary have since been considered (Vitas & Krstev, 2015; Ivanović et al. 2016). The digitization (which is also the topic of the present paper) of the published volumes and raw materials (lexicographic ...Ranka Stanković, Rada Stijović, Duško Vitas, Cvetana Krstev, Olga Sabo. "The Dictionary of the Serbian Academy: from the Text to the Lexical Database" in Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts, Ljubljana : Ljubljana University Press, Faculty of Arts (2018)
-
A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian
Uvredljivi govor na društvenim medijima, uključujući psovke, pogrdni govor i govor mržnje, dostigao je nivo pandemije. Sistem koji bi bio u stanju da detektuje takve tekstove mogao bi da pomogne da internet i društveni mediji postanu bolji virtuelni prostor sa više poštovanja. Istraživanja i komercijalna primena u ovoj oblasti do sada su bili fokusirani uglavnom na engleski jezik. Ovaj rad predstavlja rad na izgradnji AbCoSER-a, prvog korpusa uvredljivog govora na srpskom jeziku. Korpus se sastoji od 6.436 ručno označenih ...... Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian | Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih | 3rd Conference ...
... Detection in Serbian Danka Jokić # University of Belgrade, Serbia Ranka Stanković # Faculty of Mining and Geology, University of Belgrade, Serbia Cvetana Krstev # ↸ Faculty of Philology, University of Belgrade, Serbia Branislava Šandrih # ↸ Faculty of Philology, University of Belgrade, Serbia Abstract ...
... such as race, color, ethnicity, gender, sexual orientation, nationality, religion, or other characteristic”. © Danka Jokić, Ranka Stanković, Cvetana Krstev, and Branislava Šandrih; licensed under Creative Commons License CC-BY 4.0 3rd Conference on Language, Data and Knowledge (LDK 2021). Editors: ...Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih. "A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian" in 3rd Conference on Language, Data and Knowledge (LDK 2021), MDPI AG (2021). https://doi.org/10.4230/OASIcs.LDK.2021.13
-
Combining Heterogeneous Lexical Resources
... Heterogeneous Lexical Resources Cvetana Krstev, Duško Vitas, Ranka Stanković, Ivan Obradović, Gordana Pavlović-Lažetić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Combining Heterogeneous Lexical Resources | Cvetana Krstev, Duško Vitas, Ranka Stanković, Ivan ...
... publications. - The Repository is available at: www.dr.rgf.bg.ac.rs Combining Heterogeneous Lexical Resources Cvetana Krstev, professor, Faculty of Philology, Belgrade, cvetana@matf.bg.ac.yu Duško Vitas, professor, Faculty of Mathematics, Belgrade, vitas@matf.bg.ac.yu Ranka Stankoviæ , assistant ...Cvetana Krstev, Duško Vitas, Ranka Stanković, Ivan Obradović, Gordana Pavlović-Lažetić. "Combining Heterogeneous Lexical Resources" in Proceedings of the Fourth Interantional Conference on Language Resources and Evaluation, Lisabon, Portugal , May 2004, vol. 4, ELRA - European Language Resources Association (2004)
-
Integrisanje heterogenih leksičkih resursa
Osnovna aktivnost Grupe za obradu prirodnih jezika na Matematičkom fakulteta Univeziteta u Beogradu je usmerena na razvoj različitih resursa za obradu srpskog jezika. Među njima su posebno značajni sistem morfoloških rečnika srpskog jezika razvijenih u okviru mreže RELEX [1] i semantička mreža (tipa wordnet) za srpski jezik razvijena u okviru međunarodnog projekta Balkanet. Radi se o dva heterogena leksička resursa, razvijena na osnovu sasvim različitih modela, koji samim tim sadrže i različite vrste leksičkih informacija. Integracijom ovih resursa, informacije ...... leksičkih resursa Ranka Stanković, Cvetana Krstev, Duško Vitas, Ivan Obradović, Gordana Pavlović-Lažetić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Integrisanje heterogenih leksičkih resursa | Ranka Stanković, Cvetana Krstev, Duško Vitas, Ivan Obradović, Gordana ...
... available at: www.dr.rgf.bg.ac.rs Integrisanje heterogenih leksičkih resursa Ranka Stanković, Rudarsko-geološki fakultet, Beograd Cvetana Krstev, Filološki fakultet, Beograd Duško Vitas, Matematički fakultet, Beograd Ivan Obradović, Rudarsko-geološki fakultet, Beograd Gordana Pavl ...
... [5] Vossen, P. (ed.) (1998). EuroWordNet: A Multilingual Database with Lexical Semantic Networks. Dordrecht: Kluwer Academic Publishers. [6] Krstev C., et al. (2004) Combining Heterogeneous Lexical Resources, Proceedings of LREC2004, 4th International Conference On Language Resources And Evaluation ...Ranka Stanković, Cvetana Krstev, Duško Vitas, Ivan Obradović, Gordana Pavlović-Lažetić. "Integrisanje heterogenih leksičkih resursa" in Festivalski katalog 11. Festivala informatičkih dostignuća INFOFEST 2004, 26th September - 2nd October, 2004, Budva, Montenegro, INFOFEST (2004)
-
Rule-based Automatic Multi-word Term Extraction and Lemmatization
In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...... ion Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Rule-based Automatic Multi-word Term Extraction and Lemmatization | Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana ...
... Stanković1, Cvetana Krstev2, Ivan Obradović1, Biljana Lazić1, Aleksandra Trtovac3 1University of Belgrade, Faculty of Mining and Geology 2 University of Belgrade, Faculty of Philology 3 University Library “Svetozar Marković”, Belgrade E-mail: ranka.stankovic@rgf.bg.ac.rs, cvetana@poincare.matf ...
... , pp. 59--66. Krstev, C., Obradović, I., Stanković, R., and Vitas, D. (2013). An Approach to Efficient Processing of Multi-Word Units. In: Przepiórkowski, A., Piasecki, M., Jassem, K., Fuglewicz, P. (Eds.) Computational Linguistics. Berlin: Springer, pp. 109--129. Krstev, C., Stanković R. ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)
-
Production of morphological dictionaries of multi-word units using a multipurpose tool
The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...electronic dictionary, Serbian, morphology, inflection, multi-word units, noun phrases, query expansion... Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Production of morphological dictionaries of multi-word units using a multipurpose tool | Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas | Proceedings ...
... Mining and Geology, Djušina 7, 11000 Belgrade, Serbia Email: {ranka,ivano}@rgf.bg.ac.rs Cvetana Krstev University of Belgrade — Faculty of Philology, Studentski trg 3, 11000 Belgrade, Serbia Email: cvetana@matf.bg.ac.rs Duško Vitas University of Belgrade — Faculty of Mathematics, Studentski trg ...
... vol. 5070. Springer, 2009, pp. 111–141. [10] C. Krstev, R. Stanković, D. Vitas, and I. Obradović, “The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines,” in 6th LREC, Marrakech, Marocco, 2008. [11] C. Krstev, R. Stanković, D. Vitas, and S. Koeva, “E-Connecting ...Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas. "Production of morphological dictionaries of multi-word units using a multipurpose tool" in Proceedings of the Computational Linguistics-Applications Conference, October 2011, Jachranka, Poland, Jachranka, Poland : PTI - Polish Information Processing Society (2011)
-
On the compatibility of lexical resources for NooJ
Lexical resources for many languages are provided for the NooJ linguistic development environment. Meta-data descriptions of morphosyntactic and semantic properties of these languages and their resources are a mandatory part of each language module. In this paper we analyze how well the meta-data actually describe resources for a chosen subset of languages and to what extent are they compatible across languages to support multilingual processing. We show that there is place for improvement in both directions.... Stanković, Miloš Utvić, Duško Vitas, Cvetana Krstev, Ivan Obradović Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] On the compatibility of lexical resources for NooJ | Ranka Stanković, Miloš Utvić, Duško Vitas, Cvetana Krstev, Ivan Obradović | Automatic Processing ...
... the 2011 International NooJ Conference 1 ON THE COMPATIBILITY OF LEXICAL RESOURCES FOR NOOJ RANKA STANKOVIĆ, MILOŠ UTVIĆ, DUŠKO VITAS, CVETANA KRSTEV AND IVAN OBRADOVIĆ Abstract Lexical resources for many languages are provided for the NooJ linguistic development environment. Meta-data ...
... Proceedings of the Fourth International Conference on Language Resources and Evaluation, LREC'10, Paris: ELRA. (http://nl.ijs.si/ME/V4/) Erjavec, T., Krstev, C., Petkevič, V., Simov, K., Tadić, M., Vitas, D., 2003, “The MULTEXT -East Morphosyntactic Specifications for Slavic Languages”, in Proc. of the ...Ranka Stanković, Miloš Utvić, Duško Vitas, Cvetana Krstev, Ivan Obradović. "On the compatibility of lexical resources for NooJ" in Automatic Processing of Various Levels of Linguistic Phenomena: Selected Papers from the 2011 International Nooj Conference, Cambridge Scholars Publishing (2012): 96-108
-
Resource-based WordNet Augmentation and Enrichment
In this paper we present an approach to support production of synsets for SerbianWordNet(SerWN)byadjustingPrincetonWordNet(PWN)synsetsusing several bilingual English-Serbian resources. PWN synset definitions were automatically translated and post-edited, if needed, while candidate literals for Serbian synsets were obtained automatically from a list of translational equivalents compiled form bilingual resources. Preliminary results obtained from a setof1248selectedPWNsynsetsshowthattheproducedSerbiansynsetscontain 4024 literals, out of which 2278 were offered by the system we present in this paper, whereas experts added the remaining 1746. Approximately one half of ...... Ivan Obradović, Marko Vitas, Cvetana Krstev Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Resource-based WordNet Augmentation and Enrichment | Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev | Proceedings of the Third In ...
... bg.ac.rs Miljana Mladenović College for Preschool Teachers Bujanovac, Serbia ml.miljana@gmail.com Cvetana Krstev and Marko Vitas Faculty of Philology University of Belgrade, Serbia cvetana@matf.bg.ac.rs vitas.marko@gmail.com Abstract In this paper we present an approach to support production ...
... engleski. Narodna biblioteka Srbije. Krstev, C., Stanković, R., Vitas, D., and Obradović, I. (2006). WS4LR: A Workstation for Lexical Resources. In Proceedings of the Fifth International Conference on Language Resources and Evaluation, LREC 2006, pages 1692–1697. Krstev, C., Stanković, R., and Vitas, ...Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev. "Resource-based WordNet Augmentation and Enrichment" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018)
-
Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources
Large collections of textual documents represent an example of big data that requires the solution of three basic problems: the representation of documents, the representation of information needs and the matching of the two representations. This paper outlines the introduction of document indexing as a possible solution to document representation. Documents within a large textual database developed for geological projects in the Republic of Serbia for many years were indexed using methods developed within digital humanities: bag-of-words and named ...... Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources | Ranka Stanković, Cvetana Krstev, Ivan Obradović ...
... Address Belgrade, Serbia Email ranka@rgf.bg.ac.rs Author Family Name Krstev Particle Given Name Cvetana Prefix Suffix Division Faculty of Philology Organization University of Belgrade Address Belgrade, Serbia Email cvetana@matf.bg.ac.rs Author Family Name Obradović Particle Given Name Ivan Prefix ...
... Ranka Stanković1(B), Cvetana Krstev2, Ivan Obradović1, and Olivera Kitanović1 1 Faculty of Mining and Geology, University of Belgrade, Belgrade, Serbia {ranka,ivan.obradovic,olivera.kitanovic}@rgf.bg.ac.rs 2 Faculty of Philology, University of Belgrade, Belgrade, Serbia cvetana@matf.bg.ac.rs Abstract ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Olivera Kitanović. "Improving Document Retrieval in Large Domain Specific Textual Databases Using Lexical Resources" in Trans. Computational Collective Intelligence - Lecture Notes in Computer Science 26, Springer (2017). https://doi.org/10.1007/978-3-319-59268-8_8
-
Automatic construction of a morphological dictionary of multi-word units
The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...electronic dictionary, Serbian, morphology, inflection, multiwordn units, noun phrases, query expansion... dictionary of multi-word units Cvetana Krstev, Ranka Stanković, Ivan Obradović, Duško Vitas, Miloš Utvić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Automatic construction of a morphological dictionary of multi-word units | Cvetana Krstev, Ranka Stanković, Ivan ...
... publications. - The Repository is available at: www.dr.rgf.bg.ac.rs Automatic Construction of a Morphological Dictionary of Multi-Word Units Cvetana Krstev1, Ranka Stanković2, Ivan Obradović2, Duško Vitas3, and Miloš Utvić1 1 Faculty of Philology, University of Belgrade 2 Faculty of Mining ...
... produced need not be revised. References 1. Courtois, B., Silberztein, M.: Dictionnaires électroniques du français. Larousse, Paris (1990) 2. Krstev, C.: Processing of Serbian - Automata, Texts and Electronic Dictionaries. Faculty of Philology, University of Belgrade, Belgrade (2008) 3. Savary ...Cvetana Krstev, Ranka Stanković, Ivan Obradović, Duško Vitas, Miloš Utvić. "Automatic construction of a morphological dictionary of multi-word units" in Lecture Notes in Computer Science 6233, Advances in Natural Language Processing, Proceedings of the 7thInternational Conference on NLP, IceTAL 2010, Reykjavik, Iceland, August 2010, Springer (2010): 226-237. https://doi.org/10.1007/978-3-642-14770-8_26
-
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources
In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615
-
SASA Dictionary as the Gold Standard for Good Dictionary Examples for Serbian
Ranka Stanković, Branislava Šandrih, Rada Stijović, Cvetana Krstev, Duško Vitas, Aleksandra Marković (2019)У овом раду представљамо модел за избор добрих примера за речник српског језика и развој иницијалних компоненти модела. Метода која се користи заснива се на детаљној анализи различитих лексичких и синтактичких карактеристика у корпусу састављених од примера из пет дигитализованих свезака речника САНУ. Почетни скуп функција био је инспирисан сличним приступом и за друге језике. Дистрибуција карактеристика примера из овог корпуса упоређује се са карактеристиком дистрибуције узорака реченица ексцерпираних из корпуса који садрже различите текстове. Анализа је показала да ...Српски, добри примери из речника, аутоматизација израде речника, издвајање својстава, Машинско учење... Stijović, Cvetana Krstev, Duško Vitas, Aleksandra Marković Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] SASA Dictionary as the Gold Standard for Good Dictionary Examples for Serbian | Ranka Stanković, Branislava Šandrih, Rada Stijović, Cvetana Krstev, Duško ...
... Stijović2, Cvetana Krstev1, Duško Vitas1, Aleksandra Marković2 1 University of Belgrade, Studentski trg 1, Belgrade, Serbia 2 Institute for Serbian Language, SASA, Knez Mihailova 36, Belgrade, Serbia E-mail: ranka@rgf.rs, branislava.sandrih@fil.bg.ac.rs, rada.stijovic@isj.sanu.ac.rs, cvetana@matf.bg ...
... University of Belgrade). Vitas D. & Krstev C. (2015). Blueprint for the computerized dictionary of the Serbian language [Nacrt za informatizovani rečnik srpskog jezika]. Naučni sastanak slavista u Vukove dane, 44(3), pp. 105–116. (In Serbian, Cyrillic.) Vitas, D. & Krstev, C. (2012). Processing of Corpora ...Ranka Stanković, Branislava Šandrih, Rada Stijović, Cvetana Krstev, Duško Vitas, Aleksandra Marković. "SASA Dictionary as the Gold Standard for Good Dictionary Examples for Serbian" in Electronic lexicography in the 21st century. Proceedings of the eLex 2019 conference , Lexical Computing CZ, s.r.o. (2019)
-
Towards Automatic Definition Extraction for Serbian
U radu su prikazani preliminarni rezultati automatske ekstrakcije kandidata za definicije rečnika iz nestrukturiranih tekstova na srpskom jeziku u cilju ubrzanja razvoja rečnika. Definicije u rečniku Srpske akademije nauka i umetnosti (SANU) korišćene su za modelovanje različitih tipova definicija (opisnih, gramatičkih, referentnih i sinonimskih) koje imaju različite sintaksičke i leksičke karakteristike. Korpus istraživanja sastoji se od 61.213 definicija imenica, koje su analizirane korišćenjem morfoloških e-rečnika i lokalnih gramatika implementiranih kao pretvarači konačnih stanja u paketu za obradu korpusa otvorenog ...... for Serbian Ranka Stanković, Cvetana Krstev, Rada Stijović, Mirjana Gočanin, Mihailo Škorić Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Towards Automatic Definition Extraction for Serbian | Ranka Stanković, Cvetana Krstev, Rada Stijović, Mirjana Gočanin ...
... employees' publications. - The Repository is available at: www.dr.rgf.bg.ac.rs Towards Automatic Definition Extraction for Serbian Stanković Ranka1, Krstev Cvetana1, Stijović Rada2, Gočanin Mirjana2, Škorić Mihailo1 1 University of Belgrade, Serbia 2 Institute for the Serbian Language of SASA, Serbia ...
... dictionary production by automatically associating some grammatical information to lemmas, namely, word class, word forms and different types of markers (Krstev 2008). Since the research related to extraction of dictionary examples has shown that information extraction from a corpus can be used to speed up ...Ranka Stanković, Cvetana Krstev, Rada Stijović, Mirjana Gočanin, Mihailo Škorić. "Towards Automatic Definition Extraction for Serbian" in Proceedings of the XIX EURALEX Congress of the European Assocition for Lexicography: Lexicography for Inclusion (Volume 2). 7-9 September (virtual), Democritus University of Thrace (2021)
-
Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis
U ovom radu predstavljen je model koji omogućava prikupljanje, pripremu, opis metapodataka, upravljanje i eksploataciju, uključujući pretragu punog teksta dokumenata iz domena kriminalistike napisanih na srpskom jeziku. Predloženi pristup primenjuje se na veb portalu koji sakuplja različite tekstove nastale iz časopisa Akademije za kriminalistiku i policijske studije, Krivičnog zakona Srbije, konferencija „Tara“ i „Reiss“, kao i iz nekih doktorskih disertacija vezanih za ovu oblast istraživanje. Nakon obrade teksta, korpus koji sadrži preko 5500 stranica običnog teksta, kreiran je i ...... Cardiff 9 Cvetana Krstev, Duško Vitas, “Corpus and Lexicon - Mutual Incompletness ”, in Proceedings of the Corpus Linguistics Conference, 14-17 July 2005, Birmingham, eds. Pernilla Danielsson and Martijn Wagenmakers, ISSN 1747-9398, http://www.corpus.bham.ac.uk/PCLC/, 2005 10 Cvetana Krstev, Ranka Stanković ...
... resources and from the other with Omeka KPA digital library. 15 Cvetana Krstev. Processing of Serbian – Automata, Text and Electronic Dictionaries, Faculty of philology, Belgrade, 2008 16 Duško Vitas, Cvetana Krstev, Ivan Obradović, Ljubomir Popović, Gordana Pavlović-Lažetić”, An Processing ...
... Analysis as a next step in a study of forensic texts. REFERENCES 1. Cvetana Krstev. Processing of Serbian – Automata, Text and Electronic Dictionaries, Faculty of philology, Belgrade, 2008. 2. Cvetana Krstev, Duško Vitas, “Corpus and Lexicon - Mutual Incompletness ”, in Proceedings of the ...Dalibor Vorkapić, Aleksandra Tomašević, Miljana Mladenović, Ranka Stanković, Nikola Vulović. "Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis" in International Scientific Conference “Archibald Reiss Days” Thematic Conference Proceedings Of International Significance, Belgrade, 7-9 November 2017, Academy Of Criminalistic And Police Studies Belgrade (2017)
-
Увођење доменских и семантичких маркера за област рударства у српске електронске речнике
... and Lan- guage Processing, Draft of November 7, 2016. Крстев 2008: Cvetana Krstev, Processing of Serbian – Automata, Texts and Elec- tronic dictionaries Faculty of Philology, University of Belgrade, Belgrade. Крстев и др., 2008: Cvetana Krstev, DuškoVitas, Gordana Pavlović-Lažetić, “Re- sources and ...
... Peter Lang: Frankfurt am Main, pp. 3–17. Крстев и др., 2013: Cvetana Krstev, Ivan Obradović, Miloš Utvić, DuškoVitas, “A system for namedentity recognition based on local grammars”, In: J Logic Computation 24 (2), pp. 473–489. Крстев/Лазић, 2015: Цветана Крстев, Биљана Лазић, „Глаголи у кухињи и за ...
... за столом”, Научни састанак слависта у Вукове дане, 44/3, 117–136. Крстев и др. 2015: Cvetana Krstev, Ranka Stanković, Ivan Obradović, Biljana Lazić “Terminology Acquisition and Description Using Lexical Resources and Local Grammars”, In: Proc. of the 11th Conferenceon Terminology and Artificial ...Иван Обрадовић, Александра Томашевић, Ранка Станковић, Биљана Лазић. "Увођење доменских и семантичких маркера за област рударства у српске електронске речнике" in Научни састанак слависта у Вукове дане - Српски језик и његови ресурси: теорија, опис и примене, Београд : Међународни славистички центар на Филолошком факултету, Филолошки факултет (2017). https://doi.org/10.18485/msc.2017.46.3.ch10
-
Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking
U radu se prikazuju rezultati istraživanja vezanih za pripremu paralelnih korpusa, fokusirajući se na transformaciju u RDF grafove koristeći NLP Interchange Format (NIF) za lingvističku anotaciju. Pružamo pregled paralelnog korpusa koji je korišćen u ovom studijskom slučaju, kao i proces označavanja delova govora, lematizacije i prepoznavanja imenovanih entiteta (NER). Zatim opisujemo povezivanje imenovanih entiteta (NEL), konverziju podataka u RDF, i uključivanje NIF anotacija. Proizvedene NIF datoteke su evaluirane kroz istraživanje triplestore-a korišćenjem SPARQL upita. Na kraju, razmatra se povezivanje Linked ...paralelni korpusi, povezivanje imenovanih entiteta, prepoznavanje imenovanih entiteta, NER, NEL, povezani podaci, NIF, VikipodaciRanka Stanković, Milica Ikonić Nešić, Olja Perisic, Mihailo Škorić, Olivera Kitanović. "Towards Semantic Interoperability: Parallel Corpora as Linked Data Incorporating Named Entity Linking" in Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024, Turin, 20-25 May 2024, ELRA and ICCL (2024)
-
Речници у дигиталном добу - информатичка подршка за српски језик
Биљана Рујевић (2022)Морфолошки речници српског језика представљају електронски језички ресурс који има значајну историју развоја и коришћења за потребе обраде природних језика. С обзиром на то да су чувани у облику датотека чији је број нарастао па је самим тим управљање речницима постало отежано јавила се потреба за смештањем информација из речника у облик лексикографске базе. Како би се омогућио симултани рад на развоју речника за више корисника јавила се потреба за веб-апликацијом заснованој на лексикографској бази. Како би се размотриле ...Биљана Рујевић. Речници у дигиталном добу - информатичка подршка за српски језик, Београд : [Б. Рујевић], 2022
-
Речник САНУ као база терминолошких речника (на примеру речника кулинарства)
... Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić and Aleksandra Trtovac. „Rule based automatic multi-word term extraction and lemmatization.” In: 10th edition of the Language Resources and Evaluation Conference (LREC), 23-28 May 2016, Portorož. 7. Ranka Stanković, Ivan Obradović, Cvetana Krstev ...
... док се електронски речници користе у истраживањима језика и креирању језичких алата. Морфолошке речнике српског језика развили су проф. др Цветана Крстев и проф. др Душко Витас уз помоћ Групe за језичке технологије Универзитета у Београду. Анализа обрађеног корпуса обухватила је екстракцију речи ...
... Катарина, Велики српски кувар. Нови Сад, 1904. Речник српскохрватског књижевног и народног језика. Београд, 1959–2014, I–XIX. Литература 1. Krstev, Cvetana, Duško Vitas and Gordana Pavlović-Lažetić. „Resources and methods in the morphosyntactic processing of Serbo-Croatian.” In Gerhild Zybatow et ...Рада Стијовић, Олга Сабо, Ранка Станковић. "Речник САНУ као база терминолошких речника (на примеру речника кулинарства)" in Словенска терминологија данас, Београд : Српска академија наука и уметности (2017)
-
Српски језик у дигиталном добу -- The Serbian Language in the Digital Age
Duško Vitas, Ljubomir Popović, Cvetana Krstev, Ivan Obradović, Gordana Pavlović-Lažetić, Mladen Stanojević (2012)... Popović, Cvetana Krstev, Ivan Obradović, Gordana Pavlović-Lažetić, Mladen Stanojević Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Српски језик у дигиталном добу -- The Serbian Language in the Digital Age | Duško Vitas, Ljubomir Popović, Cvetana Krstev, Ivan ...
... Popović Cvetana Krstev Ivan Obradović Gordana Pavlović-Lažetić Mladen Stanojević White Paper Series THE SERBIAN LANGUAGE IN THE DIGITAL AGE Серија белих књига СРПСКИ ЈЕЗИК У ДИГИТАЛНОМ ДОБУ Duško Vitas University of Belgrade Ljubomir Popović University of Belgrade Cvetana Krstev University ...
... [46] Open Office. http://ooo.matf.bg.ac.rs. [47] Resursi srpskog jezika (Serbian language resources). http://www.korpus.matf.bg.ac.rs. [48] Cvetana Krstev. Processing of Serbian – Automata, Texts and Electronic dictionaries. Faculty of Philology, University of Belgrade, 2008. [49] Đorđe Kostić. Corpus ...Duško Vitas, Ljubomir Popović, Cvetana Krstev, Ivan Obradović, Gordana Pavlović-Lažetić, Mladen Stanojević. "Српски језик у дигиталном добу -- The Serbian Language in the Digital Age" in META-NET White Paper Series, G. Rehm, H. Uszkoreit (eds.), Springer (2012)
-
From DELA Based Dictionary to Leximirka Lexical Database
Biljana Lazić, Mihailo Škorić (2020)In this paper, we will present an approach in transforming Serbian language Morphological dictionaries from a DELA text format to a lexical database dubbed Leximirka. Considering the benefits of storing data within a database when compared to storing them in textual documents, we will outline some of the functionality that the database has made possible. We will also show how hand-made rules that use category labels lexical entries are marked with can be used to link lexical entries. ...... https://halshs.archives-ouvertes.fr/ halshs-00278316/document Krstev, Cvetana. “Jedan prilaz informatičkom modeliranju teksta i algoritmi njegove transformacije”. Phdthesis, Univerzitet u Beogradu, Matematički fakultet, 1997 Krstev, Cvetana. Processing of Serbian. Automata, Texts and Electronic Dic- ...
... Mining and Geology Belgrade, Serbia 1 Introduction Prof. Dr. Dusko Vitas and Prof. Dr. Cvetana Krstev started working on the development of Serbian morphological dictionaries more than 25 years ago (Vitas, 1993; Krstev, 1997; Vitas et al., 1993). Morphological dictionaries represent a significant linguistic ...
... of Belgrade, 2008 Krstev, Cvetana, Ranka Stanković, Duško Vitas and Ivan Obradović. “WS4LR - a Worksation for Lexical Resources”. In Proceedings of the Fifth Interantional Conference on Language Resources and Evaluation, 1692–1697, 2006. http://poincare.matf.bg.ac.rs/~cvetana/biblio/ Krstev_467_new ...Biljana Lazić, Mihailo Škorić. "From DELA Based Dictionary to Leximirka Lexical Database" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.4