228 items
Bilingual lexical extraction based on word alignment for improving corpus search
Jelena Andonovski, Branislava Šandrih, Olivera Kitanović. "Bilingual lexical extraction based on word alignment for improving corpus search" in The Electronic Library, Emerald (2019). https://doi.org/10.1108/EL-03-2019-0056
Formative evaluation of e-learning projects with the logical framework approach
... Projects and Programs results can be distinguished in fact, in: outputs, outcomes and impacts [4][10][11][12]. Outputs are the products and/or services carried out from the project implementation. Outcomes and impacts are both effects of the output, that are observable along the time in the ...
... defined as "the use of new multimedia technologies and the Internet to improve the quality of learning by facilitating access to resources and services, as well as remote exchange and collaboration” [16]. A value- oriented definition of E-learning, sees it as a broad combination of processes ...
... focus. A project of E-Learning can be defined as a temporary endeavour aimed to creating an ICT-based infrastructure, to deliver support services to education, learning, whose effects are detectable along the time, in terms of higher effectiveness/efficiency of learning, wider and higher ...Roberto Linzalone, Giovani Schiuma, Ivan Obradović, Ranka Stanković. "Formative evaluation of e-learning projects with the logical framework approach" in The Sixth International Conference on e-Learning (eLearning-2015), September 2015, Belgrade, Serbia, Belgrade Metropolitan Univesity (2015)
Production of morphological dictionaries of multi-word units using a multipurpose tool
The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...electronic dictionary, Serbian, morphology, inflection, multi-word units, noun phrases, query expansion... which provides for embedding Google searches into web pages or web applica- tions. The abundance of Google services (Web Search, Local Search, Video Search, Blog Search, News Search and Book Search) are used by this library, consisting of simple web objects aimed at performing “inline” search. A ...
... in different scenarios: as a stand alone Windows application LeXimir.exe or as a web application VeBrana.aspx2, also known as VeBrana (previously WS4QE), which is supported by the wsQueryExpand.asm web service. The web service accepts and generates data sets in XML form, which are further converted ...
... subsequently generates the inflected forms. Query expansion in the web environment is implemented in a similar way, with different levels for expansion details. VeBrana accepts the query from the user and submits it to the local web service, which then expands the query and forwards it to the Google ...Ranka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas. "Production of morphological dictionaries of multi-word units using a multipurpose tool" in Proceedings of the Computational Linguistics-Applications Conference, October 2011, Jachranka, Poland, Jachranka, Poland : PTI - Polish Information Processing Society (2011)
Electronic Dictionaries - from File System to lemon Based Lexical Database
In this paper we discuss some well-known morphological descriptions used in various projects and applications (most notably MULTEXT-East and Unitex) and illustrate the encountered problems on Serbian. We have spotted four groups of problems: the lack of a value for an existing category, the lack of a category, the interdependence of values and categories lacking some description, and the lack of a support for some types of categories. At the same time, various descriptions often describe exactly the same ...... Buitelaar, P., McCrae, J., and Sintek, M. (2011). LexInfo: A declarative model for the lexicon- ontology interface. Web Semantics: Science, Services and Agents on the World Wide Web, 9(1):29–51. Courtois, B. and Silberztein, M. (1990). Dictionnaires électroniques du français, volume 87 of Langue ...
... been designed for ontology lexicons on the Semantic Web. It is aimed at enriching the conceptualization represented by a given ontology by means of a lexico-terminological layer (McCrae et al., 2012). In order to enable sharing on the semantic web, and for interface with tools lemon is based on RDF ...
... desktop application, was that dictionary up- dates by one user could not be synchronized with other users in real time. Thus, we decided to develop a web ap- plication for dictionary management, and enhance the de- velopment environment from singe-user to multi-user. In addition to that, LeXimir did ...Ranka Stanković, Cvetana Krstev, Biljana Lazić, Mihailo Škorić. "Electronic Dictionaries - from File System to lemon Based Lexical Database" in Proceedings of the 11th International Conference on Language Resources and Evaluation - W23 6th Workshop on Linked Data in Linguistics : Towards Linguistic Data Science (LDL-2018), LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
Развој геолошког терминолошког речника ГеолИССТерм
... Obrst, leo & Smith, kevin 2003. The Semantic Web: A Guide to the Future of XML, Web Services, and Knowledge Management, John Wi- ley & Sons. mcguinness, Deborah. 2003. Ontologies come of Age. Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential, Dieter Fensel, J im hendler ...
... existing between them. On- tologies are used in important fields of computer and information science, such as artificial intel- ligence, semantic web, information systems, etc., as a form of representation of knowledge about the world or some part of it. generally speaking, ontologies describe: ...
... t environment for the gIS systems (http://www.esri.com/arcgis), as an ex- tension of the Arcmap tool for cartographic con- tent management and – Web application for browsing and search- ing the dictionary The data entry interface (Figure 6) displays the structure i.e. organization of concepts and ...Ranka Stanković, Branislav Trivić, Olivera Kitanović, Branislav Blagojević, Velizar Nikolić. "Развој геолошког терминолошког речника ГеолИССТерм" in INFOteka: časopis za informatiku i bibliotekarstvo, Beograd : Zajednica biblioteka univerziteta u Srbiji (2011)
A Data Driven Approach for Raw Material Terminology
Olivera Kitanović, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić, Ivan Babić, Ljiljana Kolonja (2021)The research presented in this paper aims at creating a bilingual (sr-en), easily searchable, hypertext, born-digital, corpus-based terminological database of raw material terminology for dictionary production. The approach is based on linking dictionaries related to the raw material domain, both digitally born and printed, into a lexicon structure, aligning terminology from different dictionaries as much as possible. This paper presents the main features of this approach, data used for compilation of the terminological database, the procedure by which it has ...sirovine, rudarstvo, terminologija, rečnik, terminološka aplikacija, mobilna aplikacija, digitizacija, leksički podaci, korpusi, otvoreni povezani podaci... corpus-evidence-based approach was needed. A method for the selection of good examples for Serbian terms was developed based on a feature extraction web services and knowledge retrieved from SASA Dictionary as the Gold Standard for Good Dictionary Examples (GDEX) for Serbian [54]. The method is based on a ...
... using VocBench, a web-based, multilingual, collaborative devel- opment platform for managing Ontolex-lemon lexicons among other RDF datasets [59], for publishing terminology as RDF data, in order to meet the needs of semantic web and linked data environments. VocBench is an open source web platform for ...
... as one of the core pillars of the Semantic Web or the Web of Data, provides links between data sets that are Appl. Sci. 2021, 11, 2892 19 of 22 understandable not only to humans, but also to machines, by sharing machine-readable interlinked data on the Web. The next big challenge for the future is ...Olivera Kitanović, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić, Ivan Babić, Ljiljana Kolonja. "A Data Driven Approach for Raw Material Terminology" in Applied Sciences, MDPI AG (2021). https://doi.org/10.3390/app11072892
Classification of Terms on a Positive-Negative Feelings Polarity Scale Based on Emoticons
Mihailo Škorić (2017)The goal of this paper is to draw attention to the possibility of using emoticon-riddled text on the web in language-neutral sentiment analysis. It introduces several innovations in the existing framework of research and tests their effectiveness. It also presents a software tool especially made for that purpose, explains how it builds a database with sentimental value of terms and offers the user manual. Finally, it presents a software tool that tests the new database and gives some examples ...... two groups. Social and demographic research: – Marketing research: exploration of current vs alternative approach to the marketing of products and services. This is the most common use of similar studies primarily for financial reasons, because using these companies can save money or get a new influx ...
... DOI 10.18485/infotheca.2017.17.1.4 ABSTRACT: The goal of this paper is to draw attention to the possibility of using emoticon-riddled text on the web in language- neutral sentiment analysis. It introduces sev- eral innovations in the existing framework of research and tests their effectiveness. It ...
... new database and gives some examples of the analysis of the ob- tained results. KEYWORDS: data mining, information extraction, emotions, text on the web. PAPER SUBMITTED: 24 January 2017 PAPER ACCEPTED: 25 March 2017 Mihailo Škorić miks@tesla.rcub.bg.ac.rs University of Belgrade 1 Introduction ...Mihailo Škorić. "Classification of Terms on a Positive-Negative Feelings Polarity Scale Based on Emoticons" in Infotheca, Faculty of Philology, University of Belgrade (2017). https://doi.org/10.18485/infotheca.2017.17.1.4
Digitalizacija u rudarstvu: Kreiranje sistema za efikasno poslovno izveštavanje
Proces donošenja odluka u oblasti rudarstva i geologije uslovljen je blagovremenim posedovanjem kvalitetnih podataka i informacija. Kompleksnost rudarskih procesa nalaže prikupljanje podataka na dnevnom odnosno na smenskom nivou. Podaci kao takvi bez analitičkog pristupa nisu dovoljni. Kako bi pristup podacima bio brz i efikasan neophodno je posedovanje adekvatnog digitalnog rešenja uz adekvatne centralizovane baze podataka. U ovom radu je dat pregled trenutne pozicije rudarstva sa aspekta digitalne transformacije kao i predlog jednostavnog prototipa u obliku digitalnog sistema za poslovno ...... prikazan MMS primer web radnog naloga za unos podataka o utovarnoj i transportnoj opremi, koji se može primeniti na mobilnim telefonima, prenosivim tabletima ili računarima. Pored toga, web forme moraju imati mogućnost i odloženog slanja usled nedostatka interneta. Slika 2. Primer web radnih naloga za ...
... [7] Wolf, M., Semm, A. & Erfurth, C. (2018). Digital transformation in companies-challenges and success factors. In Innovations for Community Services: Išth International Conference, I4CS 2018, Žilina, Slovakia, pp. 178-193. [8] Earley S. (2014). The digital transformation: staying competitive ...Stevan Đenadić, Aleksandar Mirković, Veljko Rupar . "Digitalizacija u rudarstvu: Kreiranje sistema za efikasno poslovno izveštavanje" in XVI Međunarodna rudarska konferencija OMC 2024, Zlatibor, 9 - 12. oktobar 2024, Jugoslovenski komitet za površinsku eksploataciju (2024)
An Approach to Efficient Processing of Multi-Word Units
Efficient processing of Multi-Word Units in the course of development of morphological MWU dictionaries is not easy to achieve, especially when languages with complex morphological structures are concerned, such as Serbian. Manual development of this type of dictionaries is a tedious and extremely slow process. To alleviate this problem we turned to our multipurpose software tool, dubbed LeXimir, in the production of lemmas for e-dictionaries of multi-word units. In addition to that, we developed a procedure aimed at making ...... which provides for embedding Google searches into web pages or web applications. The abundance of Google services (Web Search, Local Search, Video Search, Blog Search, News Search and Book Search) are used by this library, consisting of simple web objects aimed at performing “inline” search. Ack ...
... in different scenarios: as a standalone Windows application LeXimir.exe or as a web application VebRanka.aspx3, also known as VebRanka (previously WS4QE), which is supported by the wsQueryExpand.asm web service. The web service accepts and generates data sets in XML format, which are further converted ...
... which is, as we have seen, the correct one in most cases. Query expansion in the web environment offers different levels for expansion de- tails. VebRanka accepts the query from the user and submits it to the local web ser- vice, which then expands the query and forwards it to the Google search engine ...Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas. "An Approach to Efficient Processing of Multi-Word Units" in Computational Linguistics - Applications, Studies in Computational Intelligence 458 no. 458, Berlin Heidelberg : Springer-Verlag (2013): 109-129. https://doi.org/10.1007/978-3-642-34399-5_6
Resource-based WordNet Augmentation and Enrichment
In this paper we present an approach to support production of synsets for SerbianWordNet(SerWN)byadjustingPrincetonWordNet(PWN)synsetsusing several bilingual English-Serbian resources. PWN synset definitions were automatically translated and post-edited, if needed, while candidate literals for Serbian synsets were obtained automatically from a list of translational equivalents compiled form bilingual resources. Preliminary results obtained from a setof1248selectedPWNsynsetsshowthattheproducedSerbiansynsetscontain 4024 literals, out of which 2278 were offered by the system we present in this paper, whereas experts added the remaining 1746. Approximately one half of ...... 2Google Apps Script is a scripting language based on JavaScript that provides easy ways to automate tasks across Google products and third party services and build web applications – https://developers.google.com/apps-script/overview 3https://cloud.google.com/translate/docs/translating-text Proceedings of ...
... el.eu/ 9http://eurovoc.europa.eu/ Proceedings of CLIB 2018 107 Office, which moved forward to ontology-based thesaurus management and semantic web technologies compliant to W3C recommendations, as well as latest trends in thesaurus standards. For this research we used the bilingual en-sr version ...
... on-line resources and textbooks and used for generating lists of translational equivalents. Finally, we compiled a list of aligned en-sr terms from the web site of the Serbian Institute for Standardization. 3. Production The final parallel list of translation equivalents compiled from all of the abov ...Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev. "Resource-based WordNet Augmentation and Enrichment" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018)
EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School
Prva škola za obuku polaznika koju je organizovala COST akcija NexusLinguarum održana je od 8. do 12. februara 2021. godine sa ciljem da studenti, istraživači i stručnjaci nauče osnove lingvističke nauke o podacima. Tokom obuke polaznici su se upoznali sa širokim spektrom tema: od semantičkog veba, RDF -a i ontologija, do modeliranja i pretraživanja jezičkih podataka pomoću najsavremenijih ontoloških modela i alata. Škola je održana u okviru serije letnjih škola EUROLAN-a i organizovalo ju je virtuelno (onlajn) nekoliko instituta; ...nauka o lingvističkim podacima, povezani podaci u lingvistici, jezički podaci, EUROLAN, NexusLinguarum, COST akcija, škola za obuku... linguis- tic resources using semantic web technologies, together with the means to extract knowledge from language resources and exploit it using semantic web query languages and reasoning capabilities. The topics addressed in the school were the following: – Semantic Web and Linked Data4 (Berners-Lee et ...
... RDF(S), RDF-S, or RDF/S), Web Ontology Language (OWL),5 etc.); – SPARQL query language- a semantic query language for databases able to retrieve and manipulate data stored in the RDF format; 2. EUROLAN 3. Deliverable D1.1 4. Introducing Linked Data and the Semantic Web 5. OWL 114 Infotheca Vol. 21 ...
... m “European Network for Web-centred Linguistic Data Science”. References Berners-Lee, Tim, Yuhsin Chen, Lydia Chilton, Dan Connolly, Ruth Dha- naraj, James Hollenbach, Adam Lerer, and David Sheets. 2006. “Tab- ulator: Exploring and analyzing linked data on the semantic web.” In Proceedings of the 3rd ...Milan Dojchinovski, Julia Bosque Gil, Jorge Gracia, Ranka Stanković. "EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.7
Open Educational Resources in Serbia
... Metadata Portal (BMP) with metadata for all published OER within BAEKTEL network. ● Terminological web application for management, browse and search of terminological resources. ● Web services for linguistic support (query expansion, information retrieval, OER indexing, etc) ● Annotation of ...
... includes distance learning, intelligent tutoring systems, adaptive user modeling and Semantic Web. She teaches traditional, online and blended IT courses. She was coordinator of the regional Tempus project DL@WEB for improving the quality of distance learning in higher education institutions in the Western ...
... other portals, that are evaluated as good quality and might be interesting for a wider audience. Apart from several OERs related to programming and web design, it includes learning materials on Social Networks analysis, GIS application in geology, Lexical Analysis in Natural Language Processing, ...Ivan Obradović, Ranka Stanković, Marija Blagojević, Danijela Milošević. "Open Educational Resources in Serbia" in Current State of Open Educational Resources in the “Belt and Road” Countries, Springer Singapore (2020). https://doi.org/10.1007/978-981-15-3040-1_10
Possibilities for introduction of Bioenergy Village concept - a case study of Serbia
Ivezic Dejan, Gluščević Miodrag, Jerotić Slobodan. "Possibilities for introduction of Bioenergy Village concept - a case study of Serbia" in Proceedings of the 3rd Renewable Energy Sources - Research and Business - BOOK OF ABSTRACTS, Brisel:Wojciech Budzianowski Consulting Services (2018): 53-54
INVENTS: A Hybrid Mine Ventilation Planning and Design System
Ventilation system analysis is a complex process based on the calculation and analysis of numerous parameters. These problems can be successfully solved by the SimVent numerical package, but a full understanding and use of the obtained results require the involvement of an experienced specialist in the ventilation field. The solution was found in the creation of a hybrid system INVENTS, whose knowledge base represents a formalization of the expert knowledge in the mine ventilation field. In this paper we ...... characteristic is the separation of domain model and user interface. The domain model is represented by business services and data services, while the user interface is represented by user services. Figure 5 depicts the three-level class diagram architecture of the SimVent package. Fig. 5. Three level ...
... package can successfully be used in the analysis of ventilation system stability for mine defense and rescue plan verification within mine ventilation services, design and research companies. Fig. 4. Examples of interface forms for the ResNet software package The object-oriented approach in system s ...
... level class diagram architecture of the SimVent package Five class forms can be identified within the user services of SimVent: frmOsnovni, frmPrikaz, frmSimul, Graf and Mreza. These classes represent interface forms for entering, viewing and searching the data, a form for drawing the ventilation system ...Lilić Nikola, Stanković Ranka, Obradović Ivan. "INVENTS: A Hybrid Mine Ventilation Planning and Design System" in Proceedings of International Scientific Conference of FME Session 4: Automation Control and Applied Informatics , Hong Kong : iConcept Press (2013)
INVENTS: a hybrid system for subsurface ventilation analysis
Ventilation system analysis is a complex process based on the calculation and analysis of numerous parameters. These problems can be successfully solved by the SimVent numerical package, but a full understanding and use of the obtained results require the involvement of an experienced specialist in the ventilation field. The solution was found in the creation of a hybrid system INVENTS, whose knowledge base represents a formalization of the expert knowledge in the mine ventilation field. In this paper we ...... characteristic is the separation of domain model and user interface. The domain model is represented by business services and data services, while the user interface is represented by user services. Figure 5 depicts the three-level class diagram architecture of the SimVent package. Fig. 5. Three level ...
... package can successfully be used in the analysis of ventilation system stability for mine defense and rescue plan verification within mine ventilation services, design and research companies. Fig. 4. Examples of interface forms for the ResNet software package The object-oriented approach in system s ...
... level class diagram architecture of the SimVent package Five class forms can be identified within the user services of SimVent: frmOsnovni, frmPrikaz, frmSimul, Graf and Mreza. These classes represent interface forms for entering, viewing and searching the data, a form for drawing the ventilation system ...Nikola Lilić, Ranka Stanković, Ivan Obradović. "INVENTS: a hybrid system for subsurface ventilation analysis" in Proc. of International Scientific Conference of FME, September 2000, Ostrava, FME (2000)
Rule-based Automatic Multi-word Term Extraction and Lemmatization
In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...... in Figure 2 is based on web services, thus enabling other applications to use some of them, such as indexing or document information retrieval, for term extraction. The current application is developed and tested within a Windows environment, while a corresponding web application, which would ...
... al dictionaries, which would provide coverage of terms from a specific domain. In our future work we will concentrate on: Finalization of the web application; Improvement of the precision of correct lemma production (development of additional strategies to avoid offering of incorrect lemmas) ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)
The Many Faces of SrpKor
Акроним СрпКор означава фамилију електронских корпуса савременог српског језика чија је изградња почела крајем седамдесетих година прошлога века, а која је постала шире видљива заинтересованој истраживачкој заједници објављивањем његове прве верзије на вебу 2002. године. У овом дугом периоду, посебно пре појаве корисних текстуелних ресурса на вебу, развој корпуса се састојао у прикупљању и обради грађе као и у развоју метода обраде корпуса. Наиме, електронски корпус није само колекција текстова у дигиталном облику (како се то, на пример, наводи ...Duško Vitas, Ranka Stanković, Cvetana Krstev. "The Many Faces of SrpKor" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024, University of Belgrade - Faculty of Philology (2024.)
Towards translation of educational resources using GIZA++
... with ID: 1.2010.1.4. From aligned TMX documents is easy to produce parallel text form for tools like Giza++, or JSON format suitable for web services and Mongo and other NoSQL databases. Image 2: An example excerpt from a TMX document 5. TOWARDS MACHINE TRANSLATION FOR SERBIAN Moses ...
... augmentation of Biblisha library. The detailed evaluation will be performed when we reach at least 100000 sentence pairs. Ur aim is to publish SMT based web service (API) and integrate it with eLearning systems that we use: Moodle and edX, REFERENCES [1] Class Central • Discover Free Online Courses ...Ivan Obradović, Dalibor Vorkapić, Ranka Stanković, Nikola Vulović, Miladin Kotorčević. "Towards translation of educational resources using GIZA++" in The Seventh International Conference on e-Learning (eLearning-2016), September 2016, Belgrade : Metropolitan Univesity (2016)
An Approach to Development of Bilingual Lexical Resources
... European languages [Piperidis, 2012]. Another project, the Multilingual Web Initiative, led by W3C, is a thematic network, exploring standards and best practices supporting the creation, localization and use of multilingual web- based information [Filip et al., 2012]. Hence, the importance of m ...
... morphological and semantic expansion, This is essentially handled by a web service (wsQueryExpand.asmx), which is part of the LeXimir software package, a multipurpose tool also developed by the HLT Group [Stanković et al., 2011]. The web service invokes LeXimir’s function library LeXimirCore, whose ...
... their analysis, the synset pair {browser, web browser}-{prelistač, veb prelistač, pregledač veba} was entered into Biblimir. Query expansion for the keyword browser after this addition to Biblimir is depicted in Figure 4. Figure 4: Part of the web page for query expansion. The English term ...Stanković Ranka, Obradović Ivan, Trtovac Aleksandra. "An Approach to Development of Bilingual Lexical Resources" in Proceedings of the Fifth Balkan Conference in Informatics BCI 2012, Workshop on Computational Linguistics and Natural Language Processing of Balkan Languages – CLoBL 2012, September 2012, Novi Sad : BCI (2012)
New Language Models for South Slavic Languages
Mihailo Škorić (2024)Izlaganje će predstaviti izazove i perspektive modelovanja južnoslovenskih jezika, sa posebnim osvrtom opšte jezičke modele građene na arhitekturi transformera (BERT, GPT), na dostupne skupove tekstova za obučavanje tih modela, te kvantitet i kvalitet tih skupova. Izlaganje će ponuditi pregled dostupnih skupova i modela, dok će posebna pažnja biti posvećena najnovijim korpusima tekstova. Prvi korpus, Kišobran, predstavlja krovni veb korpus južnoslovenskih jezika i ujedno trenutno najveći korpus tekstova na našim prostorima koji broji preko osamnaest milijardi reči i uključuje sve ...Mihailo Škorić. "New Language Models for South Slavic Languages" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024, University of Belgrade - Faculty of Philology (2024)