Претрага
391 items
-
New Language Models for South Slavic Languages
Mihailo Škorić (2024)Izlaganje će predstaviti izazove i perspektive modelovanja južnoslovenskih jezika, sa posebnim osvrtom opšte jezičke modele građene na arhitekturi transformera (BERT, GPT), na dostupne skupove tekstova za obučavanje tih modela, te kvantitet i kvalitet tih skupova. Izlaganje će ponuditi pregled dostupnih skupova i modela, dok će posebna pažnja biti posvećena najnovijim korpusima tekstova. Prvi korpus, Kišobran, predstavlja krovni veb korpus južnoslovenskih jezika i ujedno trenutno najveći korpus tekstova na našim prostorima koji broji preko osamnaest milijardi reči i uključuje sve ...Mihailo Škorić. "New Language Models for South Slavic Languages" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024, University of Belgrade - Faculty of Philology (2024)
-
Integracija heterogenih tekstualnih resursa
Ranka Stanković, Ivan Obradović (2007)U radu je opisan pristup integraciji heterogenih tekstualnih resursa za srpski jezik uz pomoć jednog kompleksnog softverskog alata, razvijenog specijalno za ove potrebe. Opisani su struktura i osnovne komponente razvijenog sistema. Iznete su i mogućnosti unapređivanja resursa međusobnom razmenom informacija, koje pruža razvijeno integrisano okruženje. Konačno, opisana je i mogućnost primene integrisanih heterogenih resursa za proširenje upita, kao i pretraživanje tekstova uopšte, a naznačeni su i neki od pravaca daljeg razvoja.... Technology. Bucureş of the Romanian academy. t al. 2003 – Vitas, D. et al. (2003): Processing Serbian Written Texts: An Overview of Resources an (Hg.): Proceedings of the International Workshop on Balkan Language Resources and Tools. Thessaloniki, November 2003. S. 97–104. tanković – Ivan Obradović ...
... outils/ALIGN/align.html). l Conference on Language Silberz ti: Publishing house Vitas e d Basic Tools. In: Piperidis, S./Karakaletsis, V. Ranka S Integration of heterogeneous textual resources for Serbian developed within the Human Language Technology Group at the s constant enrichment ...
... these resource through mutual interchange of information. We also describe its even more important feature which opens new possibilities for processing of texts, namely resource combining, in particular the combining of morphological information from the dictionaries and semantic information from ...Ranka Stanković, Ivan Obradović. "Integracija heterogenih tekstualnih resursa" in Zbornik radova međunarodnog simpozijuma Razlike između bosanskog/bošnjačkog, hrvatskog i srpskog jezika, Graz, Austria, April 2007, - (2007)
-
Social-Emo.Sr: Emotional Multi-Label Categorization of Conversational Messages from Social Networks X and Reddit
U digitalnom okruženju južnoslovenskih jezika, analiza emocija u tekstovima na društvenim mrežama postaje sve važnija za razumevanje javnog mnjenja, kreiranje personalizovanog sadržaja i analizu međusobnih interakcija korisnika. U okviru ovog rada predstavljamo detaljnu metodologiju i rezultate označavanja korpusa na srpskom jeziku prema Plutčikovom modelu kategorizacije, koji prepoznaje osam osnovnih emocionalnih kategorija, kao što su radost, tuga, bes, strah, poverenje, gađenje, iščekivanje i iznenađenje. Cilj istraživanja je da se analizira emocionalni sadržaj tekstova preuzetih sa društvenih mreža X (nekada Twitter) ...Milena Šošić, Ranka Stanković, Jelena Graovac. "Social-Emo.Sr: Emotional Multi-Label Categorization of Conversational Messages from Social Networks X and Reddit" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024., University of Belgrade - Faculty of Philology (2024)
-
Coupling of artificial intelligence methods in the development of hybrid intelligent systems
In this paper we present an approach which couples various artificial intelligence (AI) methods in the solution of complex problems that cannot adequately be solved by a single AI method. We argue that the resulting, hybrid intelligent systems (HIS) can be successfully implemented with the use of available AI software libraries. Different coupling methods are analyzed and a classification of hybrid systems based on the chosen method is given. Two case studies of hybrid systems used in mining engineering ...hibridni inteligentni sistemi, spregnuti sistemi, metode veštačke inteligencije, rudarske primene veštačke inteligencije... offers safe data archiving for complex data models as this one, as well as all procedures for data manipulation. The use of SQL as a standard query language for data manipulation secures the openness of the hybrid system INVENTS for a connection with different environments. The VENTEX [2] system was ...
... of different intelligent methods within a single architecture. Hybrid systems can be classified accordingly, on basis of their functionality, processing architecture and communication needs (fig.1). Function-replacing Intercommunicating Polymorphic Fig.1 Three proposed hybrid classes Hybrid ...
... suggests measures for air pollution regulation. In the development of the system we have used NeuroWindows, GeneHunter and VisualBasic for data processing and NN training. Hybrid system for analysis of area airpollution load GIS PollutNet AIRPRES Expert system for air pol lution analysis Data ...Ranka Stanković, Ivan Obradović, Nikola Lilić. "Coupling of artificial intelligence methods in the development of hybrid intelligent systems" in X Kongres Matematičara, Matematički fakultet, Beograd (2001)
-
Веб-алат за управљање грађом Речника САНУ и анотација листића
Грађа на основу које се израђује Речник српскохрватског књижевног и народног језика САНУ, а која садржи материјал из преко 4.500 писаних извора и 300 рукописних збирки речи са подручја народних говора штокавског наречја, забележена је на око 5.000.000 листића. Богат лексички материјал, који обухвата књижевни и народни језик у протекла два века и на основу кога треба да се напише још најмање 15 томова Речника, пружа могућност и за разноврсна лингвистичка и ванлингвистичка истраживања. Из тог разлога се приступило ...Рада Стијовић, Ранка Станковић, Михаило Шкорић. "Веб-алат за управљање грађом Речника САНУ и анотација листића" in Rasprave Instituta za hrvatski jezik i jezikoslovlje, Institute of Croatian Language and Linguistics (2020). https://doi.org/10.31724/rihjj.46.2.32
-
Contrastive Analysis of Syntax Patterns in Comparable Football Corpora in Spanish and Serbian Languages
Jelena Lazarević, Olivera Kitanović (2024.)Cilj rada je istraživanje kolokabilnosti kao načina na koji se leksičke jedinice povezuju sa rečima iz različitih kategorija, formirajući veće jedinice. Istraživanje semantičkih i sintaksičkih principa ovih kombinacija u španskom i srpskom jeziku fudbala izvedeno je na komparabilnim fudbalskim korpusima SrFudKo i EsFudko, razvijenim u okviru doktorske disertacije Jelene Lazarević pod nazivom: Jezičke odlike diskursa novih medija o fudbalu: kontrastivna analiza na korpusu srpskog i španskog jezika. Korpus fudbala SrFudKo, kreiran na osnovu tekstova o fudbalu sa pet srpskih veb-portala: ...Jelena Lazarević, Olivera Kitanović . "Contrastive Analysis of Syntax Patterns in Comparable Football Corpora in Spanish and Serbian Languages" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024, University of Belgrade - Faculty of Philology (2024.)
-
Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution
This paper explores the effectiveness of parallel stylometric document embeddings in solving the authorship attribution task by testing a novel approach on literary texts in 7 different languages, totaling in 7051 unique 10,000-token chunks from 700 PoS and lemma annotated documents. We used these documents to produce four document embedding models using Stylo R package (word-based, lemma-based, PoS-trigrams-based, and PoS-mask-based) and one document embedding model using mBERT for each of the seven languages. We created further derivations of these ...Mihailo Škorić, Ranka Stanković, Milica Ikonić Nešić, Joanna Byszuk, Maciej Eder. "Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution" in Mathematics, MDPI AG (2022). https://doi.org/10.3390/math10050838
-
Advantages and challenges in presenting mathematical content using EDX platform
... resources as mathematical term bases are needed. According to [11] there is a great difference between natural languages and mathematical terms. For instance, in Serbian natural language the word “prava” is an adjective but within mathematical terms in Serbian it is a noun. Thus, there is a ...
... development of terminological dictionaries in various fields. The realization of the application was based on the ASP.NET Framework for C# programming language and MVC design pattern, as well as HTML and JavaScript, whereas SQL Server served as support for the database. The application is located at h ...
... http://termi.rgf.bg.ac.rs/ and consists of 5 specific units: browse, search, update, bibliography and profiles. Termi currently supports the processing and presentation of terms in Serbian and English, but support for other languages is also planned. On the Browse page all terms verified by editors ...Marija Radojičić, Ivan Obradović, Ranka Stanković, Olivera Kitanović, Roberto Linzalone. "Advantages and challenges in presenting mathematical content using EDX platform" in The Seventh International Conference on e-Learning (eLearning-2016), Belgrade : Metropolitan University (2016)
-
Дигиталне библиотеке у рударству и геологији са посебним освртом на представљање сиве литературе
Имајући у виду потребу за проналажењем информација похрањених у различитим облицима документације која се генерише у областима рударства и геологије на Рударско-геолошком факултету Универзитета у Београду, отпочет је процес развоја дигиталне библиотеке ROmeka@RGF, на платформи за приказивање дигиталних колекција - Омека. Значајан део документације представља такозвана сива литература која је претежно заступљена у виду вишетомне документацијe. Први савладани изазов представљало је повезивање различитих вишетомних делова пројектних извештаја у једну целину која би била лако доступна и претражива.... which are designed to define document relations. We will also present some language resources for Serbian language which are used to improve information retrieval. Keywords: digital libraries, grey literature, Omeka, language resources, dictionaries. ...
... to Improve the Performance of Web Search Engines”. Sixth International Conference on Language Resources and Evaluation (LREC ‘08), Marrakech, Morocco. Nicoletta Calzolari et al. (ur.). Marrakech : European Language Resources Association (ELRA), 2008. . Okoroma, Francisca. „Grey Literature Management ...
... Search of Multilingual Digital Libraries of E-journals”. Eighth International Conference on Language Resources and Evaluation (LREC), Istanbul, Turkey. Nicoletta Calzolari et al. (ur.). Istanbul : European Language Resources Association (ELRA), 2012. 1710-1717. Stanković, Ranka, Cvetana Krstev, Ivan Obradović ...Биљана Лазић, Александра Томашевић, Михаило Шкорић. "Дигиталне библиотеке у рударству и геологији са посебним освртом на представљање сиве литературе" in Научна конференција Библиоинфо — 55 година од покретања наставе библиотекарства на високошколском нивоу, Београд 18. мај 2017., Филолошки факултет Универзитета у Београду (2019). https://doi.org/10.18485/biblioinfo.2017.ch13
-
Advantages of python programming language in hydrological model development
Milan Tucaković, Dragoljub Bajić, Vesna Ristić Vakanjac, Dušan Polomčić . "Advantages of python programming language in hydrological model development" in Proceedings of the XVIII Serbian Geological Congress, Divčibare, Serbia, 01-04 June 2022, Serbian Geological Society (2022)
-
English for Geology Students 1 – Dyslexia friendly
Lidija Beko (2023)Lidija Beko. English for Geology Students 1 – Dyslexia friendly, Belgrade : The Faculty of Mining and Geology, 2023
-
The application of ArcGIS for assessing the potential of solar energy in urban area: The case of Vranje
In order to determine the solar energy potential for a specified location, it is crucial to consider the latitude, altitude, slope, terrain morphology, atmospheric conditions, etc. Such a complex calculation and mapping of solar energy can be done using the ArcGIS geoprocessing tool, named Area Solar Radiation (ASR). By using the ASR tool, supported with the adequate input data, it is possible to calculate the maximum solar radiation energy (irradiation) for a defined area and for a specified time ...... .mre.gov.rs/doc/efikasnost-izvor/:ENERGETSKI- BILANS-REPUBLIKE-SRBIJE-ZA-2019-Sluzbeni-glasnik-RS-broj-105-18.pdf) "Available only in Serbian language". Pavlovic, T. Milosavljevic, D., Lambić, M., et al., 2011. "Solar energy in Serbia". Contemporary Materials (Renewable energy sources), Volume ...
... w.mre.gov.rs/doc/efikasnost-izvor/ENERGETSKI- BILANS-REPUBLIKE-SRBIJE-ZA-2019-Sluzbeni-glasnik-RS-broj-105-18.pdf) "Available only in Serbian language". Republic of Serbia, Ministry of Mining and Energy, Department for strategic planning in energy sector, 2016. "Energy Sector Development Strategy ...
... Energy, Volume 132, p. 68-80. Stamenkovic LJ., 2009. "Using the solar PV energy in Serbia" Jefferson Institute, Belgrade "Available only in Serbian language". (No available internet link). 339 12% International Conference on Energy and Climate Change, 9-11 October 2019, Athens - Greece Statistical ...Boban Pavlović, Milica Pešić-Georgiadis. "The application of ArcGIS for assessing the potential of solar energy in urban area: The case of Vranje" in 12th International Conference on Energy and Climate Change, 9-11 October 2019, Athens - Greece, Energy Policy and Development Centre (KEPA) of the National and Kapodistrian University of Athens (2019)
-
English for Geology Students 2 - Dyslexia friendly
Lidija Beko (2023)Lidija Beko. English for Geology Students 2 - Dyslexia friendly, Belgrade : The Faculty of Mining and Geology, 2023
-
Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection
Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić (2022)In this paper we present the Serbian part of the ELTeC multilingual corpus of novels written in the time period 1840-1920. The corpus is being built in order to test various distant reading methods and tools with the aim of re-thinking the European literary history. We present the various steps that led to the production of the Serbian sub-collection: the novel selection and retrieval, text preparation, structural annotation, POS-tagging, lemmatization and named entity recognition. The Serbian sub-collection was published ...Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić. "Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection" in Proceedings of the Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
-
Sentiment Analysis of Serbian Old Novels
In this paper we present first study of Sentiment Analysis (SA) of Serbian novels from the 1840-1920 period. The preparation of sentiment lexicon was based on three existing lexicons: NRC, AFFIN and Bing with additional extensive corrections. The first phase of dataset refinement included filtering the word that are not found in Serbian morphological dictionary and in second automatic POS tagging and lemma were manually corrected. The polarity lexicon was extracted and transformed into ontolex-lemon and published as initial ...Ranka Stanković, Miloš Košprdić, Milica Ikonić Nešić, Tijana Radović. "Sentiment Analysis of Serbian Old Novels" in Proceedings of the 2nd Workshop on Sentiment Analysis and Linguistic Linked Data, June 2022, Marseille, France, European Language Resources Association (2022)
-
From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)
In this paper we present the wikification of the ELTeC (European Literary Text Collection), developed within the COST Action ``Distant Reading for European Literary History'' (CA16204). ELTeC is a multilingual corpus of novels written in the time period 1840—1920, built to apply distant reading methods and tools to explore the European literary history. We present the pipeline that led to the production of the linked dataset, the novels’ metadata retrieval and named entity recognition, transformation, mapping and Wikidata population, ...Milica Ikonić Nešić, Ranka Stanković, Christof Schöch and Mihailo Škorić. "From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)" in Proceedings of The 8th Workshop on Linked Data in Linguistics within the 13th Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
-
Razvoj ARCGIS geobaze površinskog kopa korišćenjem UML CASE alata
... ad logical modeling. In this paper we shall outline the third, most systematic way: the creation of a database model using the Unified Modeling Language (UML) and Computer-Aided Software Engineering (CASE) tools, which support the geodatabase development process. The first two approaches to geodatabase ...
... geodatabases with a larger number of feature and attribute classes. In the software engineering field UML is the standard object-oriented modeling language, which encompasses a set of techniques enabling visual representation of the model [Naiburg, Maksimchuk, 2002]. The model represents the foundation ...
... establishment of relations between descriptive, alphanumeric, non-spatial data and spatial data within a geodatabase, and enabling their analysis, processing and presentation. In a traditional databases the user cannot make spatial queries such as: “Which boreholes are at a distance of 50 meters from ...Aleksandra Tomašević, Ljiljana Kolonja, Ivan Obradović, Ranka Stanković, Olivera Kitanović. "Razvoj ARCGIS geobaze površinskog kopa korišćenjem UML CASE alata" in Podzemni radovi, Beograd : Univerzitet u Beogradu - Rudarsko-geološki fakultet (2012)
-
Аутоматска екстракција дефиниција – допринос убрзању израде речника
дескриптивни речници, метаанализа лексикографских дефиниција, аутоматска екстракција дефиниција, електронски речници, српски језикРада Стијовић, Цветана Крстев, Ранка Станковић. "Аутоматска екстракција дефиниција – допринос убрзању израде речника" in Лексикологија и лексикографија у светлу актуелних проблема, Институт за српски језик САНУ (2021)
-
Testing the energy value of different types of coal by the method of active thermography
In this paper, coal thermograms are presented and analyzed in order to determine their energy value. Two types of coal of different categories, brown and lignite, were selected for active thermographic imaging. The tested coal samples were processed before measurement so that they are similar in dimensions and have two plane-parallel smooth surfaces. The test samples were "primarily" heated under the same conditions and the process of their cooling was monitored by a thermal camera. "Then" they were cooled ...Stevan Đenadić, Ljubiša Tomić, Vesna Damnjanović, Predrag Jovančić, Dragutin Jovković. "Testing the energy value of different types of coal by the method of active thermography" in 9th International Scientific Conference on Defensive Technologies, Belgrade, Serbia, 15-16 October 2020, The Military Technical Institute (2020)
-
Building Terminological Resources in an e-Learning Environment
... functionality within the information system, an UML (Unified Modeling Language) engineering model with a special structure has been developed, whose main features are depicted in Figure 2. Assuming basic familiarity with this language we will briefly comment this model. The class Rečnik in the model ...
... synonyms of the basic term, its available translational equivalent in the chosen language, and the inflectional forms of the Serbian term and its synonyms. Namely, as Serbian is a morphologically very rich language, there was a need to provide for all inflectional forms of terms, as they can be ...
... they are indispensable in information an document retrieval systems. In addition to monolingual resources, machine translation systems and cross- language information retrieval emphasize the need for development of bilingual and multilingual terminological resources as well. However, terminological ...Ranka Stanković, Ivan Obradović, Olivera Kitanović, Ljiljana Kolonja. "Building Terminological Resources in an e-Learning Environment" in Proceedings of the Third International Conference on e-Learning, eLearning-2012, September 2012, Belgrade, Serbia, Belgrade : Belgrade Metropolitan University (2012)