Претрага
179 items
-
Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++
Branislava Šandrih, Ranka Stanković (2020)U nauci, industriji i mnogim istraživačkim oblastima, terminologija se brzo razvija. Najčešće, jezik koji je „lingua franca“ za većinu ovih oblasti je engleski. Kao posledica toga, za mnoga polja termini domena su koncipirani na engleskom, a kasnije se prevode na druge jezike. U ovom radu predstavljamo pristup za automatsko izdvajanje dvojezične terminologije za englesko-srpski jezički par koji se oslanja na usaglašeni dvojezični korpus domena, ekstraktor terminologije za ciljni jezik i alat za usklađivanje delova. Ispitujemo performanse metode na domenu ...... Измењено: 2023-10-14 04:19:58 Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++ Branislava Šandrih, Ranka Stanković Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++ ...
... Vol. 43, no. 4 (2017): 837–892 Cram, D. and B. Daille. “Terminology Extraction with Term Variant Detec- tion”. In Proceedings of ACL-2016 System Demonstrations, 13–18, 2016. Fawi, F. and R. Delmonte. “Italian-Arabic Domain Terminology Extraction From Parallel Corpora”. In Proceedings of the Second Italian ...
... available at: www.dr.rgf.bg.ac.rs Scientific paper Extraction of Bilingual Terminology using Graphs, Dictionaries and GIZA++ UDC 81’322.2 DOI 10.18485/infotheca.2019.19.2.6 ABSTRACT: In science, industry and many research fields, terminology is rapidly developing. Most often, a language that is ...Branislava Šandrih, Ranka Stanković. "Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.6
-
Rule-based Automatic Multi-word Term Extraction and Lemmatization
In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...... tables and graphs in Section 4. The solution to terminology extraction outlined in this paper will by all means speed up the development of e-dictionaries, as in addition to the terminology extraction, the approach can be applied to the extraction of MWUs belonging to general lexica. Expanding the ...
... 97% were associated with correct lemmas. Keywords: term extraction, terminology, multi-word units, lemmatization, finite-state transducers 1. Motivation Various approaches have been proposed for multi-word term (MWT) extraction as this problem has been gaining in importance in the field ...
... after manual evaluation 5. Concluding remarks and Future Work The paper presents an approach to terminology extraction for Serbian based on e-dictionaries and local grammars. For extraction purposes 14 graphs were developed, which extract the most frequent syntactic structures identified by ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)
-
Using English Baits to Catch Serbian Multi-Word Terminology
In this paper we present the first results in bilingual terminology extraction. The hypothesis of our approach is that if for a source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach relies on several resources and tools: aligned domain texts, domain terminology for a source language, a terminology extractor for a target language, and a ...aligned texts, word alignment, terminology extraction, electronic dictionaries, morphological inflection... domain terminology. Keywords: aligned texts, word alignment, terminology extraction, electronic dictionaries, morphological inflection 1. Motivation Terminology is rapidly developing in many research and technological fields. It is very difficult to produce and main- tain up-to-date terminology resources ...
... The rule-based system for the terminology extraction The approach to terminology extraction for Serbian based on e-dictionaries and local grammars described in (Stankovic et al., 2016) was improved with additional syntactic patterns in order to cover as many terminology structures as possible (Ranka ...
... present the first results in bilingual terminology extraction. The hypothesis of our approach is that if for a source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach ...Cvetana Krstev, Branislava Šandrih, Ranka Stanković. "Using English Baits to Catch Serbian Multi-Word Terminology" in Proceedings of the 11th International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
-
A Data Driven Approach for Raw Material Terminology
Olivera Kitanović, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić, Ivan Babić, Ljiljana Kolonja (2021)The research presented in this paper aims at creating a bilingual (sr-en), easily searchable, hypertext, born-digital, corpus-based terminological database of raw material terminology for dictionary production. The approach is based on linking dictionaries related to the raw material domain, both digitally born and printed, into a lexicon structure, aligning terminology from different dictionaries as much as possible. This paper presents the main features of this approach, data used for compilation of the terminological database, the procedure by which it has ...sirovine, rudarstvo, terminologija, rečnik, terminološka aplikacija, mobilna aplikacija, digitizacija, leksički podaci, korpusi, otvoreni povezani podaci... material terminology, and have also been included in this research. Some important related issues discussed are collocation extraction methods, the use of domain labels, lexical and semantic relations, definitions and subentries. Keywords: raw material; mining; terminology; dictionary; terminology application; ...
... material terminology, and have also been included in this research. Some important related issues discussed are collocation extraction methods, the use of domain labels, lexical and semantic relations, definitions and subentries. Keywords: raw material; mining; terminology; dictionary; terminology application; ...
... other tasks, corpora are used for extraction of mining terminology, definitions and usage examples by applying different methods and tools. 3.3. Adding New Serbian Terms to General Lexica Dictionaries Terminology from digitized dictionaries of raw material terminology in Serbian was checked by SrpMD ...Olivera Kitanović, Ranka Stanković, Aleksandra Tomašević, Mihailo Škorić, Ivan Babić, Ljiljana Kolonja. "A Data Driven Approach for Raw Material Terminology" in Applied Sciences, MDPI AG (2021). https://doi.org/10.3390/app11072892
-
Contrastive Analysis of Syntax Patterns in Comparable Football Corpora in Spanish and Serbian Languages
Jelena Lazarević, Olivera Kitanović (2024.)Cilj rada je istraživanje kolokabilnosti kao načina na koji se leksičke jedinice povezuju sa rečima iz različitih kategorija, formirajući veće jedinice. Istraživanje semantičkih i sintaksičkih principa ovih kombinacija u španskom i srpskom jeziku fudbala izvedeno je na komparabilnim fudbalskim korpusima SrFudKo i EsFudko, razvijenim u okviru doktorske disertacije Jelene Lazarević pod nazivom: Jezičke odlike diskursa novih medija o fudbalu: kontrastivna analiza na korpusu srpskog i španskog jezika. Korpus fudbala SrFudKo, kreiran na osnovu tekstova o fudbalu sa pet srpskih veb-portala: ...Jelena Lazarević, Olivera Kitanović . "Contrastive Analysis of Syntax Patterns in Comparable Football Corpora in Spanish and Serbian Languages" in South Slavic Languages in the Digital Environment JuDig Book of Abstracts, University of Belgrade - Faculty of Philology, Serbia, November 21-23, 2024, University of Belgrade - Faculty of Philology (2024.)
-
Terminological and lexical resources used to provide open multilingual educational resources
Open educational resources (OER) within BAEKTEL (Blending Academic and Entrepreneurial Knowledge in Technology enhanced learning) network will be available in different languages, mostly in the languages of Western Balkans, Russian and English. University of Belgrade (UB) hosts a central repository based on: BAEKTEL Metadata Portal (BMP), terminological web application for management, browse and search of terminological resources, web services for linguistic support (query expansion, information retrieval, OER indexing, etc.), annotation of selected resources and OER repository on local edX ...... Knowledge in Technology enhanced learning) resources, lexical resources, the process of terminology extraction and a presentation of TERMI, an application for terminology management. 2. TERMINOLOGICAL RESOURCES Terminology is considered to be a young interdisciplinary scientific field. The interest in it ...
... specific domains (e. g. law, biomedicine, mythology, culinary etc.). 5. TERMINOLOGY EXTRACTION Bearing in mind rapid changes in scientific domains and new terms production, automatic terminology recognition and extraction has become an important task. The extracted terms are then included in ontologies ...
... semiautomatic approach for term recognition, extraction and lemmatization. Picture 1 illustrates steps in terminology extraction. Crucial resources are morphological dictionaries and grammars. They are combined with some statistical measures for term extraction. The first step is analysis of terms in ...Biljana Lazić, Danica Seničić, Aleksandra Tomašević, Bojan Zlatić. "Terminological and lexical resources used to provide open multilingual educational resources" in The Seventh International Conference on eLearning (eLearning-2016), 29-30 September 2016, Belgrade, Serbia, Belgrade : Belgrade Metropolitan University (2016)
-
A bilingual digital library for academic and entrepreneurial knowledge management
A generic knowledge management process of organization, storage and retrieval of knowledge can suitably be fitted in a digital library. In the digital and knowledge age digital libraries can be used in knowledge management to handle intellectual assets and support knowledge creation. A multilingual digital library either stores content in more than one language or provides multilingual query access to monolingual content. In Serbia 18 of 308 scientific journals regularly published are bi-lingual, with papers simultaneously being in English ...... (Chen, 2004). Having in mind that a terminology research consists of: analyzing the concepts and concept structures used in a field or domain of activity, identifying the terms assigned to the concepts and in the case of bilingual or multilingual terminology, establishing correspondences between ...
... makes it a valuable tool for terminological research. Keywords – Knowledge management, Digital library, Multilingualism, Language Resources, Terminology Paper type – Academic Research Paper 2 ...
... between terms in the various languages, we have made an effort to develop a software tool and bi-lingual resource that supports terminology research. This paper presents a developed solution, named Bibliša1, which is free for use and publicly available. In the second section of this paper are presented ...Ranka Stanković, Cvetana Krstev, Biljana Lazić, Dalibor Vorkapić. "A bilingual digital library for academic and entrepreneurial knowledge management" in Proceeding of 10th International Forum on Knowledge Asset Dynamics — IFKAD 2015: Culture, Innovation and Entrepreneurship: connecting the knowledge dots, Bari, Italy, 10-12 June 2015, Bari : IFKAD (2015)
-
Keyword Extraction from Parallel Abstracts of Scientific Publications
... is publicly available from http://langnet.uniri. hr/resources.html. The previous research [14] for terminology extraction in the Serbian language used the rule-based method for multi-word term extraction that relies on lexical resources for modeling various syntactic structures of multi-word terms. It ...
... text. Keywords: Graph-based keyword extraction Bilingual keyword extraction · SBKE method · Parallel abstracts 1 Introduction The task of keyword extraction is to automatically identify a set of terms that best describes the document [1,2]. Keyword extraction can be a demanding task, especially when ...
... keyword extraction from bilingual or multilingual tex- tual sources. In such a case, a keyword extraction method should be insensitive to natural language or appropriate for extraction in different natural languages at the same time. One of the open research questions in the keyword extraction task is ...Slobodan Beliga, Olivera Kitanović, Ranka Stanković, Sanda Martinčić-Ipšić . "Keyword Extraction from Parallel Abstracts of Scientific Publications" in Sematic Keyword-Based Search on Structured Data Sources - Third International KEYSTONE Conference, IKC 2017 Gdańsk, Poland, September 11–12, 2017 Revised Selected Papers and COST Action IC1302 Reports, Springer (2017)
-
Comparison of sequential and single extraction in order to estimate environmental impact of metals from fly ash
летећи пепео угља, екстракција са једним агенсом, секвенцијална екстракција, микроталасне пећнице, ултраталасиAleksandra Tasić, Ivana Sredović-Ignjatović, Ljubiša Ignjatović, Marija Ilić, Mališa Antić. "Comparison of sequential and single extraction in order to estimate environmental impact of metals from fly ash" in Journal of the Serbian Chemical Society (2016). https://doi.org/10.2298/JSC160307038T
-
Some examples of interactions between certain rare earth elements and soil
... happened in several different phases and the extraction in phases with milder extraction agents had occurred. In case of neodymium, the nature of soil is important in particular, since only in certain extraction phases there came to ion extraction, and also due to slightly different neodymium ...
... stic for humus. The extraction was noticed in ion exchange phase (Phase I) and in extraction phases with acids (Phases IV and V). The results show that in sand soil about 80 % of neodymium ions have been extracted and in the clay type of soil 96 % (Table VI). The extraction from humus soil was ...
... quantities depend on soil type, nature of extraction agent and metallic ion characteristics. Neodymium, first of all, due to its ionic diameter reacts differently with soil. For that reason the extraction happened only with solutions of certain extraction agents depending on their nature and ionic ...Zlatko Nikolovski, Jelena Isailović, Dejan Jeremić, Sabina Kovač, Ilija Brčeski. "Some examples of interactions between certain rare earth elements and soil" in Journal of the Serbian Chemical Society, National Library of Serbia (2021). https://doi.org/10.2298/JSC211006095N
-
Corpus-based bilingual terminology extraction in the power engineering domain
Ovaj rad predstavlja resurse i alate koji se koriste za ekstrkciju i evaluaciju dvojezične, englesko-srpske terminologije u domenu energetike. Resursi se sastoje od postojeće opšte i domenske leksike i domenskog paralelnog korpusa; alati uključuju ekstraktore termina za oba jezika i alat za poravnavanje segmenata koji pripadaju korpusnim rečenicama. Sistem je testiran variranjem funkcije podudaranja koja utvrđuje prisustvo ekstrahovanog termina u poravnatom segmentu (odsečak), u rasponu od veoma labavog do strogog. Procena rezultata je pokazala da je preciznost izdvajanja termina ...Tanja Ivanović, Ranka Stanković, Branislava Šandrih Todorović, Cvetana Krstev. "Corpus-based bilingual terminology extraction in the power engineering domain" in Terminology, John Benjamins Publishing Company (2022). https://doi.org/10.1075/term.20038.iva
-
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources
In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615
-
Towards Automatic Definition Extraction for Serbian
U radu su prikazani preliminarni rezultati automatske ekstrakcije kandidata za definicije rečnika iz nestrukturiranih tekstova na srpskom jeziku u cilju ubrzanja razvoja rečnika. Definicije u rečniku Srpske akademije nauka i umetnosti (SANU) korišćene su za modelovanje različitih tipova definicija (opisnih, gramatičkih, referentnih i sinonimskih) koje imaju različite sintaksičke i leksičke karakteristike. Korpus istraživanja sastoji se od 61.213 definicija imenica, koje su analizirane korišćenjem morfoloških e-rečnika i lokalnih gramatika implementiranih kao pretvarači konačnih stanja u paketu za obradu korpusa otvorenog ...... 2021), we focused our present research on the extraction of the sentences contained in the definition. The extraction also implies recognition of paradigmatic lexical relations, e.g. synonyms, antonyms, hypernyms, hyponyms. The problem of automatic extraction of definitions from the text has not been ...
... approach for definition modelling and extraction, relying on the existing Serbian dictionaries (morphological and descriptive), as well as the results of the preliminary experiments in automatic extraction of definitions from unstructured Serbian text. Definition extraction task can be formalized either ...
... concept and permits its differentiation from other concepts within a system of concepts.”. According to the standard “ISO 1087:2019 (en) Terminology work and terminology science — Vocabulary”, a definition is “representation of a concept by an expression that describes it and differentiates it from related ...Ranka Stanković, Cvetana Krstev, Rada Stijović, Mirjana Gočanin, Mihailo Škorić. "Towards Automatic Definition Extraction for Serbian" in Proceedings of the XIX EURALEX Congress of the European Assocition for Lexicography: Lexicography for Inclusion (Volume 2). 7-9 September (virtual), Democritus University of Thrace (2021)
-
Bilingual lexical extraction based on word alignment for improving corpus search
Jelena Andonovski, Branislava Šandrih, Olivera Kitanović. "Bilingual lexical extraction based on word alignment for improving corpus search" in The Electronic Library, Emerald (2019). https://doi.org/10.1108/EL-03-2019-0056
-
Project REASONING: Characterization and technological procedures for recycling and reusing of the rudnik mine flotation tailings
Vesna Cvetkov, Vladimir Simić, Stefan Petrović, Filip Arnaut, Milena Kostović, Dragan Radulović, Jovica Stojanović, Vladimir Jovanović, Dejan Todorović, Nina Nikolić, Jelena Senćanski, Grozdanka Bogdanović, Dragana Marilović (2024)Vesna Cvetkov, Vladimir Simić, Stefan Petrović, Filip Arnaut, Milena Kostović, Dragan Radulović, Jovica Stojanović, Vladimir Jovanović, Dejan Todorović, Nina Nikolić, Jelena Senćanski, Grozdanka Bogdanović, Dragana Marilović. "Project REASONING: Characterization and technological procedures for recycling and reusing of the rudnik mine flotation tailings" in 5th Congress Geologists of the Republic of North Macedonia, Ohrid, 28-29. 10. 2024, Македонско геолошко друштво (2024)
-
Groundwater management by riverbank filtration and an infiltration channel, the case of Obrenovac, Serbia
Dušan Polomčić, Bojan Hajdin, Zoran Stevanović, Dragoljub Bajić, Katarina Hajdin. "Groundwater management by riverbank filtration and an infiltration channel, the case of Obrenovac, Serbia" in Hydrogeology Journal, Berlin, Heidelberg : Springer, International Association of Hydrogeologists (2013). https://doi.org/10.1007/s10040-013-1025-9
-
Global trend and negative synergy: Climate changes and groundwater over-extraction
Stevanović Zoran (2013)Stevanović Zoran. "Global trend and negative synergy: Climate changes and groundwater over-extraction" in Proceedings of the International Conference “Climate Change Impact on Water Resources”, 17-18 Oct.2013, Belgrade, Belgrade:Institute of Wat. Manag. J.Cerni & WSDAC (2013): 42-45
-
Terminology Acquisition and Description Using Lexical Resources and Local Grammars
Acquisition of new terminology from specific domains and its adequate description within terminological dictionaries is a complex task, especially for languages that are morphologically complex such as Serbian. In this paper we present an approach to solving this task semi-automatically on basis of lexical resources and local grammars developed for Serbian. Special attention is given to automatic inflectional class prediction for simple adjectives and nouns and the use of syntactic graphs for extraction of Multi-Word Unit (MWU) candidates for ...... outlined in this paper tackles the extraction of domain terminology and its integra- tion into terminological dictionaries using lexical resources and local grammars. Results obtained by following this approach justify its basic as- sumption that the task of term extraction, both in the case of simple words ...
... nouns and the use of syntactic graphs for extraction of Multi-Word Unit (MWU) candidates for termbases, their lem- matization and assignment of inflectional classes. 1 Introduction In this paper we present a semi-automatic proce- dure for terminology acquisition in Serbian. Rap- id changes in ...
... representation of terms is proposed in (Ammar et al., 2015) and applied on Arabic scientific and technical corpus. In (Savary et al., 2012) terminology extraction in the domain of ecsonomy is presented for Polish. It has two modules: a grammatical lexicon of terminological MWEs and a fully lexicalized ...Cvetana Krstev, Ranka Stanković, Ivan Obradović, Biljana Lazić. "Terminology Acquisition and Description Using Lexical Resources and Local Grammars" in Proceedings of the 11th Conference on Terminology and Artificial Intelligence, Granada, Spain, 2015, Granada : LexiCon (Universidad de Granada) (2015)
-
Application of contour blasting for the extraction of dimension stone blocks
Kričak Lazar, Negovanović Milanka, Janković Ivan, Zeković D., Mitrović S.. "Application of contour blasting for the extraction of dimension stone blocks" in Proceedings of the 2nd International Conference „Harmony of nature and spirituality in stone“, Kragujevac, Serbia:Stone Studio Association, (2012): 79-85
-
Hydrodynamic analysis of potential groundwater extraction capacity increase: case study of Nelt groundwater source at Dobanovci
Bajić Dragoljub, Polomčić Dušan, Ratković Jelena, Matić Ivan. "Hydrodynamic analysis of potential groundwater extraction capacity increase: case study of Nelt groundwater source at Dobanovci" in Tehnika 4 no. 68, Belgrade:Union of Engineers and Technicians of Serbia (2017): 512-525. https://doi.org/10.5937/tehnika1704512B