Претрага
808 items
-
Knowledge and Rule-Based Diacritic Restoration in Serbian
In this paper we present a procedure for the restoration of diacritics in Serbian texts written using the degraded Latin alphabet. The procedure relies on the comprehensive lexical resources for Serbian: the morphological electronic dictionaries, the Corpus of Contemporary Serbian and local grammars. Dictionaries are used to identify possible candidates for the restoration, while the dataobtainedfromSrpKorandlocalgrammarsassistsinmakingadecisionbetween several candidates in cases of ambiguity. The evaluation results reveal that,dependingonthetext,accuracyrangesfrom95.03%to99.36%,whilethe precision (average 98.93%) is always higher than the recall (average 94.94%).Cvetana Krstev, Ranka Stanković, Duško Vitas. "Knowledge and Rule-Based Diacritic Restoration in Serbian" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018): 41-51
-
Towards translation of educational resources using GIZA++
Ivan Obradović, Dalibor Vorkapić, Ranka Stanković, Nikola Vulović, Miladin Kotorčević. "Towards translation of educational resources using GIZA++" in The Seventh International Conference on e-Learning (eLearning-2016), September 2016, Belgrade : Metropolitan Univesity (2016)
-
Using English Baits to Catch Serbian Multi-Word Terminology
In this paper we present the first results in bilingual terminology extraction. The hypothesis of our approach is that if for a source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach relies on several resources and tools: aligned domain texts, domain terminology for a source language, a terminology extractor for a target language, and a ...aligned texts, word alignment, terminology extraction, electronic dictionaries, morphological inflectionCvetana Krstev, Branislava Šandrih, Ranka Stanković. "Using English Baits to Catch Serbian Multi-Word Terminology" in Proceedings of the 11th International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
-
An Approach to Efficient Processing of Multi-Word Units
Efficient processing of Multi-Word Units in the course of development of morphological MWU dictionaries is not easy to achieve, especially when languages with complex morphological structures are concerned, such as Serbian. Manual development of this type of dictionaries is a tedious and extremely slow process. To alleviate this problem we turned to our multipurpose software tool, dubbed LeXimir, in the production of lemmas for e-dictionaries of multi-word units. In addition to that, we developed a procedure aimed at making ...Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas. "An Approach to Efficient Processing of Multi-Word Units" in Computational Linguistics - Applications, Studies in Computational Intelligence 458 no. 458, Berlin Heidelberg : Springer-Verlag (2013): 109-129. https://doi.org/10.1007/978-3-642-34399-5_6
-
Geochemical characterization of sediments from the archaeological site Vinča – Belo Brdo, Serbia
Gorica Veselinović, Dragana Životić, Kristina Penezić, Milica Kašanin-Grubin, Nevenka Mijatović, Jovana Malbašić, Aleksandra Šajnović (2020)... 6. n-Alkanes The aliphatic hydrocarbon fraction consists mainly of n-alkanes and a certain amount of isoprenoid alkanes. The concentrations of n-alkanes are determined based on the ratio of integrated peak areas of n-alkanes and an internal standard of known concentration. GC–MS fragmento- grams of ...
... n-C31)/(n-C15 + n- C17 + n-C19). d L/H – low to high n-alkanes = (C14 + C15 + C16 + C17 + C18 + C19)/ (C27 + C28 + C29 + C30 + C31 + C32 + C33). e LSR - long chain n-alkanes to short and mid chain n-alkanes = ∑ (n-al- kanes) ≥ n-C25 / ∑ (n-alkanes) < n-C25. f ACL - average chain length = (25 × n-C25 ...
... amplitude of n-alkanes (Tissot & Welte, 1984)) = ½ x [(∑oddC17-C31)/(∑evenC16-C30) + (∑oddC15-C31)/ (∑evenC18-C32)]. b OEP - odd-over-even predominance (calculated by Peters et al., 2005) = (n-C27 + n-C29 + n-C31 + n-C33)/(n-C26 + n-C28 + n-C30 + n-C32). c TAR - terrigenous/aquatic ratio = (n-C27 + n-C29 + ...Gorica Veselinović, Dragana Životić, Kristina Penezić, Milica Kašanin-Grubin, Nevenka Mijatović, Jovana Malbašić, Aleksandra Šajnović. "Geochemical characterization of sediments from the archaeological site Vinča – Belo Brdo, Serbia" in CATENA, Elsevier BV (2020). https://doi.org/10.1016/j.catena.2020.104914
-
Production of morphological dictionaries of multi-word units using a multipurpose tool
The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation ...electronic dictionary, Serbian, morphology, inflection, multi-word units, noun phrases, query expansionRanka Stanković, Ivan Obradović, Cvetana Krstev, Duško Vitas. "Production of morphological dictionaries of multi-word units using a multipurpose tool" in Proceedings of the Computational Linguistics-Applications Conference, October 2011, Jachranka, Poland, Jachranka, Poland : PTI - Polish Information Processing Society (2011)
-
The use of biological markers in determination of origin and type of organic matter in the Tisza river sediments
Snežana Štrbac, Gordana Gajica, Aleksandra Šajnović, Nebojša Vasić, Ksenija Stojanović, Branimir Jovančićević (2013)The objective of the study was to determine the origin and type of organic matter (OM) of the Tisza recent sediments along the distance of 153 km through the territory of Serbia. For this purpose group organic-geochemical parameters and biomarker compositions were used. All samples contain approximately same amount of OM, which was deposited under uniform, slightly reducing conditions. Based on the distribution of n-alkanes, the origin and type of OM could not be precisely estimated. However, n-alkane patterns ...Snežana Štrbac, Gordana Gajica, Aleksandra Šajnović, Nebojša Vasić, Ksenija Stojanović, Branimir Jovančićević. "The use of biological markers in determination of origin and type of organic matter in the Tisza river sediments" in Journal of Serbian Chemical Society, Beograd : Srpsko hemijsko društvo (2013). https://doi.org/10.2298/JSC130614087S
-
Multi-word Expressions for Abusive Speech Detection in Serbian
Ovaj rad predstavlja istraživanja na usavršavanju i unapređenju srpske verzije rečnika Hurtlex, višejezičnog leksikona uvredljivih reči. Posebnu pažnju posvećujemo dodavanju izraza sa više reči (polileksemskih jedinica) koji se mogu smatrati uvredljivim, jer su takvi leksički zapisi veoma važni za postizanje dobrih rezultata u mnoštvu zadataka otkrivanja uvredljivog jezika. Srpski morfološki rečnici se koriste kao osnova za čišćenje podataka i stvaranje rečnika. Istaknuta je veza sa drugim leksičkim i semantičkim resursima na srpskom jeziku i predviđena je izgradnja sistema za ...Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev. "Multi-word Expressions for Abusive Speech Detection in Serbian" in Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, Association for Computational Linguistics (2020)
-
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources
In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...... tag_0 C POS-tag of the 1st word tag_1 C POS-tag of the 2nd word tag_2 C POS-tag of the 3rd word tag_3 C POS-tag of the 4th word tag_4 C POS-tag of the 5th word tag_5 C POS-tag of the 6th word is_compound C Component is a compound J o in t F e a t u r e s perc_of_cmn_tokens N Comm. tokens to a total num ...
... 23(5):763–788. Baldwin, T., and Kim, S. N. 2010. Multiword Expressions. Handbook of Natural Language Processing, 2:267–292. Bouamor, D., Semmar, N., and Zweigenbaum, P. 2012. Identifying Bilingual Multi-Word Expressions for Statistical Machine Translation. In Calzolari, N., Choukri, K., Declerck, T., Doğan ...
... approach to classification of terms was proposed by Hakami and Bollegala (2017). Authors represented each term using two types of features: character n-grams extracted from a term and contextual features. We examine performance of several binary classi- fiers, which are based on different lexical and syntactic ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615
-
Rule-based Automatic Multi-word Term Extraction and Lemmatization
In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)
-
Determining the groundwater movement velocity using cross-correlation analysis: Velika Morava alluvium case study
Milan Kresojević, Vesna Ristić Vakanjac, Dušan Polomčić, Boris Vakanjac, Dragan Trifković, Jugoslav Nikolić (2024)Monitoring of water levels and flow rates of the Velika Morava River, due to its significance for the Republic of Serbia, was established over 100 years ago at the profiles of Ljubičevski Most and Ćuprija. This was followed a year later by the activation of the Varvarin monitoring station, then in 1935 by the Žabarski Most, and in 1952 by the Bagrdan station. Groundwater monitoring started in 1977 with 12 piezometers and the network was gradually expanded to include ...režim površinskih i podzemnih voda, kroskorelacione analize, brzina kretanja podzemnih voda, Velika MoravaMilan Kresojević, Vesna Ristić Vakanjac, Dušan Polomčić, Boris Vakanjac, Dragan Trifković, Jugoslav Nikolić. "Determining the groundwater movement velocity using cross-correlation analysis: Velika Morava alluvium case study" in Review of the Bulgarian Geological Society, Bulgarian Geological Society (2024). https://doi.org/10.52215/rev.bgs.2024.85.3.210
-
Geochemistry of neutral mine drainage at sulfide deposits ‒ example of the „Grot“ Pb-Zn mine, South-Eastern Serbia
U ovom radu proučavan je hemijski sastav voda iz rudnika i izdvojeni su hidro-geohemijski faktori koji utiču na njegovo formiranje. Sakupljeno je 11 uzoraka vode sa 6 lokacija na području Krive Feje, kako bi se utvrdio njihov hemijski sastav. Analizom podataka utvrđeno je da su vode sa ovog područja visokog kvaliteta, HCO3--SO42--Ca2+ i SO42--Ca2+ tipa, sa neutralnom pH vrednošću. Koncentracije metala u ovim vodama (cinka, olova, barijuma, bakra i hroma) generalno su niske i većina uzoraka voda ispunjava kriterijume ...podzemne vode, kisele rudničke vode, hidrogeohemijski faktori, metali, PHREEQC modeliranje, indeks saturacijeSnežana Kretić, Jana Štrbački, Nebojša Atanacković. "Geochemistry of neutral mine drainage at sulfide deposits ‒ example of the „Grot“ Pb-Zn mine, South-Eastern Serbia" in Journal of the Serbian Chemical Society, National Library of Serbia (2024). https://doi.org/10.2298/jsc230811013k
-
Mineral and Thermal Waters of Serbia: Multivariate Statistical Approach to Hydrochemical Characterization
... similarity be- tween clusters decreases (Roques et al. 2014). Taking into account parameters that were taken for cluster analysis, Stiff dia- grams were chosen for better presentation of the parameters within clusters (Fig. 2b). The average Stiff diagram of each cluster (based on median conce ...
... Dimitrijević N (1988) Hidrohemija (Hydrochemistry). University of Belgrade, Faculty of Mining and Geology, Belgrade Filipović B (2003) Mineralne, termalne i termomineralne vode Srbije (Mineral, thermal and thermomineral waters). Faculty of mining and geology, Belgrade Filipović B, Dimitrijević N (1991) ...
... of hi- erarchical cluster and principal component analyses. Appl Geochem 26: 1399–1413. doi:10.1016/j.apgeochem.2011.05.013 Nikolov J, Todorović N, Petrović Pantić T, Forkapić S, Mrdja D, Bikit I, Krmar M, Vesković M (2012) Exposure to radon in the radon spa Niška Banja, Serbia. Radiation Measurements ...Maja Todorović, Jana Štrbački, Marina Ćuk, Jakov Andrijašević, Jovana Šišović, Petar Papić . "Mineral and Thermal Waters of Serbia: Multivariate Statistical Approach to Hydrochemical Characterization" in Mineral and Thermal Waters of Southeastern Europe, Springer International Publishing (2016). https://doi.org/10.1007/978-3-319-25379-4
-
A survey of greenhouse gases production in central European lignites
Anna Pytlak, Anna Szafranek-Nakonieczna, Weronika Goraj, Izabela Śnieżyńska, Aleksandra Krążała, Artur Banach, Ivica Ristović, Mirosław Słowakiewicz, Zofia Stępniewska (2021)Anna Pytlak, Anna Szafranek-Nakonieczna, Weronika Goraj, Izabela Śnieżyńska, Aleksandra Krążała, Artur Banach, Ivica Ristović, Mirosław Słowakiewicz, Zofia Stępniewska. "A survey of greenhouse gases production in central European lignites" in Science of The Total Environment, Elsevier (2021). https://doi.org/10.1016/j.scitotenv.2021.149551
-
Bridging Computational Lexicography and Corpus Linguistics: A Query Extension for OntoLex-FrAC
OntoLex, dominantni standard zajednice za mašinski čitljive leksičke resurse u kontekstu RDF-a, Linked Data i tehnologija Semantičkog veba, trenutno se proširuje sa posebnim modulom za Frekvencije, Primere i Informacije zasnovane na Korpusu (OntoLex-FrAC). Predlažemo novi komponent za OntoLex-FrAC, koji se bavi inkorporacijom korpusnih upita za (a) povezivanje rečnika sa korpusnim mašinama, (b) omogućavanje RDF baziranih web servisa da dinamički razmenjuju korpusne upite i podatke odgovora, i (c) korišćenje konvencionalnih upitačkih jezika za formalizaciju unutrašnje strukture kolokacija, skica reči i ...standardizacija, digitalna leksikografija, OntoLex, upiti korpusa, povezani podaci, Lingvistički povezani otvoreni podaciChristian Chiarcos, Ranka Stanković, Maxim Ionov, Gilles Sérasset. "Bridging Computational Lexicography and Corpus Linguistics: A Query Extension for OntoLex-FrAC" in Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Turin, 20-25 May 2024, LREC (2024)
-
Keyword Extraction from Parallel Abstracts of Scientific Publications
Slobodan Beliga, Olivera Kitanović, Ranka Stanković, Sanda Martinčić-Ipšić . "Keyword Extraction from Parallel Abstracts of Scientific Publications" in Sematic Keyword-Based Search on Structured Data Sources - Third International KEYSTONE Conference, IKC 2017 Gdańsk, Poland, September 11–12, 2017 Revised Selected Papers and COST Action IC1302 Reports, Springer (2017)
-
Late and post-collisional tectonic evolution of the Adria-Europe suture in the Vardar Zone
The Vardar Zone is a product of the Triassic-Jurassic opening of the Neotethys, Jurassic obduction, Late Cretaceous/Paleogene consumption of the oceanic crust and continental collision. During the last process, the Eastern Vardar Zone was thrust over the Central and eventually both onto the Western Vardar Zone. The present paleomagnetic and structural study provided new results from the first two zones in the Belgrade area. The younger set of data, together with published ones from the third zone, provide firm ...Emő Márton, Marinko Toljić, Vesna Cvetkov. "Late and post-collisional tectonic evolution of the Adria-Europe suture in the Vardar Zone" in Journal of Geodynamics, Elsevier BV (2022). https://doi.org/10.1016/j.jog.2021.101880
-
A comparative study of the molecular and isotopic composition of biomarkers in immature oil shale (Aleksinac deposit, Serbia) and its liquid pyrolysis products (open and closed systems)
Gordana Gajica, Aleksandra Šajnović, Ksenija Stojanović, Jan Schwarzbauer, Aleksandar Kostić, Branimir Jovančićević (2021)The molecular and isotopic composition of biomarkers in initial bitumen isolated from immature (0.41% Rr) oil shale samples (Aleksinac deposit) and liquid products obtained by pyrolysis in open (OS) and closed (CS) systems are studied. The influence of pyrolysis type and variations of kerogen type on biomarkers composition and their isotopic signatures in liquid products is determined. The applicability of pyrolysis type, numerous biomarkers and carbon isotopic compositions (δ13C) of n-alkanes in liquid pyrolysates is established. Pyrolysis experiments were ...Uljni šejl, Aleksinac, organska supstanca, otvoreni i zatvoreni sistem pirolize, biomarkeri, izotopski sastav ugljjenika... mentarily to the biomarkers of the aliphatic fraction due to their struc- tural similarity to n-alkanes. n-Alkan-2-ones and fatty acid methyl esters were identified in aromatic fraction based on typical ion fragmento- grams, m/z 58 and 74, respectively. G. Gajica et al. ...
... CP I – Ca rb on Pr efe ren ce In de x, CP I ( C 1 5– C 3 5) = ½ x [Σ od d( n-C 15 -n- C 3 5)/ Σ ev en (n- C 1 4-n -C 34 ) + Σ od d( n- C 1 5-n -C 35 )/Σ ev en (n- C 1 6-n -C 36 )]; Pr /P h = Pr ist an e/ Ph yta ne ; % C 2 7 α α α (R ) – C 2 7/( C 2 7 ...
... from C15 to C35, with dominance of n-C27, n-C29, n-C31 homologues (Fig. 2a, d). The distribution of n-alkanes in liquid pyrolysates differs from those in bitumen of raw samples. n-Alkanes are identified in almost the same range of C14–C40 in all liquid pyrolysates (Fig. 2). The chromatograms reveal ...Gordana Gajica, Aleksandra Šajnović, Ksenija Stojanović, Jan Schwarzbauer, Aleksandar Kostić, Branimir Jovančićević. "A comparative study of the molecular and isotopic composition of biomarkers in immature oil shale (Aleksinac deposit, Serbia) and its liquid pyrolysis products (open and closed systems)" in Marine and Petroleum Geology, Elsevier BV (2021). https://doi.org/10.1016/j.marpetgeo.2021.105383
-
Evidence of Variscan and Alpine tectonics in the structural and thermochronological record of the central Serbo-Macedonian Massif (south-eastern Serbia)
... and complete “ Ar/”Ar data. A brief overview of critical sample information is given in Table 1, while relevant “Ar/”Ar date spectra, K/Ca dia- grams, and isotope correlation plots are shown in Fig. 14. Additionally, the “Ar/Ar dates are presented in Fig. 2 along with previously published K/Ar ...
... predominantly shallow dipping Int J Earth Sci (Geol Rundsch) (2017) 106:1665–1692 1671 @ \N, s N=5 Western part ofthe Lower Complex Vrvi Kobila area N Umin N=19 imylonitc • lneadon N shear-zone % foliation foliation outside" % ihe shear:zone - Vranjska Banja area - see Figure ...
... Simplified tectonic map of the study area with arrows representing local directions of tectonic transport. See text for i w details N O 42 *% 0' N || ~ N y j }a ~ AOI NM / i w\ S 1673 “ Outcrop-scale high conf. “ Outcrop-scale low conf. m Microstructure s Mineral lineation ...Milorad D. Antić, Alexandre Kounov, Branislav Trivić, Richard Spikings, Andreas Wetzel. "Evidence of Variscan and Alpine tectonics in the structural and thermochronological record of the central Serbo-Macedonian Massif (south-eastern Serbia)" in International Journal of Earth Sciences, Springer Science and Business Media LLC (2016). https://doi.org/10.1007/s00531-016-1380-6
-
Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities
Овај рад представља активности на развоју корпуса ELEXIS-sr, српском додатку вишејезичном анотираном корпусу ELEXIS-а, који се састоји од семантичких анотација и репозиторија значења речи. ELEXIS је паралелни вишејезични анотирани корпус на десет европских језика, који може да се користи као вишејезички репер за евалуацију европских језика са мање и средње развијеним ресурсима. Фокус овог рада је на вишечланим изразима и именованим ентитетима, њиховом препознавању у скупу реченица ELEXIS-sr и поређењу са анотацијама на другим језицима. Разматрају се први кораци ...Cvetana Krstev, Ranka Stanković, Aleksandra Marković, Teodora Mihajlov. "Towards the semantic annotation of SR-ELEXIS corpus: Insights into Multiword Expressions and Named Entities" in Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024, Turin, May 25, 2024, ELRA and ICCL (2024)