Goto

Collaborating Authors

 Tufis, Dan


PyEuroVoc: A Tool for Multilingual Legal Document Classification with EuroVoc Descriptors

arXiv.org Artificial Intelligence

EuroVoc is a multilingual thesaurus that was built for organizing the legislative documentary of the European Union institutions. It contains thousands of categories at different levels of specificity and its descriptors are targeted by legal texts in almost thirty languages. In this work we propose a unified framework for EuroVoc classification on 22 languages by fine-tuning modern Transformer-based pretrained language models. We study extensively the performance of our trained models and show that they significantly improve the results obtained by a similar tool - JEX - on the same dataset. The code and the fine-tuned models were open sourced, together with a programmatic interface that eases the process of loading the weights of a trained model and of classifying a new document.


The Semantic Web and Language Technology, Its Potential and Practicalities: EUROLAN-2003

AI Magazine

EUROLAN, which has been held biennially since 1993, is one of the most significant European summer schools in the area of natural language processing. Each of the EUROLAN sessions has focused on an area of timely interest to researchers in the field; this year's EUROLAN involved students in tutorials and hands-on sessions concerned with semantic web technologies as applied to language processing, ontology creation and use, and consideration of the semantic web's potential and limitations.


The Semantic Web and Language Technology, Its Potential and Practicalities: EUROLAN-2003

AI Magazine

Later in the school, the focus turned to ontologies, which is where the true power of the semantic web lies. EUROLAN lecturers treated its potential in terms of what the topic of ontology development it might--and might not--bring to us in the future. This year's and how great its impact will really start somewhere, somehow, even if school was organized by the Faculty be. Although it is not yet clear what emerges is a variety of ontological of Computer Science at the A. I. Cuza whether the current vision of the semantic stores from which to choose. University of Iasi, the Research Institute web will indeed reach its expectations, The EUROLAN summer school also for Artificial Intelligence at the there are more and more included a workshop on ontologies Romanian Academy in Bucharest, opinions that it represents a major and information extraction, a student and the Department of Computer technological step that will permanently workshop on applied natural Science at Vassar College.