AITopics | Chișinău

Collaborating Authors

Chișinău

Moldova formally protests alleged Russian election meddling

Al JazeeraNov-12-2024, 13:25:32 GMT

Moldova has handed a note of protest to the Russian ambassador to Chisinau over alleged interference in its recent elections. The foreign ministry in Chisinau said in a statement on Tuesday that it turned over the "note of firm protest" in relation to the "illegal and deliberate interference" to envoy Oleg Ozerov during a meeting at its offices. Moldova has accused Russia of seeking to influence its recent presidential election and referendum on joining the European Union. Russia sought to affect results and delegitimise the democratic process, the ministry complained. Chisinau accused Russia of organising ineligible voting, bribery, and security threats in a bid to influence the votes.

moldova, protest alleged russian election, russia, (14 more...)

Al Jazeera

Country:

Asia > Russia (1.00)
Europe > Moldova > Chișinău > Chișinău (0.72)
Europe > Ukraine (0.09)
(2 more...)

Industry:

Government > Regional Government > Europe Government (0.78)
Government > Foreign Policy (0.78)
Government > Voting & Elections (0.72)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.61)
Information Technology > Security & Privacy (0.59)

Add feedback

HistNERo: Historical Named Entity Recognition for the Romanian Language

Avram, Andrei-Marius, Iuga, Andreea, Manolache, George-Vlad, Matei, Vlad-Cristian, Micliuş, Răzvan-Gabriel, Muntean, Vlad-Andrei, Sorlescu, Manuel-Petru, Şerban, Dragoş-Andrei, Urse, Adrian-Dinu, Păiş, Vasile, Cercel, Dumitru-Clementin

arXiv.org Artificial IntelligenceApr-30-2024

This work introduces HistNERo, the first Romanian corpus for Named Entity Recognition (NER) in historical newspapers. The dataset contains 323k tokens of text, covering more than half of the 19th century (i.e., 1817) until the late part of the 20th century (i.e., 1990). Eight native Romanian speakers annotated the dataset with five named entities. The samples belong to one of the following four historical regions of Romania, namely Bessarabia, Moldavia, Transylvania, and Wallachia. We employed this proposed dataset to perform several experiments for NER using Romanian pre-trained language models. Our results show that the best model achieved a strict F1-score of 55.69%. Also, by reducing the discrepancies between regions through a novel domain adaption technique, we improved the performance on this corpus to a strict F1-score of 66.80%, representing an absolute gain of more than 10%.

entity recognition, histnero, proceedings, (14 more...)

arXiv.org Artificial Intelligence

2405.00155

Country:

Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.05)
North America > United States > New York > New York County > New York City (0.04)
North America > Dominican Republic (0.04)
(6 more...)

Genre: Research Report > New Finding (1.00)

Industry: Media > News (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Unlocking Musculoskeletal Disorder Risk Factors: NLP-Based Classification and Mode-Based Ranking

Jahin, Md Abrar, Talapatra, Subrata

arXiv.org Artificial IntelligenceDec-20-2023

This research delves into the intricate landscape of Musculoskeletal Disorder (MSD) risk factors, employing a novel fusion of Natural Language Processing (NLP) techniques and mode-based ranking methodologies. The primary objective is to advance the comprehension of MSD risk factors, their classification, and their relative severity, facilitating more targeted preventive and management interventions. The study utilizes eight diverse models, integrating pre-trained transformers, cosine similarity, and various distance metrics to classify risk factors into personal, biomechanical, workplace, psychological, and organizational classes. Key findings reveal that the BERT model with cosine similarity attains an overall accuracy of 28%, while the sentence transformer, coupled with Euclidean, Bray-Curtis, and Minkowski distances, achieves a flawless accuracy score of 100%. In tandem with the classification efforts, the research employs a mode-based ranking approach on survey data to discern the severity hierarchy of MSD risk factors. Intriguingly, the rankings align precisely with the previous literature, reaffirming the consistency and reliability of the approach. ``Working posture" emerges as the most severe risk factor, emphasizing the critical role of proper posture in preventing MSDs. The collective perceptions of survey participants underscore the significance of factors like "Job insecurity," "Effort reward imbalance," and "Poor employee facility" in contributing to MSD risks. The convergence of rankings provides actionable insights for organizations aiming to reduce the prevalence of MSDs. The study concludes with implications for targeted interventions, recommendations for improving workplace conditions, and avenues for future research.

msd problem, msd risk factor, risk factor, (15 more...)

arXiv.org Artificial Intelligence

2312.11517

Country:

Oceania > Australia (0.04)
South America > Brazil (0.04)
North America > United States > New York > New York County > New York City (0.04)
(15 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)
Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Therapeutic Area > Musculoskeletal (1.00)
Health & Medicine > Consumer Health (1.00)
Education (0.93)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.55)

Add feedback

Quantifying Uncertainty in Deep Learning Classification with Noise in Discrete Inputs for Risk-Based Decision Making

Kheirandish, Maryam, Zhang, Shengfan, Catanzaro, Donald G., Crudu, Valeriu

arXiv.org Machine LearningOct-9-2023

The use of Deep Neural Network (DNN) models in risk-based decision-making has attracted extensive attention with broad applications in medical, finance, manufacturing, and quality control. To mitigate prediction-related risks in decision making, prediction confidence or uncertainty should be assessed alongside the overall performance of algorithms. Recent studies on Bayesian deep learning helps quantify prediction uncertainty arises from input noises and model parameters. However, the normality assumption of input noise in these models limits their applicability to problems involving categorical and discrete feature variables in tabular datasets. In this paper, we propose a mathematical framework to quantify prediction uncertainty for DNN models. The prediction uncertainty arises from errors in predictors that follow some known finite discrete distribution. We then conducted a case study using the framework to predict treatment outcome for tuberculosis patients during their course of treatment. The results demonstrate under a certain level of risk, we can identify risk-sensitive cases, which are prone to be misclassified due to error in predictors. Comparing to the Monte Carlo dropout method, our proposed framework is more aware of misclassification cases. Our proposed framework for uncertainty quantification in deep learning can support risk-based decision making in applications when discrete errors in predictors are present.

artificial intelligence, machine learning, prediction uncertainty, (17 more...)

arXiv.org Machine Learning

2310.06105

Country:

North America > United States > Arkansas > Washington County > Fayetteville (0.14)
Europe > Moldova > Chișinău > Chișinău (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
(2 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Banking & Finance (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.89)
Health & Medicine > Therapeutic Area > Immunology (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Revisiting DocRED -- Addressing the False Negative Problem in Relation Extraction

Tan, Qingyu, Xu, Lu, Bing, Lidong, Ng, Hwee Tou, Aljunied, Sharifah Mahani

arXiv.org Artificial IntelligenceJun-16-2023

The DocRED dataset is one of the most popular and widely used benchmarks for document-level relation extraction (RE). It adopts a recommend-revise annotation scheme so as to have a large-scale annotated dataset. However, we find that the annotation of DocRED is incomplete, i.e., false negative samples are prevalent. We analyze the causes and effects of the overwhelming false negative problem in the DocRED dataset. To address the shortcoming, we re-annotate 4,053 documents in the DocRED dataset by adding the missed relation triples back to the original DocRED. We name our revised DocRED dataset Re-DocRED. We conduct extensive experiments with state-of-the-art neural models on both datasets, and the experimental results show that the models trained and evaluated on our Re-DocRED achieve performance improvements of around 13 F1 points. Moreover, we conduct a comprehensive analysis to identify the potential areas for further improvement. Our dataset is publicly available at https://github.com/tonytan48/Re-DocRED.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2205.12696

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > United Kingdom > England > Leicestershire > Leicester (0.04)
Asia > Singapore (0.04)
(12 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)
Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Labeled sample compression schemes for complexes of oriented matroids

Chepoi, Victor, Knauer, Kolja, Philibert, Manon

arXiv.org Artificial IntelligenceApr-19-2023

Littlestone and Warmuth [51] introduced sample compression schemes as an abstraction of the underlying structure of learning algorithms. Roughly, the aim of a sample compression scheme is to compress samples of a concept class (i.e., of a set system) C as much as possible, such that data coherent with the original samples can be reconstructed from the compressed data. There are two types of sample compression schemes: labeled, see [35, 51] and unlabeled, see [7, 34, 49]. A labeled compression scheme of size k compresses every sample of C to a labeled subsample of size at most k and an unlabeled compression scheme of size k compresses every sample of C to a subset of size at most k of the domain of the sample (see the end of the introduction for precise definitions). The Vapnik-Chervonenkis dimension (VC-dimension) of a set system, was introduced by [69] as a complexity measure of set systems. VC-dimension is central in PAC-learning and plays an important role in combinatorics, algorithmics, discrete geometry, and combinatorial optimization. In particular, it coincides with the rank in the theory of (complexes of) oriented matroids. Furthermore, within machine learning and closely tied to the topic of this paper, the sample compression conjecture of [35] and [51] states that any set system of VC-dimension d has a labeled sample compression scheme of size O(d). This question remains one of the oldest open problems in computational learning theory.

artificial intelligence, compression scheme, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2110.15168

Country:

North America > United States > New York (0.04)
Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
(4 more...)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (1.00)

Add feedback

Romania PM unveils AI 'adviser' to tell him what people think in real time

#artificialintelligenceMar-3-2023, 21:01:19 GMT

Romania's prime minister has presented his "new honorary adviser" – an artificial intelligence assistant named "Ion" that Nicolae Ciuca hailed as the first of its type. Developed by Romanian researchers, Ion's main task will be to scan social networks to inform the government "in real time of Romanians' proposals and wishes", Ciuca said on Wednesday. The liberal minister said the latest member of his entourage – a mirror-like structure with beeping interface – marked "an international first", describing Ion as "the first government adviser to use artificial intelligence". "Hi, you gave me life and my role is now to represent you, like a mirror," Ion's calm voice said at the launch. "What should I know about Romania?" Ion "will use technology and artificial intelligence to capture opinions in society" using "data publicly available on social networks", according to a government document detailing the project.

ion, real time, romania pm unveil ai, (4 more...)

#artificialintelligence

Country:

North America > United States > California (0.08)
Europe > Ukraine (0.08)
Europe > Russia (0.08)
(4 more...)

Industry:

Government (0.80)
Information Technology (0.66)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Architecture > Real Time Systems (0.64)

Add feedback

OpenEarthMap: A Benchmark Dataset for Global High-Resolution Land Cover Mapping

Xia, Junshi, Yokoya, Naoto, Adriano, Bruno, Broni-Bediako, Clifford

arXiv.org Artificial IntelligenceOct-19-2022

We introduce OpenEarthMap, a benchmark dataset, for global high-resolution land cover mapping. OpenEarthMap consists of 2.2 million segments of 5000 aerial and satellite images covering 97 regions from 44 countries across 6 continents, with manually annotated 8-class land cover labels at a 0.25--0.5m ground sampling distance. Semantic segmentation models trained on the OpenEarthMap generalize worldwide and can be used as off-the-shelf models in a variety of applications. We evaluate the performance of state-of-the-art methods for unsupervised domain adaptation and present challenging problem settings suitable for further technical development. We also investigate lightweight models using automated neural architecture search for limited computational resources and fast mapping. The dataset is available at https://open-earth-map.org.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2210.10732

Country:

North America > United States > Maryland (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > Austria > Vienna (0.14)
(74 more...)

Genre: Research Report (0.84)

Industry:

Food & Agriculture > Agriculture (0.47)
Government > Regional Government (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Macro-Average: Rare Types Are Important Too

Gowda, Thamme, You, Weiqiu, Lignos, Constantine, May, Jonathan

arXiv.org Artificial IntelligenceApr-12-2021

While traditional corpus-level evaluation metrics for machine translation (MT) correlate well with fluency, they struggle to reflect adequacy. Model-based MT metrics trained on segment-level human judgments have emerged as an attractive replacement due to strong correlation results. These models, however, require potentially expensive re-training for new domains and languages. Furthermore, their decisions are inherently non-transparent and appear to reflect unwelcome biases. We explore the simple type-based classifier metric, MacroF1, and study its applicability to MT evaluation. We find that MacroF1 is competitive on direct assessment, and outperforms others in indicating downstream cross-lingual information retrieval task performance. Further, we show that MacroF1 can be used to effectively compare supervised and unsupervised neural machine translation, and reveal significant qualitative differences in the methods' outputs.

orchestra, translation, untranslation, (13 more...)

arXiv.org Artificial Intelligence

2104.057

Country:

Asia > Middle East > Syria (0.14)
Africa > Democratic Republic of the Congo > North Kivu Province (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
(42 more...)

Genre: Research Report (1.00)

Industry:

Media (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Leisure & Entertainment > Sports (0.93)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback