AITopics | Montrouge

Collaborating Authors

Montrouge

MIX : a Multi-task Learning Approach to Solve Open-Domain Question Answering

Chaybouti, Sofian, Saghe, Achraf, Shabou, Aymen

arXiv.org Artificial IntelligenceMar-13-2025

This paper introduces MIX, a multi-task deep learning approach to solve open-ended question-answering. First, we design our system as a multi-stage pipeline of 3 building blocks: a BM25-based Retriever to reduce the search space, a RoBERTa-based Scorer, and an Extractor to rank retrieved paragraphs and extract relevant text spans, respectively. Eventually, we further improve the computational efficiency of our system to deal with the scalability challenge: thanks to multi-task learning, we parallelize the close tasks solved by the Scorer and the Extractor. Our system is on par with state-of-the-art performances on the squad-open benchmark while being simpler conceptually.

machine learning, natural language, question answering, (18 more...)

arXiv.org Artificial Intelligence

2012.09766

Country:

Europe > Italy > Tuscany > Florence (0.04)
Europe > France > Île-de-France > Hauts-de-Seine > Montrouge (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.79)

Add feedback

EfficientQA : a RoBERTa Based Phrase-Indexed Question-Answering System

Chaybouti, Sofian, Saghe, Achraf, Shabou, Aymen

arXiv.org Artificial IntelligenceMar-10-2025

State-of-the-art extractive question-answering models achieve superhuman performances on the SQuAD benchmark. Yet, they are unreasonably heavy and need expensive GPU computing to answer questions in a reasonable time. Thus, they cannot be used in the open-domain question-answering paradigm for real-world queries on hundreds of thousands of documents. In this paper, we explore the possibility of transferring the natural language understanding of language models into dense vectors representing questions and answer candidates to make question-answering compatible with a simple nearest neighbor search task. This new model, which we call EfficientQA, takes advantage of the pair of sequences kind of input of BERT-based models to build meaningful, dense representations of candidate answers. These latter are extracted from the context in a question-agnostic fashion. Our model achieves state-of-the-art results in Phrase-Indexed Question Answering (PIQA), beating the previous state-of-art by 1.3 points in exact-match and 1.4 points in f1-score. These results show that dense vectors can embed rich semantic representations of sequences, although these were built from language models not originally trained for the use case. Thus, to build more resource-efficient NLP systems in the future, training language models better adapted to build dense representations of phrases is one of the possibilities.

benchmark, language model, representation, (15 more...)

arXiv.org Artificial Intelligence

2101.02157

Country:

Europe > Italy > Tuscany > Florence (0.04)
Europe > France > Île-de-France > Hauts-de-Seine > Montrouge (0.04)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)

Add feedback

A graph-based approach to extracting narrative signals from public discourse

Pournaki, Armin, Willaert, Tom

arXiv.org Artificial IntelligenceNov-1-2024

Narratives are key interpretative devices by which humans make sense of political reality. As the significance of narratives for understanding current societal issues such as polarization and misinformation becomes increasingly evident, there is a growing demand for methods that support their empirical analysis. To this end, we propose a graph-based formalism and machine-guided method for extracting, representing, and analyzing selected narrative signals from digital textual corpora, based on Abstract Meaning Representation (AMR). The formalism and method introduced here specifically cater to the study of political narratives that figure in texts from digital media such as archived political speeches, social media posts, political manifestos and transcripts of parliamentary debates. We conceptualize these political narratives as a type of ontological narratives: stories by which actors position themselves as political beings, and which are akin to political worldviews in which actors present their normative vision of the world, or aspects thereof. We approach the study of such political narratives as a problem of information retrieval: starting from a textual corpus, we first extract a graph-like representation of the meaning of each sentence in the corpus using AMR. Drawing on transferable concepts from narratology, we then apply a set of heuristics to filter these graphs for representations of 1) actors, 2) the events in which these actors figure, and 3) traces of the perspectivization of these events. We approach these references to actors, events, and instances of perspectivization as core narrative signals that initiate a further analysis by alluding to larger political narratives. By means of a case study of State of the European Union addresses, we demonstrate how the formalism can be used to inductively surface signals of political narratives from public discourse.

artificial intelligence, natural language, text processing, (19 more...)

arXiv.org Artificial Intelligence

2411.00702

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > Ireland (0.14)
Asia > Russia (0.14)
(16 more...)

Genre: Research Report (0.82)

Industry:

Government > Regional Government > Europe Government (1.00)
Health & Medicine > Therapeutic Area (0.93)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.68)

Add feedback

Moly\'e: A Corpus-based Approach to Language Contact in Colonial France

Dent, Rasul, Janès, Juliette, Clérice, Thibault, Suarez, Pedro Ortiz, Sagot, Benoît

arXiv.org Artificial IntelligenceAug-8-2024

Whether or not several Creole languages which developed during the early modern period can be considered genetic descendants of European languages has been the subject of intense debate. This is in large part due to the absence of evidence of intermediate forms. This work introduces a new open corpus, the Moly\'e corpus, which combines stereotypical representations of three kinds of language variation in Europe with early attestations of French-based Creole languages across a period of 400 years. It is intended to facilitate future research on the continuity between contact situations in Europe and Creolophone (former) colonies.

corpus, creole, pronoun, (15 more...)

arXiv.org Artificial Intelligence

2408.04554

Country:

Africa > Mauritius (0.05)
South America > French Guiana (0.04)
Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
(17 more...)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Evaluating Adversarial Robustness on Document Image Classification

Fronteau, Timothée, Paran, Arnaud, Shabou, Aymen

arXiv.org Artificial IntelligenceMay-1-2023

Adversarial attacks and defenses have gained increasing interest on computer vision systems in recent years, but as of today, most investigations are limited to natural images. However, many artificial intelligence models actually handle documentary data, which is very different from real world images. Hence, in this work, we try to apply the adversarial attack philosophy on documentary data and to protect models against such attacks. Our methodology is to implement untargeted gradient-based, transfer-based and score-based attacks and evaluate the impact of defenses such as adversarial training, JPEG input compression and grey-scale input transformation on the robustness of ResNet50 and EfficientNetB0 model architectures. To the best of our knowledge, no such work has been conducted by the community in order to study the impact of these attacks on the document image classification task.

artificial intelligence, image understanding, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2304.12486

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > France > Île-de-France > Hauts-de-Seine > Montrouge (0.04)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (0.58)
Government (0.58)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.65)

Add feedback

DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents

Dhouib, Mohamed, Bettaieb, Ghassen, Shabou, Aymen

arXiv.org Artificial IntelligenceMay-1-2023

Information Extraction from visually rich documents is a challenging task that has gained a lot of attention in recent years due to its importance in several document-control based applications and its widespread commercial value. The majority of the research work conducted on this topic to date follow a two-step pipeline. First, they read the text using an off-the-shelf Optical Character Recognition (OCR) engine, then, they extract the fields of interest from the obtained text. The main drawback of these approaches is their dependence on an external OCR system, which can negatively impact both performance and computational speed. Recent OCR-free methods were proposed to address the previous issues. Inspired by their promising results, we propose in this paper an OCR-free end-to-end information extraction model named DocParser. It differs from prior end-to-end approaches by its ability to better extract discriminative character features. DocParser achieves state-of-the-art results on various datasets, while still being faster than previous works.

data mining, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2304.12484

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Switzerland > Vaud > Lausanne (0.04)
North America > United States > New York > New York County > New York City (0.04)
(5 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science > Data Mining > Text Mining (0.84)

Add feedback

Financial Risk Management on a Neutral Atom Quantum Processor

Leclerc, Lucas, Ortiz-Guitierrez, Luis, Grijalva, Sebastian, Albrecht, Boris, Cline, Julia R. K., Elfving, Vincent E., Signoles, Adrien, Henriet, Loïc, Del Bimbo, Gianni, Sheikh, Usman Ayub, Shah, Maitree, Andrea, Luc, Ishtiaq, Faysal, Duarte, Andoni, Mugel, Samuel, Caceres, Irene, Kurek, Michel, Orus, Roman, Seddik, Achraf, Hammammi, Oumaima, Isselnane, Hacene, M'tamon, Didier

arXiv.org Artificial IntelligenceDec-6-2022

Machine Learning models capable of handling the large datasets collected in the financial world can often become black boxes expensive to run. The quantum computing paradigm suggests new optimization techniques, that combined with classical algorithms, may deliver competitive, faster and more interpretable models. In this work we propose a quantum-enhanced machine learning solution for the prediction of credit rating downgrades, also known as fallen-angels forecasting in the financial risk management field. We implement this solution on a neutral atom Quantum Processing Unit with up to 60 qubits on a real-life dataset. We report competitive performances against the state-of-the-art Random Forest benchmark whilst our model achieves better interpretability and comparable training times. We examine how to improve performance in the near-term validating our ideas with Tensor Networks-based numerical simulations.

artificial intelligence, classifier, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2212.03223

Country:

North America > United States > California (0.04)
North America > Canada > Ontario > Toronto (0.04)
Europe > Spain > Basque Country > Biscay Province > Bilbao (0.04)
(3 more...)

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (0.46)

Industry: Banking & Finance > Credit (1.00)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

VisualWordGrid: Information Extraction From Scanned Documents Using A Multimodal Approach

Kerroumi, Mohamed, Sayem, Othmane, Shabou, Aymen

arXiv.org Artificial IntelligenceOct-13-2020

We introduce a novel approach for scanned document representation to perform field extraction. It allows the simultaneous encoding of the textual, visual and layout information in a 3D matrix used as an input to a segmentation model. We improve the recent Chargrid and Wordgrid models in several ways, first by taking into account the visual modality, then by boosting its robustness in regards to small datasets while keeping the inference time low. Our approach is tested on public and private document-image datasets, showing higher performances compared to the recent state-of-the-art methods.

artificial intelligence, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2010.02358

Country:

North America > United States > California > San Diego County > San Diego (0.04)
Europe > France > Île-de-France > Hauts-de-Seine > Montrouge (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)

Genre: Research Report > Promising Solution (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.68)

Add feedback

Federated Survival Analysis with Discrete-Time Cox Models

Andreux, Mathieu, Manoel, Andre, Menuet, Romuald, Saillard, Charlie, Simpson, Chloé

arXiv.org Machine LearningJun-16-2020

Building machine learning models from decentralized datasets located in different centers with federated learning (FL) is a promising approach to circumvent local data scarcity while preserving privacy. However, the prominent Cox proportional hazards (PH) model, used for survival analysis, does not fit the FL framework, as its loss function is non-separable with respect to the samples. The na\"ive method to bypass this non-separability consists in calculating the losses per center, and minimizing their sum as an approximation of the true loss. We show that the resulting model may suffer from important performance loss in some adverse settings. Instead, we leverage the discrete-time extension of the Cox PH model to formulate survival analysis as a classification problem with a separable loss function. Using this approach, we train survival models using standard FL techniques on synthetic data, as well as real-world datasets from The Cancer Genome Atlas (TCGA), showing similar performance to a Cox PH model trained on aggregated data. Compared to previous works, the proposed method is more communication-efficient, more generic, and more amenable to using privacy-preserving techniques.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Machine Learning

2006.08997

Country:

North America > Canada (0.04)
South America > Brazil > São Paulo (0.04)
North America > United States > New York (0.04)
Europe > France > Île-de-France > Hauts-de-Seine > Montrouge (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback