AITopics | Mikhailov, Vladislav

Collaborating Authors

Mikhailov, Vladislav

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Artificial Text Detection via Examining the Topology of Attention Maps

Kushnareva, Laida, Cherniavskii, Daniil, Mikhailov, Vladislav, Artemova, Ekaterina, Barannikov, Serguei, Bernstein, Alexander, Piontkovskaya, Irina, Piontkovski, Dmitri, Burnaev, Evgeny

arXiv.org Artificial IntelligenceApr-28-2022

The impressive capabilities of recent generative models to create texts that are challenging to distinguish from the human-written ones can be misused for generating fake news, product reviews, and even abusive content. Despite the prominent performance of existing methods for artificial text detection, they still lack interpretability and robustness towards unseen models. To this end, we propose three novel types of interpretable topological features for this task based on Topological Data Analysis (TDA) which is currently understudied in the field of NLP. We empirically show that the features derived from the BERT model outperform count- and neural-based baselines up to 10\% on three common datasets, and tend to be the most robust towards unseen GPT-style generation models as opposed to existing methods. The probing analysis of the features reveals their sensitivity to the surface and syntactic properties. The results demonstrate that TDA is a promising line with respect to NLP tasks, specifically the ones that incorporate surface and structural information.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2021.emnlp-main.50

2109.04825

Country: Europe (0.46)

Genre: Research Report > New Finding (0.34)

Industry: Media > News (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Russian SuperGLUE 1.1: Revising the Lessons not Learned by Russian NLP models

Fenogenova, Alena, Tikhonova, Maria, Mikhailov, Vladislav, Shavrina, Tatiana, Emelyanov, Anton, Shevelev, Denis, Kukushkin, Alexandr, Malykh, Valentin, Artemova, Ekaterina

arXiv.org Artificial IntelligenceFeb-15-2022

In the last year, new neural architectures and multilingual pre-trained models have been released for Russian, which led to performance evaluation problems across a range of language understanding tasks. This paper presents Russian SuperGLUE 1.1, an updated benchmark styled after GLUE for Russian NLP models. The new version includes a number of technical, user experience and methodological improvements, including fixes of the benchmark vulnerabilities unresolved in the previous version: novel and improved tests for understanding the meaning of a word in context (RUSSE) along with reading comprehension and common sense reasoning (DaNetQA, RuCoS, MuSeRC). Together with the release of the updated datasets, we improve the benchmark toolkit based on \texttt{jiant} framework for consistent training and evaluation of NLP-models of various architectures which now supports the most recent models for Russian. Finally, we provide the integration of Russian SuperGLUE with a framework for industrial evaluation of the open-source models, MOROCCO (MOdel ResOurCe COmparison), in which the models are evaluated according to the weighted average metric over all tasks, the inference speed, and the occupied amount of RAM. Russian SuperGLUE is publicly available at https://russiansuperglue.com/.

commonsense reasoning, natural language, russian superglue 1, (3 more...)

arXiv.org Artificial Intelligence

2202.07791

Country: Africa > Middle East > Morocco (0.24)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.53)

Add feedback

Shaking Syntactic Trees on the Sesame Street: Multilingual Probing with Controllable Perturbations

Taktasheva, Ekaterina, Mikhailov, Vladislav, Artemova, Ekaterina

arXiv.org Artificial IntelligenceSep-28-2021

Recent research has adopted a new experimental field centered around the concept of text perturbations which has revealed that shuffled word order has little to no impact on the downstream performance of Transformer-based language models across many NLP tasks. These findings contradict the common understanding of how the models encode hierarchical and structural information and even question if the word order is modeled with position embeddings. To this end, this paper proposes nine probing datasets organized by the type of \emph{controllable} text perturbation for three Indo-European languages with a varying degree of word order flexibility: English, Swedish and Russian. Based on the probing analysis of the M-BERT and M-BART models, we report that the syntactic sensitivity depends on the language and model pre-training objectives. We also find that the sensitivity grows across layers together with the increase of the perturbation granularity. Last but not least, we show that the models barely use the positional information to induce syntactic trees from their intermediate self-attention and contextualized representations.

artificial intelligence, machine learning, natural language, (4 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2021.mrl-1.17

2109.14017

Genre: Research Report (0.89)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.87)
Information Technology > Artificial Intelligence > Machine Learning (0.53)

Add feedback

RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark

Shavrina, Tatiana, Fenogenova, Alena, Emelyanov, Anton, Shevelev, Denis, Artemova, Ekaterina, Malykh, Valentin, Mikhailov, Vladislav, Tikhonova, Maria, Chertok, Andrey, Evlampiev, Andrey

arXiv.org Artificial IntelligenceNov-2-2020

In this paper, we introduce an advanced Russian general language understanding evaluation benchmark -- RussianGLUE. Recent advances in the field of universal language models and transformers require the development of a methodology for their broad diagnostics and testing for general intellectual skills - detection of natural language inference, commonsense reasoning, ability to perform simple logical operations regardless of text subject or lexicon. For the first time, a benchmark of nine tasks, collected and organized analogically to the SuperGLUE methodology, was developed from scratch for the Russian language. We provide baselines, human level evaluation, an open-source framework for evaluating models (https://github.com/RussianNLP/RussianSuperGLUE), and an overall leaderboard of transformer models for the Russian language. Besides, we present the first results of comparing multilingual models in the adapted diagnostic test set and offer the first steps to further expanding or assessing state-of-the-art models independently of language.

dataset, survey article, text processing, (19 more...)

arXiv.org Artificial Intelligence

2010.15925

Country:

Europe > Russia (0.15)
Asia > China (0.14)

Genre: Research Report (0.84)

Industry: Health & Medicine (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.94)

Add feedback