AITopics | Dixit, Tanay

Collaborating Authors

Dixit, Tanay

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

LatentGAN Autoencoder: Learning Disentangled Latent Distribution

Kalwar, Sanket, Aich, Animikh, Dixit, Tanay, Chhabra, Adit

arXiv.org Artificial IntelligenceJan-20-2025

Generative models like GAN(Goodfellow et al. 2014) and In this work, we present a new way to learn control VAE(Kingma and Welling 2014) have shown remarkable over autoencoder latent distribution with the help of AAE progress in recent years.Generative adversarial networks (Makhzani et al. 2016) which approximates posterior of the have shown state-of-the-art performance in a variety of latent distribution of autoencoder using any arbitrary prior tasks like Image-To-Image translation(Isola et al. 2018), distribution and using (Chen et al. 2016) for learning disentangled video prediction(Liang et al. 2017), Text-to-Image translation(Zhang representation. The previous work by (Wang, Peng, et al. 2017), drug discovery(Hong et al. 2019), and Ko 2019) had used a similar method of learning the latent and privacy-preserving(Shi et al. 2018). VAE has shown prior using AAE along with a perceptual loss and Information state-of-the-art performance in a variety of tasks like image maximization regularizer to train the decoder with generation(Gregor et al. 2015), semi-supervised learning(Maaløe the help of an extra discriminator.

artificial intelligence, latent distribution, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2204.0201

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages

Sai, Ananya B., Nagarajan, Vignesh, Dixit, Tanay, Dabre, Raj, Kunchukuttan, Anoop, Kumar, Pratyush, Khapra, Mitesh M.

arXiv.org Artificial IntelligenceJul-3-2023

The rapid growth of machine translation (MT) systems has necessitated comprehensive studies to meta-evaluate evaluation metrics being used, which enables a better selection of metrics that best reflect MT quality. Unfortunately, most of the research focuses on high-resource languages, mainly English, the observations for which may not always apply to other languages. Indian languages, having over a billion speakers, are linguistically different from English, and to date, there has not been a systematic study of evaluating MT systems from English into Indian languages. In this paper, we fill this gap by creating an MQM dataset consisting of 7000 fine-grained annotations, spanning 5 Indian languages and 7 MT systems, and use it to establish correlations between annotator scores and scores obtained using existing automatic metrics. Our results show that pre-trained metrics, such as COMET, have the highest correlations with annotator scores. Additionally, we find that the metrics do not adequately capture fluency-based errors in Indian languages, and there is a need to develop metrics focused on Indian languages. We hope that our dataset and analysis will help promote further research in this area.

artificial intelligence, computational linguistic, natural language, (15 more...)

arXiv.org Artificial Intelligence

2212.1018

Country:

Europe (1.00)
Asia (1.00)
North America > United States > Pennsylvania (0.14)
North America > United States > Minnesota (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Government (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Improving Factuality of Abstractive Summarization without Sacrificing Summary Quality

Dixit, Tanay, Wang, Fei, Chen, Muhao

arXiv.org Artificial IntelligenceMay-24-2023

Improving factual consistency of abstractive summarization has been a widely studied topic. However, most of the prior works on training factuality-aware models have ignored the negative effect it has on summary quality. We propose EFACTSUM (i.e., Effective Factual Summarization), a candidate summary generation and ranking technique to improve summary factuality without sacrificing summary quality. We show that using a contrastive learning framework with our refined candidate summaries leads to significant gains on both factuality and similarity-based metrics. Specifically, we propose a ranking strategy in which we effectively combine two metrics, thereby preventing any conflict during training. Models trained using our approach show up to 6 points of absolute improvement over the base model with respect to FactCC on XSUM and 11 points on CNN/DM, without negatively affecting either similarity-based metrics or absractiveness.

computational linguistic, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2305.14981

Country:

North America > United States (1.00)
Europe (1.00)
Asia > Middle East > Republic of Türkiye (0.29)

Genre: Research Report (1.00)

Industry:

Banking & Finance (1.00)
Government > Regional Government (0.94)
Government > Military (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)

Add feedback

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

Dhole, Kaustubh D., Gangal, Varun, Gehrmann, Sebastian, Gupta, Aadesh, Li, Zhenhao, Mahamood, Saad, Mahendiran, Abinaya, Mille, Simon, Srivastava, Ashish, Tan, Samson, Wu, Tongshuang, Sohl-Dickstein, Jascha, Choi, Jinho D., Hovy, Eduard, Dusek, Ondrej, Ruder, Sebastian, Anand, Sajant, Aneja, Nagender, Banjade, Rabin, Barthe, Lisa, Behnke, Hanna, Berlot-Attwell, Ian, Boyle, Connor, Brun, Caroline, Cabezudo, Marco Antonio Sobrevilla, Cahyawijaya, Samuel, Chapuis, Emile, Che, Wanxiang, Choudhary, Mukund, Clauss, Christian, Colombo, Pierre, Cornell, Filip, Dagan, Gautier, Das, Mayukh, Dixit, Tanay, Dopierre, Thomas, Dray, Paul-Alexis, Dubey, Suchitra, Ekeinhor, Tatiana, Di Giovanni, Marco, Gupta, Rishabh, Gupta, Rishabh, Hamla, Louanes, Han, Sang, Harel-Canada, Fabrice, Honore, Antoine, Jindal, Ishan, Joniak, Przemyslaw K., Kleyko, Denis, Kovatchev, Venelin, Krishna, Kalpesh, Kumar, Ashutosh, Langer, Stefan, Lee, Seungjae Ryan, Levinson, Corey James, Liang, Hualou, Liang, Kaizhao, Liu, Zhexiong, Lukyanenko, Andrey, Marivate, Vukosi, de Melo, Gerard, Meoni, Simon, Meyer, Maxime, Mir, Afnan, Moosavi, Nafise Sadat, Muennighoff, Niklas, Mun, Timothy Sum Hon, Murray, Kenton, Namysl, Marcin, Obedkova, Maria, Oli, Priti, Pasricha, Nivranshu, Pfister, Jan, Plant, Richard, Prabhu, Vinay, Pais, Vasile, Qin, Libo, Raji, Shahab, Rajpoot, Pawan Kumar, Raunak, Vikas, Rinberg, Roy, Roberts, Nicolas, Rodriguez, Juan Diego, Roux, Claude, S., Vasconcellos P. H., Sai, Ananya B., Schmidt, Robin M., Scialom, Thomas, Sefara, Tshephisho, Shamsi, Saqib N., Shen, Xudong, Shi, Haoyue, Shi, Yiwen, Shvets, Anna, Siegel, Nick, Sileo, Damien, Simon, Jamie, Singh, Chandan, Sitelew, Roman, Soni, Priyank, Sorensen, Taylor, Soto, William, Srivastava, Aman, Srivatsa, KV Aditya, Sun, Tony, T, Mukund Varma, Tabassum, A, Tan, Fiona Anting, Teehan, Ryan, Tiwari, Mo, Tolkiehn, Marie, Wang, Athena, Wang, Zijian, Wang, Gloria, Wang, Zijie J., Wei, Fuxuan, Wilie, Bryan, Winata, Genta Indra, Wu, Xinyi, Wydmański, Witold, Xie, Tianbao, Yaseen, Usama, Yee, M., Zhang, Jing, Zhang, Yue

arXiv.org Artificial IntelligenceDec-5-2021

Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data splits according to specific features). We describe the framework and an initial set of 117 transformations and 23 filters for a variety of natural language tasks. We demonstrate the efficacy of NL-Augmenter by using several of its transformations to analyze the robustness of popular natural language models. The infrastructure, datacards and robustness analysis results are available publicly on the NL-Augmenter repository (\url{https://github.com/GEM-benchmark/NL-Augmenter}).

artificial intelligence, machine learning, natural language, (24 more...)

arXiv.org Artificial Intelligence

2112.02721

Country:

North America > Canada (1.00)
Europe (1.00)
Asia (1.00)
(5 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment > Sports (1.00)
Information Technology > Security & Privacy (0.92)
Education (0.92)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)

Add feedback