AITopics | Pappadopulo, Duccio

Collaborating Authors

Pappadopulo, Duccio

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Non-contrastive sentence representations via self-supervision

Farina, Marco, Pappadopulo, Duccio

arXiv.org Artificial IntelligenceOct-26-2023

Sample contrastive methods, typically referred to simply as contrastive are the foundation of most unsupervised methods to learn text and sentence embeddings. On the other hand, a different class of self-supervised loss functions and methods have been considered in the computer vision community and referred to as dimension contrastive. In this paper, we thoroughly compare this class of methods with the standard baseline for contrastive sentence embeddings, SimCSE. We find that self-supervised embeddings trained using dimension contrastive objectives can outperform SimCSE on downstream tasks without needing auxiliary loss functions.

non-contrastive sentence representation

arXiv.org Artificial Intelligence

2310.1769

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.40)

Add feedback

Distillation of encoder-decoder transformers for sequence labelling

Farina, Marco, Pappadopulo, Duccio, Gupta, Anant, Huang, Leslie, İrsoy, Ozan, Solorio, Thamar

arXiv.org Artificial IntelligenceFeb-10-2023

Driven by encouraging results on a wide range of tasks, the field of NLP is experiencing an accelerated race to develop bigger language models. This race for bigger models has also underscored the need to continue the pursuit of practical distillation approaches that can leverage the knowledge acquired by these big models in a compute-efficient manner. Having this goal in mind, we build on recent work to propose a hallucination-free framework for sequence tagging that is especially suited for distillation. We show empirical results of new state-of-the-art performance across multiple sequence labelling datasets and validate the usefulness of this framework for distilling a large model in a few-shot learning scenario.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2302.05454

Country: North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Hierarchical clustering in particle physics through reinforcement learning

Brehmer, Johann, Macaluso, Sebastian, Pappadopulo, Duccio, Cranmer, Kyle

arXiv.org Artificial IntelligenceNov-16-2020

Particle physics experiments often require the reconstruction of decay patterns through a hierarchical clustering of the observed final-state particles. We show that this task can be phrased as a Markov Decision Process and adapt reinforcement learning algorithms to solve it. In particular, we show that Monte-Carlo Tree Search guided by a neural policy can construct high-quality hierarchical clusterings and outperform established greedy and beam search baselines.

artificial intelligence, particle, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2011.08191

Country: North America > United States (0.68)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Dialogue Act Classification in Group Chats with DAG-LSTMs

İrsoy, Ozan, Gosangi, Rakesh, Zhang, Haimin, Wei, Mu-Hsin, Lund, Peter, Pappadopulo, Duccio, Fahy, Brendan, Nephytou, Neophytos, Ortiz, Camilo

arXiv.org Machine LearningAug-2-2019

Dialogue act (DA) classification has been studied for the past two decades and has several key applications such as workflow automation and conversation analytics. Researchers have used, to address this problem, various traditional machine learning models, and more recently deep neural network models such as hierarchical convolutional neural networks (CNNs) and long short-term memory (LSTM) networks. In this paper, we introduce a new model architecture, directed-acyclic-graph LSTM (DAG-LSTM) for DA classification. A DAG-LSTM exploits the turn-taking structure naturally present in a multi-party conversation, and encodes this relation in its model structure. Using the STAC corpus, we show that the proposed method performs roughly 0.8% better in accuracy and 1.2% better in macro-F1 score when compared to existing methods. The proposed method is generic and not limited to conversation applications.

deep learning, neural network, utterance, (18 more...)

arXiv.org Machine Learning

1908.01821

Country:

North America > United States (0.28)
Europe > United Kingdom > England (0.14)

Genre:

Overview (0.67)
Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Inferring the quantum density matrix with machine learning

Cranmer, Kyle, Golkar, Siavash, Pappadopulo, Duccio

arXiv.org Machine LearningApr-11-2019

In particular, There is a nexus of concepts at the heart of a rich interplay machine learning techniques have been used for between physics, statistics, machine learning, and variational optimization of ground state energy for quantum information theory. Concepts such as entropy that were systems [6]. Additionally, there have been a number key to the early work in thermodynamics are the bedrock of important developments that extend statistical inference of information theory. Similarly the Gibbs (or Boltzman) to domains where probabilistic modeling was previously distribution, which characterize the distribution of states inaccessible. These techniques have recently been in thermal equilibrium, is at the heart of energy based explored to solve statistical mechanics of classical systems models and Boltzman machines that were widely studied [7, 8]. In this work, we aim to connect recent developments in machine learning [1, 2]. Additionally, the study of in deep generative models [9-12], unsupervised complicated many-body systems gave rise to mean-field learning for implicit models [13], and variational inference methods and renormalization group methods.

deep learning, density matrix, neural network, (20 more...)

arXiv.org Machine Learning

1904.05903

Country: North America > United States > New York (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback