AITopics | Murray, Gabriel

Collaborating Authors

Murray, Gabriel

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Visual Analytics for Generative Transformer Models

Li, Raymond, Yang, Ruixin, Xiao, Wen, AbuRaed, Ahmed, Murray, Gabriel, Carenini, Giuseppe

arXiv.org Artificial IntelligenceNov-21-2023

While transformer-based models have achieved state-of-the-art results in a variety of classification and generation tasks, their black-box nature makes them challenging for interpretability. In this work, we present a novel visual analytical framework to support the analysis of transformer-based generative networks. In contrast to previous work, which has mainly focused on encoder-based models, our framework is one of the first dedicated to supporting the analysis of transformer-based encoder-decoder models and decoder-only models for generative and classification tasks. Hence, we offer an intuitive overview that allows the user to explore different facets of the model through interactive visualization. To demonstrate the feasibility and usefulness of our framework, we present three detailed case studies based on real-world NLP research problems.

computational linguistic, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2311.12418

Country:

Europe (1.00)
North America > Canada > British Columbia (0.28)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Mixture-of-Linguistic-Experts Adapters for Improving and Interpreting Pre-trained Language Models

Li, Raymond, Murray, Gabriel, Carenini, Giuseppe

arXiv.org Artificial IntelligenceOct-24-2023

In this work, we propose a method that combines two popular research areas by injecting linguistic structures into pre-trained language models in the parameter-efficient fine-tuning (PEFT) setting. In our approach, parallel adapter modules encoding different linguistic structures are combined using a novel Mixture-of-Linguistic-Experts architecture, where Gumbel-Softmax gates are used to determine the importance of these modules at each layer of the model. To reduce the number of parameters, we first train the model for a fixed small number of steps before pruning the experts based on their importance scores. Our experiment results with three different pre-trained models show that our approach can outperform state-of-the-art PEFT methods with a comparable number of parameters. In addition, we provide additional analysis to examine the experts selected by each model at each layer to provide insights for future studies.

interpreting pre-trained language model, machine learning, natural language, (2 more...)

arXiv.org Artificial Intelligence

2310.1624

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.89)

Add feedback

Diversity-Aware Coherence Loss for Improving Neural Topic Models

Li, Raymond, González-Pizarro, Felipe, Xing, Linzi, Murray, Gabriel, Carenini, Giuseppe

arXiv.org Artificial IntelligenceMay-26-2023

The standard approach for neural topic modeling uses a variational autoencoder (VAE) framework that jointly minimizes the KL divergence between the estimated posterior and prior, in addition to the reconstruction loss. Since neural topic models are trained by recreating individual input documents, they do not explicitly capture the coherence between topic words on the corpus level. In this work, we propose a novel diversity-aware coherence loss that encourages the model to learn corpus-level coherence scores while maintaining a high diversity between topics. Experimental results on multiple datasets show that our method significantly improves the performance of neural topic models without requiring any pretraining or additional parameters.

computational linguistic, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2305.16199

Country:

Europe (1.00)
Asia (0.94)
North America > United States > California (0.28)
(2 more...)

Genre: Research Report (0.82)

Industry: Health & Medicine > Therapeutic Area (0.30)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Supervised Topic Segmentation of Email Conversations

Joty, Shafiq (University of British Columbia) | Carenini, Giuseppe (University of British Columbia) | Murray, Gabriel (University of British Columbia) | Ng, Raymond T (University of British Columbia)

AAAI ConferencesJul-12-2011

We propose a graph-theoretic supervised topic segmentation model for email conversations which combines (i) lexical knowledge, (ii) conversational features, and (iii) topic features. We compare our results with the existing unsupervised models (i.e., LCSeg and LDA), and with their two extensions for email conversations (i.e., LCSeg+FQG and LDA+FQG) that not only use lexical information but also exploit finer conversation structure. Empirical evaluation shows that our supervised model is the best performer and achieves highest accuracy by combining the three different knowledge sources, where knowledge about the conversation has proved to be the most important indicator for segmenting emails.

artificial intelligence, email conversation, natural language, (15 more...)

AAAI Conferences

Fifth International AAAI Conference on Weblogs and Social Media

Country:

Asia (0.28)
North America > Canada > British Columbia (0.14)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Add feedback