AITopics | May, Jonathan

Collaborating Authors

May, Jonathan

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Speechworthy Instruction-tuned Language Models

Cho, Hyundong, Jedema, Nicolaas, Ribeiro, Leonardo F. R., Sharma, Karishma, Szekely, Pedro, Moschitti, Alessandro, Janssen, Ruben, May, Jonathan

arXiv.org Artificial IntelligenceSep-22-2024

Current instruction-tuned language models are exclusively trained with textual preference data and thus are often not aligned with the unique requirements of other modalities, such as speech. To better align language models with the speech domain, we explore (i) prompting strategies grounded in radio-industry best practices and (ii) preference learning using a novel speech-based preference data of 20K samples, generated with a wide spectrum of prompts that induce varying dimensions of speech-suitability and labeled by annotators who listen to response pairs. Both human and automatic evaluation show that both prompting and preference learning increase the speech-suitability of popular instruction-tuned LLMs. Interestingly, we find that prompting and preference learning can be additive; combining them achieves the best win rates in head-to-head comparison, resulting in responses that are preferred or tied to the base model in 76.2% of comparisons on average. Lastly, we share lexical, syntactical, and qualitative analyses to showcase how each method contributes to improving the speech-suitability of generated responses.

large language model, machine learning, system prompt, (19 more...)

arXiv.org Artificial Intelligence

2409.14672

Country:

North America (0.67)
Asia > Thailand (0.46)
Asia > Middle East (0.28)
Asia > Japan > Honshū > Chūgoku > Tottori Prefecture (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Media > Radio (0.55)
Banking & Finance > Economy (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Are Large Language Models Capable of Generating Human-Level Narratives?

Tian, Yufei, Huang, Tenghao, Liu, Miri, Jiang, Derek, Spangher, Alexander, Chen, Muhao, May, Jonathan, Peng, Nanyun

arXiv.org Artificial IntelligenceJul-18-2024

This paper investigates the capability of LLMs in storytelling, focusing on narrative development and plot progression. We introduce a novel computational framework to analyze narratives through three discourse-level aspects: i) story arcs, ii) turning points, and iii) affective dimensions, including arousal and valence. By leveraging expert and automatic annotations, we uncover significant discrepancies between the LLM- and human- written stories. While human-written stories are suspenseful, arousing, and diverse in narrative structures, LLM stories are homogeneously positive and lack tension. Next, we measure narrative reasoning skills as a precursor to generative capacities, concluding that most LLMs fall short of human abilities in discourse understanding. Finally, we show that explicit integration of aforementioned discourse features can enhance storytelling, as is demonstrated by over 40% improvement in neural storytelling in terms of diversity, suspense, and arousal.

artificial intelligence, generating human-level narrative, natural language, (1 more...)

arXiv.org Artificial Intelligence

2407.13248

Genre: Research Report (0.89)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Style Transfer with Multi-iteration Preference Optimization

Liu, Shuai, May, Jonathan

arXiv.org Artificial IntelligenceJun-17-2024

Numerous recent techniques for text style transfer characterize their approaches as variants of reinforcement learning and preference optimization. In this work, we consider the relationship between these approaches and a class of optimization approaches developed primarily for (non-neural) statistical machine translation, formerly known as 'tuning'. Inspired by these techniques from the past, we improve upon established preference optimization approaches, incorporating multiple iterations of exploration and optimization, and choosing contrastive examples by following a 'hope' vs 'fear' sampling strategy. Cognizant of the difference between machine translation and style transfer, however, we further tailor our framework with a new pseudo-parallel generation method and a dynamic weighted reward aggregation method to tackle the lack of parallel data and the need for a multi-objective reward. We evaluate our model on two commonly used text style transfer datasets. Through automatic and human evaluation results we show the effectiveness and the superiority of our model compared to state-of-the-art baselines.

computational linguistic, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2406.11581

Country:

Asia (1.00)
Europe (0.67)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre:

Research Report > Experimental Study (0.69)
Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

GNOME: Generating Negotiations through Open-Domain Mapping of Exchanges

Deshpande, Darshan, Sinha, Shambhavi, Kumar, Anirudh Ravi, Pal, Debaditya, May, Jonathan

arXiv.org Artificial IntelligenceJun-15-2024

Language Models have previously shown strong negotiation capabilities in closed domains where the negotiation strategy prediction scope is constrained to a specific setup. In this paper, we first show that these models are not generalizable beyond their original training domain despite their wide-scale pretraining. Following this, we propose an automated framework called GNOME, which processes existing human-annotated, closed-domain datasets using Large Language Models and produces synthetic open-domain dialogues for negotiation. GNOME improves the generalizability of negotiation systems while reducing the expensive and subjective task of manual data curation. Through our experimental setup, we create a benchmark comparing encoder and decoder models trained on existing datasets against datasets created through GNOME. Our results show that models trained on our dataset not only perform better than previous state of the art models on domain specific strategy prediction, but also generalize better to previously unseen domains.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2406.10764

Country:

North America > United States > California (0.14)
North America > Canada (0.14)
Europe > Middle East > Malta (0.14)
Europe > France (0.14)

Genre: Research Report > New Finding (0.86)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

More Victories, Less Cooperation: Assessing Cicero's Diplomacy Play

Wongkamjan, Wichayaporn, Gu, Feng, Wang, Yanze, Hermjakob, Ulf, May, Jonathan, Stewart, Brandon M., Kummerfeld, Jonathan K., Peskoff, Denis, Boyd-Graber, Jordan Lee

arXiv.org Artificial IntelligenceJun-7-2024

The boardgame Diplomacy is a challenging setting for communicative and cooperative artificial intelligence. The most prominent communicative Diplomacy AI, Cicero, has excellent strategic abilities, exceeding human players. However, the best Diplomacy players master communication, not just tactics, which is why the game has received attention as an AI challenge. This work seeks to understand the degree to which Cicero succeeds at communication. First, we annotate in-game communication with abstract meaning representation to separate in-game tactics from general language. Second, we run two dozen games with humans and Cicero, totaling over 200 human-player hours of competition. While AI can consistently outplay human players, AI-Human communication is still limited because of AI's difficulty with deception and persuasion. This shows that Cicero relies on strategy and has not yet reached the full promise of communicative and cooperative AI.

cicero, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2406.04643

Country:

Europe (1.00)
North America > United States > California (0.14)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Games > Computer Games (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Ma, Xuezhe, Yang, Xiaomeng, Xiong, Wenhan, Chen, Beidi, Yu, Lili, Zhang, Hao, May, Jonathan, Zettlemoyer, Luke, Levy, Omer, Zhou, Chunting

arXiv.org Artificial IntelligenceApr-16-2024

The quadratic complexity and weak length extrapolation of Transformers limits their ability to scale to long sequences, and while sub-quadratic solutions like linear attention and state space models exist, they empirically underperform Transformers in pretraining efficiency and downstream task accuracy. We introduce Megalodon, a neural architecture for efficient sequence modeling with unlimited context length. Megalodon inherits the architecture of Mega (exponential moving average with gated attention), and further introduces multiple technical components to improve its capability and stability, including complex exponential moving average (CEMA), timestep normalization layer, normalized attention mechanism and pre-norm with two-hop residual configuration. In a controlled head-to-head comparison with Llama2, Megalodon achieves better efficiency than Transformer in the scale of 7 billion parameters and 2 trillion training tokens. Megalodon reaches a training loss of 1.70, landing mid-way between Llama2-7B (1.75) and 13B (1.67). Code: https://github.com/XuezheMax/megalodon

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2404.08801

Country:

North America > United States > California (0.28)
North America > Canada > Ontario > Toronto (0.14)

Genre:

Research Report (0.64)
Personal (0.46)

Industry:

Media > Film (0.67)
Education (0.67)
Health & Medicine (0.46)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Authorship Style Transfer with Policy Optimization

Liu, Shuai, Agarwal, Shantanu, May, Jonathan

arXiv.org Artificial IntelligenceMar-12-2024

Authorship style transfer aims to rewrite a given text into a specified target while preserving the original meaning in the source. Existing approaches rely on the availability of a large number of target style exemplars for model training. However, these overlook cases where a limited number of target style examples are available. The development of parameter-efficient transfer learning techniques and policy optimization (PO) approaches suggest lightweight PO is a feasible approach to low-resource style transfer. In this work, we propose a simple two step tune-and-optimize technique for low-resource textual style transfer. We apply our technique to authorship transfer as well as a larger-data native language style task and in both cases find it outperforms state-of-the-art baseline models.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2403.08043

Country:

Asia (1.00)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Cross-lingual Lifelong Learning

M'hamdi, Meryem, Ren, Xiang, May, Jonathan

arXiv.org Artificial IntelligenceDec-28-2023

The longstanding goal of multi-lingual learning has been to develop a universal cross-lingual model that can withstand the changes in multilingual data distributions. There has been a large amount of work to adapt such multilingual models to unseen target languages. However, the majority of work in this direction focuses on the standard one-hop transfer learning pipeline from source to target languages, Figure 1: An overview of CCL: We use an example whereas in realistic scenarios, new languages of a non-stationary datastream moving from high to can be incorporated at any time in a sequential low resource languages. Each bold and dashed box manner. In this paper, we present a principled represents either a training or test data instance being Cross-lingual Continual Learning (CCL) evaluation fine-tuned or evaluated on, respectively. To support this paradigm, where we analyze different categories problem setup, we evaluate the cross-lingual capabilities of approaches used to continually adapt of continual approaches. Those capabilities include to emerging data from different languages. We knowledge preservation on old languages, accumulation provide insights into what makes multilingual to the current language, and generalization to sequential learning particularly challenging.

artificial intelligence, machine learning, natural language, (13 more...)

arXiv.org Artificial Intelligence

2205.11152

Country:

Europe (0.67)
North America > United States > California (0.46)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry: Education > Educational Setting > Continuing Education (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Continual Dialogue State Tracking via Example-Guided Question Answering

Cho, Hyundong, Madotto, Andrea, Lin, Zhaojiang, Chandu, Khyathi Raghavi, Kottur, Satwik, Xu, Jing, May, Jonathan, Sankar, Chinnadhurai

arXiv.org Artificial IntelligenceDec-14-2023

Dialogue systems are frequently updated to accommodate new services, but naively updating them by continually training with data for new services in diminishing performance on previously learnt services. Motivated by the insight that dialogue state tracking (DST), a crucial component of dialogue systems that estimates the user's goal as a conversation proceeds, is a simple natural language understanding task, we propose reformulating it as a bundle of granular example-guided question answering tasks to minimize the task shift between services and thus benefit continual learning. Our approach alleviates service-specific memorization and teaches a model to contextualize the given question and example to extract the necessary information from the conversation. We find that a model with just 60M parameters can achieve a significant boost by learning to learn from in-context examples retrieved by a retriever trained to identify turns with similar dialogue state changes. Combining our method with dialogue-level memory replay, our approach attains state of the art performance on DST continual learning metrics without relying on any complex regularization or parameter expansion methods.

machine learning, natural language, question answering, (17 more...)

arXiv.org Artificial Intelligence

2305.13721

Country:

Europe (0.93)
Asia > Middle East > UAE (0.14)
North America > United States > California (0.14)

Genre: Research Report (1.00)

Industry: Education (0.68)

Technology: Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

Add feedback

Tracking the Newsworthiness of Public Documents

Spangher, Alexander, Ferrara, Emilio, Welsh, Ben, Peng, Nanyun, Tumgoren, Serdar, May, Jonathan

arXiv.org Artificial IntelligenceNov-16-2023

Journalists must find stories in huge amounts of textual data (e.g. leaks, bills, press releases) as part of their jobs: determining when and why text becomes news can help us understand coverage patterns and help us build assistive tools. Yet, this is challenging because very few labelled links exist, language use between corpora is very different, and text may be covered for a variety of reasons. In this work we focus on news coverage of local public policy in the San Francisco Bay Area by the San Francisco Chronicle. First, we gather news articles, public policy documents and meeting recordings and link them using probabilistic relational modeling, which we show is a low-annotation linking methodology that outperforms other retrieval-based baselines. Second, we define a new task: newsworthiness prediction, to predict if a policy item will get covered. We show that different aspects of public policy discussion yield different newsworthiness signals. Finally we perform human evaluation with expert journalists and show our systems identify policies they consider newsworthy with 68% F1 and our coverage recommendations are helpful with an 84% win-rate.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2311.09734

Country: North America > United States > California > San Francisco County > San Francisco (0.45)

Genre: Research Report > New Finding (0.46)

Industry:

Media > News (1.00)
Law (1.00)
Government > Regional Government (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback