AITopics | fudge

Collaborating Authors

fudge

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

79ec2a4246feb2126ecf43c4a4418002-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 11:20:15 GMT

constraint, usim, xsim, (16 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.88)

Add feedback

1be5bc25d50895ee656b8c2d9eb89d6a-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 18:33:48 GMT

coffee shop, customer rating, diffusion-lm, (16 more...)

Neural Information Processing Systems

Country:

South America > Brazil (0.04)
North America > United States > California (0.04)

Industry:

Consumer Products & Services > Restaurants (1.00)
Leisure & Entertainment > Sports (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

1 . For all authors a

Neural Information Processing SystemsAug-15-2025, 08:13:46 GMT

Do the main claims made in the abstract and introduction accurately reflect the paper's If you ran experiments... (a) Did you include the code, data, and instructions needed to reproduce the main experimental results (either in the supplemental material or as a URL)? [Y es] Provided in the Did you specify all the training details (e.g., data splits, hyperparameters, how they Did you report error bars (e.g., with respect to the random seed after running experiments multiple times)? Did you include the total amount of compute and the type of resources used (e.g., type If you are using existing assets (e.g., code, data, models) or curating/releasing new assets... (a) If your work uses existing assets, did you cite the creators? Did you include any new assets either in the supplemental material or as a URL? [Y es] Did you discuss whether and how consent was obtained from people whose data you're If you used crowdsourcing or conducted research with human subjects... (a) Figure 2 shows an overview of our proposed approach. Any number of differentiable constraints can be incorporated. D.1 Semantic similarity models We explain the semantic similarity models we use in our experiments in more detail here: 15 Weights Fluency (%) Transfer (%) wsim (w.r .t. input) wsim (w.r .t. ref.) log p ( y | x) We use this model for adding constraints in style-transfer ( 3.1) and D.2 Models used in multi-attribute transfer We collect Y elp restaurant reviews using scripts provided by Lample et al.

constraint, usim, xsim, (16 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.88)

Add feedback

Controlling Difficulty of Generated Text for AI-Assisted Language Learning

Jin, Meiqing, Dugan, Liam, Callison-Burch, Chris

arXiv.org Artificial IntelligenceJun-5-2025

Practicing conversations with large language models (LLMs) presents a promising alternative to traditional in-person language learning. However, most LLMs generate text at a near-native level of complexity, making them ill-suited for beginner learners (CEFR: A1-A2). In this paper, we investigate whether controllable generation techniques -- specifically modular methods that do not require model fine-tuning -- can adapt LLM outputs to better support absolute beginners. We evaluate these methods through both automatic metrics and a user study with university-level learners of Japanese. Our findings show that while prompting alone fails to control output difficulty, the use of future discriminators (Yang and Klein, 2021) significantly improves output comprehensibility (from 40.4\% to 84.3\%). We further introduce a novel token-level evaluation metric, Token Miss Rate (TMR), that quantifies the proportion of incomprehensible tokens per utterance and correlates strongly with human judgments. To support future research in AI-assisted language learning, we release our code, models, annotation tools, and dataset.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2506.04072

Country:

North America > United States (1.00)
Europe (0.67)
Asia > Middle East > UAE (0.28)

Genre: Research Report > New Finding (1.00)

Industry: Education > Curriculum > Subject-Specific Education (0.91)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Revealed: The common words that used to have VERY different meanings - including 'meat', 'flirt, and 'pink'

Daily Mail - Science & techMay-3-2025, 08:06:34 GMT

If scientists had a time machine, having a conversation with a Brit from even just 250 years ago could be very confusing. Although they'd be speaking the same language as us, the meaning of many English words have dramatically changed. In fact, the mention of things like'fudge', 'meat', 'pink', 'stripe', 'flirt' and'artificial' in a certain context could send our 18th century ancestors into a muddle. Lynne Cahill, a linguistics professor at the University of Sussex, said some words change their meanings and others don't because'there are lots of things going on'. 'As our lives change, we need words for different things, so some meanings go out of use (think of different types of horse-drawn carriage) and new ones come in (think of technology, like mobile phones and computers),' she told MailOnline. 'Languages deal with these things in different ways, sometimes using existing words with related meanings to refer to new things.' MailOnline has scoured the historical records and dictionaries to find more than 40 words that once had a very different definition.

artificial intelligence, meat, natural language, (15 more...)

Daily Mail - Science & tech

Country:

North America > United States (0.15)
Europe > United Kingdom > England (0.05)

Technology:

Information Technology > Artificial Intelligence > Robots (0.49)
Information Technology > Artificial Intelligence > Natural Language (0.41)

Add feedback

Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows

Zhang, Shujian, Wu, Lemeng, Gong, Chengyue, Liu, Xingchao

arXiv.org Machine LearningMar-25-2024

Recent works have demonstrated success in controlling sentence attributes ($e.g.$, sentiment) and structure ($e.g.$, syntactic structure) based on the diffusion language model. A key component that drives theimpressive performance for generating high-quality samples from noise is iteratively denoise for thousands of steps. While beneficial, the complexity of starting from the noise and the learning steps has limited its implementation to many NLP real-world applications. This paper proposes Language Rectified Flow ({\ours}). Our method is based on the reformulation of the standard probabilistic flow models. Language rectified flow learns (neural) ordinary differential equation models to transport between the source distribution and the target distribution, hence providing a unified and effective solution to generative modeling and domain transfer. From the source distribution, our language rectified flow yields fast simulation and effectively decreases the inference time. Experiments on three challenging fine-grained control tasks and multiple high-quality text editing show that our method consistently outperforms its baselines. Extensive experiments and ablation studies demonstrate that our method can be general, effective, and beneficial for many NLP tasks.

arxiv preprint arxiv, latent space, text generation, (13 more...)

arXiv.org Machine Learning

2403.16995

Country:

North America > United States > Texas > Travis County > Austin (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report > New Finding (0.93)

Industry:

Health & Medicine (0.68)
Energy (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Composable Text Controls in Latent Space with ODEs

Liu, Guangyi, Feng, Zeyu, Gao, Yuan, Yang, Zichao, Liang, Xiaodan, Bao, Junwei, He, Xiaodong, Cui, Shuguang, Li, Zhen, Hu, Zhiting

arXiv.org Artificial IntelligenceNov-6-2023

Real-world text applications often involve composing a wide range of text control operations, such as editing the text w.r.t. an attribute, manipulating keywords and structure, and generating new text of desired properties. Prior work typically learns/finetunes a language model (LM) to perform individual or specific subsets of operations. Recent research has studied combining operations in a plug-and-play manner, often with costly search or optimization in the complex sequence space. This paper proposes a new efficient approach for composable text operations in the compact latent space of text. The low-dimensionality and differentiability of the text latent vector allow us to develop an efficient sampler based on ordinary differential equations (ODEs) given arbitrary plug-in operators (e.g., attribute classifiers). By connecting pretrained LMs (e.g., GPT2) to the latent space through efficient adaption, we then decode the sampled vectors into desired text sequences. The flexible approach permits diverse control operators (sentiment, tense, formality, keywords, etc.) acquired using any relevant data from different domains. Experiments show that composing those operators within our approach manages to generate or edit high-quality text, substantially improving over previous methods in terms of generation quality and efficiency.

classifier, fudge, latent space, (16 more...)

arXiv.org Artificial Intelligence

2208.00638

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(18 more...)

Genre: Research Report > New Finding (0.67)

Industry: Consumer Products & Services > Restaurants (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback

KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection

Choi, Sehyun, Fang, Tianqing, Wang, Zhaowei, Song, Yangqiu

arXiv.org Artificial IntelligenceOct-13-2023

Large Language Models (LLMs) have demonstrated remarkable human-level natural language generation capabilities. However, their potential to generate misinformation, often called the hallucination problem, poses a significant risk to their deployment. A common approach to address this issue is to retrieve relevant knowledge and fine-tune the LLM with the knowledge in its input. Unfortunately, this method incurs high training costs and may cause catastrophic forgetting for multi-tasking models. To overcome these limitations, we propose a knowledge-constrained decoding method called KCTS (Knowledge-Constrained Tree Search), which guides a frozen LM to generate text aligned with the reference knowledge at each decoding step using a knowledge classifier score and MCTS (Monte-Carlo Tree Search). To adapt the sequence-level knowledge classifier to token-level guidance, we also propose a novel token-level hallucination detection method called RIPA (Reward Inflection Point Approximation). Our empirical results on knowledge-grounded dialogue and abstractive summarization demonstrate the strength of KCTS as a plug-and-play, model-agnostic decoding method that can effectively reduce hallucinations in natural language generation.

computational linguistic, knowledge, proceedings, (16 more...)

arXiv.org Artificial Intelligence

2310.09044

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
North America > Canada > Ontario > Toronto (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(15 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment > Sports > Soccer (1.00)
Health & Medicine > Consumer Health (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Self-conditioning pre-trained language models

Suau, Xavier, Zappella, Luca, Apostoloff, Nicholas

arXiv.org Artificial IntelligenceJun-14-2023

In this paper we aim to investigate the mechanisms that guide text generation with pre-trained Transformer-based Language Models (TLMs). Grounded on the Product of Experts formulation by Hinton (1999), we describe a generative mechanism that exploits expert units which naturally exist in TLMs. Such units are responsible for detecting concepts in the input and conditioning text generation on such concepts. We describe how to identify expert units and how to activate them during inference in order to induce any desired concept in the generated output. We find that the activation of a surprisingly small amount of units is sufficient to steer text generation (as little as 3 units in a model with 345M parameters). While the objective of this work is to learn more about how TLMs work, we show that our method is effective for conditioning without fine-tuning or using extra parameters, even on fine-grained homograph concepts. Additionally, we show that our method can be used to correct gender bias present in the output of TLMs and achieves gender parity for all evaluated contexts. We compare our method with FUDGE and PPLM-BoW, and show that our approach is able to achieve gender parity at a lower perplexity. The proposed method is accessible to a wide audience thanks to its simplicity and minimal compute needs. The findings in this paper are a step forward in understanding the generative mechanisms of TLMs.

artificial intelligence, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2110.02802

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment > Sports > Soccer (0.93)
Leisure & Entertainment > Sports > Football (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Commonsense-Aware Prompting for Controllable Empathetic Dialogue Generation

Liu, Yiren, Kilicoglu, Halil

arXiv.org Artificial IntelligenceFeb-2-2023

Improving the emotional awareness of pre-trained language models is an emerging important problem for dialogue generation tasks. Although prior studies have introduced methods to improve empathetic dialogue generation, few have discussed how to incorporate commonsense knowledge into pre-trained language models for controllable dialogue generation. In this study, we propose a novel framework that improves empathetic dialogue generation using pre-trained language models by 1) incorporating commonsense knowledge through prompt verbalization, and 2) controlling dialogue generation using a strategy-driven future discriminator. We conducted experiments to reveal that both the incorporation of social commonsense knowledge and enforcement of control over generation help to improve generation performance. Finally, we discuss the implications of our study for future research.

dialogue generation, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2302.01441

Country:

North America > United States > Illinois > Champaign County > Urbana (0.05)
North America > United States > Pennsylvania (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre: Research Report > New Finding (0.48)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.78)

Add feedback