AITopics | pipa

Collaborating Authors

pipa

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

PIPA: Preference Alignment as Prior-Informed Statistical Estimation

Li, Junbo, Wang, Zhangyang, Liu, Qiang

arXiv.org Machine LearningFeb-8-2025

Offline preference alignment for language models such as Direct Preference Optimization (DPO) is favored for its effectiveness and simplicity, eliminating the need for costly reinforcement learning. Various offline algorithms have been developed for different data settings, yet they lack a unified understanding. In this study, we introduce Pior-Informed Preference Alignment (PIPA), a unified, RL-free probabilistic framework that formulates language model preference alignment as a Maximum Likelihood Estimation (MLE) problem with prior constraints. This method effectively accommodates both paired and unpaired data, as well as answer and step-level annotations. We illustrate that DPO and KTO are special cases with different prior constraints within our framework. By integrating different types of prior information, we developed two variations of PIPA: PIPA-M and PIPA-N. Both algorithms demonstrate a $3\sim10\%$ performance enhancement on the GSM8K and MATH benchmarks across all configurations, achieving these gains without additional training or computational costs compared to existing algorithms.

machine learning, natural language, preprint arxiv, (18 more...)

arXiv.org Machine Learning

2502.05773

Country: North America > United States > Texas > Travis County > Austin (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.86)

Add feedback

Can tweets predict the next flu epidemic? - IBM Industries

#artificialintelligenceNov-21-2019, 10:32:43 GMT

The fall season brings many familiar favorites. It's common nowadays to see notifications from healthcare organizations on the local news alongside email reminders from employers about annual flu shots. If anything, it's a normal occurrence--perhaps anticipated, alongside ads for new pumpkin spice- flavored consumables. But even with careful preparation, healthcare professionals often work behind the curve to track the progress of reported flu outbreaks. Numerous factors are at play.

flu epidemic, pipa, tweet predict, (10 more...)

#artificialintelligence

Country: Europe > Germany (0.05)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.76)

Add feedback