AITopics | sipo

Collaborating Authors

sipo

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Iteratively Learn Diverse Strategies with State Distance Information

Neural Information Processing SystemsFeb-11-2026, 07:07:41 GMT

In addition, we examine two common computation frameworks for this problem, i.e., population-based training (PBT) and iterative learning

artificial intelligence, machine learning, natural language, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > Austria (0.04)
North America > United States > Maryland > Baltimore (0.04)
(11 more...)

Genre: Research Report (0.68)

Industry:

Leisure & Entertainment > Sports (0.67)
Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(3 more...)

Add feedback

Iteratively Learn Diverse Strategies with State Distance Information

Neural Information Processing SystemsDec-24-2025, 23:43:07 GMT

In complex reinforcement learning (RL) problems, policies with similar rewards may have substantially different behaviors. It remains a fundamental challenge to optimize rewards while also discovering as many strategies as possible, which can be crucial in many practical applications. Our study examines two design choices for tackling this challenge, i.e., and . First, we find that with existing diversity measures, visually indistinguishable policies can still yield high diversity scores. To accurately capture the behavioral difference, we propose to incorporate the state-space distance information into the diversity measure.

iteratively learn diverse strategy, name change, state distance information, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.57)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback

Iteratively Learn Diverse Strategies with State Distance Information

Neural Information Processing SystemsOct-8-2025, 14:41:40 GMT

In addition, we examine two common computation frameworks for this problem, i.e., population-based training (PBT) and iterative learning

diversity measure, international conference, sipo, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > Austria (0.04)
North America > United States > Maryland > Baltimore (0.04)
(11 more...)

Genre: Research Report (0.68)

Industry:

Leisure & Entertainment > Sports (0.67)
Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(3 more...)

Add feedback

Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment

Li, Moxin, Zhang, Yuantao, Wang, Wenjie, Shi, Wentao, Liu, Zhuo, Feng, Fuli, Chua, Tat-Seng

arXiv.org Artificial IntelligenceFeb-20-2025

Multi-Objective Alignment (MOA) aims to align LLMs' responses with multiple human preference objectives, with Direct Preference Optimization (DPO) emerging as a prominent approach. However, we find that DPO-based MOA approaches suffer from widespread preference conflicts in the data, where different objectives favor different responses. This results in conflicting optimization directions, hindering the optimization on the Pareto Front. To address this, we propose to construct Pareto-optimal responses to resolve preference conflicts. To efficiently obtain and utilize such responses, we propose a self-improving DPO framework that enables LLMs to self-generate and select Pareto-optimal responses for self-supervised preference alignment. Extensive experiments on two datasets demonstrate the superior Pareto Front achieved by our framework compared to various baselines. Code is available at \url{https://github.com/zyttt-coder/SIPO}.

objective, pareto-optimal response, preference conflict, (15 more...)

arXiv.org Artificial Intelligence

2502.14354

Country:

Europe > Austria > Vienna (0.15)
North America > United States > Florida > Miami-Dade County > Miami (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(8 more...)

Genre: Research Report (1.00)

Industry:

Education (0.68)
Health & Medicine > Therapeutic Area (0.67)
Health & Medicine > Health Care Providers & Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Iteratively Learn Diverse Strategies with State Distance Information

Neural Information Processing SystemsJan-16-2025, 07:42:21 GMT

In complex reinforcement learning (RL) problems, policies with similar rewards may have substantially different behaviors. It remains a fundamental challenge to optimize rewards while also discovering as many diverse strategies as possible, which can be crucial in many practical applications. Our study examines two design choices for tackling this challenge, i.e., diversity measure and computation framework. First, we find that with existing diversity measures, visually indistinguishable policies can still yield high diversity scores. To accurately capture the behavioral difference, we propose to incorporate the state-space distance information into the diversity measure.

diversity measure, iteratively learn diverse strategy, state distance information, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.42)

Add feedback

China's research institutes file more AI patents than businesses

#artificialintelligenceAug-10-2018, 17:04:24 GMT

Chinese academic institutions are more prolific patent filers in the artificial intelligence (AI) area than domestic companies, according to China's State Intellectual Property Office (SIPO). SIPO shared the statement, based on a release from China IP News, on Wednesday, August 1. The release is based on "China's AI Development Report 2018", which was recently published by Tsinghua University, in Beijing. The university's report revealed that the most prolific filers in AI tend to come from research institutions, such as universities. Unlike in other countries, industry players in China file fewer patents in the AI sphere than those in research institutions. The country's "top IT giants" such as Alibaba and Tencent are "overwhelmed" by the filings of foreign companies, such as IBM and Microsoft, SIPO said.

artificial intelligence, china, patent, (10 more...)

#artificialintelligence

Country:

Asia > China > Beijing > Beijing (0.26)
North America > United States (0.06)
Asia > Taiwan > Taiwan Province > Taipei (0.06)
(2 more...)

Industry:

Law > Intellectual Property & Technology Law (0.62)
Information Technology (0.38)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback