AITopics | Li, Yexin

Collaborating Authors

Li, Yexin

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Causal Graph Guided Steering of LLM Values via Prompts and Sparse Autoencoders

Kang, Yipeng, Wang, Junqi, Li, Yexin, Zhong, Fangwei, Feng, Xue, Wang, Mengmeng, Tu, Wenming, Wang, Quansen, Li, Hengli, Zheng, Zilong

arXiv.org Artificial IntelligenceDec-31-2024

As large language models (LLMs) become increasingly integrated into critical applications, aligning their behavior with human values presents significant challenges. Current methods, such as Reinforcement Learning from Human Feedback (RLHF), often focus on a limited set of values and can be resource-intensive. Furthermore, the correlation between values has been largely overlooked and remains underutilized. Our framework addresses this limitation by mining a causal graph that elucidates the implicit relationships among various values within the LLMs. Leveraging the causal graph, we implement two lightweight mechanisms for value steering: prompt template steering and Sparse Autoencoder feature steering, and analyze the effects of altering one value dimension on others. Extensive experiments conducted on Gemma-2B-IT and Llama3-8B-IT demonstrate the effectiveness and controllability of our steering methods.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2501.00581

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

DWCL: Dual-Weighted Contrastive Learning for Multi-View Clustering

Zhang, Zhihui, Hao, Xiaoshuai, Yuan, Hanning, Chi, Lianhua, Guo, Qi, Li, Qi, Yuan, Ziqiang, Pang, Jinhui, Li, Yexin, Ruan, Sijie

arXiv.org Artificial IntelligenceNov-26-2024

Multi-view contrastive clustering (MVCC) has gained significant attention for generating consistent clustering structures from multiple views through contrastive learning. However, most existing MVCC methods create cross-views by combining any two views, leading to a high volume of unreliable pairs. Furthermore, these approaches often overlook discrepancies in multi-view representations, resulting in representation degeneration. To address these challenges, we introduce a novel model called Dual-Weighted Contrastive Learning (DWCL) for Multi-View Clustering. Specifically, to reduce the impact of unreliable cross-views, we introduce an innovative Best-Other (B-O) contrastive mechanism that enhances the representation of individual views at a low computational cost. Furthermore, we develop a dual weighting strategy that combines a view quality weight, reflecting the quality of each view, with a view discrepancy weight. This approach effectively mitigates representation degeneration by downplaying cross-views that are both low in quality and high in discrepancy. We theoretically validate the efficiency of the B-O contrastive mechanism and the effectiveness of the dual weighting strategy. Extensive experiments demonstrate that DWCL outperforms previous methods across eight multi-view datasets, showcasing superior performance and robustness in MVCC. Specifically, our method achieves absolute accuracy improvements of 5.4\% and 5.6\% compared to state-of-the-art methods on the Caltech6V7 and MSRCv1 datasets, respectively.

artificial intelligence, dataset, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2411.17354

Country: Asia > China (0.15)

Genre:

Research Report > Promising Solution (1.00)
Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

Qi, Siyuan, Chen, Shuo, Li, Yexin, Kong, Xiangyu, Wang, Junqi, Yang, Bangcheng, Wong, Pring, Zhong, Yifan, Zhang, Xiaoyuan, Zhang, Zhaowei, Liu, Nian, Wang, Wei, Yang, Yaodong, Zhu, Song-Chun

arXiv.org Artificial IntelligenceJan-19-2024

The generalization of decision-making agents encompasses two fundamental elements: learning from past experiences and reasoning in novel contexts. However, the predominant emphasis in most interactive environments is on learning, often at the expense of complexity in reasoning. In this paper, we introduce CivRealm, an environment inspired by the Civilization game. Civilization's profound alignment with human history and society necessitates sophisticated learning, while its ever-changing situations demand strong reasoning to generalize. Particularly, CivRealm sets up an imperfect-information general-sum game with a changing number of players; it presents a plethora of complex features, challenging the agent to deal with open-ended stochastic environments that require diplomacy and negotiation skills. Within CivRealm, we provide interfaces for two typical agent types: tensor-based agents that focus on learning, and language-based agents that emphasize reasoning. To catalyze further research, we present initial results for both paradigms. The canonical RL-based agents exhibit reasonable performance in mini-games, whereas both RL- and LLM-based agents struggle to make substantial progress in the full game. Overall, CivRealm stands as a unique learning and reasoning challenge for decision-making agents. The code is available at https://github.com/bigai-ai/civrealm.

machine learning, natural language, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

2401.10568

Country:

Europe (0.14)
Asia > China (0.14)

Genre: Research Report (0.63)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Government > Military (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.93)
(2 more...)

Add feedback

CityNet: A Multi-city Multi-modal Dataset for Smart City Applications

Geng, Xu, Jin, Yilun, Zheng, Zhengfei, Yang, Yu, Li, Yexin, Tian, Han, Duan, Peibo, Wang, Leye, Cao, Jiannong, Yang, Hai, Yang, Qiang, Chen, Kai

arXiv.org Artificial IntelligenceJun-30-2021

Data-driven approaches have been applied to many problems in urban computing. However, in the research community, such approaches are commonly studied under data from limited sources, and are thus unable to characterize the complexity of urban data coming from multiple entities and the correlations among them. Consequently, an inclusive and multifaceted dataset is necessary to facilitate more extensive studies on urban computing. In this paper, we present CityNet, a multi-modal urban dataset containing data from 7 cities, each of which coming from 3 data sources. We first present the generation process of CityNet as well as its basic properties. In addition, to facilitate the use of CityNet, we carry out extensive machine learning experiments, including spatio-temporal predictions, transfer learning, and reinforcement learning. The experimental results not only provide benchmarks for a wide range of tasks and methods, but also uncover internal correlations among cities and tasks within CityNet that, with adequate leverage, can improve performances on various tasks. With the benchmarking results and the correlations uncovered, we believe that CityNet can contribute to the field of urban computing by supporting research on many advanced topics.

artificial intelligence, machine learning, multi-city multi-modal dataset, (4 more...)

arXiv.org Artificial Intelligence

2106.15802

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.87)

Add feedback