AITopics | Chen, Xiaowei

Collaborating Authors

Chen, Xiaowei

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary Negative Samples

Xie, Shuo, Zhu, Fangzhi, Wang, Jiahui, Wen, Lulu, Dai, Wei, Chen, Xiaowei, Zhu, Junxiong, Zhou, Kai, Zheng, Bo

arXiv.org Artificial IntelligenceDec-13-2024

Aligning Large Language Models (LLMs) with human feedback is crucial for their development. Existing preference optimization methods such as DPO and KTO, while improved based on Reinforcement Learning from Human Feedback (RLHF), are inherently derived from PPO, requiring a reference model that adds GPU memory resources and relies heavily on abundant preference data. Meanwhile, current preference optimization research mainly targets single-question scenarios with two replies, neglecting optimization with multiple replies, which leads to a waste of data in the application. This study introduces the MPPO algorithm, which leverages the average likelihood of model responses to fit the reward function and maximizes the utilization of preference data. Through a comparison of Point-wise, Pair-wise, and List-wise implementations, we found that the Pair-wise approach achieves the best performance, significantly enhancing the quality of model responses. Experimental results demonstrate MPPO's outstanding performance across various benchmarks. On MT-Bench, MPPO outperforms DPO, ORPO, and SimPO. Notably, on Arena-Hard, MPPO surpasses DPO and ORPO by substantial margins. These achievements underscore the remarkable advantages of MPPO in preference optimization tasks.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2412.15244

Country: North America > United States (0.14)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Pricing Catastrophe Bonds -- A Probabilistic Machine Learning Approach

Chen, Xiaowei, Li, Hong, Lu, Yufan, Zhou, Rui

arXiv.org Artificial IntelligenceApr-10-2024

Catastrophe (CAT) bonds have become increasingly vital in managing and transferring catastrophic risk. These bonds offer a source of capital to cover losses arising from natural disasters, allowing investors to diversify their portfolios while helping issuers mitigate potentially devastating financial consequences. Understanding the pricing dynamics of CAT bonds is essential, both for investors seeking informed decisions and for issuers optimizing their risk management strategies. This paper introduces a probabilistic machine-learning-based predictive framework for the pricing of CAT bonds, aiming to enhance empirical pricing accuracy and discover previously undetected nonlinear dependence between the key risk factors and CAT bond spreads. Early research by Lane (2000) laid the groundwork for CAT bond pricing literature, proposing a log-linear regression model employing conditional expected loss and probability of first loss as predictors. Subsequent studies expanded on this linear framework, incorporating additional predictors and examining pricing under diverse conditions. Gürtler et al. (2016) incorporated bond characteristics like trigger type and bond rating, while Braun (2016) integrated market condition indices, such as the Lane Synthetic Rate on Line index and the BB corporate bond spread. Götze and Gürtler (2020a) explored sponsor-related pricing inefficiencies across different market conditions, and Morana and Sbrana (2019) focused on the impact of climate change on CAT bond returns. Further extending the research scope, Zhao and Yu (2020) utilized actual catastrophe data to forecast CAT bond prices using market-based methods, Braun et al. (2022) developed factor pricing models for cross-sectional CAT bond returns, and Herrmann and Hibbeln (2023) investigated liquidity premiums in the secondary market.

artificial intelligence, machine learning, prediction interval, (13 more...)

arXiv.org Artificial Intelligence

2405.00697

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.93)

Industry: Banking & Finance > Trading (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.75)

Add feedback

Distance and Hop-wise Structures Encoding Enhanced Graph Attention Networks

Huang, Zhiguo, Chen, Xiaowei, Wang, Bojuan

arXiv.org Artificial IntelligenceDec-6-2021

Many works have proven that existing neighbor-averaging Graph Neural Networks cannot efficiently catch structure information, such GNNs cannot even catch degree features in some cases. The reason is intuitive: as the neighbor-averaging GNNs can only combine neighbor's feature vectors for every node, if the neighbor's feature vectors contains no structure information, the hop-wise neighbor-averaging GNNs can only catch degree information at best([1];[2];[3]). So, as an intuitive idea, injecting structure information into feature vectors may improve the performance of GNNs. Numerous works have shown that injecting structure, distance, position or spatial information can significantly improve performance of neighbor-averaging GNNs([4];[5];[6];[7];[8];[9];[10]). However, existing works have their problems. Some of them has very high computation complexity which can not apply to large-scale graph(MotifNet[4]). Some of them simply concatenate structure information with intrinsic feature vector (ID-GNN[6]; P-GNN[8]; DE-GNN[9]), which may confuse the signals of different feature. For example, in ogbn-arxiv dataset, the intrinsic feature is semantic embedding of headline or abstract, which provides total different signal with structure information. Some of them are graph-level-task oriented and only deal with small graph(Graphormer[7]; SubGNN[10]).

artificial intelligence, information, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2112.02868

Country: Asia > China (0.29)

Genre: Research Report (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)

Add feedback

Multi-layered Network Exploration via Random Walks: From Offline Optimization to Online Learning

Liu, Xutong, Zuo, Jinhang, Chen, Xiaowei, Chen, Wei, Lui, John C. S.

arXiv.org Artificial IntelligenceJun-9-2021

Multi-layered network exploration (MuLaNE) problem is an important problem abstracted from many applications. In MuLaNE, there are multiple network layers where each node has an importance weight and each layer is explored by a random walk. The MuLaNE task is to allocate total random walk budget $B$ into each network layer so that the total weights of the unique nodes visited by random walks are maximized. We systematically study this problem from offline optimization to online learning. For the offline optimization setting where the network structure and node weights are known, we provide greedy based constant-ratio approximation algorithms for overlapping networks, and greedy or dynamic-programming based optimal solutions for non-overlapping networks. For the online learning setting, neither the network structure nor the node weights are known initially. We adapt the combinatorial multi-armed bandit framework and design algorithms to learn random walk related parameters and node weights while optimizing the budget allocation in multiple rounds, and prove that they achieve logarithmic regret bounds. Finally, we conduct experiments on a real-world social network dataset to validate our theoretical results.

algorithm, computer based training, educational technology, (19 more...)

arXiv.org Artificial Intelligence

2106.05065

Country:

North America > United States (0.46)
Asia > China (0.28)

Genre: Research Report (0.63)

Industry: Education > Educational Setting > Online (0.90)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(2 more...)

Add feedback

Community Exploration: From Offline Optimization to Online Learning

Chen, Xiaowei, Huang, Weiran, Chen, Wei, Lui, John C. S.

Neural Information Processing SystemsDec-31-2018

We introduce the community exploration problem that has various real-world applications such as online advertising. In the problem, an explorer allocates limited budget to explore communities so as to maximize the number of members he could meet. We provide a systematic study of the community exploration problem, from offline optimization to online learning. For the offline setting where the sizes of communities are known, we prove that the greedy methods for both of non-adaptive exploration and adaptive exploration are optimal. For the online setting where the sizes of communities are not known and need to be learned from the multi-round explorations, we propose an ``upper confidence'' like algorithm that achieves the logarithmic regret bounds. By combining the feedback from different rounds, we can achieve a constant regret bound.

computer based training, educational technology, exploration, (20 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.14)
Asia > China (0.14)

Industry:

Information Technology (0.66)
Education > Educational Setting > Online (0.61)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.61)

Add feedback

Community Exploration: From Offline Optimization to Online Learning

Chen, Xiaowei, Huang, Weiran, Chen, Wei, Lui, John C. S.

Neural Information Processing SystemsDec-31-2018

computer based training, educational technology, exploration, (20 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.14)
Asia > China (0.14)

Industry:

Information Technology (0.66)
Education > Educational Setting > Online (0.61)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.61)

Add feedback

Community Exploration: From Offline Optimization to Online Learning

Chen, Xiaowei, Huang, Weiran, Chen, Wei, Lui, John C. S.

arXiv.org Machine LearningNov-13-2018

We introduce the community exploration problem that has many real-world applications such as online advertising. In the problem, an explorer allocates limited budget to explore communities so as to maximize the number of members he could meet. We provide a systematic study of the community exploration problem, from offline optimization to online learning. For the offline setting where the sizes of communities are known, we prove that the greedy methods for both of non-adaptive exploration and adaptive exploration are optimal. For the online setting where the sizes of communities are not known and need to be learned from the multi-round explorations, we propose an `upper confidence' like algorithm that achieves the logarithmic regret bounds. By combining the feedback from different rounds, we can achieve a constant regret bound.

computer based training, educational technology, exploration, (20 more...)

arXiv.org Machine Learning

1811.05134

Country:

North America > Canada (0.14)
Asia > China (0.14)

Genre: Research Report (0.50)

Industry:

Information Technology (0.66)
Education > Educational Setting > Online (0.61)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.61)

Add feedback