AITopics | Tian, Yuxing

Collaborating Authors

Tian, Yuxing

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Latent Conditional Diffusion-based Data Augmentation for Continuous-Time Dynamic Graph Mode

Tian, Yuxing, Qi, Yiyan, Jiang, Aiwen, Huang, Qi, Guo, Jian

arXiv.org Artificial IntelligenceJul-11-2024

Continuous-Time Dynamic Graph (CTDG) precisely models evolving real-world relationships, drawing heightened interest in dynamic graph learning across academia and industry. However, existing CTDG models encounter challenges stemming from noise and limited historical data. Graph Data Augmentation (GDA) emerges as a critical solution, yet current approaches primarily focus on static graphs and struggle to effectively address the dynamics inherent in CTDGs. Moreover, these methods often demand substantial domain expertise for parameter tuning and lack theoretical guarantees for augmentation efficacy. To address these issues, we propose Conda, a novel latent diffusion-based GDA method tailored for CTDGs. Conda features a sandwich-like architecture, incorporating a Variational Auto-Encoder (VAE) and a conditional diffusion model, aimed at generating enhanced historical neighbor embeddings for target nodes. Unlike conventional diffusion models trained on entire graphs via pre-training, Conda requires historical neighbor sequence embeddings of target nodes for training, thus facilitating more targeted augmentation. We integrate Conda into the CTDG model and adopt an alternating training strategy to optimize performance. Extensive experimentation across six widely used real-world datasets showcases the consistent performance improvement of our approach, particularly in scenarios with limited historical data.

artificial intelligence, machine learning, node, (15 more...)

arXiv.org Artificial Intelligence

2407.085

Country: North America > United States (0.68)

Genre:

Research Report (0.64)
Instructional Material (0.47)

Industry: Information Technology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.88)

Add feedback

A Survey on Large Language Model Hallucination via a Creativity Perspective

Jiang, Xuhui, Tian, Yuxing, Hua, Fengrui, Xu, Chengjin, Wang, Yuanzhuo, Guo, Jian

arXiv.org Artificial IntelligenceFeb-2-2024

Hallucinations in large language models (LLMs) are always seen as limitations. However, could they also be a source of creativity? This survey explores this possibility, suggesting that hallucinations may contribute to LLM application by fostering creativity. This survey begins with a review of the taxonomy of hallucinations and their negative impact on LLM reliability in critical applications. Then, through historical examples and recent relevant theories, the survey explores the potential creative benefits of hallucinations in LLMs. To elucidate the value and evaluation criteria of this connection, we delve into the definitions and assessment methods of creativity. Following the framework of divergent and convergent thinking phases, the survey systematically reviews the literature on transforming and harnessing hallucinations for creativity in LLMs. Finally, the survey discusses future research directions, emphasizing the need to further explore and refine the application of hallucinations in creative processes within LLMs.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2402.06647

Country: North America > United States (0.14)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.68)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey

Zhang, Weixu, Wang, Yifei, Song, Yuanfeng, Wei, Victor Junqiu, Tian, Yuxing, Qi, Yiyan, Chan, Jonathan H., Wong, Raymond Chi-Wing, Yang, Haiqin

arXiv.org Artificial IntelligenceOct-27-2023

The emergence of natural language processing has revolutionized the way users interact with tabular data, enabling a shift from traditional query languages and manual plotting to more intuitive, language-based interfaces. The rise of large language models (LLMs) such as ChatGPT and its successors has further advanced this field, opening new avenues for natural language processing techniques. This survey presents a comprehensive overview of natural language interfaces for tabular data querying and visualization, which allow users to interact with data using natural language queries. We introduce the fundamental concepts and techniques underlying these interfaces with a particular emphasis on semantic parsing, the key technology facilitating the translation from natural language to SQL queries or data visualization commands. We then delve into the recent advancements in Text-to-SQL and Text-to-Vis problems from the perspectives of datasets, methodologies, metrics, and system designs. This includes a deep dive into the influence of LLMs, highlighting their strengths, limitations, and potential for future improvements. Through this survey, we aim to provide a roadmap for researchers and practitioners interested in developing and applying natural language interfaces for data interaction in the era of large language models.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2310.17894

Country:

Europe (1.00)
Asia > China (0.68)
North America > Canada (0.67)
(3 more...)

Genre: Overview (1.00)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

M3FGM:a node masking and multi-granularity message passing-based federated graph model for spatial-temporal data prediction

Tian, Yuxing, Liu, Zheng, Qu, Yanwen, Li, Song, Luo, Jiachi

arXiv.org Artificial IntelligenceSep-7-2023

Researchers are solving the challenges of spatial-temporal prediction by combining Federated Learning (FL) and graph models with respect to the constrain of privacy and security. In order to make better use of the power of graph model, some researchs also combine split learning(SL). However, there are still several issues left unattended: 1) Clients might not be able to access the server during inference phase; 2) The graph of clients designed manually in the server model may not reveal the proper relationship between clients. This paper proposes a new GNN-oriented split federated learning method, named node {\bfseries M}asking and {\bfseries M}ulti-granularity {\bfseries M}essage passing-based Federated Graph Model (M$^3$FGM) for the above issues. For the first issue, the server model of M$^3$FGM employs a MaskNode layer to simulate the case of clients being offline. We also redesign the decoder of the client model using a dual-sub-decoders structure so that each client model can use its local data to predict independently when offline. As for the second issue, a new GNN layer named Multi-Granularity Message Passing (MGMP) layer enables each client node to perceive global and local information. We conducted extensive experiments in two different scenarios on two real traffic datasets. Results show that M$^3$FGM outperforms the baselines and variant models, achieves the best results in both datasets and scenarios.

artificial intelligence, machine learning, node, (19 more...)

arXiv.org Artificial Intelligence

2210.16193

Country:

North America > United States (0.14)
Asia > China (0.14)

Genre: Research Report > New Finding (0.48)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.63)

Add feedback

Boosting long-term forecasting performance for continuous-time dynamic graph networks via data augmentation

Tian, Yuxing, Zhu, Mingjie, Luo, Jiachi, Li, Song

arXiv.org Artificial IntelligenceApr-12-2023

This study focuses on long-term forecasting (LTF) on continuous-time dynamic graph networks (CTDGNs), which is important for real-world modeling. Existing CTDGNs are effective for modeling temporal graph data due to their ability to capture complex temporal dependencies but perform poorly on LTF due to the substantial requirement for historical data, which is not practical in most cases. To relieve this problem, a most intuitive way is data augmentation. In this study, we propose \textbf{\underline{U}ncertainty \underline{M}asked \underline{M}ix\underline{U}p (UmmU)}: a plug-and-play module that conducts uncertainty estimation to introduce uncertainty into the embedding of intermediate layer of CTDGNs, and perform masked mixup to further enhance the uncertainty of the embedding to make it generalize to more situations. UmmU can be easily inserted into arbitrary CTDGNs without increasing the number of parameters. We conduct comprehensive experiments on three real-world dynamic graph datasets, the results demonstrate that UmmU can effectively improve the long-term forecasting performance for CTDGNs.

artificial intelligence, data mining, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2304.05749

Genre: Research Report > New Finding (0.86)

Industry: Education > Educational Setting > Online (0.31)

Technology:

Information Technology > Information Management (0.93)
Information Technology > Modeling & Simulation (0.84)
Information Technology > Communications (0.73)
(4 more...)

Add feedback

Motif-aware temporal GCN for fraud detection in signed cryptocurrency trust networks

Li, Song, Zhou, Jiandong, MO, Chong, LI, Jin, Tso, Geoffrey K. F., Tian, Yuxing

arXiv.org Artificial IntelligenceMar-29-2023

Graph convolutional networks (GCNs) is a class of artificial neural networks for processing data that can be represented as graphs. Since financial transactions can naturally be constructed as graphs, GCNs are widely applied in the financial industry, especially for financial fraud detection. In this paper, we focus on fraud detection on cryptocurrency truct networks. In the literature, most works focus on static networks. Whereas in this study, we consider the evolving nature of cryptocurrency networks, and use local structural as well as the balance theory to guide the training process. More specifically, we compute motif matrices to capture the local topological information, then use them in the GCN aggregation process. The generated embedding at each snapshot is a weighted average of embeddings within a time window, where the weights are learnable parameters. Since the trust networks is signed on each edge, balance theory is used to guide the training process. Experimental results on bitcoin-alpha and bitcoin-otc datasets show that the proposed model outperforms those in the literature.

data mining, information, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2211.13123

Country: Asia > China (0.68)

Genre: Research Report > New Finding (0.36)

Industry:

Banking & Finance > Trading (1.00)
Law Enforcement & Public Safety > Fraud (0.90)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback