AITopics | Zhang, Xun

Collaborating Authors

Zhang, Xun

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

PolarQuant: Leveraging Polar Transformation for Efficient Key Cache Quantization and Decoding Acceleration

Wu, Songhao, Lv, Ang, Feng, Xiao, Zhang, Yufei, Zhang, Xun, Yin, Guojun, Lin, Wei, Yan, Rui

arXiv.org Artificial IntelligenceFeb-1-2025

The KV cache in large language models is a dominant factor in memory usage, limiting their broader applicability. Quantizing the cache to lower bit widths is an effective way to reduce computational costs; however, previous methods struggle with quantizing key vectors due to outliers, resulting in excessive overhead. We propose a novel quantization approach called PolarQuant, which efficiently addresses the outlier challenge. We observe that outliers typically appear in only one of two dimensions, which are rotated together by a specific angle when rotary position embeddings are applied. When represented as two-dimensional vectors, these dimensions exhibit well-structured patterns, with radii and angles smoothly distributed in polar coordinates. This alleviates the challenge of outliers on per-channel quantization, making them well-suited for quantization. Thus, PolarQuant divides key vectors into groups of two-dimensional sub-vectors, encoding them as the corresponding quantized radius and the polar angle, rather than quantizing original key vectors directly. PolarQuant achieves the superior efficiency in KV cache quantization and accelerates the decoding process by turning the query-key inner product into a table lookup, all while maintaining the downstream performance of full-precision models.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2502.00527

Country: Asia (0.28)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Exact Fit Attention in Node-Holistic Graph Convolutional Network for Improved EEG-Based Driver Fatigue Detection

Xu, Meiyan, Chen, Qingqing, Chen, Duo, Ding, Yi, Wang, Jingyuan, Gu, Peipei, Pan, Yijie, Huang, Deshuang, Zhang, Xun, Guo, Jiayang

arXiv.org Artificial IntelligenceJan-24-2025

-- EEG-based fatigue monitoring can effectively reduce the incidence of related traffic accidents. In the past decade, with the advancement of deep learning, convolu-tional neural networks (CNN) have been increasingly used for EEG signal processing. However, due to the data's non-Euclidean characteristics, existing CNNs may lose important spatial information from EEG, specifically channel correlation. Thus, we propose the node-holistic graph convo-lutional network (NHGNet), a model that uses graphic convolution to dynamically learn each channel's features. The interpretability is enhanced by revealing critical areas of brain activity and their interrelations in various mental states. In validations on two public datasets, NHGNet outperforms the SOTAs. Specifically, in the intra-subject, NHGNet improved detection accuracy by at least 2.34% and 3.42%, and in the inter-subjects, it improved by at least 2.09% and 15.06%. Visualization research on the model revealed that the central parietal area plays an important role in detecting fatigue levels, whereas the frontal and temporal lobes are essential for maintaining vigilance. Duo Chen is with the School of Artificial Intelligence and Information T echnology, Nanjing University of Chinese Medicine, Nanjing 210023, China (e-mail: 380013@njucm.edu.cn). Yi Ding is with the College of Computing and Data Science, Nanyang T echnological University, Singapore.

artificial intelligence, machine learning, spatial reasoning, (18 more...)

arXiv.org Artificial Intelligence

2501.15062

Country: Asia > China (0.90)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.89)

Add feedback

Semantic Convergence: Harmonizing Recommender Systems via Two-Stage Alignment and Behavioral Semantic Tokenization

Li, Guanghan, Zhang, Xun, Zhang, Yufei, Yin, Yifan, Yin, Guojun, Lin, Wei

arXiv.org Artificial IntelligenceDec-18-2024

Large language models (LLMs), endowed with exceptional reasoning capabilities, are adept at discerning profound user interests from historical behaviors, thereby presenting a promising avenue for the advancement of recommendation systems. However, a notable discrepancy persists between the sparse collaborative semantics typically found in recommendation systems and the dense token representations within LLMs. In our study, we propose a novel framework that harmoniously merges traditional recommendation models with the prowess of LLMs. We initiate this integration by transforming ItemIDs into sequences that align semantically with the LLMs space, through the proposed Alignment Tokenization module. Additionally, we design a series of specialized supervised learning tasks aimed at aligning collaborative signals with the subtleties of natural language semantics. To ensure practical applicability, we optimize online inference by pre-caching the top-K results for each user, reducing latency and improving effciency. Extensive experimental evidence indicates that our model markedly improves recall metrics and displays remarkable scalability of recommendation systems.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2412.13771

Country: Asia > China (0.14)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Optimizing Indoor Navigation Policies For Spatial Distancing

Zhang, Xun, Schwartz, Mathew, Usman, Muhammad, Faloutsos, Petros, Kapadia, Mubbasir

arXiv.org Artificial IntelligenceJun-4-2022

In this paper, we focus on the modification of policies that can lead to movement patterns and directional guidance of occupants, which are represented as agents in a 3D simulation engine. We demonstrate an optimization method that improves a spatial distancing metric by modifying the navigation graph by introducing a measure of spatial distancing of agents as a function of agent density (i.e., occupancy). Our optimization framework utilizes such metrics as the target function, using a hybrid approach of combining genetic algorithm and simulated annealing. We show that within our framework, the simulation-optimization process can help to improve spatial distancing between agents by optimizing the navigation policies for a given indoor environment.

artificial intelligence, machine learning, simulation, (16 more...)

arXiv.org Artificial Intelligence

2207.0886

Country: North America > United States > New Jersey (0.28)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Epidemiology (1.00)
Construction & Engineering (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.35)

Add feedback

A Unifying Framework for Variance Reduction Algorithms for Finding Zeroes of Monotone Operators

Zhang, Xun, Haskell, William B., Ye, Zhisheng

arXiv.org Machine LearningJun-22-2019

A wide range of optimization problems can be recast as monotone inclusion problems. We propose a unifying framework for solving the monotone inclusion problem with randomized Forward-Backward algorithms. Our framework covers many existing deterministic and stochastic algorithms. Under various conditions, we can establish both sublinear and linear convergence rates in expectation for the algorithms covered by this framework. In addition, we consider algorithm design as well as asynchronous randomized Forward algorithms. Numerical experiments demonstrate the worth of the new algorithms that emerge from our framework.

algorithm, artificial intelligence, optimization problem, (17 more...)

arXiv.org Machine Learning

1906.09437

Country:

North America > United States (0.46)
Asia (0.46)
Europe (0.45)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Dynamic and Accelerated Partial Order Planning for Interactive Narratives

Zhang, Xun (Rutgers University) | Inampudi, Bhuvana C. (Rutgers University) | Badler, Norman I. (University of Pennsylvania) | Kapadia, Mubbasir (Rutgers University)

AAAI ConferencesOct-1-2017

This paper explores new narrative generation paradigms for open world problems. We propose a speed-up variant of partial planner–accelerated partial order planner, that can automatically generate narratives for large plan spaces. To incorporate real-time free-form user interaction, a dynamic partial planning technique has been introduced to self-repair the narratives. We also propose a scalable and robust framework to craft open world narratives with minimal effort. Our approach enables content creators to craft complex open world narratives without explicitly authoring user interaction arcs. We tested our framework by developing multiple narratives with free-form interactions. Those narratives were used to test the robustness of the proposed planners.

accelerated partial order planning, interactive narrative

AAAI Conferences

Thirteenth Artificial Intelligence and Interactive Digital Entertainment Conference

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.69)

Add feedback