AITopics | Xiaohan Chen

Collaborating Authors

Xiaohan Chen

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Theoretical Linear Convergence of Unfolded ISTA and Its Practical Weights and Thresholds

Xiaohan Chen, Jialin Liu, Zhangyang Wang, Wotao Yin

Neural Information Processing SystemsMar-27-2025, 00:32:22 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, lista, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.28)
North America > United States > California (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Can We Gain More from Orthogonality Regularizations in Training Deep Networks?

Nitin Bansal, Xiaohan Chen, Zhangyang Wang

Neural Information Processing SystemsMar-26-2025, 22:52:13 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, regularization, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Texas (0.28)

Genre: Research Report (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

E2-Train: Training State-of-the-art CNNs with Over 80% Energy Savings

Yue Wang, Ziyu Jiang, Xiaohan Chen, Pengfei Xu, Yang Zhao, Yingyan Lin, Zhangyang Wang

Neural Information Processing SystemsMar-23-2025, 21:23:39 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, neural network, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

E2-Train: Training State-of-the-art CNNs with Over 80% Energy Savings

Yue Wang, Ziyu Jiang, Xiaohan Chen, Pengfei Xu, Yang Zhao, Yingyan Lin, Zhangyang Wang

Neural Information Processing SystemsJan-24-2025, 08:01:54 GMT

Convolutional neural networks (CNNs) have been increasingly deployed to edge devices. Hence, many efforts have been made towards efficient CNN inference in resource-constrained platforms. This paper attempts to explore an orthogonal direction: how to conduct more energy-efficient training of CNNs, so as to enable on-device training? We strive to reduce the energy cost during training, by dropping unnecessary computations, from three complementary levels: stochastic mini-batch dropping on the data level; selective layer update on the model level; and sign prediction for low-cost, low-precision back-propagation, on the algorithm level. Extensive simulations and ablation studies, with real energy measurements from an FPGA board, confirm the superiority of our proposed strategies and demonstrate remarkable energy savings for training. For example, when training ResNet-74 on CIFAR-10, we achieve aggressive energy savings of >90% and >60%, while incurring a top-1 accuracy loss of only about 2% and 1.2%, respectively. When training ResNet-110 on CIFAR-100, an over 84% training energy saving is achieved without degrading inference accuracy.

artificial intelligence, machine learning, neural network, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Industry: Education > Educational Setting (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Theoretical Linear Convergence of Unfolded ISTA and Its Practical Weights and Thresholds

Xiaohan Chen, Jialin Liu, Zhangyang Wang, Wotao Yin

Neural Information Processing SystemsOct-8-2024, 04:41:31 GMT

In recent years, unfolding iterative algorithms as neural networks has become an empirical success in solving sparse recovery problems. However, its theoretical understanding is still immature, which prevents us from fully utilizing the power of neural networks. In this work, we study unfolded ISTA (Iterative Shrinkage Thresholding Algorithm) for sparse signal recovery. We introduce a weight structure that is necessary for asymptotic convergence to the true sparse signal. With this structure, unfolded ISTA can attain a linear convergence, which is better than the sublinear convergence of ISTA/FISTA in general cases. Furthermore, we propose to incorporate thresholding in the network to perform support selection, which is easy to implement and able to boost the convergence rate both theoretically and empirically. Extensive simulations, including sparse vector recovery and a compressive sensing experiment on real image data, corroborate our theoretical results and demonstrate their practical usefulness. We have made our codes publicly available.

artificial intelligence, lista, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.28)
North America > United States > California (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Can We Gain More from Orthogonality Regularizations in Training Deep Networks?

Nitin Bansal, Xiaohan Chen, Zhangyang Wang

Neural Information Processing SystemsOct-8-2024, 02:32:18 GMT

This paper seeks to answer the question: as the (near-) orthogonality of weights is found to be a favorable property for training deep convolutional neural networks, how can we enforce it in more effective and easy-to-use ways? We develop novel orthogonality regularizations on training deep CNNs, utilizing various advanced analytical tools such as mutual coherence and restricted isometry property. These plug-and-play regularizations can be conveniently incorporated into training almost any CNN without extra hassle.

artificial intelligence, machine learning, regularization, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Texas (0.28)

Genre: Research Report (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback