AITopics | College of Computer Science, Sichuan University

Collaborating Authors

College of Computer Science, Sichuan University

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SC2Net: Sparse LSTMs for Sparse Coding

Zhou, Joey Tianyi (Institute of High Performance Computing, A*STAR) | Di, Kai (Institute of High Performance Computing, A*STAR) | Du, Jiawei (Institute of High Performance Computing, A*STAR) | Peng, Xi (College of Computer Science, Sichuan University) | Yang, Hao (Amazon, Seattle) | Pan, Sinno Jialin (Nanyang Technological University) | Tsang, Ivor W. (University of Technology Sydney) | Liu, Yong (Institute of High Performance Computing, A*STAR) | Qin, Zheng (Institute of High Performance Computing, A*STAR) | Goh, Rick Siow Mong (Institute of High Performance Computing, A*STAR)

AAAI ConferencesFeb-8-2018

The iterative hard-thresholding algorithm (ISTA) is one of the most popular optimization solvers to achieve sparse codes. However, ISTA suffers from following problems: 1) ISTA employs non-adaptive updating strategy to learn the parameters on each dimension with a fixed learning rate. Such a strategy may lead to inferior performance due to the scarcity of diversity; 2) ISTA does not incorporate the historical information into the updating rules, and the historical information has been proven helpful to speed up the convergence. To address these challenging issues, we propose a novel formulation of ISTA (named as adaptive ISTA) by introducing a novel \textit{adaptive momentum vector}. To efficiently solve the proposed adaptive ISTA, we recast it as a recurrent neural network unit and show its connection with the well-known long short term memory (LSTM) model. With a new proposed unit, we present a neural network (termed SC2Net) to achieve sparse codes in an end-to-end manner. To the best of our knowledge, this is one of the first works to bridge the $\ell_1$-solver and LSTM, and may provide novel insights in understanding model-based optimization and LSTM. Extensive experiments show the effectiveness of our method on both unsupervised and supervised tasks.

deep learning, ista, neural network, (20 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Country:

North America > United States (0.69)
Asia > Middle East > Israel (0.14)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback