AITopics | Thomas, Anna

Learning Compressed Transforms with Low Displacement Rank

Thomas, Anna, Gu, Albert, Dao, Tri, Rudra, Atri, Ré, Christopher

Neural Information Processing SystemsDec-31-2018

The low displacement rank (LDR) framework for structured matrices represents a matrix through two displacement operators and a low-rank residual. Existing use of LDR matrices in deep learning has applied fixed displacement operators encoding forms of shift invariance akin to convolutions. We introduce a rich class of LDR matrices with more general displacement operators, and explicitly learn over both the operators and the low-rank component. This class generalizes several previous constructions while preserving compression and efficient computation. We prove bounds on the VC dimension of multi-layer neural networks with structured weight matrices and show empirically that our compact parameterization can reduce the sample complexity of learning. When replacing weight layers in fully-connected, convolutional, and recurrent neural networks for image classification and language modeling tasks, our new classes exceed the accuracy of existing compression approaches, and on some tasks even outperform general unstructured layers while using more than 20x fewer parameters.

artificial intelligence, machine learning, matrix, (20 more...)

Neural Information Processing Systems

Country: North America > Canada (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Learning Compressed Transforms with Low Displacement Rank

Thomas, Anna, Gu, Albert, Dao, Tri, Rudra, Atri, Ré, Christopher

Neural Information Processing SystemsDec-31-2018

The low displacement rank (LDR) framework for structured matrices represents a matrix through two displacement operators and a low-rank residual. Existing use of LDR matrices in deep learning has applied fixed displacement operators encoding forms of shift invariance akin to convolutions. We introduce a rich class of LDR matrices with more general displacement operators, and explicitly learn over both the operators and the low-rank component. This class generalizes several previous constructions while preserving compression and efficient computation. We prove bounds on the VC dimension of multi-layer neural networks with structured weight matrices and show empirically that our compact parameterization can reduce the sample complexity of learning. When replacing weight layers in fully-connected, convolutional, and recurrent neural networks for image classification and language modeling tasks, our new classes exceed the accuracy of existing compression approaches, and on some tasks even outperform general unstructured layers while using more than 20x fewer parameters.

deep learning, matrix, neural network, (19 more...)

Neural Information Processing Systems

Country:

Europe (1.00)
North America > United States > New York > New York County > New York City (0.14)
North America > Canada > Ontario > Toronto (0.14)

Industry: Semiconductors & Electronics (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Neurally-Guided Procedural Models: Amortized Inference for Procedural Graphics Programs using Neural Networks

Ritchie, Daniel, Thomas, Anna, Hanrahan, Pat, Goodman, Noah

Neural Information Processing SystemsDec-31-2016

Probabilistic inference algorithms such as Sequential Monte Carlo (SMC) provide powerful tools for constraining procedural models in computer graphics, but they require many samples to produce desirable results. In this paper, we show how to create procedural models which learn how to satisfy constraints. We augment procedural models with neural networks which control how the model makes random choices based on the output it has generated thus far. We call such models neurally-guided procedural models. As a pre-computation, we train these models to maximize the likelihood of example outputs generated via SMC. They are then used as efficient SMC importance samplers, generating high-quality results with very few samples. We evaluate our method on L-system-like models with image-based constraints. Given a desired quality threshold, neurally-guided models can generate satisfactory results up to 10x faster than unguided models.

artificial intelligence, neural network, procedural model, (14 more...)

Neural Information Processing Systems

Country: Europe > Spain (0.14)

Genre:

Research Report (0.46)
Instructional Material (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Neurally-Guided Procedural Models: Amortized Inference for Procedural Graphics Programs using Neural Networks

Ritchie, Daniel, Thomas, Anna, Hanrahan, Pat, Goodman, Noah D.

arXiv.org Artificial IntelligenceOct-13-2016

Probabilistic inference algorithms such as Sequential Monte Carlo (SMC) provide powerful tools for constraining procedural models in computer graphics, but they require many samples to produce desirable results. In this paper, we show how to create procedural models which learn how to satisfy constraints. We augment procedural models with neural networks which control how the model makes random choices based on the output it has generated thus far. We call such models neurally-guided procedural models. As a pre-computation, we train these models to maximize the likelihood of example outputs generated via SMC. They are then used as efficient SMC importance samplers, generating high-quality results with very few samples. We evaluate our method on L-system-like models with image-based constraints. Given a desired quality threshold, neurally-guided models can generate satisfactory results up to 10x faster than unguided models.

artificial intelligence, neural network, procedural model, (15 more...)

arXiv.org Artificial Intelligence

1603.06143

Country: Europe > Spain (0.14)

Technology: