Goto

Collaborating Authors

 deep generative video compression


Deep Generative Video Compression

Neural Information Processing Systems

The usage of deep generative models for image compression has led to impressive performance gains over classical codecs while neural video compression is still in its infancy. Here, we propose an end-to-end, deep generative modeling approach to compress temporal sequences with a focus on video. Our approach builds upon variational autoencoder (VAE) models for sequential data and combines them with recent work on neural image compression. The approach jointly learns to transform the original sequence into a lower-dimensional representation as well as to discretize and entropy code this representation according to predictions of the sequential VAE. Rate-distortion evaluations on small videos from public data sets with varying complexity and diversity show that our model yields competitive results when trained on generic video content. Extreme compression performance is achieved when training the model on specialized content.


Reviews: Deep Generative Video Compression

Neural Information Processing Systems

Originality: - Using deep learning methods for video compression is still underexplored and poses an interesting research direction compared to current (handcrafted) methods. Furthermore, the current approach seems to be limited to a small fixed sequence length. Significance: - The combination of local and global feature is well motivated and the global feature is shown to have an significant impact on performance. However, the usability of the approach seems limited (small sequence length, global encoding of complete sequence). Superior results compared to traditional approaches were mainly achieved on special domain videos, the improvement on the diverse set Kinetics600 is relatively low and only evaluated within a small range of image quality scores.


Deep Generative Video Compression

Neural Information Processing Systems

The usage of deep generative models for image compression has led to impressive performance gains over classical codecs while neural video compression is still in its infancy. Here, we propose an end-to-end, deep generative modeling approach to compress temporal sequences with a focus on video. Our approach builds upon variational autoencoder (VAE) models for sequential data and combines them with recent work on neural image compression. The approach jointly learns to transform the original sequence into a lower-dimensional representation as well as to discretize and entropy code this representation according to predictions of the sequential VAE. Rate-distortion evaluations on small videos from public data sets with varying complexity and diversity show that our model yields competitive results when trained on generic video content.


Deep Generative Video Compression

Neural Information Processing Systems

The usage of deep generative models for image compression has led to impressive performance gains over classical codecs while neural video compression is still in its infancy. Here, we propose an end-to-end, deep generative modeling approach to compress temporal sequences with a focus on video. Our approach builds upon variational autoencoder (VAE) models for sequential data and combines them with recent work on neural image compression. The approach jointly learns to transform the original sequence into a lower-dimensional representation as well as to discretize and entropy code this representation according to predictions of the sequential VAE. Rate-distortion evaluations on small videos from public data sets with varying complexity and diversity show that our model yields competitive results when trained on generic video content.