Generative AI
AWS previews ultra-efficient AI instances for neural network training - SiliconANGLE
The cloud giant is introducing the Gaudi instances at an opportune time. AI models are getting more complex, partially because enterprise machine learning initiatives are maturing and partially because research conducted by the likes of OpenAI is facilitating bigger neural network architectures. As neural networks grow in complexity, the amount of computing power necessary to train them is increasing and fueling demand for more efficient training infrastructure.
Refining Deep Generative Models via Wasserstein Gradient Flows
Ansari, Abdul Fatir, Ang, Ming Liang, Soh, Harold
Deep generative modeling has seen impressive advances in recent years, to the point where it is now commonplace to see simulated samples (e.g., images) that closely resemble real-world data. However, generation quality is generally inconsistent for any given model and can vary dramatically between samples. We introduce Discriminator Gradient f low (DGf low), a new technique that improves generated samples via the gradient flow of entropy-regularized f-divergences between the real and the generated data distributions. The gradient flow takes the form of a nonlinear Fokker-Plank equation, which can be easily simulated by sampling from the equivalent McKean-Vlasov process. By refining inferior samples, our technique avoids wasteful sample rejection used by previous methods (DRS & MH-GAN). Compared to existing works that focus on specific GAN variants, we show our refinement approach can be applied to GANs with vector-valued critics and even other deep generative models such as VAEs and Normalizing Flows. Empirical results on multiple synthetic, image, and text datasets demonstrate that DGf low leads to significant improvement in the quality of generated samples for a variety of generative models, outperforming the state-of-the-art Discriminator Optimal Transport (DOT) and Discriminator Driven Latent Sampling (DDLS) methods. Deep generative models (DGMs) have excelled at numerous tasks, from generating realistic images (Brock et al., 2019) to learning policies in reinforcement learning (Ho & Ermon, 2016).
Is AI finally closing in on human intelligence?
The company OpenAI has developed an extremely powerful machine-learning system that can rapidly generate text with minimal human input. The system is known as GPT-3 and it does everything from crafting an email to writing advanced fiction. However, the FT's innovation editor, John Thornhill, explains, there are barriers and even a dark side to this tool. A transcript for this podcast is currently unavailable, view our accessibility guide.
Power of AI With Cloud Computing is "Stunning" to Microsoft's Nadella - AI Trends
The Microsoft license is exclusive however, meaning Microsoft's cloud computing competitors cannot access it in the same way. The agreement was seen as important to helping OpenAI with the expense of getting GPT-3 up and running and maintaining it, according to an account in TechTalks. These include an estimated $10 million in expenses to research GPT-3 and train the model, tens of thousands of dollars in monthly cloud computing and electricity costs to run the models, an estimated one million dollars annually to retrain the model to prevent decay, and additional costs of customer support, marketing, IT, legal and other requirements to put a software product on the market.
2020's Top AI & Machine Learning Research Papers
Despite the challenges of 2020, the AI research community produced a number of meaningful technical breakthroughs. GPT-3 by OpenAI may be the most famous, but there are definitely many other research papers worth your attention. For example, teams from Google introduced a revolutionary chatbot, Meena, and EfficientDet object detectors in image recognition. Researchers from Yale introduced a novel AdaBelief optimizer that combines many benefits of existing optimization methods. OpenAI researchers demonstrated how deep reinforcement learning techniques can achieve superhuman performance in Dota 2. To help you catch up on essential reading, we've summarized 10 important machine learning research papers from 2020. These papers will give you a broad overview of AI research advancements this year.
2020's Top AI & Machine Learning Research Papers
Despite the challenges of 2020, the AI research community produced a number of meaningful technical breakthroughs. GPT-3 by OpenAI may be the most famous, but there are definitely many other research papers worth your attention. For example, teams from Google introduced a revolutionary chatbot, Meena, and EfficientDet object detectors in image recognition. Researchers from Yale introduced a novel AdaBelief optimizer that combines many benefits of existing optimization methods. OpenAI researchers demonstrated how deep reinforcement learning techniques can achieve superhuman performance in Dota 2. To help you catch up on essential reading, we've summarized 10 important machine learning research papers from 2020. These papers will give you a broad overview of AI research advancements this year. Of course, there are many more breakthrough papers worth reading as well.
Learning a Deep Generative Model like a Program: the Free Category Prior
Humans surpass the cognitive abilities of most other animals in our ability to "chunk" concepts into words, and then combine the words to combine the concepts. In this process, we make "infinite use of finite means", enabling us to learn new concepts quickly and nest concepts within each-other. While program induction and synthesis remain at the heart of foundational theories of artificial intelligence, only recently has the community moved forward in attempting to use program learning as a benchmark task itself. The cognitive science community has thus often assumed that if the brain has simulation and reasoning capabilities equivalent to a universal computer, then it must employ a serialized, symbolic representation. Here we confront that assumption, and provide a counterexample in which compositionality is expressed via network structure: the free category prior over programs. We show how our formalism allows neural networks to serve as primitives in probabilistic programs. We learn both program structure and model parameters end-to-end.
Artificial Intelligence breakthrough: Expert 'open to idea' new AI tech 'is conscious'
Cutting-edge AI tech has been described as'conscious' by a leading philosophy of mind expert. New York University's Professor David Chalmers made the bombshell claim while discussing the highly-controversial Generative Pre-trained Transformer 3 (GPT-3) - OpenAI's powerful new language generator able to create content better than anything else ever made.
Data-driven Accelerogram Synthesis using Deep Generative Models
Florez, Manuel A., Caporale, Michaelangelo, Buabthong, Pakpoom, Ross, Zachary E., Asimaki, Domniki, Meier, Men-Andrin
Robust estimation of ground motions generated by scenario earthquakes is critical for many engineering applications. We leverage recent advances in Generative Adversarial Networks (GANs) to develop a new framework for synthesizing earthquake acceleration time histories. Our approach extends the Wasserstein GAN formulation to allow for the generation of ground-motions conditioned on a set of continuous physical variables. Our model is trained to approximate the intrinsic probability distribution of a massive set of strong-motion recordings from Japan. We show that the trained generator model can synthesize realistic 3-Component accelerograms conditioned on magnitude, distance, and $V_{s30}$. Our model captures the expected statistical features of the acceleration spectra and waveform envelopes. The output seismograms display clear P and S-wave arrivals with the appropriate energy content and relative onset timing. The synthesized Peak Ground Acceleration (PGA) estimates are also consistent with observations. We develop a set of metrics that allow us to assess the training process's stability and tune model hyperparameters. We further show that the trained generator network can interpolate to conditions where no earthquake ground motion recordings exist. Our approach allows the on-demand synthesis of accelerograms for engineering purposes.
'It's the screams of the damned!' The eerie AI world of deepfake music
The song in question not a genuine track, but a convincing fake created by "research and deployment company" OpenAI, whose Jukebox project uses artificial intelligence to generate music, complete with lyrics, in a variety of genres and artist styles. Along with Sinatra, they've done what are known as "deepfakes" of Katy Perry, Elvis, Simon and Garfunkel, 2Pac, Céline Dion and more. Having trained the model using 1.2m songs scraped from the web, complete with the corresponding lyrics and metadata, it can output raw audio several minutes long based on whatever you feed it. Input, say, Queen or Dolly Parton or Mozart, and you'll get an approximation out the other end.