Generative AI
Robotic hand made by Elon Musk's OpenAI learns to solve Rubik's Cube
Last year we were amazed by the level of dexterity achieved by OpenAI's Dactyl system which was able to learn how to manipulate a cube block to display any commanded side/face.If you missed that article, read about it here. OpenAI then set themselves a harder task of teaching the robotic hand to solve a Rubik's cube. Quite a daunting task made no easier by the fact that it would use one hand which most humans would find it hard to do. OpenAI harnessed the power of neural networks which are trained entirely in simulation. However, one of the main challenges faced was to make the simulations as realistic as possible because physical factors like friction, elasticity etc. are very hard to model.
Concept Saliency Maps to Visualize Relevant Features in Deep Generative Models
Brocki, Lennart, Chung, Neo Christopher
--Evaluating, explaining, and visualizing high-level concepts in generative models, such as variational autoencoders (V AEs), is challenging in part due to a lack of known prediction classes that are required to generate saliency maps in supervised learning. While saliency maps may help identify relevant features (e.g., pixels) in the input for classification tasks of deep neural networks, similar frameworks are understudied in unsupervised learning. Therefore, we introduce a new method of obtaining saliency maps for latent representations of known or novel high-level concepts, often called concept vectors in generative models. Concept scores, analogous to class scores in classification tasks, are defined as dot products between concept vectors and encoded input data, which can be readily used to compute the gradients. The resulting concept saliency maps are shown to highlight input features deemed important for high-level concepts. Our method is applied to the V AE's latent space of CelebA dataset in which known attributes such as "smiles" and "hats" are used to elucidate relevant facial features. Furthermore, our application to spatial transcriptomic (ST) data of a mouse olfactory bulb demonstrates the potential of latent representations of morphological layers and molecular features in advancing our understanding of complex biological systems. By extending the popular method of saliency maps to generative models, the proposed concept saliency maps help improve interpretability of latent variable models in deep learning. I NTRODUCTION A rapidly increasing amount of unlabeled data, such as images and molecular data, has prompted a rise of deep generative models, that can be trained without human supervision. By using a vast amount of unlabeled data, unsupervised learning models such as variational autoencoders (V AEs) [1], [2] extract low-dimensional latent spaces that compactly encode high-dimensional input data and potentially reveal hidden relationships.
A robot hand taught itself to solve a Rubik's Cube after creating its own training regime
Over a year ago, OpenAI, the San Franciscoโbased for-profit AI research lab, announced that it had trained a robotic hand to manipulate a cube with remarkable dexterity. That might not sound earth-shattering. But in the AI world, it was impressive for two reasons. First, the hand had taught itself how to fidget with the cube using a reinforcement-learning algorithm, a technique modeled on the way animals learn. Second, all the training had been done in simulation, but it managed to successfully translate to the real world.
Wavelets to the Rescue: Improving Sample Quality of Latent Variable Deep Generative Models
Gyawali, Prashnna K, Saha, Rudra, Wang, Linwei, Veeravasarapu, VSR, Singh, Maneesh
Variational Autoencoders (VAE) are probabilistic deep generative models underpinned by elegant theory, stable training processes, and meaningful manifold representations. However, they produce blurry images due to a lack of explicit emphasis over high-frequency textural details of the images, and the difficulty to directly model the complex joint probability distribution over the high-dimensional image space. In this work, we approach these two challenges with a novel wavelet space VAE that uses the decoder to model the images in the wavelet coefficient space. This enables the VAE to emphasize over high-frequency components within an image obtained via wavelet decomposition. Additionally, by decomposing the complex function of generating high-dimensional images into inverse wavelet transformation and generation of wavelet coefficients, the latter becomes simpler to model by the VAE. We empirically validate that deep generative models operating in the wavelet space can generate images of higher quality than the image (RGB) space counterparts. Quantitatively, on benchmark natural image datasets, we achieve consistently better FID scores than VAE based architectures and competitive FID scores with a variety of GAN models for the same architectural and experimental setup. Furthermore, the proposed wavelet-based generative model retains desirable attributes like disentangled and informative latent representation without losing the quality in the generated samples.
Artificial Intelligence Research Needs Responsible Publication Norms
After nearly a year of suspense and controversy, any day now the team of artificial intelligence (AI) researchers at OpenAI will release the full and final version of GPT-2, a language model that can "generate coherent paragraphs and perform rudimentary reading comprehension, machine translation, question answering, and summarization--all without task-specific training." When OpenAI first unveiled the program in February, it was capable of impressive feats: Given a two-sentence prompt about unicorns living in the Andes Mountains, for example, the program produced a coherent nine-paragraph news article. At the time, the technical achievement was newsworthy--but it was how OpenAI chose to release the new technology that really caused a firestorm. There is a prevailing norm of openness in the machine learning research community, consciously created by early giants in the field: Advances are expected to be shared, so that they can be evaluated and so that the entire field advances. However, in February, OpenAI opted for a more limited release due to concerns that the program could be used to generate misleading news articles; impersonate people online; or automate the production of abusive, fake or spam content.
Why a robot that can 'solve' Rubik's Cube one-handed has the AI community at war
OpenAI, a non-profit co-founded by Elon Musk, recently unveiled its newest trick: A robot hand that can'solve' Rubik's Cube. Whether this is a feat of science or mere prestidigitation is a matter of some debate in the AI community right now. In case you missed it, OpenAI posted an article on its blog last week titled "Solving Rubik's Cube With a Robot Hand." Based on this title, you'd be forgiven if you thought the research discussed in said article was about solving Rubik's Cube with a robot hand. Don't get me wrong, OpenAI created a software and machine learning pipeline by which a robot hand can physically manipulate a Rubik's Cube from an'unsolved' state to a solved one. But the truly impressive bit here is that a robot hand can hold an object and move it around (to accomplish a goal) without dropping it.
This robot can now solve a Rubik's cube with one hand
Once again, a robot can do something I cannot do. Researchers at the artificial intelligence lab OpenAI just revealed that its humanoid robotic hand can solve a Rubik's cube. The researchers utilized a pair of neural networks to make it happen. The team has been working on this project, named Dactyl, since the middle of 2017, and they felt showing their robotic hand could solve a Rubik's cube would show it had adequate dexterity. It can now solve the cube about 60 percent of the time.
OpenAI's AI-powered robot learned how to solve a Rubik's cube one-handed
Artificial intelligence research organization OpenAI has achieved a new milestone in its quest to build general purpose, self-learning robots. The group's robotics division says Dactyl, its humanoid robotic hand first developed last year, has learned to solve a Rubik's cube one-handed. OpenAI sees the feat as a leap forward both for the dexterity of robotic appendages and its own AI software, which allows Dactyl to learn new tasks using virtual simulations before it is presented with a real, physical challenge to overcome. In a demonstration video showcasing Dactyl's new talent, we can see the robotic hand fumble its way toward a complete cube solve with clumsy yet accurate maneuvers. It takes many minutes, but Dactyl is eventually able to solve the puzzle.
Opinion: How companies can prepare for the disruptive power of AI
U.S.-based artificial intelligence research organization OpenAI have rolled out a robot hand that can take and solve a Rubik's Cube. Joshua Gans is a professor of Strategic Management at the Rotman School of Management and the chief economist at the Creative Destruction Lab. Tiff Macklem is dean of Rotman School of Management at the University of Toronto. Last week, the U.S.-based artificial intelligence research organization, OpenAI, rolled out a robot hand that can take and solve a Rubik's Cube. Creating a robot with visual sense and complex touch and dexterity is an impressive achievement in AI.