Deep Science: Vision plus language could yield capable AI – TechCrunch

Apr-10-2022, 17:18:00 GMT–#artificialintelligence

Depending on the theory of intelligence to which you subscribe, achieving "human-level" AI will require a system that can leverage multiple modalities -- e.g., sound, vision and text -- to reason about the world. For example, when shown an image of a toppled truck and a police cruiser on a snowy freeway, a human-level AI might infer that dangerous road conditions caused an accident. Or, running on a robot, when asked to grab a can of soda from the refrigerator, they'd navigate around people, furniture and pets to retrieve the can and place it within reach of the requester. But new research shows signs of encouraging progress, from robots that can figure out steps to satisfy basic commands (e.g., "get a water bottle") to text-producing systems that learn from explanations. In this revived edition of Deep Science, our weekly series about the latest developments in AI and the broader scientific field, we're covering work out of DeepMind, Google and OpenAI that makes strides toward systems that can -- if not perfectly understand the world -- solve narrow tasks like generating images with impressive robustness.

dall-e 2, deep science, robot, (14 more...)

#artificialintelligence

Apr-10-2022, 17:18:00 GMT

News Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning > Generative AI (0.65)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found