AITopics

2508.01561

Country: North America > United States (0.28)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.87)
Research Report > Promising Solution (0.65)

Industry:

Transportation (0.46)
Information Technology (0.46)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Gualdoni, Eleonora, Boleda, Gemma

Why do objects have many names? A study on word informativeness in language use and lexical systems

arXiv.org Artificial IntelligenceOct-10-2024

Human lexicons contain many different words that speakers can use to refer to the same object, e.g., "purple" or "magenta" for the same shade of color. On the one hand, studies on language use have explored how speakers adapt their referring expressions to successfully communicate in context, without focusing on properties of the lexical system. On the other hand, studies in language evolution have discussed how competing pressures for informativeness and simplicity shape lexical systems, without tackling in-context communication. We aim at bridging the gap between these traditions, and explore why a soft mapping between referents and words is a good solution for communication, by taking into account both in-context communication and the structure of the lexicon. We propose a simple measure of informativeness for words and lexical systems, grounded in a visual space, and analyze color naming data for English and Mandarin Chinese. We conclude that optimal lexical systems are those where multiple words can apply to the same referent, conveying different amounts of information. Such systems allow speakers to maximize communication accuracy and minimize the amount of information they convey when communicating about referents in contexts.

communication, informativeness, lexical system, (17 more...)

2410.07827

Country:

Asia > China (0.14)
Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.04)
South America > Uruguay > Maldonado > Maldonado (0.04)
(4 more...)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Cognitive Science (0.68)

arXiv.org Artificial IntelligenceOct-9-2024

COMMA: A Communicative Multimodal Multi-Agent Benchmark

Ossowski, Timothy, Chen, Jixuan, Maqbool, Danyal, Cai, Zefan, Bradshaw, Tyler, Hu, Junjie

The rapid advances of multi-modal agents built on large foundation models have largely overlooked their potential for language-based communication between agents in collaborative tasks. This oversight presents a critical gap in understanding their effectiveness in real-world deployments, particularly when communicating with humans. Existing agentic benchmarks fail to address key aspects of inter-agent communication and collaboration, particularly in scenarios where agents have unequal access to information and must work together to achieve tasks beyond the scope of individual capabilities. To fill this gap, we introduce a novel benchmark designed to evaluate the collaborative performance of multimodal multi-agent systems through language communication. Our benchmark features a variety of scenarios, providing a comprehensive evaluation across four key categories of agentic capability in a communicative collaboration setting. By testing both agent-agent and agent-human collaborations using open-source and closed-source models, our findings reveal surprising weaknesses in state-of-the-art models, including proprietary models like GPT-4o. These models struggle to outperform even a simple random agent baseline in agent-agent collaboration and only surpass the random baseline when a human is involved.

agent, puzzle, solver, (16 more...)

2410.07553

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report > New Finding (0.66)

Industry:

Information Technology > Security & Privacy (0.67)
Health & Medicine > Diagnostic Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Jackermeier, Mathias, Abate, Alessandro

DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications

arXiv.org Artificial IntelligenceOct-6-2024

Linear temporal logic (LTL) has recently been adopted as a powerful formalism for specifying complex, temporally extended tasks in reinforcement learning (RL). However, learning policies that efficiently satisfy arbitrary specifications not observed during training remains a challenging problem. Existing approaches suffer from several shortcomings: they are often only applicable to finite-horizon fragments of LTL, are restricted to suboptimal solutions, and do not adequately handle safety constraints. In this work, we propose a novel learning approach to address these concerns. Our method leverages the structure of Büchi automata, which explicitly represent the semantics of LTL specifications, to learn policies conditioned on sequences of truth assignments that lead to satisfying the desired formulae. Experiments in a variety of discrete and continuous domains demonstrate that our approach is able to zero-shot satisfy a wide range of finite-and infinite-horizon specifications, and outperforms existing methods in terms of both satisfaction probability and efficiency. One of the fundamental challenges in artificial intelligence (AI) is to create agents capable of following arbitrary instructions. While significant research efforts have been devoted to designing reinforcement learning (RL) agents that can complete tasks expressed in natural language (Oh et al., 2017; Goyal et al., 2019; Luketina et al., 2019), recent years have witnessed increased interest in formal languages to specify tasks in RL (Andreas et al., 2017; Camacho et al., 2019; Jothimurugan et al., 2021). Formal specification languages offer several desirable properties over natural language, such as well-defined semantics and compositionality, allowing for the specification of unambiguous, structured tasks (Vaezipoor et al., 2021; León et al., 2022). Recent works have furthermore shown that it is possible to automatically translate many natural language instructions into a relevant specification language, providing interpretable yet precise representations of tasks, which is especially important in safety-critical domains (León et al., 2021; Pan et al., 2023; Liu et al., 2023; Cohen et al., 2024). Linear temporal logic (LTL) (Pnueli, 1977) in particular has been adopted as a powerful formalism for instructing RL agents (Hasanbeig et al., 2018; Araki et al., 2021; Voloshin et al., 2023). LTL is an appealing specification language that allows for the definition of tasks in terms of high-level features of the environment.

reach-avoid sequence, sequence, specification, (16 more...)

2410.04631

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

#artificialintelligenceApr-3-2023, 18:25:17 GMT

How Color is Represented and Viewed in Computer Vision

The eye is such a beautiful creation of the creators, which can perceive the color of an object in an astatically pleasing and harmonious way. Color Models are important for digital visualization.

color model, lightness, saturation, (16 more...)

Technology: Information Technology > Artificial Intelligence > Vision (0.55)

EngadgetNov-14-2022, 19:50:30 GMT

Soft robotic device stimulates muscles, sparks hope for ALS and MS patients

Today, muscle atrophy is often unavoidable when you can't move due to severe injury, old age or diseases like amyotrophic lateral sclerosis (ALS) and multiple sclerosis (MS). However, Harvard researchers see hope in soft robotics that could someday stretch and contract the muscles of patients unable to do so themselves. The Harvard engineers tested a new mechanostimulation system on mice, successfully preventing or assisting in their recovery from muscle atrophy. The team implanted the "soft robotic device" on a mouse's hind limb, which they immobilized in a cast-like enclosure for around two weeks. While the control group's untreated muscles wasted away as expected, the actively stimulated muscles showed reduced degradation.

atrophy, muscle atrophy, soft robotic device stimulate muscle, (7 more...)

Engadget

Genre: Research Report (0.80)

Industry: Health & Medicine > Therapeutic Area > Neurology > Multiple Sclerosis (0.72)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

#artificialintelligenceJun-22-2022, 20:26:52 GMT

Google Brain wants creative AI to help humans make "a new kind of art"

Machine-learning algorithms aren't likely to put painters or singer-songwriters out of work anytime soon, to judge from their body of work to date. But Google Brain is developing tools that pair artists with deep-learning tools to develop novel artwork together, said Douglas Eck, senior staff scientist at the search giant's artificial-intelligence research division, during the MIT Technology Review's EmTech Digital conference on Tuesday. He hopes the platform, called Magenta, will allow people to produce completely new kinds of music and art, in much the way that keyboards, drum machines, and cameras did. Eck said that Magenta could serve a role analogous to that of Les Paul, who helped develop the modern electric guitar. But Eck said they want to keep artists in the loop to push the boundaries of the new tool in interesting ways, like a Jimi Hendrix who flips it upside down, bends the strings, and distorts the sound.

google brain, help human make, new kind, (4 more...)

Country: North America > Canada > Quebec > Montreal (0.07)

Industry:

Media > Music (0.39)
Leisure & Entertainment (0.39)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceApr-30-2021, 09:30:15 GMT

Smells like team spirit: Getting 'art' out of artificial intelligence

Well, now we do not have to only imagine it. A project called Lost Tapes of the 27 Club, focused on mental health in the music industry, recently released a song called Drowned in the Sun. It was touted as a never-heard-before Nirvana song. Except that this song was never written by Kurt Cobain or Nirvana and discovered from some old musty attic years later; it was written by an artificial intelligence (AI) engine. To be more precise, it was written by a neural network trained on the entire body of Nirvana's work.

artificial intelligence, magenta, neural network, (8 more...)

Country:

Europe > Russia (0.07)
Asia > Russia (0.07)

Genre: Personal > Opinion (0.40)

Industry:

Media > Music (0.51)
Leisure & Entertainment > Games > Go (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Daily Mail - Science & techApr-7-2021, 23:06:03 GMT

Google's AI software used to create 'new' Nirvana song 'Drowned in the Sun'

Fans of Nirvana may do a double-take when they hear'Drowned in the Sun,' a new song created by artificial intelligence that simulates the songwriting of late grunge legend Kurt Cobain. Engineers fed Nirvana's back catalog to Google's AI program, Magenta, which analyzed it for recurring components and then developed an entirely new track. The voice on'Drowned in the Sun,' is 100 percent human, though--provided by Eric Hogan, lead singer of the Atlanta Nirvana cover band Nevermind. The song is just one release from The Lost Tapes of the 27 Club, a project developed by the nonprofit Over the Bridge, which spotlights mental health issues in the music industry. Other AI-generated'lost' tracks have taken their cue from Jim Morrison, Jimi Hendrix and Amy Winehouse, who, like Cobain, died at age 27.

drowned, google, nirvana song, (12 more...)

Daily Mail - Science & tech

Genre: Personal > Obituary (0.92)

Industry:

Media > Music (1.00)
Leisure & Entertainment (0.92)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.72)

#artificialintelligenceApr-6-2021, 04:20:27 GMT

AI software creates "new" Nirvana song "Drowned in the Sun"

The recently launched Lost Tapes of the 27 Club project uses AI software to create songs in the style of musicians who died at the age of 27. One of the featured tracks is called "Drowned in the Sun", and it comes pretty close to replicating a Nirvana song written by Kurt Cobain himself. With opening guitars starting out restrained before reaching a crescendo on the chorus, the track is reminiscent of Nirvana's signature hit, "Come as You Are". Its chorus sounds like something Cobain might have written, too, with lyrics like, "I don't care/ I feel as one, drowned in the sun." As explained in a Rolling Stone feature, Google's AI program Magenta was used to analyze the pioneering grunge band's music and create the instrumental track.

ai software create, drowned, nirvana song, (11 more...)

Country: North America > Canada > Ontario > Toronto (0.06)

Technology: Information Technology > Artificial Intelligence (1.00)