AITopics

2208.12415

Country:

Asia > South Korea > Seoul > Seoul (0.04)
Asia > India > Karnataka > Bengaluru (0.04)

Genre: Research Report (1.00)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.56)

#artificialintelligenceAug-24-2022, 22:15:48 GMT

AI And The Limits Of Language

Jacob Browning is a postdoc in NYU's Department of Computer Science working on the philosophy of AI. Yann LeCun is a Turing Award-winning machine learning researcher and an NYU Silver professor. When a Google engineer recently declared Google's AI chatbot a person, pandemonium ensued. The chatbot, LaMDA, is a large language model (LLM) that is designed to predict the likely next words to whatever lines of text it is given. Since many conversations are somewhat predictable, these systems can infer how to keep a conversation going productively. LaMDA did this so impressively that the engineer, Blake Lemoine, began to wonder about whether there was a ghost in the machine.

information, knowledge, representational schema, (16 more...)

Industry: Information Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.89)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.72)

#artificialintelligenceAug-24-2022, 21:47:45 GMT

Evaluating Diverse Knowledge Sources for Online One-shot Learning of Novel Tasks

Online autonomous agents are able to draw on a wide variety of potential sources of task knowledge; however current approaches invariably focus on only one or two. Here we investigate the challenges and impact of exploiting diverse knowledge sources to learn, in one-shot, new tasks for a simulated household mobile robot. The resulting agent, developed in the Soar cognitive architecture, uses the following sources of domain and task knowledge: interaction with the environment, task execution and planning knowledge, human natural language instruction, and responses retrieved from a large language model (GPT-3). We explore the distinct contributions of these knowledge sources and evaluate the performance of different combinations in terms of learning correct task knowledge, human workload, and computational costs. The results from combining all sources demonstrate that integration improves one-shot task learning overall in terms of computational costs and human workload.

diverse knowledge source, online one-shot learning, task knowledge, (4 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)
(2 more...)

#artificialintelligenceAug-24-2022, 15:15:14 GMT

Assembly AI offers AI-as-a-service API to ease model development

Were you unable to attend Transform 2022? Check out all of the summit sessions in our on-demand library now! Over the last decade, artificial intelligence (AI) technologies have increasingly relied on neural networks to perform pattern recognition, machine learning (ML) and prediction. However, with ML models that consist of billions of parameters, training becomes more complicated as the model is unable to fit on a single GPU. Large language models (LLMs) such as GPT-3 and Gopher cost millions of dollars and require vast amounts of computing resources, making it challenging for cash and resource-constrained organizations to enter the field.

api, fox, integration, (9 more...)

Country: North America > United States > California > San Francisco County > San Francisco (0.15)

Industry: Information Technology (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

#artificialintelligenceAug-24-2022, 08:46:48 GMT

La veille de la cybersécurité

AI adoption may be steadily rising, but a closer examination shows that most enterprise companies may not be quite ready for the big time when it comes to artificial intelligence. Recent data from Palo Alto, California-based AI unicorn SambaNova Systems, for example, shows that more than two-thirds of organizations think using artificial intelligence (AI) will cut costs by automating processes and using employees more efficiently. But only 18% are rolling out large-scale, enterprise-class AI initiatives. The rest are introducing AI individually across multiple programs, rather than risking an investment in big-picture, large-scale adoption. That will create an increasing amount of distance between companies that are AI leaders and innovators and those that fall behind, said Marshall Choy, senior vice president of product at SambaNova, which offers custom-built dataflow-as-a-service (and won VentureBeat's AI Innovation Award for Edge AI in 2021). Companies that are more mature in AI and able to invest in large-scale adoption will reap the rewards, he told VentureBeat, while the ones introducing AI across multiple programs will suffer from information and insight silos.

artificial intelligence, large-scale adoption, multiple program, (2 more...)

Country: North America > United States > California > Santa Clara County > Palo Alto (0.29)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)

#artificialintelligenceAug-24-2022, 04:50:49 GMT

Why some AI companies are securing massive funding despite economic downturn

Were you unable to attend Transform 2022? Check out all of the summit sessions in our on-demand library now! Tech startups are going through tough times as a result of a slowdown in growth capital. Investment firms are advising their portfolio companies to extend their runway. Companies are suffering from valuation markdowns and resorting to layoffs to cut costs.

ai company, economic downturn, startup, (16 more...)

Country:

North America > United States > California > San Francisco County > San Francisco (0.15)
Europe > Germany (0.05)

Industry:

Banking & Finance > Capital Markets (0.72)
Banking & Finance > Trading (0.56)
Banking & Finance > Economy (0.45)
Government > Regional Government > North America Government > United States Government > FDA (0.30)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.30)

#artificialintelligenceAug-24-2022, 01:15:39 GMT

Disney-backed Inworld raises cash for AI-powered characters – TechCrunch

If software is eating the world, AI isn't far behind. AI-powered text-, art- and audio-generating systems will soon make -- and already are making -- their way into the tools people use every day, from programming environments and spellcheck plugins to concept art creation platforms. The video game industry is no exception to this, and that hardly comes as a surprise. As illustrated by games like AI Dungeon, AI -- while imperfect -- can inject surprising creativity and novelty into branching narrative storytelling. Inworld AI was founded on this premise.

gelfenbeyn, inworld, virtual character, (14 more...)

Industry: Leisure & Entertainment > Games > Computer Games (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.31)

arXiv.org Artificial IntelligenceAug-24-2022

Diverse Title Generation for Stack Overflow Posts with Multiple Sampling Enhanced Transformer

Zhang, Fengji, Liu, Jin, Wan, Yao, Yu, Xiao, Liu, Xiao, Keung, Jacky

Stack Overflow is one of the most popular programming communities where developers can seek help for their encountered problems. Nevertheless, if inexperienced developers fail to describe their problems clearly, it is hard for them to attract sufficient attention and get the anticipated answers. We propose M$_3$NSCT5, a novel approach to automatically generate multiple post titles from the given code snippets. Developers may use the generated titles to find closely related posts and complete their problem descriptions. M$_3$NSCT5 employs the CodeT5 backbone, which is a pre-trained Transformer model having an excellent language understanding and generation ability. To alleviate the ambiguity issue that the same code snippets could be aligned with different titles under varying contexts, we propose the maximal marginal multiple nucleus sampling strategy to generate multiple high-quality and diverse title candidates at a time for the developers to choose from. We build a large-scale dataset with 890,000 question posts covering eight programming languages to validate the effectiveness of M$_3$NSCT5. The automatic evaluation results on the BLEU and ROUGE metrics demonstrate the superiority of M$_3$NSCT5 over six state-of-the-art baseline models. Moreover, a human evaluation with trustworthy results also demonstrates the great potential of our approach for real-world application.

large language model, machine learning, programming language, (18 more...)

doi: 10.1016/j.jss.2023.111672

2208.11523

Country:

Asia > China > Hubei Province > Wuhan (0.04)
Asia > China > Hong Kong (0.04)
Oceania > Australia (0.04)
Asia > India (0.04)

Genre:

Research Report > New Finding (0.68)
Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Software > Programming Languages (0.88)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

arXiv.org Artificial IntelligenceAug-24-2022

PEER: A Collaborative Language Model

Schick, Timo, Dwivedi-Yu, Jane, Jiang, Zhengbao, Petroni, Fabio, Lewis, Patrick, Izacard, Gautier, You, Qingfei, Nalmpantis, Christoforos, Grave, Edouard, Riedel, Sebastian

Textual content is often the output of a collaborative writing process: We start with an initial draft, ask for suggestions, and repeatedly make changes. Agnostic of this process, today's language models are trained to generate only the final result. As a consequence, they lack several abilities crucial for collaborative writing: They are unable to update existing texts, difficult to control and incapable of verbally planning or explaining their actions. To address these shortcomings, we introduce PEER, a collaborative language model that is trained to imitate the entire writing process itself: PEER can write drafts, add suggestions, propose edits and provide explanations for its actions. Crucially, we train multiple instances of PEER able to infill various parts of the writing process, enabling the use of self-training techniques for increasing the quality, amount and diversity of training data. This unlocks PEER's full potential by making it applicable in domains for which no edit histories are available and improving its ability to follow instructions, to write useful comments, and to explain its actions. We show that PEER achieves strong performance across various domains and editing tasks.

computational linguistic, language model, wikipedia, (15 more...)

2208.11663

Country:

North America > United States > California > Los Angeles County > Inglewood (0.28)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
North America > United States > Mississippi > Harrison County > Gulfport (0.04)
(16 more...)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Primus, Paul, Widmer, Gerhard

Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers

arXiv.org Artificial IntelligenceAug-24-2022

Standard machine learning models for tagging and classifying acoustic signals cannot handle classes that were not seen during training. Zero-Shot (ZS) learning overcomes this restriction by predicting classes based on adaptable class descriptions. This study sets out to investigate the effectiveness of self-attention-based audio embedding architectures for ZS learning. To this end, we compare the very recent patchout spectrogram transformer with two classic convolutional architectures. We evaluate these three architectures on three tasks and on three different benchmark datasets: general-purpose tagging on AudioSet, environmental sound classification on ESC-50, and instrument tagging on OpenMIC. Our results show that the self-attention-based embedding methods outperform both compared convolutional architectures in all of these settings. By designing training and test data accordingly, we observe that prediction performance suffers significantly when the `semantic distance' between training and new test classes is large, an effect that will deserve more detailed investigations.

architecture, classification, conf, (13 more...)

2208.11402

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.05)
Oceania > Australia > Queensland > Brisbane (0.04)
(12 more...)

Genre: Research Report > New Finding (0.86)

Industry:

Media > Music (0.47)
Leisure & Entertainment (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.86)