AITopics

Medical term normalization consists in mapping a piece of text to a large number of output classes. Given the small size of the annotated datasets and the extremely long tail distribution of the concepts, it is of utmost importance to develop models that are capable to generalize to scarce or unseen concepts. An important attribute of most target ontologies is their hierarchical structure. In this paper we introduce a simple and effective learning strategy that leverages such information to enhance the generalizability of both discriminative and generative models. The evaluation shows that the proposed strategy produces state-of-the-art performance on seen concepts and consistent improvements on unseen ones, allowing also for efficient zero-shot knowledge transfer across text typologies and datasets.

large language model, machine learning, natural language, (18 more...)

2210.11947

Country:

Europe > Italy (0.04)
Asia > China > Hong Kong (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > New Jersey (0.04)

Genre: Research Report (0.50)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.91)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Le, Hung, Wang, Yue, Gotmare, Akhilesh Deepak, Savarese, Silvio, Hoi, Steven C. H.

CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning

Program synthesis or code generation aims to generate a program that satisfies a problem specification. Recent approaches using large-scale pretrained language models (LMs) have shown promising results, yet they have some critical limitations. In particular, they often follow a standard supervised fine-tuning procedure to train a code generation model only from the pairs of natural-language problem descriptions and ground-truth programs. Such paradigm largely ignores some important but potentially useful signals in the problem specification such as unit tests, which thus often results in poor performance when solving complex unseen coding tasks. To address the limitations, we propose "CodeRL", a new framework for program synthesis tasks through pretrained LMs and deep reinforcement learning (RL). Specifically, during training, we treat the code-generating LM as an actor network, and introduce a critic network that is trained to predict the functional correctness of generated programs and provide dense feedback signals to the actor. During inference, we introduce a new generation procedure with a critical sampling strategy that allows a model to automatically regenerate programs based on feedback from example unit tests and critic scores. For the model backbones, we extended the encoder-decoder architecture of CodeT5 with enhanced learning objectives, larger model sizes, and better pretraining data. Our method not only achieves new SOTA results on the challenging APPS benchmark, but also shows strong zero-shot transfer capability with new SOTA results on the simpler MBPP benchmark.

large language model, machine learning, reinforcement learning, (19 more...)

2207.0178

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
North America > Puerto Rico > San Juan > San Juan (0.04)
(2 more...)

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Hernandez-Olivan, Carlos, Hernandez-Olivan, Javier, Beltran, Jose R.

A Survey on Artificial Intelligence for Music Generation: Agents, Domains and Perspectives

Music is one of the Gardner's intelligences in his theory of multiple intelligences. How humans perceive and understand music is still being studied and is crucial to develop artificial intelligence models that imitate such processes. Music generation with Artificial Intelligence is an emerging field that is gaining much attention in the recent years. In this paper, we describe how humans compose music and how new AI systems could imitate such process by comparing past and recent advances in the field with music composition techniques. To understand how AI models and algorithms generate music and the potential applications that might appear in the future, we explore, analyze and describe the agents that take part of the music generation process: the datasets, models, interfaces, the users and the generated music. We mention possible applications that might benefit from this field and we also propose new trends and future research directions that could be explored in the future.

large language model, machine learning, natural language, (20 more...)

2210.13944

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
(32 more...)

Genre: Research Report (1.00)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
(3 more...)

Diwan, Anuj, Peng, Puyuan, Mooney, Raymond J.

Zero-shot Video Moment Retrieval With Off-the-Shelf Models

For the majority of the machine learning community, the expensive nature of collecting high-quality human-annotated data and the inability to efficiently finetune very large state-of-the-art pretrained models on limited compute are major bottlenecks for building models for new tasks. We propose a zero-shot simple approach for one such task, Video Moment Retrieval (VMR), that does not perform any additional finetuning and simply repurposes off-the-shelf models trained on other tasks. Our three-step approach consists of moment proposal, moment-query matching and postprocessing, all using only off-the-shelf models. On the QVHighlights benchmark for VMR, we vastly improve performance of previous zero-shot approaches by at least 2.5x on all metrics and reduce the gap between zero-shot and state-of-the-art supervised by over 74%. Further, we also show that our zero-shot approach beats non-pretrained supervised models on the Recall metrics and comes very close on mAP metrics; and that it also performs better than the best pretrained supervised model on shorter moments. Finally, we ablate and analyze our results and propose interesting future directions.

large language model, machine learning, natural language, (19 more...)

2211.02178

Country:

North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceNov-2-2022, 19:40:07 GMT

OpenAI will give roughly 10 AI startups $1M each and early access to its systems

OpenAI, the San Francisco-based lab behind AI systems like GPT-3 and DALL-E 2, today launched a new program to provide early-stage AI startups with capital and access to OpenAI tech and resources. Called Converge, the cohort will be financed by the OpenAI Startup Fund, OpenAI says. The $100 million entrepreneurial tranche was announced last May and was backed by Microsoft and other partners. The 10 or so founders chosen for Converge will receive $1 million each and admission to five weeks of office hours, workshops and events with OpenAI staff, as well as early access to OpenAI models and "programming tailored to AI companies." "We're excited to meet groups across all phases of the seed stage, from pre-idea solo founders to co-founding teams already working on a product," OpenAI writes in a blog post shared with TechCrunch ahead of today's announcement.

ai startup, converge, openai, (10 more...)

Country: North America > United States > California > San Francisco County > San Francisco (0.26)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

#artificialintelligenceNov-2-2022, 18:00:26 GMT

Google plans giant AI language model supporting world's 1,000 most spoken languages

By having a single model that is exposed to and trained on many different languages, we get much better performance on our low resource languages,

plan giant ai language model, world

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.59)

#artificialintelligenceNov-2-2022, 18:00:10 GMT

AlphaFold's new rival? Meta AI predicts shape of 600 million proteins

The ESM Metagenomic Atlas database contains structure predictions for 617 million proteins.Credit: ESM Metagenomic Atlas (CC BY 4.0) When London-based Deep Mind unveiled predicted structures for some 220 million proteins this year, it covered nearly every protein from known organisms in DNA databases. Now, another tech giant is filling in the dark matter of our protein universe. Researchers at Meta (formerly Facebook, headquartered in Menlo Park, California) have used artificial intelligence (AI) to predict the structures of some 600 million proteins from bacteria, viruses and other microbes that haven't been characterized. 'It will change everything': DeepMind's AI makes gigantic leap in solving protein structures "These are the structures we know the least about. These are incredibly mysterious proteins. I think they offer the potential for great insight into biology," says Alexander Rives, the research lead for Meta AI's protein team.

alphafold, prediction, protein, (16 more...)

Country:

North America > United States > California > San Mateo County > Menlo Park (0.25)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
Asia > South Korea > Seoul > Seoul (0.05)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.58)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.58)
Information Technology > Artificial Intelligence > Applied AI (0.56)
Information Technology > Communications > Social Media (0.46)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.38)

#artificialintelligenceNov-2-2022, 16:15:30 GMT

A next-gen AI protein folder that could help science? Meta's good for something

AI researchers at Meta say they have developed the largest protein-folding model of its kind to date, and that it is capable of predicting the structure of more than 600 million proteins.… The team released the 15-billion-parameter ESM-2 transformer-based model and a database of its protein structure predictions, dubbed the ESM Metagenomic Atlas, on Tuesday. This database includes protein shapes that haven't been observed yet by scientists. Proteins are complex biological molecules containing up of 20 types of amino acids, and perform all sorts of biological functions in organisms. Crucially, they fold up into intricate 3D structures, the shape of which is vital to how they operate; knowing their shape helps scientists understand how they function, and from that, helps them figure out ways to mimic, alter, or counter that behavior. Unfortunately, you can't just take the amino acid formula and immediately work out the eventual structure.

language model, meta, protein, (10 more...)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Education > Health & Safety > School Nutrition (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.38)

#artificialintelligenceNov-2-2022, 12:40:46 GMT

Forget chess, DeepMind's training its new AI to play football

Researchers from DeepMind, the UK's juggernaut AI lab, have forsaken the noble games of chess and Go for a more plebeian delight: football. The Google sister company yesterday published a research paper and accompanying blog post detailing its new neural probabilistic motor primitives (NPMP) -- a method by which artificial intelligence agents can learn to operate physical bodies. An NPMP is a general-purpose motor control module that translates short-horizon motor intentions to low-level control signals, and it's trained offline or via RL by imitating motion capture (MoCap) data, recorded with trackers on humans or animals performing motions of interest. Up front: Essentially, the DeepMind team created an AI system that can learn how to do things inside of a physics simulator by watching videos of other agents performing those tasks. And, of course, if you've got a giant physics engine and an endless supply of curious robots, the only rational thing to do is to teach it how to dribble and shoot: We optimized teams of agents to play simulated football via reinforcement learning, constraining the solution space to that of plausible movements learned using human motion capture data. Background: In order to train AI to operate and control robots in the world, researchers have to prepare the machines for reality.

agent, deepmind, football, (9 more...)

Industry:

Leisure & Entertainment > Sports (1.00)
Leisure & Entertainment > Games > Chess (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)

#artificialintelligenceNov-2-2022, 03:55:46 GMT

AI programming tools may mean rethinking compsci education

Analysis While the legal and ethical implications of assistive AI models like GitHub's Copilot continue to be sorted out, computer scientists continue to find uses for large language models and urge educators to adapt. Brett A. Becker, assistant professor at University College Dublin in Ireland, provided The Register with pre-publication copies of two research papers exploring the educational risks and opportunities of AI tools for generating programming code. The papers have been accepted at the 2023 SIGCSE Technical Symposium on Computer Science Education, to be held March 15 to 18 in Toronto, Canada. In June, GitHub Copilot, a machine learning tool that automatically suggests programming code in response to contextual prompts, emerged from a year long technical preview, just as concerns about the way its OpenAI Codex model was trained and the implications of AI models for society coalesced into focused opposition. In "Programming Is Hard – Or at Least It Used to Be: Educational Opportunities And Challenges of AI Code Generation" [PDF], Becker and co-authors Paul Denny (University of Auckland, New Zealand), James Finnie-Ansley (University of Auckland), Andrew Luxton-Reilly (University of Auckland), James Prather (Abilene Christian University, USA), and Eddie Antonio Santos (University College Dublin) argue that the educational community needs to deal with the immediate opportunities and challenges presented by AI-driven code generation tools.

error message, explanation, university, (15 more...)

Country:

Oceania > New Zealand > North Island > Auckland Region > Auckland (0.67)
North America > Canada > Ontario > Toronto (0.25)
North America > United States > Massachusetts > Hampshire County > Amherst (0.05)

Genre: Research Report (0.31)

Industry:

Education (0.36)
Law (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)