AITopics | frankle

Collaborating Authors

frankle

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

This Tool Probes Frontier AI Models for Lapses in Intelligence

WIREDApr-2-2025, 16:00:00 GMT

Executives at artificial intelligence companies may like to tell us that AGI is almost here, but the latest models still need some additional tutoring to help them be as clever as they can. Scale AI, a company that's played a key role in helping frontier AI firms build advanced models, has developed a platform that can automatically test a model across thousands of benchmarks and tasks, pinpoint weaknesses, and flag additional training data that ought to help enhance their skills. Scale, of course, will supply the data required. Scale rose to prominence providing human labor for training and testing advanced AI models. Large language models (LLMs) are trained on oodles of text scraped from books, the web, and other sources.

intelligence, scale evaluation, tool probe frontier ai model, (8 more...)

WIRED

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.96)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.57)

Add feedback

Databricks Has a Trick That Lets AI Models Improve Themselves

WIREDMar-25-2025, 20:00:11 GMT

Databricks, a company that helps big businesses build custom artificial intelligence models, has developed a machine learning trick that can boost the performance of an AI model without the need for clean labelled data. Jonathan Frankle, chief AI scientist at Databricks, spent the past year talking to customers about the key challenges they face in getting AI to work reliably. The problem, Frankle says, is dirty data. "Everybody has some data, and has an idea of what they want to do," Frankle says. But the lack of clean data makes it challenging to fine-tune a model to perform a specific task.. "Nobody shows up with nice, clean fine-tuning data that you can stick into a prompt or an [application programming interface]," for a model.

databrick, large language model, machine learning, (12 more...)

WIRED

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.53)

Add feedback

Inside the Creation of DBRX, the World's Most Powerful Open Source AI Model

WIREDMar-27-2024, 12:00:00 GMT

This past Monday, about a dozen engineers and executives at data science and AI company Databricks gathered in conference rooms connected via Zoom to learn if they had succeeded in building a top artificial intelligence language model. The team had spent months, and about 10 million, training DBRX, a large language model similar in design to the one behind OpenAI's ChatGPT. But they wouldn't know how powerful their creation was until results came back from the final tests of its abilities. "We've surpassed everything," Jonathan Frankle, chief neural network architect at Databricks and leader of the team that built DBRX, eventually told the team, which responded with whoops, cheers, and applause emojis. Frankle usually steers clear of caffeine but was taking sips of iced latte after pulling an all-nighter to write up the results.

databrick, dbrx, powerful open source ai model, (10 more...)

WIRED

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

Why Machine Learning Models Die In Silence?

#artificialintelligenceFeb-15-2022, 16:20:49 GMT

The meaning of life differs from man to man, from day to day, and from hour to hour -- Viktor E. Frankle, Man's search for meaning. Frankle was not only right about the meaning of life, his saying was correct about machine learning models in production too. ML models perform well when you deploy them in production. Its quality of predictions decay and soon becomes less valuable. This is the primary difference between a software deployment and a machine learning one.

data distribution, machine learning model, prediction, (15 more...)

#artificialintelligence

Country: North America > United States (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.51)

Add feedback

Shrinking massive neural networks used to model language

#artificialintelligenceDec-3-2020, 03:32:06 GMT

Jonathan Frankle is researching artificial intelligence -- not noshing pistachios -- but the same philosophy applies to his "lottery ticket hypothesis." It posits that, hidden within massive neural networks, leaner subnetworks can complete the same task more efficiently. The trick is finding those "lucky" subnetworks, dubbed winning lottery tickets. In a new paper, Frankle and colleagues discovered such subnetworks lurking within BERT, a state-of-the-art neural network approach to natural language processing (NLP). As a branch of artificial intelligence, NLP aims to decipher and analyze human language, with applications like predictive text generation or online chatbots.

bert, neural network, subnetwork, (12 more...)

#artificialintelligence

Country: North America > United States > Texas > Travis County > Austin (0.05)

Genre: Contests & Prizes (0.62)

Industry: Leisure & Entertainment (0.62)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.85)

Add feedback

Shrinking massive neural networks used to model language

#artificialintelligenceDec-2-2020, 04:49:32 GMT

BEGIN ARTICLE PREVIEW: You don’t need a sledgehammer to crack a nut. Jonathan Frankle is researching artificial intelligence — not noshing pistachios — but the same philosophy applies to his “lottery ticket hypothesis.” It posits that, hidden within massive neural networks, leaner subnetworks can complete the same task more efficiently. The trick is finding those “lucky” subnetworks, dubbed winning lottery tickets. In a new paper, Frankle and colleagues discovered such subnetworks lurking within BERT, a state-of-the-art neural network approach to natural language processing (NLP). As a branch of artificial intelligence, NLP aims to decipher and analyze human language, with applications like predictive te

massive neural network, model language, subnetwork, (5 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.89)

Add feedback

Shrinking massive neural networks used to model language

#artificialintelligenceDec-1-2020, 06:08:30 GMT

You don't need a sledgehammer to crack a nut. Jonathan Frankle is researching artificial intelligence -- not noshing pistachios -- but the same philosophy applies to his "lottery ticket hypothesis." It posits that, hidden within massive neural networks, leaner subnetworks can complete the same task more efficiently. The trick is finding those "lucky" subnetworks, dubbed winning lottery tickets. In a new paper, Frankle and colleagues discovered such subnetworks lurking within BERT, a state-of-the-art neural network approach to natural language processing (NLP).

bert, neural network, subnetwork, (12 more...)

#artificialintelligence

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.40)
North America > United States > Texas > Travis County > Austin (0.05)

Genre: Contests & Prizes (0.62)

Industry: Leisure & Entertainment (0.62)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.85)

Add feedback

Neural Networks: Frankle's Miracle

#artificialintelligenceSep-3-2020, 17:15:17 GMT

Jonathan Frankle and Michael Carbin, of Lottery Ticket fame, and Alex Renda, have made the perfect pruner, shrinking neural networks as much as you please, without sacrificing accuracy. And, the method is dead simple. When you add up the utility gained at the edge, these researchers are worth their weight in Californium. Paul Erdos, one of the greatest and by far the most prolific mathematician of the last century, would fall in love with those proofs and theorems which, by their utter simplicity, elegance, breadth of impact, and insightful technique, must have come from "The Book." By that, Erdos meant that scroll of truths God used to make the world.

artificial intelligence, frankle, machine learning, (6 more...)

#artificialintelligence

Industry:

Transportation > Passenger (0.31)
Transportation > Ground > Road (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.62)

Add feedback

A Foolproof Way to Shrink Deep Learning Models

#artificialintelligenceMay-7-2020, 17:00:04 GMT

Researchers have proposed a technique for shrinking deep learning models that they say is simpler and produces more accurate results than state-of-the-art methods. Massachusetts Institute of Technology (MIT) researchers have proposed a technique for compressing deep learning models, by retraining a smaller model whose weakest connections have been "pruned," at its faster, initial rate of learning. The technique's groundwork was partly laid by the AutoML for model compression (AMC) algorithm from MIT's Song Han, which automatically removes redundant neurons and connections, and retrains the model to reinstate its initial accuracy. MIT's Jonathan Frankle and Michael Carbin determined that the model could simply be rewound to its early training rate without tinkering with any parameters. Although greater shrinkage is accompanied by reduced model accuracy, in comparing their method to AMC or earlier work by Frankle on weight-rewinding techniques, Frankle and Carbin found that it performed better regardless of the amount of compression.

artificial intelligence, machine learning, shrink deep learning model, (6 more...)

#artificialintelligence

AI-Alerts: 2020 > 2020-05 > AAAI AI-Alert for May 12, 2020 (1.00)

Country:

North America > United States > Massachusetts (0.29)
North America > United States > District of Columbia > Washington (0.09)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Researchers unveil a pruning algorithm to make artificial intelligence applications run faster

#artificialintelligenceMay-3-2020, 07:35:20 GMT

As more artificial intelligence applications move to smartphones, deep learning models are getting smaller to allow apps to run faster and save battery power. Now, MIT researchers have a new and better way to compress models. It's so simple that they unveiled it in a tweet last month: Train the model, prune its weakest connections, retrain the model at its fast, early training rate, and repeat, until the model is as tiny as you want. "That's it," says Alex Renda, a Ph.D. student at MIT. "The standard things people do to prune their models are crazy complicated." Renda discussed the technique when the International Conference of Learning Representations (ICLR) convened remotely this month.

make artificial intelligence application run, pruning algorithm, researcher unveil, (10 more...)

#artificialintelligence

Industry: Energy (0.39)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.75)

Add feedback