AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

The NLP Cypher

#artificialintelligenceDec-24-2020, 05:20:51 GMT

Around five percent of papers from the conference were on graphs so lots to discuss. A new paper (with authors from every major big tech), was recently published showing how one can attack language models like GPT-2 and extract information verbatim like personal identifiable information from just by querying the model. The information extracted derived from the models' training data that was based on scraped internet info. This is a big problem especially when you train a language model on a private custom dataset. Looks like Booking.com wants a new recommendation engine and they are offering up their dataset of over 1 million anonymized hotel reservations to get you in the game.

dataset, nlp cypher, reservation, (4 more...)

#artificialintelligence

Country: North America > United States (0.17)

Industry: Government > Military > Air Force (0.33)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.37)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.37)

Add feedback

Leveraging GPT-2 for Classifying Spam Reviews with Limited Labeled Data via Adversarial Training

Irissappane, Athirai A., Yu, Hanfei, Shen, Yankun, Agrawal, Anubha, Stanton, Gray

arXiv.org Artificial IntelligenceDec-24-2020

Online reviews are a vital source of information when purchasing a service or a product. Opinion spammers manipulate these reviews, deliberately altering the overall perception of the service. Though there exists a corpus of online reviews, only a few have been labeled as spam or non-spam, making it difficult to train spam detection models. We propose an adversarial training mechanism leveraging the capabilities of Generative Pre-Training 2 (GPT-2) for classifying opinion spam with limited labeled data and a large set of unlabeled data. Experiments on TripAdvisor and YelpZip datasets show that the proposed model outperforms state-of-the-art techniques by at least 7% in terms of accuracy when labeled data is limited. The proposed model can also generate synthetic spam/non-spam reviews with reasonable perplexity, thereby, providing additional labeled data during training.

classifier, generator, proceedings, (15 more...)

arXiv.org Artificial Intelligence

2012.134

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Washington > Pierce County > Tacoma (0.04)
North America > United States > Colorado (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Add feedback

DeepMind's latest AI can master games without being told their rules

EngadgetDec-23-2020, 16:00:24 GMT

In 2016, Alphabet's DeepMind came out with AlphaGo, an AI which consistently beat the best human Go players. One year later, the subsidiary went on to refine its work, creating AlphaGo Zero. Where its predecessor learned to play Go by observing amateur and professional matches, AlphaGo Zero mastered the ancient game by simply playing against itself. DeepMind then created AlphaZero, which could play Go, chess and shogi with a single algorithm. What tied all those AIs together is that they knew the rules of the games they had to master going into their training.

algorithm, deepmind, muzero, (9 more...)

Engadget

Industry: Leisure & Entertainment > Games > Go (0.99)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Games (0.79)

Add feedback

DeepMind's AI agent MuZero could turbocharge YouTube

BBC NewsDec-23-2020, 16:00:01 GMT

The successor to AlphaGo is being used to create a more efficient type of video compression.

ai agent muzero, deepmind, turbocharge youtube

BBC News

Technology:

Information Technology > Communications > Social Media (0.76)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.40)

Add feedback

Toward Transformer-Based Object Detection

Beal, Josh, Kim, Eric, Tzeng, Eric, Park, Dong Huk, Zhai, Andrew, Kislyuk, Dmitry

arXiv.org Artificial IntelligenceDec-17-2020

Transformers have become the dominant model in natural language processing, owing to their ability to pretrain on massive amounts of data, then transfer to smaller, more specific tasks via fine-tuning. The Vision Transformer was the first major attempt to apply a pure transformer model directly to images as input, demonstrating that as compared to convolutional networks, transformer-based architectures can achieve competitive results on benchmark classification tasks. However, the computational complexity of the attention operator means that we are limited to low-resolution inputs. For more complex tasks such as detection or segmentation, maintaining a high input resolution is crucial to ensure that models can properly identify and reflect fine details in their output. This naturally raises the question of whether or not transformer-based architectures such as the Vision Transformer are capable of performing tasks other than classification. In this paper, we determine that Vision Transformers can be used as a backbone by a common detection task head to produce competitive COCO results. The model that we propose, ViT-FRCNN, demonstrates several known properties associated with transformers, including large pretraining capacity and fast fine-tuning performance. We also investigate improvements over a standard detection backbone, including superior performance on out-of-domain images, better performance on large objects, and a lessened reliance on non-maximum suppression. We view ViT-FRCNN as an important stepping stone toward a pure-transformer solution of complex vision tasks such as object detection.

backbone, dataset, detection, (14 more...)

arXiv.org Artificial Intelligence

2012.09958

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Indiana > Marion County > Lawrence (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A powerful AI generated some predictions for the future and they're quite outrageous

#artificialintelligenceDec-16-2020, 13:05:23 GMT

A powerful AI algorithm has some, well, unusual predictions for what lies in store down the road. It's a been a weird year, what with monoliths, terrifying animals, and of course a global pandemic dominating the news cycle. Inspired by all that chaos, research scientist and author Janelle Shane asked GPT-3, a powerful text-generating algorithm, to guess the future. With killer orchids, monster toads, and deadly puffballs, the algorithm seems to have missed the mark. But then again, who could have predicted half of the nonsense we've endured lately?

algorithm, gpt-3, prediction, (5 more...)

#artificialintelligence

Country: Asia > China (0.07)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

Could GPT-3 Change The Way Future AI Models Are Developed and Deployed ?

#artificialintelligenceDec-16-2020, 05:45:57 GMT

Much has been said about GPT-3 already. Traditionally, we start with data for a problem and develop the model based on the data. The model is specific to the problem. If you want to train a model to predict traffic patterns in New York, you build a model of New York traffic patterns. If you want to model air pollution in New York, that's a different model With GPT-3 you start with the model instead of the data.

gpt-3, language model, new york, (9 more...)

#artificialintelligence

Country: North America > United States > New York (0.80)

Industry: Health & Medicine > Health Care Providers & Services (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

DeepMind's latest AI breakthrough could turbocharge drug discovery

#artificialintelligenceDec-15-2020, 21:41:05 GMT

While impressive, the technology wasn't yet capable of replacing the existing expensive and time-consuming experimental methods for determining what these proteins look like. However, its latest software comes close. In November, AlphaFold again outperformed all the other competing groups at CASP. The technology solved protein structures other labs had been working on for years. Scientists think the technology could have immense implications for the way proteins are studied.

deepmind, latest ai breakthrough, turbocharge drug discovery, (4 more...)

#artificialintelligence

Genre: Personal (0.40)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

DeepMind AI Predicts Protein Structure

#artificialintelligenceDec-15-2020, 19:07:32 GMT

If you are even remotely interested in science, you will have probably already heard about DeepMind's latest leap. Their AI system Alphafold 2 has cracked predicting proteins' 3D structure. There are plenty of great articles about it. Since I have written about machine learning/AI in an earlier series of posts, I decided to write a brief post about this development as well. For more details, do check the Nature/New Scientist/DeepMind articles linked above.

alphafold 2, deepmind ai predict protein structure, protein, (9 more...)

#artificialintelligence

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.87)

Add feedback

Everything Product People Need to Know About Transformers, GPT-3, and HuggingFace (

#artificialintelligenceDec-15-2020, 13:19:21 GMT

This is Part 1in the 3 Part Series on Transformers for Product People. Natural language processing (NLP) has passed an industry-changing inflection point. More than 20 long standing NLP challenges have been solved with near-human results in the past year, all by a single model: the attention-based transformer. This model was developed and published in December 2017, and has since kicked off an arms race between Google and OpenAI, with both labs shattering state of the art results with each new model release. With models like GPT-3 making a splash in the media, decision makers are wondering just how big this development is.

sequence, transformer, transformer model, (12 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)

Add feedback