AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

Deep Maths - Machine Learning and Mathematics

#artificialintelligenceMar-6-2022, 00:01:18 GMT

Oxford Mathematics Public LectureDeep Maths - Machine Learning and MathematicsIn December 2021 mathematicians at Oxford and Sydney universities together with their collaborators at DeepMind announced that they had successfully used tools from machine learning to discover new patterns in mathematics. But what exactly had they done and what are its implications for the future of mathematics and mathematicians?This online event will feature short talks from each of the four collaborators, explaining their work, followed by a panel discussion addressing its wider implications. If you wish to submit a question, please email external-relations@maths.ox.ac.ukThe speakers:Alex Davies - DeepMindAndras Juhasz - University of OxfordMarc Lackenby - University of OxfordGeordie Williamson - University of SydneyThe panel will be chaired by Jon Keating, Sedleian Professor of Natural Philosophy in Oxford.This is an online only lecture which every one is free to watch: www.youtube.com/c/OxfordMathematicsThe Oxford Mathematics Public Lectures are generously supported by XTX Markets.

deep math, machine learning and mathematics, mathematician, (2 more...)

#artificialintelligence

Genre:

Instructional Material > Online (0.31)
Instructional Material > Course Syllabus & Notes (0.31)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.31)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

How Good is Your Chatbot? An Introduction to Perplexity in NLP

#artificialintelligenceMar-5-2022, 06:55:59 GMT

New, state-of-the-art language models like DeepMind's Gopher, Microsoft's Megatron, and OpenAI's GPT-3 are driving a wave of innovation in NLP. How do you measure the performance of these language models to see how good they are? In a previous post, we gave an overview of different language model evaluation metrics. This post dives more deeply into one of the most popular: a metric known as perplexity. Surge AI delivers better data, faster.

dataset, perplexity, probability, (16 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

GPT-3: The biggest breakthrough in AI in recent history

#artificialintelligenceMar-5-2022, 03:30:19 GMT

GPT-1 was released on June 11, 2018. When this model was released by OpenAI, there was much excitement. It was the transformer structure combined with unsupervised pre-training with promising results. The key difference between GPT-1 and the other language-based models before it, is that it was fine-tuned, or trained for specific tasks. GPT-2 was introduced in February 2019.

biggest breakthrough, gpt-3, recent history, (14 more...)

#artificialintelligence

Country: North America > United States > California > San Francisco County > San Francisco (0.04)

Industry: Media (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Lessons Learned on Language Model Safety and Misuse

#artificialintelligenceMar-3-2022, 23:00:09 GMT

The deployment of powerful AI systems has enriched our understanding of safety and misuse far more than would have been possible through research alone. Here, we describe our latest thinking in the hope of helping other AI developers address safety and misuse of deployed models. Over the past two years, we've learned a lot about how language models can be used and abused--insights we couldn't have gained without the experience of real-world deployment. In June 2020, we began giving access to developers and researchers to the OpenAI API, an interface for accessing and building applications on top of new AI models developed by OpenAI. Deploying GPT-3, Codex, and other models in a way that reduces risks of harm has posed various technical and policy challenges.

deployment, language model, misuse, (10 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.83)

Add feedback

La veille de la cybersécurité

#artificialintelligenceMar-2-2022, 17:15:21 GMT

The Internet of the future might be something completely different from what we know currently. But this change will be for the better good or worse. Experts from Copenhagen Institute for Future Studies (CIFS) have raised questions about AI-generated content, and how it might rule digital locations and the much-hyped metaverse. According to CIFS expert Timothy Shoup, 99% or more of the internet's content will be generated by artificial intelligence by 2025 to 2030, especially if models such as OpenAI's GPT-3 witness a wider use.

internet, shoup, veille

#artificialintelligence

Country: Europe > Denmark > Capital Region > Copenhagen (0.31)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.31)

Add feedback

Adversarial Robustness of Neural-Statistical Features in Detection of Generative Transformers

Crothers, Evan, Japkowicz, Nathalie, Viktor, Herna, Branco, Paula

arXiv.org Artificial IntelligenceMar-2-2022

The detection of computer-generated text is an area of rapidly increasing significance as nascent generative models allow for efficient creation of compelling human-like text, which may be abused for the purposes of spam, disinformation, phishing, or online influence campaigns. Past work has studied detection of current state-of-the-art models, but despite a developing threat landscape, there has been minimal analysis of the robustness of detection methods to adversarial attacks. To this end, we evaluate neural and non-neural approaches on their ability to detect computer-generated text, their robustness against text adversarial attacks, and the impact that successful adversarial attacks have on human judgement of text quality. We find that while statistical features underperform neural features, statistical features provide additional adversarial robustness that can be leveraged in ensemble detection models. In the process, we find that previously effective complex phrasal features for detection of computer-generated text hold little predictive power against contemporary generative models, and identify promising statistical features to use instead. Finally, we pioneer the usage of $\Delta$MAUVE as a proxy measure for human judgement of adversarial text quality.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/IJCNN55064.2022.9892269

2203.07983

Country:

North America > Canada > Ontario > National Capital Region > Ottawa (0.14)
Oceania > Australia (0.04)
North America > United States > New York > New York County > New York City (0.04)
(6 more...)

Genre: Research Report > Promising Solution (0.34)

Industry:

Information Technology > Security & Privacy (1.00)
Government (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.75)

Add feedback

99% Of Future Internet Content To Be AI-Generated; For Better Or Worse?

#artificialintelligenceMar-1-2022, 09:25:11 GMT

According to CIFS expert Timothy Shoup, 99% or more of the internet's content will be generated by artificial intelligence by 2025 to 2030, especially if models such as OpenAI's GPT-3 witness a wider use. "The internet would be completely unrecognizable," Shoup told colleague Sofie Hvitved. As the capabilities of AI advance, it could start creating entire online worlds, alongside all the things that inhabit them. This will also include all the online material that humans make use of even now. This could give birth to things that we could only imagine right now.

ai-generated, artificial intelligence, future internet content, (5 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.62)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.42)

Add feedback

This new dataset shows that AI still lacks commonsense reasoning

#artificialintelligenceFeb-28-2022, 19:56:56 GMT

Abductive reasoning, frequently misidentified as deductive reasoning, is the process of making a plausible prediction when faced with incomplete information. For example, given a photo showing a toppled truck and a police cruiser on a snowy freeway, abductive reasoning may lead someone to infer that dangerous road conditions caused an accident. Humans can quickly consider this sort of context to arrive at a hypothesis. But AI struggles, despite recent technical advances. Motivated to explore the challenge, researchers at the Allen Institute for Artificial Intelligence, the University of California, Berkeley, and the MIT-IBM Watson AI lab created a dataset called Sherlock, a collection of over 100,000 images of scenes paired with clues a viewer could use to answer questions about the scenes.

commonsense reasoning, hessel, reasoning, (14 more...)

#artificialintelligence

Country: North America > United States > California > Alameda County > Berkeley (0.25)

Genre: Personal (0.49)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.44)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.30)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

DeepMind's AI can control superheated plasma inside a fusion reactor

#artificialintelligenceFeb-28-2022, 17:05:31 GMT

Controlling nuclear fusion on Earth is hard, however. The problem is that atomic nuclei repel each other. Smashing them together inside a reactor can only be done at extremely high temperatures, often reaching hundreds of millions of degrees--hotter than the center of the sun. At these temperatures, matter is neither solid, liquid, nor gas. It enters a fourth state, known as plasma: a roiling, superheated soup of particles.

control superheated plasma, plasma, reactor, (5 more...)

#artificialintelligence

Country: Europe > Switzerland > Vaud > Lausanne (0.07)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

Swiss Plasma Center and DeepMind Use AI To Control Plasmas for Nuclear Fusion

#artificialintelligenceFeb-28-2022, 04:50:05 GMT

Scientists at EPFL's Swiss Plasma Center and DeepMind have jointly developed a new method for controlling plasma configurations for use in nuclear fusion research. EPFL's Swiss Plasma Center (SPC) has decades of experience in plasma physics and plasma control methods. DeepMind is a scientific discovery company acquired by Google in 2014 that's committed to'solving intelligence to advance science and humanity. Together, they have developed a new magnetic control method for plasmas based on deep reinforcement learning, and applied it to a real-world plasma for the first time in the SPC's tokamak research facility, TCV. Their study has just been published in Nature.

plasma, plasma configuration, swiss plasma center, (13 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.78)

Add feedback