AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

An Introduction to Working with BERT in Practice

#artificialintelligenceFeb-13-2022, 00:46:20 GMT

Luckily, smaller pretrained BERT or XLNET models are becoming increasingly available for free, and they may well serve as stepping stones for fine-tuning. This means that, in practice, you start from downloading a pre-trained BERT or XLNET model, incorporate it into your network, and fine-tune it with much more manageable, smaller datasets. In this article, we'll see how that works. First, let's start with incorporating existing BERT models in our models. For this to work, we need a dedicated BERT layer: a landing hub for BERT models.

bert model, xlnet model

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.80)

Add feedback

Transformer-based Approaches for Legal Text Processing

Nguyen, Ha-Thanh, Nguyen, Minh-Phuong, Vuong, Thi-Hai-Yen, Bui, Minh-Quan, Nguyen, Minh-Chau, Dang, Tran-Binh, Tran, Vu, Nguyen, Le-Minh, Satoh, Ken

arXiv.org Artificial IntelligenceFeb-13-2022

In this paper, we introduce our approaches using Transformer-based models for different problems of the COLIEE 2021 automatic legal text processing competition. Automated processing of legal documents is a challenging task because of the characteristics of legal documents as well as the limitation of the amount of data. With our detailed experiments, we found that Transformer-based pretrained language models can perform well with automated legal text processing problems with appropriate approaches. We describe in detail the processing steps for each task such as problem formulation, data processing and augmentation, pretraining, finetuning. In addition, we introduce to the community two pretrained models that take advantage of parallel translations in legal domain, NFSP and NMSP. In which, NFSP achieves the state-of-the-art result in Task 5 of the competition. Although the paper focuses on technical reporting, the novelty of its approaches can also be an useful reference in automated legal document processing using Transformer-based models.

ha-thanh nguyen, paragraph, pretrained model, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s12626-022-00102-2

2202.06397

Country:

Asia > Japan > Honshū > Tōhoku (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Research Report (0.64)

Industry: Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

YouTube is now using A.I. on videos that had previously mastered games including chess and Go

#artificialintelligenceFeb-12-2022, 02:13:04 GMT

The CIA has been conducting mass surveillance in the U.S. with minimal oversight--and the program's uncovering is bad news for Big Tech

video, youtube

#artificialintelligence

Country: North America > United States (1.00)

Industry: Leisure & Entertainment > Games > Chess (0.40)

Technology:

Information Technology > Communications > Social Media (0.76)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

Could 'expiration dates' for AI systems help prevent bias?

#artificialintelligenceFeb-11-2022, 19:30:08 GMT

Today's AI technology, much like humans, learns from examples. AI systems are developed on datasets containing text, images, audio, and other information that serve as a ground truth. By figuring out the relationships between these examples, AI systems gradually "learn" to make predictions, like which word is likely to come next in a sentence or whether objects in a picture are inanimate. The technique holds up remarkably well in the language domain, for example, where systems like OpenAI's GPT-3 can write content from essays to advertisements in human-like ways. But similar in character to humans, AI that isn't supplied fresh, new data eventually grows stale in its predictions -- a phenomenon known as model drift.

dataset, expiration, expiration date, (13 more...)

#artificialintelligence

AI-Alerts: 2022 > 2022-02 > AAAI AI-Alert for Feb 16, 2022 (1.00)

Country: North America > Canada > Ontario > Toronto (0.15)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.55)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.55)

Add feedback

Passing the Turing Test: AI creates human-like text

#artificialintelligenceFeb-11-2022, 15:36:36 GMT

"The baseball legend Yogi Berra once had a manager tell him to think more when he was up at bat. Berra responded, 'How can a guy hit and think at the same time?' It was a fair question. After all, when a pitcher throws a fastball, the batter has about 400 milliseconds to see the pitch, judge its direction, and swing the bat. "The human eye takes about 80 milliseconds to react to a stimulus.

ai create human-like text, caterpillar, gpt-3, (14 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.58)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.58)
Information Technology > Artificial Intelligence > Issues > Turing's Test (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.39)

Add feedback

OpenAI Chief Scientist Says Advanced AI May Already Be Conscious

#artificialintelligenceFeb-11-2022, 15:35:06 GMT

OpenAI's top researcher has made a startling claim this week: that artificial intelligence may already be gaining consciousness. Ilya Sutskever, chief scientist of the OpenAI research group, tweeted today that "it may be that today's large neural networks are slightly conscious." Needless to say, that's an unusual point of view. The widely accepted idea among AI researchers is that the tech has made great strides over the past decade, but still falls far short of human intelligence, nevermind being anywhere close to experiencing the world consciously. It's possible that Sutskever was speaking facetiously, but it's also conceivable that as the top researcher at one of the foremost AI groups in the world, he's already looking downrange. He's long been preoccupied with artificial general intelligence, or AGI, which would refer to AI that operates at a human or superhuman level.

intelligence, openai, sutskever, (5 more...)

#artificialintelligence

Industry: Energy (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.95)

Add feedback

NLP -- The pioneer for simplifying our lives

#artificialintelligenceFeb-11-2022, 12:01:24 GMT

A remarkable portion of humans' everyday tasks are based on text: from reading to comprehending and generating it. According to a McKinsey study the average American employee spends 28% of their working hours on reading and responding to emails. Naturally, the ability to streamline and automate these processes opens up infinite possibilities for optimizing resources and thereby creating value for the user. While there has been significant progress made in these spaces, widespread business applications, especially of generative models is yet to come. Natural language processing models (NLP) will be productized, commercialized and begin to be more widely implemented.

application, generative model, lives, (3 more...)

#artificialintelligence

Industry: Information Technology (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.35)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

GitHub Copilot Crushes Data Science And ML Tasks: Ultimate Review

#artificialintelligenceFeb-10-2022, 23:50:36 GMT

I've noticed a few things while using Copilot. First of all, it has got an excellent sense of context. The suggestions I got changed often when I included/excluded specific libraries or some code in a script. On another personal project, I realized that Copilot could pick up particular references from other files to help me write the rest of the code. It could also adapt to my coding style and commenting.

data science and ml task, github copilot crush data science, ultimate review

#artificialintelligence

Technology:

Information Technology > Data Science (0.58)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)

Add feedback

La veille de la cybersécurité

#artificialintelligenceFeb-10-2022, 10:19:24 GMT

For the past few decades, anxious parents, educators, and politicians have latched onto the idea that teaching kids to code would be a surefire way to prepare them for "the workforce of tomorrow." But artificial intelligence is now starting to slowly but surely deflate the economic life preserver that coding was supposed to represent. At first, this may seem counterintuitive. After all, A.I. is just software, and someone still has to write that software, right? Well, the answer is increasingly likely to be no.

human coder, software, veille

#artificialintelligence

Country: North America > United States > California > San Francisco County > San Francisco (0.09)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.43)

Add feedback

DeepMind's AlphaCode AI writes code at a competitive level – TechCrunch

#artificialintelligenceFeb-10-2022, 00:27:44 GMT

DeepMind has created an AI capable of writing code to solve arbitrary problems posed to it, as proven by participating in a coding challenge and placing -- well, somewhere in the middle. It won't be taking any software engineers' jobs just yet, but it's promising and may help automate basic tasks. The team at DeepMind, a subsidiary of Alphabet, is aiming to create intelligence in as many forms as it can, and of course these days the task to which many of our great minds are bent is coding. Code is a fusion of language, logic and problem-solving that is both a natural fit for a computer's capabilities and a tough one to crack. Of course it isn't the first to attempt something like this: OpenAI has its own Codex natural-language coding project, and it powers both GitHub Copilot and a test from Microsoft to let GPT-3 finish your lines.

alphacode ai write code, deepmind, techcrunch, (2 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback