AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

OpenAI's hunger for data is coming back to bite it

MIT Technology ReviewApr-19-2023, 09:50:32 GMT

In AI development, the dominant paradigm is that the more training data, the better. OpenAI's GPT-2 model had a data set consisting of 40 gigabytes of text. GPT-3, which ChatGPT is based on, was trained on 570 GB of data. OpenAI has not shared how big the data set for its latest model, GPT-4, is. But that hunger for larger models is now coming back to bite the company. In the past few weeks, several Western data protection authorities have started investigations into how OpenAI collects and processes the data powering ChatGPT.

chatgpt, hunger, openai, (5 more...)

MIT Technology Review

Country:

Europe > Italy (0.18)
South America > Brazil (0.06)
North America > United States > California (0.06)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Demand rising amid AI frenzy, says chip-testing maker Advantest

The Japan TimesApr-19-2023, 06:06:18 GMT

Advantest is seeing a spike in demand for its chip-testing devices, catalyzed by the frenzy of interest in OpenAI's ChatGPT and other novel uses of artificial intelligence. A global race to develop powerful computing clusters and next-generation AI-training systems is spurring chipmakers to buy many more of the Tokyo-based company's testing tools, Advantest Co-Chief Strategy Officer Yasuo Mihashi said in an interview. The executive's outlook comes against a backdrop of muted demand for consumer electronics and a U.S.-led campaign to restrict trade of advanced semiconductors to China. Japan said last month it would expand export controls on its chip technology. Still, the AI surge is driving up orders from Nvidia and Advanced Micro Devices, two companies that provide the key AI-training semiconductors and which rely on Advantest as their main testing tools supplier, according to data compiled by Bloomberg.

advantest, ai frenzy, chip-testing maker advantest, (1 more...)

The Japan Times

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.29)
Asia > China (0.29)

Industry:

Semiconductors & Electronics (1.00)
Information Technology > Hardware (0.63)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.63)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)

Add feedback

Yes, AI is a cybersecurity 'nuclear' threat. That's why companies have to dare to do this

FOX NewsApr-19-2023, 06:00:06 GMT

Fox News correspondent Grady Trimble has the latest on fears the technology will spiral out of control on'Special Report.' Microsoft just announced Security Copilot, their AI-powered assistant that will revolutionize cybersecurity defense by increasing efficiency and productivity. The tool will incorporate ChatGPT4 technology from OpenAI and a proprietary security specific model created by Microsoft from all the data they have. The Security Copilot is currently available to a small number of selected companies for testing with the official launch date still unknown. However, hackers are not waiting and have already started utilizing widely available AI tools to launch attacks.

chatgpt, information, security copilot, (15 more...)

FOX News

Country: North America > United States > District of Columbia > Washington (0.05)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.77)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.75)

Add feedback

Scaling Transformer to 1M tokens and beyond with RMT

Bulatov, Aydar, Kuratov, Yuri, Burtsev, Mikhail S.

arXiv.org Artificial IntelligenceApr-19-2023

This technical report presents the application of a recurrent memory to extend the context length of BERT, one of the most effective Transformer-based models in natural language processing. By leveraging the Recurrent Memory Transformer architecture, we have successfully increased the model's effective context length to an unprecedented two million tokens, while maintaining high memory retrieval accuracy. Our method allows for the storage and processing of both local and global information and enables information flow between segments of the input sequence through the use of recurrence. Our experiments demonstrate the effectiveness of our approach, which holds significant potential to enhance long-term dependency handling in natural language understanding and generation tasks as well as enable large-scale context processing for memory-intensive applications.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2304.11062

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.05)
North America > United States > Washington > King County > Seattle (0.04)
(5 more...)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Low-resource Bilingual Dialect Lexicon Induction with Large Language Models

Artemova, Ekaterina, Plank, Barbara

arXiv.org Artificial IntelligenceApr-19-2023

Bilingual word lexicons are crucial tools for multilingual natural language understanding and machine translation tasks, as they facilitate the mapping of words in one language to their synonyms in another language. To achieve this, numerous papers have explored bilingual lexicon induction (BLI) in high-resource scenarios, using a typical pipeline consisting of two unsupervised steps: bitext mining and word alignment, both of which rely on pre-trained large language models~(LLMs). In this paper, we present an analysis of the BLI pipeline for German and two of its dialects, Bavarian and Alemannic. This setup poses several unique challenges, including the scarcity of resources, the relatedness of the languages, and the lack of standardization in the orthography of dialects. To evaluate the BLI outputs, we analyze them with respect to word frequency and pairwise edit distance. Additionally, we release two evaluation datasets comprising 1,500 bilingual sentence pairs and 1,000 bilingual word pairs. They were manually judged for their semantic similarity for each Bavarian-German and Alemannic-German language pair.

computational linguistic, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2304.09957

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(19 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.89)

Add feedback

Catch Me If You Can: Identifying Fraudulent Physician Reviews with Large Language Models Using Generative Pre-Trained Transformers

Shukla, Aishwarya Deep, Agarwal, Laksh, Mein, Jie, Goh, null, Guodong, null, Gao, null, Agarwal, Ritu

arXiv.org Artificial IntelligenceApr-19-2023

The proliferation of fake reviews of doctors has potentially detrimental consequences for patient well-being and has prompted concern among consumer protection groups and regulatory bodies. Yet despite significant advancements in the fields of machine learning and natural language processing, there remains limited comprehension of the characteristics differentiating fraudulent from authentic reviews. This study utilizes a novel pre-labeled dataset of 38048 physician reviews to establish the effectiveness of large language models in classifying reviews. Specifically, we compare the performance of traditional ML models, such as logistic regression and support vector machines, to generative pre-trained transformer models. Furthermore, we use GPT4, the newest model in the GPT family, to uncover the key dimensions along which fake and genuine physician reviews differ. Our findings reveal significantly superior performance of GPT-3 over traditional ML models in this context. Additionally, our analysis suggests that GPT3 requires a smaller training sample than traditional models, suggesting its appropriateness for tasks with scarce training data. Moreover, the superiority of GPT3 performance increases in the cold start context i.e., when there are no prior reviews of a doctor. Finally, we employ GPT4 to reveal the crucial dimensions that distinguish fake physician reviews. In sharp contrast to previous findings in the literature that were obtained using simulated data, our findings from a real-world dataset show that fake reviews are generally more clinically detailed, more reserved in sentiment, and have better structure and grammar than authentic ones.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2304.09948

Country:

Europe (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Asia > India (0.04)
(11 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Health Care Providers & Services (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Is ChatGPT Equipped with Emotional Dialogue Capabilities?

Zhao, Weixiang, Zhao, Yanyan, Lu, Xin, Wang, Shilong, Tong, Yanpeng, Qin, Bing

arXiv.org Artificial IntelligenceApr-19-2023

This report presents a study on the emotional dialogue capability of ChatGPT, an advanced language model developed by OpenAI. The study evaluates the performance of ChatGPT on emotional dialogue understanding and generation through a series of experiments on several downstream tasks. Our findings indicate that while ChatGPT's performance on emotional dialogue understanding may still lag behind that of supervised models, it exhibits promising results in generating emotional responses. Furthermore, the study suggests potential avenues for future research directions.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2304.09582

Country:

North America > United States > Washington > King County > Seattle (0.04)
Europe > Italy > Tuscany > Florence (0.04)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
(2 more...)

Genre: Research Report > New Finding (0.66)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

OpenAI CEO says era of giant AI models is over

FOX NewsApr-18-2023, 20:32:52 GMT

Russell Wald, director of the Stanford Institute for Human-Centered AI, sounds off on'The Story.' OpenAI CEO Sam Altman says the age of the giant artificial intelligence model is already over. "I think we're at the end of the era where it's going to be these, like, giant, giant models," he told an audience at the Massachusetts Institute of Technology over Zoom last week. "We'll make them better in other ways." During the same event, Altman also confirmed that his company is not developing Chat GPT-5. "An earlier version of the letter claimed OpenAI is training GPT-5 right now," he said, referencing a letter from billionaire Elon Musk and Apple co-founder Steve Wozniak.

getty image, giant ai model, sam altman, (9 more...)

FOX News

Country:

North America > United States > Massachusetts (0.26)
North America > United States > Washington > King County > Redmond (0.06)
North America > United States > Nevada (0.06)
North America > United States > California > Los Angeles County > Hawthorne (0.06)

Industry:

Media > News (0.36)
Government (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.87)

Add feedback

You Should Ask a Chatbot to Make You a Drink

The Atlantic - TechnologyApr-18-2023, 19:52:19 GMT

Two weeks in a row, ChatGPT botched my grocery list. I thought that I had found a really solid, practical use for AI--automating one of my least favorite Sunday chores--but the bot turned out to be pretty darn bad at it. I fed it a link to a recipe for cauliflower shawarma with a spicy sauce and asked it to compile the ingredients in a list. It forgot the pita, so I forgot the pita, and then I had to use tortillas instead. The following week, I gave it a link to a taco recipe.

chatgpt, ingredient, recipe, (15 more...)

The Atlantic - Technology

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)
Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.05)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.80)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.41)

Add feedback

The merger of TruthGPT and OpenAI by Alex Hammer Podcast

#artificialintelligenceApr-18-2023, 18:51:18 GMT

Is it possible that some of the greatest AI inventions are overlooking something major? In this episode, I dive into what seems to be the missing piece in artificial intelligence and AI research. Listen in to learn how a different approach could bridge the gap between the world of AI and everyday function and usability. "Steve Jobs showed that with Apple. That they were not just science and engineering creations they were theater, they were art.

alex hammer podcast, podcast, truthgpt and openai, (1 more...)

#artificialintelligence

Industry:

Media > Music (0.40)
Leisure & Entertainment (0.40)
Banking & Finance > Mergers & Acquisitions (0.40)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback