Goto

Collaborating Authors

 Large Language Model


OpenAI's hunger for data is coming back to bite it

MIT Technology Review

In AI development, the dominant paradigm is that the more training data, the better. OpenAI's GPT-2 model had a data set consisting of 40 gigabytes of text. GPT-3, which ChatGPT is based on, was trained on 570 GB of data. OpenAI has not shared how big the data set for its latest model, GPT-4, is. But that hunger for larger models is now coming back to bite the company. In the past few weeks, several Western data protection authorities have started investigations into how OpenAI collects and processes the data powering ChatGPT.


Demand rising amid AI frenzy, says chip-testing maker Advantest

The Japan Times

Advantest is seeing a spike in demand for its chip-testing devices, catalyzed by the frenzy of interest in OpenAI's ChatGPT and other novel uses of artificial intelligence. A global race to develop powerful computing clusters and next-generation AI-training systems is spurring chipmakers to buy many more of the Tokyo-based company's testing tools, Advantest Co-Chief Strategy Officer Yasuo Mihashi said in an interview. The executive's outlook comes against a backdrop of muted demand for consumer electronics and a U.S.-led campaign to restrict trade of advanced semiconductors to China. Japan said last month it would expand export controls on its chip technology. Still, the AI surge is driving up orders from Nvidia and Advanced Micro Devices, two companies that provide the key AI-training semiconductors and which rely on Advantest as their main testing tools supplier, according to data compiled by Bloomberg.


Yes, AI is a cybersecurity 'nuclear' threat. That's why companies have to dare to do this

FOX News

Fox News correspondent Grady Trimble has the latest on fears the technology will spiral out of control on'Special Report.' Microsoft just announced Security Copilot, their AI-powered assistant that will revolutionize cybersecurity defense by increasing efficiency and productivity. The tool will incorporate ChatGPT4 technology from OpenAI and a proprietary security specific model created by Microsoft from all the data they have. The Security Copilot is currently available to a small number of selected companies for testing with the official launch date still unknown. However, hackers are not waiting and have already started utilizing widely available AI tools to launch attacks.


Scaling Transformer to 1M tokens and beyond with RMT

arXiv.org Artificial Intelligence

This technical report presents the application of a recurrent memory to extend the context length of BERT, one of the most effective Transformer-based models in natural language processing. By leveraging the Recurrent Memory Transformer architecture, we have successfully increased the model's effective context length to an unprecedented two million tokens, while maintaining high memory retrieval accuracy. Our method allows for the storage and processing of both local and global information and enables information flow between segments of the input sequence through the use of recurrence. Our experiments demonstrate the effectiveness of our approach, which holds significant potential to enhance long-term dependency handling in natural language understanding and generation tasks as well as enable large-scale context processing for memory-intensive applications.


Low-resource Bilingual Dialect Lexicon Induction with Large Language Models

arXiv.org Artificial Intelligence

Bilingual word lexicons are crucial tools for multilingual natural language understanding and machine translation tasks, as they facilitate the mapping of words in one language to their synonyms in another language. To achieve this, numerous papers have explored bilingual lexicon induction (BLI) in high-resource scenarios, using a typical pipeline consisting of two unsupervised steps: bitext mining and word alignment, both of which rely on pre-trained large language models~(LLMs). In this paper, we present an analysis of the BLI pipeline for German and two of its dialects, Bavarian and Alemannic. This setup poses several unique challenges, including the scarcity of resources, the relatedness of the languages, and the lack of standardization in the orthography of dialects. To evaluate the BLI outputs, we analyze them with respect to word frequency and pairwise edit distance. Additionally, we release two evaluation datasets comprising 1,500 bilingual sentence pairs and 1,000 bilingual word pairs. They were manually judged for their semantic similarity for each Bavarian-German and Alemannic-German language pair.


Catch Me If You Can: Identifying Fraudulent Physician Reviews with Large Language Models Using Generative Pre-Trained Transformers

arXiv.org Artificial Intelligence

The proliferation of fake reviews of doctors has potentially detrimental consequences for patient well-being and has prompted concern among consumer protection groups and regulatory bodies. Yet despite significant advancements in the fields of machine learning and natural language processing, there remains limited comprehension of the characteristics differentiating fraudulent from authentic reviews. This study utilizes a novel pre-labeled dataset of 38048 physician reviews to establish the effectiveness of large language models in classifying reviews. Specifically, we compare the performance of traditional ML models, such as logistic regression and support vector machines, to generative pre-trained transformer models. Furthermore, we use GPT4, the newest model in the GPT family, to uncover the key dimensions along which fake and genuine physician reviews differ. Our findings reveal significantly superior performance of GPT-3 over traditional ML models in this context. Additionally, our analysis suggests that GPT3 requires a smaller training sample than traditional models, suggesting its appropriateness for tasks with scarce training data. Moreover, the superiority of GPT3 performance increases in the cold start context i.e., when there are no prior reviews of a doctor. Finally, we employ GPT4 to reveal the crucial dimensions that distinguish fake physician reviews. In sharp contrast to previous findings in the literature that were obtained using simulated data, our findings from a real-world dataset show that fake reviews are generally more clinically detailed, more reserved in sentiment, and have better structure and grammar than authentic ones.


Is ChatGPT Equipped with Emotional Dialogue Capabilities?

arXiv.org Artificial Intelligence

This report presents a study on the emotional dialogue capability of ChatGPT, an advanced language model developed by OpenAI. The study evaluates the performance of ChatGPT on emotional dialogue understanding and generation through a series of experiments on several downstream tasks. Our findings indicate that while ChatGPT's performance on emotional dialogue understanding may still lag behind that of supervised models, it exhibits promising results in generating emotional responses. Furthermore, the study suggests potential avenues for future research directions.


OpenAI CEO says era of giant AI models is over

FOX News

Russell Wald, director of the Stanford Institute for Human-Centered AI, sounds off on'The Story.' OpenAI CEO Sam Altman says the age of the giant artificial intelligence model is already over. "I think we're at the end of the era where it's going to be these, like, giant, giant models," he told an audience at the Massachusetts Institute of Technology over Zoom last week. "We'll make them better in other ways." During the same event, Altman also confirmed that his company is not developing Chat GPT-5. "An earlier version of the letter claimed OpenAI is training GPT-5 right now," he said, referencing a letter from billionaire Elon Musk and Apple co-founder Steve Wozniak.


You Should Ask a Chatbot to Make You a Drink

The Atlantic - Technology

Two weeks in a row, ChatGPT botched my grocery list. I thought that I had found a really solid, practical use for AI--automating one of my least favorite Sunday chores--but the bot turned out to be pretty darn bad at it. I fed it a link to a recipe for cauliflower shawarma with a spicy sauce and asked it to compile the ingredients in a list. It forgot the pita, so I forgot the pita, and then I had to use tortillas instead. The following week, I gave it a link to a taco recipe.


The merger of TruthGPT and OpenAI by Alex Hammer Podcast

#artificialintelligence

Is it possible that some of the greatest AI inventions are overlooking something major? In this episode, I dive into what seems to be the missing piece in artificial intelligence and AI research. Listen in to learn how a different approach could bridge the gap between the world of AI and everyday function and usability. "Steve Jobs showed that with Apple. That they were not just science and engineering creations they were theater, they were art.