Goto

Collaborating Authors

 Large Language Model


Will ChatGPT take your job -- and millions of others?

Al Jazeera

It is the whiz-kid of the artificial intelligence (AI) world that others are trying to emulate. In the four months since its November 30 launch, ChatGPT has shown the ability to perform a wide range of tasks, from cracking the bar and medical licensing exams in the United States to writing emails and songs, building apps, and more. The fact that it is freely available for public use has opened up a plethora of opportunities previously thought beyond the realm of possibility of AI -- even though the app's makers have faced criticism for opacity around the programming they have used to train it. Developed by OpenAI, a company backed by Microsoft, ChatGPT became the fastest-growing consumer app in the world two months after its launch, with more than 100 million users by January. That early success has prompted Microsoft to integrate its Bing search engine and Edge browser with the technology running ChatGPT in the hope of improving the experience of users.


5 ways OpenAI's ChatGPT plugins could change the AI game

#artificialintelligence

Join top executives in San Francisco on July 11-12, to hear how leaders are integrating and optimizing AI investments for success. Last week felt like a lifetime in AI. Sure, the week before was fast-paced and so was the week before that, but this one -- really, a lifetime. And somehow I just couldn't let it go. While the rest of the world went about its business, I noodled about the implications of OpenAI's latest ChatGPT chess move.



What YouTube Hustlers Can Teach Us About AI

#artificialintelligence

The tech industry is all-in on AI. Tech giants are pumping massive resources into research and new products. Microsoft and Google are suggesting they'll revamp their entire product lines. AI start-ups have collectively raised tens of billions of dollars in the past year, during a tech slowdown. The buzz has broken Silicon Valley containment, as regular people have had the chance to interact with new and surprising tools.


Ecosystem Graphs: The Social Footprint of Foundation Models

arXiv.org Artificial Intelligence

Foundation models (e.g. ChatGPT, StableDiffusion) pervasively influence society, warranting immediate social attention. While the models themselves garner much attention, to accurately characterize their impact, we must consider the broader sociotechnical ecosystem. We propose Ecosystem Graphs as a documentation framework to transparently centralize knowledge of this ecosystem. Ecosystem Graphs is composed of assets (datasets, models, applications) linked together by dependencies that indicate technical (e.g. how Bing relies on GPT-4) and social (e.g. how Microsoft relies on OpenAI) relationships. To supplement the graph structure, each asset is further enriched with fine-grained metadata (e.g. the license or training emissions). We document the ecosystem extensively at https://crfm.stanford.edu/ecosystem-graphs/. As of March 16, 2023, we annotate 262 assets (64 datasets, 128 models, 70 applications) from 63 organizations linked by 356 dependencies. We show Ecosystem Graphs functions as a powerful abstraction and interface for achieving the minimum transparency required to address myriad use cases. Therefore, we envision Ecosystem Graphs will be a community-maintained resource that provides value to stakeholders spanning AI researchers, industry professionals, social scientists, auditors and policymakers.


Evaluation of ChatGPT for NLP-based Mental Health Applications

arXiv.org Artificial Intelligence

Large language models (LLM) have been successful in several natural language understanding tasks and could be relevant for natural language processing (NLP)-based mental health application research. In this work, we report the performance of LLM-based ChatGPT (with gpt-3.5-turbo backend) in three text-based mental health classification tasks: stress detection (2-class classification), depression detection (2-class classification), and suicidality detection (5-class classification). We obtained annotated social media posts for the three classification tasks from public datasets. Then ChatGPT API classified the social media posts with an input prompt for classification. We obtained F1 scores of 0.73, 0.86, and 0.37 for stress detection, depression detection, and suicidality detection, respectively. A baseline model that always predicted the dominant class resulted in F1 scores of 0.35, 0.60, and 0.19. The zero-shot classification accuracy obtained with ChatGPT indicates a potential use of language models for mental health classification tasks.


Synthetically generated text for supervised text analysis

arXiv.org Artificial Intelligence

This article proposes a partial solution to these three issues, in the form of controlled generation of synthetic text with large language models. I provide a conceptual overview of text generation, guidance on when researchers should prefer different techniques for generating synthetic text, a discussion of ethics, and a simple technique for improving the quality of synthetic text. I demonstrate the usefulness of synthetic text with three applications: generating synthetic tweets describing the fighting in Ukraine, synthetic news articles describing specified political events for training an event detection system, and a multilingual corpus of populist manifesto statements for training a sentence-level populism classifier.


Two-stage Pipeline for Multilingual Dialect Detection

arXiv.org Artificial Intelligence

Dialect Identification is a crucial task for localizing various Large Language Models. This paper outlines our approach to the VarDial 2023 shared task. Here we have to identify three or two dialects from three languages each which results in a 9-way classification for Track-1 and 6-way classification for Track-2 respectively. Our proposed approach consists of a two-stage system and outperforms other participants' systems and previous works in this domain. We achieve a score of 58.54% for Track-1 and 85.61% for Track-2. Our codebase is available publicly (https://github.com/ankit-vaidya19/EACL_VarDial2023).


ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models

arXiv.org Artificial Intelligence

Large language models (LLMs) such as ChatGPT and GPT-4 have made significant progress in NLP. However, their ability to memorize, represent, and leverage commonsense knowledge has been a well-known pain point for LLMs. It remains unclear that: (1) Can GPTs effectively answer commonsense questions? (2) Are GPTs knowledgeable in commonsense? (3) Are GPTs aware of the underlying commonsense knowledge for answering a specific question? (4) Can GPTs effectively leverage commonsense for answering questions? To evaluate the above commonsense problems, we conduct a series of experiments to evaluate ChatGPT's commonsense abilities, and the experimental results show that: (1) GPTs can achieve good QA accuracy in commonsense tasks, while they still struggle with certain types of knowledge. (2) ChatGPT is knowledgeable, and can accurately generate most of the commonsense knowledge using knowledge prompts. (3) Despite its knowledge, ChatGPT is an inexperienced commonsense problem solver, which cannot precisely identify the needed commonsense knowledge for answering a specific question, i.e., ChatGPT does not precisely know what commonsense knowledge is required to answer a question. The above findings raise the need to investigate better mechanisms for utilizing commonsense knowledge in LLMs, such as instruction following, better commonsense guidance, etc.


When Brain-inspired AI Meets AGI

arXiv.org Artificial Intelligence

Artificial General Intelligence (AGI) has been a long-standing goal of humanity, with the aim of creating machines capable of performing any intellectual task that humans can do. To achieve this, AGI researchers draw inspiration from the human brain and seek to replicate its principles in intelligent machines. Brain-inspired artificial intelligence is a field that has emerged from this endeavor, combining insights from neuroscience, psychology, and computer science to develop more efficient and powerful AI systems. In this article, we provide a comprehensive overview of brain-inspired AI from the perspective of AGI. We begin with the current progress in brain-inspired AI and its extensive connection with AGI. We then cover the important characteristics for both human intelligence and AGI (e.g., scaling, multimodality, and reasoning). We discuss important technologies toward achieving AGI in current AI systems, such as in-context learning and prompt tuning. We also investigate the evolution of AGI systems from both algorithmic and infrastructural perspectives. Finally, we explore the limitations and future of AGI.