AITopics | Generative AI

Collaborating Authors

Generative AI

News Overviews Instructional Materials AI-Alerts Classics

Fine-tuning ChatGPT for Automatic Scoring

arXiv.org Artificial IntelligenceDec-25-2023

This study highlights the potential of fine-tuned ChatGPT (GPT-3.5) for automatically scoring student written constructed responses using example assessment tasks in science education. Recent studies on OpenAI's generative model GPT-3.5 proved its superiority in predicting the natural language with high accuracy and human-like responses. GPT-3.5 has been trained over enormous online language materials such as journals and Wikipedia; therefore, more than direct usage of pre-trained GPT-3.5 is required for automatic scoring as students utilize a different language than trained material. These imply that a domain-specific model, fine-tuned over data for specific tasks, can enhance model performance. In this study, we fine-tuned GPT-3.5 on six assessment tasks with a diverse dataset of middle-school and high-school student responses and expert scoring. The six tasks comprise two multi-label and four multi-class assessment tasks. We compare the performance of fine-tuned GPT-3.5 with the fine-tuned state-of-the-art Google's generated language model, BERT. The results show that in-domain training corpora constructed from science questions and responses for BERT achieved average accuracy = 0.838, SD = 0.069. GPT-3.5 shows a remarkable average increase (9.1%) in automatic scoring accuracy (mean = 9.15, SD = 0.042) for the six tasks, p =0.001 < 0.05. Specifically, for multi-label tasks (item 1 with 5 labels; item 2 with 10 labels), GPT-3.5 achieved significantly higher scoring accuracy than BERT across all the labels, with the second item achieving a 7.1% increase. The average scoring increase for the four multi-class items for GPT-3.5 was 10.6% compared to BERT. Our study confirmed the effectiveness of fine-tuned GPT-3.5 for automatic scoring of student responses on domain-specific data in education with high accuracy. We have released fine-tuned models for public use and community engagement.

arxiv preprint arxiv, gpt-3, student, (15 more...)

arXiv.org Artificial Intelligence

2310.10072

Country:

North America > United States > Georgia > Clarke County > Athens (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
South America > Uruguay > Maldonado > Maldonado (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Education > Curriculum > Subject-Specific Education (1.00)
Education > Assessment & Standards (1.00)
Education > Educational Technology > Educational Software (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

AI Is Telling Bedtime Stories to Your Kids Now

WIREDDec-24-2023, 12:00:00 GMT

The problem with Bluey is there's not enough of it. Even with 151 seven-minute-long episodes of the popular children's animated show out there, parents of toddlers still desperately wait for Australia's Ludo Studio to release another season. The only way to get more Bluey more quickly is if they create their own stories starring the Brisbane-based family of blue heeler dogs. The London-based developer and father used OpenAI's latest tool, customizable bots called GPTs, to create a story generator for his young daughter. The bot, which he calls Bluey-GPT, begins each session by asking people their name, age, and a bit about their day, then churns out personalized tales starring Bluey and her sister Bingo.

bedtime story, openai, warner, (5 more...)

WIRED

Country:

Oceania > Australia (0.30)
Europe > United Kingdom (0.08)
North America > United States (0.06)

Industry: Law (0.33)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.48)

Add feedback

Sam Altman's Knack for Dodging Bullets---With a Little Help From Bigshot Friends

WSJ.com: WSJD - TechnologyDec-24-2023, 10:29:00 GMT

The OpenAI CEO lost the confidence of top leaders in the three organizations he has directed, yet each time he's rebounded to greater heights

bigshot friend, dodging bullet, sam altman, (1 more...)

WSJ.com: WSJD - Technology

Country: North America > United States > California (0.40)

Industry: Information Technology (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.85)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.85)

Add feedback

An In-depth Look at Gemini's Language Abilities

Akter, Syeda Nahida, Yu, Zichun, Muhamed, Aashiq, Ou, Tianyue, Bäuerle, Alex, Cabrera, Ángel Alexander, Dholakia, Krish, Xiong, Chenyan, Neubig, Graham

arXiv.org Artificial IntelligenceDec-24-2023

The recently released Google Gemini class of models are the first to comprehensively report results that rival the OpenAI GPT series across a wide variety of tasks. In this paper, we do an in-depth exploration of Gemini's language abilities, making two contributions. First, we provide a third-party, objective comparison of the abilities of the OpenAI GPT and Google Gemini models with reproducible code and fully transparent results. Second, we take a closer look at the results, identifying areas where one of the two model classes excels. We perform this analysis over 10 datasets testing a variety of language abilities, including reasoning, answering knowledge-based questions, solving math problems, translating between languages, generating code, and acting as instruction-following agents. From this analysis, we find that Gemini Pro achieves accuracy that is close but slightly inferior to the corresponding GPT 3.5 Turbo on all English-language tasks that we benchmarked, but find that Gemini Pro excels in translation into other languages for the languages that it supports. We further provide explanations for some of the under-performing tasks, including failures in mathematical reasoning with many digits, sensitivity to multiple-choice answer ordering, and others. We also identify areas where Gemini Pro demonstrates comparably high performance, such as handling longer and more complex reasoning chains.

gemini, gpt 3, turbo, (12 more...)

arXiv.org Artificial Intelligence

2312.11444

Country: North America > United States > California > San Diego County > San Diego (0.04)

Genre: Research Report (0.64)

Industry: Education > Curriculum > Subject-Specific Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.55)

Add feedback

Elon Musk promised an anti-'woke' chatbot. It's not going as planned.

Washington Post - Technology NewsDec-23-2023, 13:00:47 GMT

Artificial intelligence systems of all kinds are prone to biases ingrained in their design or the data they've learned from. In the past year, the rise of OpenAI's ChatGPT and other AI chatbots and image generators has sparked debate over how they represent minority groups or respond to prompts about politics and culture-war issues such as race and gender identity. While many tech ethicists and AI experts warn that these systems can absorb and reinforce harmful stereotypes, efforts by tech firms to counter those tendencies have provoked a backlash from some on the right who see them as overly censorial.

chatbot, elon musk

Washington Post - Technology News

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

"King of the cannibals": How Sam Altman took over Silicon Valley

Washington Post - Technology NewsDec-23-2023, 11:02:42 GMT

He and Elon Musk, the founder of Tesla and owner of what used to be Twitter, created OpenAI as a nonprofit with the aim of warning and protecting the world against a technology Musk believed could wipe out humanity by accident. Altman appeared to agree: "Development of superhuman machine intelligence is probably the greatest threat to the continued existence of humanity," he wrote on his personal blog before the company's launch in 2015, adding that it "does not have to be the inherently evil sci-fi version to kill us all." But the technology's promise was too brilliant to pass up. It just needed the right regulation, and he wanted to set up a global governing board to erect boundaries for the tool's use.

cannibal, sam altman, silicon valley, (3 more...)

Washington Post - Technology News

Country: North America > United States > California (0.40)

Industry: Information Technology (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.68)

Add feedback

OpenAI founder Sam Altman's sprawling network of investments

Washington Post - Technology NewsDec-23-2023, 11:00:42 GMT

The self-driving car company went through Y Combinator when Altman worked there, and he made a personal investment in 2015. The next year, General Motors acquired the start-up. Cruise is now one of the most prominent self-driving car companies, and it was the first to provide a driverless ride-hailing service in San Francisco. But the company is now in crisis. In October, a human driver hit a pedestrian, flinging her into the path of a Cruise car, which then rolled over the person and dragged her for 20 feet. California authorities accused Cruise of trying to cover up the details of the accident.

investment, openai founder sam altman, self-driving car company, (1 more...)

Washington Post - Technology News

Country: North America > United States > California > San Francisco County > San Francisco (0.31)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Automobiles & Trucks > Manufacturer (0.96)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

Apple is reportedly looking to team up with news publishers to train its AI

EngadgetDec-23-2023, 07:43:48 GMT

Apple has been noticeably missing in the list of companies with their own generative AI product, but based on a new report by The New York Times, it's looking to change that real soon. In recent weeks, Apple has reportedly started negotiating with major publishers and news organizations to ask for permission to use their content to train the generative AI system it's developing. The company doesn't expect to get its hands on their content for free, though, and The Times says it's offering them multi-year deals worth at least $50 million for access to their news archives. Apparently, some of the publishers it approached are concerned about the repercussions of letting Apple use their news articles throughout the years. They think a broad licensing deal for their archives could lead to legal issues along the way.

apple, news publisher, publisher, (3 more...)

Engadget

Industry:

Law (1.00)
Media > Publishing (0.80)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.76)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.69)

Add feedback

A Survey on Generative Diffusion Model

Cao, Hanqun, Tan, Cheng, Gao, Zhangyang, Xu, Yilun, Chen, Guangyong, Heng, Pheng-Ann, Li, Stan Z.

arXiv.org Artificial IntelligenceDec-23-2023

Deep generative models have unlocked another profound realm of human creativity. By capturing and generalizing patterns within data, we have entered the epoch of all-encompassing Artificial Intelligence for General Creativity (AIGC). Notably, diffusion models, recognized as one of the paramount generative models, materialize human ideation into tangible instances across diverse domains, encompassing imagery, text, speech, biology, and healthcare. To provide advanced and comprehensive insights into diffusion, this survey comprehensively elucidates its developmental trajectory and future directions from three distinct angles: the fundamental formulation of diffusion, algorithmic enhancements, and the manifold applications of diffusion. Each layer is meticulously explored to offer a profound comprehension of its evolution. Structured and summarized approaches are presented in https://github.com/chq1155/A-Survey-on-Generative-Diffusion-Model.

diffusion model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2209.02646

Country:

Asia > China (0.46)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)

Genre:

Overview (1.00)
Instructional Material > Course Syllabus & Notes (0.46)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (0.68)
Energy > Oil & Gas > Upstream (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.66)

Add feedback

Dual Use Concerns of Generative AI and Large Language Models

Grinbaum, Alexei, Adomaitis, Laurynas

arXiv.org Artificial IntelligenceDec-23-2023

Gif-sur-Yvette 91191 Abstract We suggest the implementation of the Dual Use Research of Concern (DURC) framework, originally designed for life sciences, to the domain of generative AI, with a specific focus on Large Language Models (LLMs). With its demonstrated advantages and drawbacks in biological research, we believe the DURC criteria can be effectively redefined for LLMs, potentially contributing to improved AI governance. Acknowledging the balance that must be struck when employing the DURC framework, we highlight its crucial political role in enhancing societal awareness of the impact of generative AI. As a final point, we offer a series of specific recommendations for applying the DURC approach to LLM research. Keywords: Dual Use Research of Concern (DURC), Generative AI, Large Language Models (LLMs), AI Ethics Conflict of interest No conflict of interest to report. Funding This research was supported through projects TechEthos (grant number 101006249) and MultiRATE (grant number 101073929) funded by the European Commission Horizon program. Ethics approval No human subjects were involved in the study. Consent No data needing consent has been used. Data availability statement In this article, we do not analyze or generate any datasets. Author Contribution All authors contributed to the study conception and design. Sections 1 and 4 were written with equal contribution. Sections 2 and 3 were conceived by Adomaitis and later edited by Grinbaum.

language model, llm, responsibility, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1080/23299460.2024.2304381

2305.07882

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > New York > Monroe County > Rochester (0.04)
North America > United States > District of Columbia > Washington (0.04)
(4 more...)

Genre: Research Report (0.50)

Industry:

Media (1.00)
Law (1.00)
Information Technology > Security & Privacy (1.00)
(8 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback