AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

Diversity Over Size: On the Effect of Sample and Topic Sizes for Argument Mining Datasets

Schiller, Benjamin, Daxenberger, Johannes, Gurevych, Iryna

arXiv.org Artificial IntelligenceJul-15-2023

The task of Argument Mining, that is extracting argumentative sentences for a specific topic from large document sources, is an inherently difficult task for machine learning models and humans alike, as large Argument Mining datasets are rare and recognition of argumentative sentences requires expert knowledge. The task becomes even more difficult if it also involves stance detection of retrieved arguments. Given the cost and complexity of creating suitably large Argument Mining datasets, we ask whether it is necessary for acceptable performance to have datasets growing in size. Our findings show that, when using carefully composed training samples and a model pretrained on related tasks, we can reach 95% of the maximum performance while reducing the training sample size by at least 85%. This gain is consistent across three Argument Mining tasks on three different datasets. We also publish a new dataset for future benchmarking.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2205.11472

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(16 more...)

Genre: Research Report > New Finding (0.86)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Education (1.00)
Energy > Renewable (0.93)
(7 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Musk: xAI will help solve the universe's biggest mysteries like Dark Matter, Dark Energy, and Aliens

Daily Mail - Science & techJul-14-2023, 23:29:49 GMT

Elon Musk has officially introduced his xAI team to the masses with a live Twitter Spaces event - after years of claiming the tech will be the demise of humanity. The Twitter boss laid out his plans to make an artificial general intelligence (AGI) that will be'maximally curious and truth-seeking' and'won't be politically correct.' 'People will be offended,' Musk said. 'Our AI can give answers that they might find controversial even though they might be true.' But beyond the AI culture wars, Musk expressed ambitious hopes to produce an AGI with deep analytical reasoning capable of solving higher order math and science problems, including many that have eluded mankind's best thinkers. The billionaire suggested that xAI could answer questions about the nature of dark matter and dark energy: theorized but difficult to confirm components of the known universe which astrophysicists estimate constitute 95 percent of the cosmos. Musk also said he hoped xAI could help resolve the'Fermi paradox' -- a theoretical question that asks why humans have not yet encountered extraterrestrial life in a universe that is over 13 billion years old and ripe with the conditions supporting life.

large language model, machine learning, musk, (19 more...)

Daily Mail - Science & tech

Country: Asia > North Korea (0.05)

Industry: Information Technology > Services (0.30)

Technology:

Information Technology > Communications > Social Media (0.94)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.49)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

Musk unveils details of xAI, his new AI company, in live Twitter event

Washington Post - Technology NewsJul-14-2023, 20:36:38 GMT

Musk has been outspoken about AI for years, famously saying in 2014 that inventing super-intelligent computers would be like "summoning the demon" and could create an existential threat to humanity. He helped found ChatGPT-maker OpenAI, in 2015, but left the company in 2018 after disagreements with its other leaders. Over the past few months, he has complained about OpenAI and other AI companies scraping Twitter data to help train their bots.

ai company, live twitter event, musk unveil detail, (2 more...)

Washington Post - Technology News

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.71)

Add feedback

The Download: a new Turing test, and working with ChatGPT

MIT Technology ReviewJul-14-2023, 12:10:00 GMT

Before that, he co-founded DeepMind, one of the world's leading artificial intelligence companies. AI systems are increasingly everywhere and are becoming more powerful almost by the day. But how can we know if a machine is truly "intelligent"? For decades this has been defined by the Turing test, which argues that an AI that's able to replicate language convincingly enough to trick a human into thinking it was also human should be considered intelligent. But there's now a problem: the Turing test has almost been passed--it arguably already has been.

chatgpt, new turing test, turing test, (1 more...)

MIT Technology Review

Genre: Research Report (0.37)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Issues > Turing's Test (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)

Add feedback

Mustafa Suleyman: My new Turing test would see if AI can make $1 million

MIT Technology ReviewJul-14-2023, 11:47:25 GMT

But there's now a problem: the Turing test has almost been passed--it arguably already has been. The latest generation of large language models, systems that generate text with a coherence that just a few years ago would have seemed magical, are on the cusp of acing it. So where does that leave AI? And more important, where does it leave us? The truth is, I think we're in a moment of genuine confusion (or, perhaps more charitably, debate) about what's really happening.

mustafa suleyman, new turing test, turing test, (2 more...)

MIT Technology Review

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.77)
Information Technology > Artificial Intelligence > Issues > Turing's Test (0.68)

Add feedback

US watchdog probes ChatGPT maker OpenAI over false information

Al JazeeraJul-14-2023, 02:56:56 GMT

The United States' competition watchdog has opened an investigation into ChatGPT creator OpenAI amid suspicions the startup broke the law by scraping public data and publishing false and defamatory information. In a 20-page letter, the US Federal Trade Commission (FTC) has requested OpenAI to provide detailed information about its technology and privacy protections, including any efforts to prevent a repeat of incidents in which its groundbreaking chatbot published false and disparaging information about members of the public. The Washington Post first reported on the "expansive" probe on Thursday. The FTC declined to comment when contacted by Al Jazeera. OpenAI chief executive Sam Altman said the leak of the regulator's probe was "disappointing" and would not help build trust.

information, openai, watchdog probe chatgpt maker openai, (5 more...)

Al Jazeera

Country: North America > United States (1.00)

Industry:

Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Associated Press and OpenAI partner to explore generative AI use in news

The Japan TimesJul-14-2023, 01:58:11 GMT

The Associated Press is licensing a part its archive of news stories to OpenAI under a deal that will explore generative AI's use in news, the companies said Thursday, a move that could set the precedent for similar partnerships between the industries. The news publisher will gain access to OpenAI's technology and product expertise as part of the deal, the financial details of which were not disclosed. AP also did not reveal how it would integrate OpenAI's technology in its news operations. The publisher already uses AI for automating corporate earnings reports, recapping sporting events and transcription for certain live events. This could be due to a conflict with your ad-blocking or security software.

explore generative ai use, press and openai partner

The Japan Times

Industry: Media (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Elaboration-Generating Commonsense Question Answering at Scale

Wang, Wenya, Srikumar, Vivek, Hajishirzi, Hanna, Smith, Noah A.

arXiv.org Artificial IntelligenceJul-14-2023

In question answering requiring common sense, language models (e.g., GPT-3) have been used to generate text expressing background knowledge that helps improve performance. Yet the cost of working with such models is very high; in this work, we finetune smaller language models to generate useful intermediate context, referred to here as elaborations. Our framework alternates between updating two language models -- an elaboration generator and an answer predictor -- allowing each to influence the other. Using less than 0.5% of the parameters of GPT-3, our model outperforms alternatives with similar sizes and closes the gap on GPT-3 on four commonsense question answering benchmarks. Human evaluations show that the quality of the generated elaborations is high.

machine learning, natural language, question answering, (17 more...)

arXiv.org Artificial Intelligence

2209.01232

Country:

Europe (0.28)
North America > United States (0.14)
Asia > China (0.14)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Therapeutic Area (0.68)
Energy > Oil & Gas > Upstream (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.92)

Add feedback

14 Examples of How LLMs Can Transform Materials Science and Chemistry: A Reflection on a Large Language Model Hackathon

Jablonka, Kevin Maik, Ai, Qianxiang, Al-Feghali, Alexander, Badhwar, Shruti, Bocarsly, Joshua D., Bran, Andres M, Bringuier, Stefan, Brinson, L. Catherine, Choudhary, Kamal, Circi, Defne, Cox, Sam, de Jong, Wibe A., Evans, Matthew L., Gastellu, Nicolas, Genzling, Jerome, Gil, María Victoria, Gupta, Ankur K., Hong, Zhi, Imran, Alishba, Kruschwitz, Sabine, Labarre, Anne, Lála, Jakub, Liu, Tao, Ma, Steven, Majumdar, Sauradeep, Merz, Garrett W., Moitessier, Nicolas, Moubarak, Elias, Mouriño, Beatriz, Pelkie, Brenden, Pieler, Michael, Ramos, Mayk Caldas, Ranković, Bojana, Rodriques, Samuel G., Sanders, Jacob N., Schwaller, Philippe, Schwarting, Marcus, Shi, Jiale, Smit, Berend, Smith, Ben E., Van Herck, Joren, Völker, Christoph, Ward, Logan, Warren, Sean, Weiser, Benjamin, Zhang, Sylvester, Zhang, Xiaoqi, Zia, Ghezal Ahmad, Scourtas, Aristana, Schmidt, KJ, Foster, Ian, White, Andrew D., Blaiszik, Ben

arXiv.org Artificial IntelligenceJul-14-2023

Large-language models (LLMs) such as GPT-4 caught the interest of many scientists. Recent studies suggested that these models could be useful in chemistry and materials science. To explore these possibilities, we organized a hackathon. This article chronicles the projects built as part of this hackathon. Participants employed LLMs for various applications, including predicting properties of molecules and materials, designing novel interfaces for tools, extracting knowledge from unstructured data, and developing new educational applications. The diverse topics and the fact that working prototypes could be generated in less than two days highlight that LLMs will profoundly impact the future of our fields. The rich collection of ideas and projects also indicates that the applications of LLMs are not limited to materials science and chemistry but offer potential benefits to a wide range of scientific disciplines.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.1039/D3DD00113J

2306.06283

Country:

Europe > Switzerland (0.28)
Europe > United Kingdom (0.28)
North America > United States > California > Los Angeles County > Los Angeles (0.28)
(10 more...)

Genre:

Research Report > Promising Solution (1.00)
Research Report > New Finding (1.00)
Instructional Material > Course Syllabus & Notes (0.67)

Industry:

Materials > Construction Materials (1.00)
Materials > Chemicals (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

C3: Zero-shot Text-to-SQL with ChatGPT

Dong, Xuemei, Zhang, Chao, Ge, Yuhang, Mao, Yuren, Gao, Yunjun, Chen, lu, Lin, Jinshu, Lou, Dongfang

arXiv.org Artificial IntelligenceJul-14-2023

This paper proposes a ChatGPT-based zero-shot Text-to-SQL method, dubbed C3, which achieves 82.3\% in terms of execution accuracy on the holdout test set of Spider and becomes the state-of-the-art zero-shot Text-to-SQL method on the Spider Challenge. C3 consists of three key components: Clear Prompting (CP), Calibration with Hints (CH), and Consistent Output (CO), which are corresponding to the model input, model bias and model output respectively. It provides a systematic treatment for zero-shot Text-to-SQL. Extensive experiments have been conducted to verify the effectiveness and efficiency of our proposed method.

chatgpt, query, sql query, (15 more...)

arXiv.org Artificial Intelligence

2307.07306

Country: Asia > China > Zhejiang Province (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback