AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

SAINE: Scientific Annotation and Inference Engine of Scientific Research

Rao, Susie Xi, Tu, Yilei, Egger, Peter H.

arXiv.org Artificial IntelligenceJul-11-2023

We present SAINE, an Scientific Annotation and Inference ENgine based on a set of standard open-source software, such as Label Studio and MLflow. We show that our annotation engine can benefit the further development of a more accurate classification. Based on our previous work on hierarchical discipline classifications, we demonstrate its application using SAINE in understanding the space for scholarly publications. The user study of our annotation results shows that user input collected with the help of our system can help us better understand the classification process. We believe that our work will help to foster greater transparency and better understand scientific research. Our annotation and inference engine can further support the downstream meta-science projects. We welcome collaboration and feedback from the scientific community on these projects. The demonstration video can be accessed from https://youtu.be/yToO-G9YQK4. A live demo website is available at https://app.heartex.com/user/signup/?token=e2435a2f97449fa1 upon free registration.

category, large language model, machine learning, (23 more...)

arXiv.org Artificial Intelligence

2302.14468

Country:

Europe > Switzerland > Zürich > Zürich (0.05)
Europe > United Kingdom > England > Nottinghamshire > Nottingham (0.04)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.82)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)

Add feedback

Explanation Regeneration via Information Bottleneck

Li, Qintong, Wu, Zhiyong, Kong, Lingpeng, Bi, Wei

arXiv.org Artificial IntelligenceJul-11-2023

Explaining the black-box predictions of NLP models naturally and accurately is an important open problem in natural language generation. These free-text explanations are expected to contain sufficient and carefully-selected evidence to form supportive arguments for predictions. Due to the superior generative capacity of large pretrained language models, recent work built on prompt engineering enables explanation generation without specific training. However, explanation generated through single-pass prompting often lacks sufficiency and conciseness. To address this problem, we develop an information bottleneck method EIB to produce refined explanations that are sufficient and concise. Our approach regenerates the free-text explanation by polishing the single-pass output from the pretrained language model but retaining the information that supports the contents being explained. Experiments on two out-of-domain tasks verify the effectiveness of EIB through automatic evaluation and thoroughly-conducted human evaluation.

explanation, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2212.09603

Country:

North America > United States > Michigan (0.04)
North America > United States > Arkansas > Cross County (0.04)
Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report (0.64)

Industry: Transportation (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.54)

Add feedback

Meta's Threads tops 100 million users in just 5 days, Zuckerberg says

Washington Post - Technology NewsJul-10-2023, 21:53:02 GMT

Despite not being available in Europe yet because of European Union data privacy regulations, Threads has reached 100 million users faster than any other app. The speed of its growth handily beat artificial intelligence app ChatGPT, which took two months to reach that mark, according to a UBS study.

just 5, thread top 100 million, zuckerberg, (1 more...)

Washington Post - Technology News

Country: Europe (0.93)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.44)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.44)
(2 more...)

Add feedback

Sarah Silverman sues OpenAI and Meta over copyright infringement

EngadgetJul-10-2023, 17:53:22 GMT

On Friday, the comedian and author, alongside novelists Christopher Golden and Richard Kadrey, filed a pair of complaints against OpenAI and Meta ( via Gizmodo). Everyday pirates can access these materials through direct downloads, but perhaps more usefully for those generating large language models, many shadow libraries also make written material available in bulk torrent packages. One exhibit from Silverman's lawsuit involves an exchange between the comedian's lawyers and ChatGPT. Silverman's legal team asked the chatbot to summarize The Bedwetter, a memoir she published in 2010. The chatbot was not only able to outline entire parts of the book, but some passages it relayed appear to have been reproduced verbatim.

infringement, sarah silverman sue openai, silverman sue openai and meta, (3 more...)

Engadget

Industry:

Law > Intellectual Property & Technology Law (0.83)
Law > Litigation (0.79)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.94)

Add feedback

Programs to detect AI discriminate against non-native English speakers, shows study

The GuardianJul-10-2023, 15:00:17 GMT

Computer programs that are used to detect essays, job applications and other work generated by artificial intelligence can discriminate against people who are non-native English speakers, researchers say. Tests on seven popular AI text detectors found that articles written by people who did not speak English as a first language were often wrongly flagged as AI-generated, a bias that could have a serious impact on students, academics and job applicants. With the rise of ChatGPT, a generative AI program that can write essays, solve problems and create computer code, many teachers now consider AI detection as a "critical countermeasure to deter a 21st-century form of cheating", the researchers say, but they warn that the 99% accuracy claimed by some detectors is "misleading at best." Alex Hern's weekly dive in to how technology is shaping our lives Scientists led by James Zou, an assistant professor of biomedical data science at Stanford University, ran 91 English essays written by non-native English speakers through seven popular GPT detectors to see how well the programs performed. More than half of the essays, which were written for a widely recognised English proficiency test known as the Test of English as a Foreign Language, or TOEFL, were flagged as AI-generated, with one program flagging 98% of the essays as composed by AI.

detector, gpt detector, non-native english speaker, (13 more...)

The Guardian

Country:

Europe > Middle East > Cyprus (0.07)
North America > United States (0.06)

Genre: Research Report (0.80)

Industry: Education > Educational Setting (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.57)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.41)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.40)

Add feedback

Sarah Silverman sues OpenAI and Meta for copyright infringement

The GuardianJul-10-2023, 12:41:46 GMT

Silverman has filed the suits along with two authors, Christopher Golden and Richard Kadrey, in which they claim the AI models developed by OpenAI and Meta used their work as part of their training data. Tools like ChatGPT, a highly popular chatbot, are based on large language models that are fed vast amounts of data taken from the internet in order to train them to give convincing responses to text prompts from users. The suits claim the authors' works were obtained from "shadow library" sites that have "long been of interest to the AI-training community". The OpenAI suit includes exhibits claiming that, when prompted, it summarised three books: Silverman's The Bedwetter, Ararat by Golden, and Kadrey's Sandman Slim. The Meta suit cites multiple works by Kadrey and Golden, alongside The Bedwetter, and flags a Meta paper that indicates LLaMA's training datasets included material taken from shadow libraries the suit describes as "flagrantly illegal".

infringement, openai and meta, silverman sue openai and meta, (9 more...)

The Guardian

Country: North America > United States > Georgia (0.06)

Industry:

Law > Litigation (0.92)
Law > Intellectual Property & Technology Law (0.81)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.96)

Add feedback

Meta's Twitter-killer app Threads passes 100million users in five days

Daily Mail - Science & techJul-10-2023, 12:17:26 GMT

Meta Inc's Threads app launched by Instagram that has been called a Twitter-killer has signed up more than 100 million users in less than five days. That is according to data tracking websites on Monday, suggesting the app has smashed the record of AI tool ChatGPT for fastest-growing consumer app. While ChatGPT took two months to hit the 100 million user mark and video-sharing app TikTok took nine months, Instagram itself took two and a half years to reach that mark after its 2010 launch. Threads went live on Apple and Android app stores in 100 countries late on Wednesday (July 5), though it is not available in Europe because parent company Meta is unsure how to navigate the European Union's data privacy legislation. Meanwhile, experts have described the traffic of Elon Musk-owned Twitter as'tanking' in the face of the new competition.

musk, twitter, zuckerberg, (9 more...)

Daily Mail - Science & tech

Country: Europe (0.56)

Industry:

Information Technology > Security & Privacy (0.91)
Law > Statutes (0.56)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.46)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Threads hits 100 million users in five-day record surge

Al JazeeraJul-10-2023, 11:10:12 GMT

The Threads app launched by Instagram as a rival to Twitter has seen more than 100 million users sign up in less than five days, data tracking websites said on Monday, smashing the record of artificial intelligence tool ChatGPT for the fastest-growing consumer app. While ChatGPT took two months to hit the 100-million-user mark and video-sharing app TikTok took nine months, Instagram itself took two and a half years to reach the same mark after its 2010 launch. Threads went live on Apple and Android app stores in 100 countries late on Wednesday, though it is not available in Europe due to legal issues the parent company Meta has had with the European Union's data privacy legislation. Twitter is thought to have around 200 million regular users but it has suffered repeated technical failures since Elon Musk bought the platform last year and sacked thousands of staff. Musk, who also serves as the boss of Tesla and SpaceX, has also alienated many users by introducing charges for previously free services and allowing banned right-wing accounts back on the platform.

five-day record surge, instagram, twitter, (6 more...)

Al Jazeera

Country: Europe (0.60)

Industry:

Law (1.00)
Information Technology > Services (1.00)
Information Technology > Security & Privacy (0.97)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.50)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

How AI Can Make Gaming Better for All Players

WIREDJul-10-2023, 11:00:00 GMT

When Google revealed Project Gameface, the company was proud to show off a hands-free, AI-powered gaming mouse that, according to its announcement, "enables people to control a computer's cursor using their head movement and facial gestures." While this may not be the first AI-based gaming tool, it was certainly one of the first to put AI in the hands of players, rather than developers. The project was inspired by Lancy Carr, a quadriplegic video game streamer who utilizes a head-tracking mouse as part of his gaming setup. After his existing hardware was lost in a fire, Google stepped in to create an open source, highly configurable, low-cost alternative to expensive replacement hardware, powered by machine learning. While AI's broader existence is proving divisive, we set out to discover whether AI, when used for good, could be the future of gaming accessibility.

accessibility, gameface, implementation, (9 more...)

WIRED

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

Google is testing its medical AI chatbot at the Mayo Clinic

EngadgetJul-10-2023, 10:20:55 GMT

Google is already testing its Med-PaLM 2 AI chat technology at at the Mayo Clinic and other hospitals, The Wall Street Journal has reported. It's based on the company's PaLM 2 large language model (LLM) that underpins Bard, Google's ChatGPT rival -- and was launched just months ago at Google I/O. Unlike the base model, Med-PaLM-2 has been trained on questions and answer from medical licensing exams, along with a curated set of medical expert demonstrations. That gives it expertise in answering health-related questions, and it can also do labor-intensive tasks like summarizing documents and organizing research data, according to the report. During I/O, Google released a paper detailing its work on Med-PaLM2.

google, mayo clinic, medical ai chatbot, (2 more...)

Engadget

Industry: Health & Medicine > Consumer Health (0.63)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.45)

Add feedback