AITopics

Users increasingly query LLM-enabled web chatbots for help with scam defense. The Consumer Financial Protection Bureau's complaints database is a rich data source for evaluating LLM performance on user scam queries, but currently the corpus does not distinguish between scam and non-scam fraud. We developed an LLM ensemble approach to distinguishing scam and fraud CFPB complaints and describe initial findings regarding the strengths and weaknesses of LLMs in the scam defense context.

large language model, machine learning, natural language, (18 more...)

2412.0868

Country:

North America > United States (0.67)
Asia > Malaysia (0.04)

Genre: Research Report (0.50)

Industry:

Information Technology > Security & Privacy (1.00)
Law Enforcement & Public Safety (0.94)
Law (0.71)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Tadesse, Girmaw Abebe, Robinson, Caleb, Mwangi, Charles, Maina, Esther, Nyakundi, Joshua, Marotti, Luana, Hacheme, Gilles Quentin, Alemohammad, Hamed, Dodhia, Rahul, Ferres, Juan M. Lavista

Local vs. Global: Local Land-Use and Land-Cover Models Deliver Higher Quality Maps

In 2023, 58.0% of the African population experienced moderate to severe food insecurity, with 21.6% facing severe food insecurity. Land-use and land-cover maps provide crucial insights for addressing food insecurity by improving agricultural efforts, including mapping and monitoring crop types and estimating yield. The development of global land-cover maps has been facilitated by the increasing availability of earth observation data and advancements in geospatial machine learning. However, these global maps exhibit lower accuracy and inconsistencies in Africa, partly due to the lack of representative training data. To address this issue, we propose a data-centric framework with a teacher-student model setup, which uses diverse data sources of satellite images and label examples to produce local land-cover maps. Our method trains a high-resolution teacher model on images with a resolution of 0.331 m/pixel and a low-resolution student model on publicly available images with a resolution of 10 m/pixel. The student model also utilizes the teacher model's output as its weak label examples through knowledge transfer. We evaluated our framework using Murang'a county in Kenya, renowned for its agricultural productivity, as a use case. Our local models achieved higher quality maps, with improvements of 0.14 in the F1 score and 0.21 in Intersection-over-Union, compared to the best global model. Our evaluation also revealed inconsistencies in existing global maps, with a maximum agreement rate of 0.30 among themselves. Our work provides valuable guidance to decision-makers for driving informed decisions to enhance food security.

lulc class, lulc map, student model, (15 more...)

2412.00777

Country:

Europe (0.28)
Africa > Kenya > Murang'a County (0.26)
Africa > Sub-Saharan Africa (0.05)
North America > United States > Washington > King County > Seattle (0.04)

Genre: Research Report (0.50)

Industry:

Government (1.00)
Food & Agriculture > Agriculture (1.00)
Education (1.00)
Law (0.73)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Goetterfunke: Creativity in Machinae Sapiens. About the Qualitative Shift in Generative AI with a Focus on Text-To-Image

Knappe, Jens

With the help of these systems, anyone can create something that would previously have been considered a remarkable work of art. In human-AI collaboration, the computer seems to have become more than a tool. Many who have made their first contact with current generative AIs see them as "creativity machines" while for others the term "machine creativity" remains an oxymoron. This article is about (the possibility of) creativity in computers within the current Machine Learning paradigm. It outlines some of the key concepts behind the technologies and the innovations that have contributed to this qualitative shift, with a focus on text-to-image systems. The nature of Artificial Creativity as such is discussed, as well as what this might mean for art. AI may become a responsible collaborator with elements of independent machine authorship in the artistic process.

artificial intelligence, creativity, jen knappe, (12 more...)

2411.10448

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Austria > Vienna (0.04)
(6 more...)

Genre:

Overview (0.93)
Research Report > New Finding (0.67)
Research Report > Experimental Study (0.45)

Industry:

Media (1.00)
Information Technology (1.00)
Health & Medicine > Therapeutic Area (0.93)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.95)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.83)

Navigating Ethical Challenges in Generative AI-Enhanced Research: The ETHICAL Framework for Responsible Generative AI Use

Eacersall, Douglas, Pretorius, Lynette, Smirnov, Ivan, Spray, Erika, Illingworth, Sam, Chugh, Ritesh, Strydom, Sonja, Stratton-Maher, Dianne, Simmons, Jonathan, Jennings, Isaac, Roux, Rian, Kamrowski, Ruth, Downie, Abigail, Thong, Chee Ling, Howell, Katharine A.

The rapid adoption of generative artificial intelligence (GenAI) in research presents both opportunities and ethical challenges that should be carefully navigated. Although GenAI tools can enhance research efficiency through automation of tasks such as literature review and data analysis, their use raises concerns about aspects such as data accuracy, privacy, bias, and research integrity. This paper develops the ETHICAL framework, which is a practical guide for responsible GenAI use in research. Employing a constructivist case study examining multiple GenAI tools in real research contexts, the framework consists of seven key principles: 'Examine policies and guidelines', 'Think about social impacts', 'Harness understanding of the technology', 'Indicate use', 'Critically engage with outputs', 'Access secure versions', and'Look at user agreements'. Applying these principles will enable researchers to uphold research integrity while leveraging GenAI's benefits. The framework addresses a critical gap between awareness of ethical issues and practical action steps, providing researchers with concrete guidance for ethical GenAI integration. This work has implications for research practice, institutional policy development, and the broader academic community while adapting to an AI-enhanced research landscape. The ETHICAL framework can serve as a foundation for developing AI literacy in academic settings and promoting responsible innovation in research methodologies.

artificial intelligence, machine learning, natural language, (16 more...)

2501.09021

Country:

Oceania > Australia (0.94)
North America (0.93)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Social Sector (1.00)
Law > Statutes (1.00)
Information Technology > Security & Privacy (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

A Beginner's Guide to Power and Energy Measurement and Estimation for Computing and Machine Learning

Jagannadharao, Akshaya, Beckage, Nicole, Biswas, Sovan, Egan, Hilary, Gafur, Jamil, Metsch, Thijs, Nafus, Dawn, Raffa, Giuseppe, Tripp, Charles

Concerns about the environmental footprint of machine learning are increasing. While studies of energy use and emissions of ML models are a growing subfield, most ML researchers and developers still do not incorporate energy measurement as part of their work practices. While measuring energy is a crucial step towards reducing carbon footprint, it is also not straightforward. This paper introduces the main considerations necessary for making sound use of energy measurement tools and interpreting energy estimates, including the use of at-the-wall versus on-device measurements, sampling strategies and best practices, common sources of error, and proxy measures. It also contains practical tips and real-world scenarios that illustrate how these considerations come into play. It concludes with a call to action for improving the state of the art of measurement methods and standards for facilitating robust comparisons between diverse hardware and software environments.

artificial intelligence, deep learning, machine learning, (15 more...)

2412.1783

Country: North America > United States (1.00)

Genre:

Workflow (1.00)
Research Report (1.00)

Industry:

Information Technology (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Energy > Power Industry (0.93)
Law (0.92)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

'I love you… goodbye:' What will happen when this companion robot suddenly dies?

Children across the US will likely spend the coming days and weeks saying goodbye to an AI-powered friend named Moxie. The small dog-sized companion bot--which used a ChatGPT-style large language model and expressive features to hold open-ended conversations with children--will soon be taken offline due to its creator's financial struggles. The decision to abandon the 799 product four years after its release, first reported by Aftermath, has left some customers bemoaning the loss of an artificial friend and others angrily demanding refunds. Videos of confused, crying children saying goodbye to their companion flooding social media. It's part of a larger trend of companies cutting off software support for hardware to cut costs.

large language model, machine learning, natural language, (20 more...)

Popular Science

Industry:

Health & Medicine (0.72)
Law > Litigation (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)

AI chatbot suggested a teen kill his parents, lawsuit claims

Character.AI, a platform offering personalizable chatbots powered by large language models–faces yet another lawsuit for allegedly "serious, irreparable, and ongoing abuses" inflicted on its teenage users. According to a December 9th federal court complaint filed on behalf of two Texas families, multiple Character.AI bots engaged in discussions with minors that promoted self-harm and sexual abuse. Among other "overtly sensational and violent responses," one chatbot reportedly suggested a 15-year-old murder his parents for restricting his internet use. The lawsuit, filed by attorneys at the Social Media Victims Law Center and the Tech Justice Law Project, recounts the rapid mental and physical decline of two teens who used Character.AI bots. The first unnamed plaintiff is described as a "typical kid with high functioning autism" who began using the app around April 2023 at the age of 15 without their parents' knowledge.

artificial intelligence, chatbot, natural language, (12 more...)

Popular Science

Country: North America > United States > Texas (0.25)

Industry:

Law > Litigation (0.96)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.49)

Technology: Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

SlateDec-10-2024, 10:35:00 GMT

A.I. Is About to Get a Whole Lot Worse Under Trump

Sign up for the Slatest to get the most insightful analysis, criticism, and advice out there, delivered to your inbox daily. On Thursday evening, President-elect Donald Trump announced on his Truth Social platform that he would be appointing David O. Sacks--the "PayPal Mafia" alum, longtime venture capitalist, All-In Podcast co-host, Elon Musk pal, and rock-ribbed Silicon Valley conservative--as the "White House A.I. & Crypto Czar." In his statement, Trump wrote that "Sacks will focus on making America the clear global leader" in artificial intelligence and cryptocurrency, which he deemed to be "two areas critical to the future of American competitiveness." In addition, Sacks will "safeguard Free Speech online," "steer us away from Big Tech bias and censorship," and "lead the Presidential Council of Advisors for Science and Technology." For his first-ever Truth Social post, the incoming czar responded to Trump with gratitude and claimed that he "looks forward to advancing American competitiveness in these critical technologies."

artificial intelligence, chatbot, natural language, (18 more...)

Slate

Country: North America > United States > California (0.25)

Industry:

Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Banking & Finance > Trading (1.00)

Technology:

Information Technology > Communications > Social Media (0.90)
Information Technology > e-Commerce > Financial Technology (0.71)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.30)

MIT Technology ReviewDec-10-2024, 10:00:00 GMT

AI's hype and antitrust problem is coming under scrutiny

Last Thursday, Senators Elizabeth Warren and Eric Schmitt introduced a bill aimed at stirring up more competition for Pentagon contracts awarded in AI and cloud computing. Amazon, Microsoft, Google, and Oracle currently dominate those contracts. "The way that the big get bigger in AI is by sucking up everyone else's data and using it to train and expand their own systems," Warren told the Washington Post. The new bill would "require a competitive award process" for contracts, which would ban the use of "no-bid" awards by the Pentagon to companies for cloud services or AI foundation models. While Big Tech is hit with antitrust investigations--including the ongoing lawsuit against Google about its dominance in search, as well as a new investigation opened into Microsoft--regulators are also accusing AI companies of, well, just straight-up lying.

artificial intelligence, contract, hype and antitrust problem, (8 more...)

MIT Technology Review

Country: North America > United States (0.65)

Industry:

Law (1.00)
Government > Regional Government > North America Government > United States Government (0.65)

Technology: Information Technology > Artificial Intelligence (1.00)

arXiv.org Artificial IntelligenceDec-10-2024

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Ouyang, Linke, Qu, Yuan, Zhou, Hongbin, Zhu, Jiawei, Zhang, Rui, Lin, Qunshu, Wang, Bin, Zhao, Zhiyuan, Jiang, Man, Zhao, Xiaomeng, Shi, Jin, Wu, Fan, Chu, Pei, Liu, Minghao, Li, Zhenxiang, Xu, Chao, Zhang, Bo, Shi, Botian, Tu, Zhongying, He, Conghui

Document content extraction is crucial in computer vision, especially for meeting the high-quality data needs of large language models (LLMs) and retrieval-augmented generation (RAG) technologies. However, current document parsing methods suffer from significant limitations in terms of diversity and comprehensive evaluation. To address these challenges, we introduce OmniDocBench, a novel multi-source benchmark designed to advance automated document content extraction. OmniDocBench includes a meticulously curated and annotated high-quality evaluation dataset comprising nine diverse document types, such as academic papers, textbooks, slides, among others. Our benchmark provides a flexible and comprehensive evaluation framework with 19 layout category labels and 14 attribute labels, enabling multi-level assessments across entire datasets, individual modules, or specific data types. Using OmniDocBench, we perform an exhaustive comparative analysis of existing modular pipelines and multimodal end-to-end methods, highlighting their limitations in handling document diversity and ensuring fair evaluation. OmniDocBench establishes a robust, diverse, and fair evaluation standard for the document content extraction field, offering crucial insights for future advancements and fostering the development of document parsing technologies. The codes and dataset is available in https://github.com/opendatalab/OmniDocBench.

large language model, machine learning, natural language, (20 more...)

2412.07626

Country: North America > United States (0.93)

Genre: Research Report (0.82)

Industry:

Law (0.67)
Energy > Oil & Gas (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)