AITopics

2503.00407

Country:

Europe > United Kingdom (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Memory-Based Learning > Rote Learning (0.41)

WIREDFeb-28-2025, 20:00:00 GMT

So You Bought a Humane Ai Pin. Here's What You Can Do Next

As of today, the Humane Ai Pin is dead--less than a year since its launch. Following an acquisition by HP, Humane shut down many of the core features of the artificial intelligence-powered wearable and deleted user data, rendering it useless. Yes, some functions remain, like checking battery life (useful!), but you can't access the voice assistant. If you spent 700 on the Ai Pin, you might be wondering what you can do now. These are the risks of being an early adopter, but not getting a refund on a device bricked before the warranty is even up feels like a rip-off.

artificial intelligence, consumer, humane ai pin, (7 more...)

WIRED

Country: North America > United States (0.21)

Industry: Law > Business Law (0.38)

Technology: Information Technology > Artificial Intelligence (0.93)

The Japan TimesFeb-28-2025, 08:46:00 GMT

New bill lets government publicize names of firms who maliciously use AI

The government at a Cabinet meeting Friday adopted a bill allowing it to investigate businesses, give them guidance and disclose, as needed, their names in cases of human rights abuses and other malicious activities related to the use of artificial intelligence (AI). The government hopes that the bill, which is aimed at balancing AI development and measures to deal with risks related to the new technology, will be passed into law during the current ordinary session of parliament. The legislation is expected to "enhance the effectiveness of risk countermeasures, including through investigations into cases where people's rights and interests have been infringed," science and technology policy minister Minoru Kiuchi told a news conference while noting that the bill does not include "excessive regulations" that could impede technological innovation.

artificial intelligence, government publicize name, maliciously use ai

The Japan Times

Country: Asia > Japan (0.40)

Industry:

Law > Statutes (1.00)
Government (1.00)
Law > Civil Rights & Constitutional Law (0.69)

Technology: Information Technology > Artificial Intelligence (1.00)

More of the Same: Persistent Representational Harms Under Increased Representation

Mickel, Jennifer, De-Arteaga, Maria, Liu, Leqi, Tian, Kevin

To recognize and mitigate the harms of generative AI systems, it is crucial to consider who is represented in the outputs of generative AI systems and how people are represented. A critical gap emerges when naively improving who is represented, as this does not imply bias mitigation efforts have been applied to address how people are represented. We critically examined this by investigating gender representation in occupation across state-of-the-art large language models. We first show evidence suggesting that over time there have been interventions to models altering the resulting gender distribution, and we find that women are more represented than men when models are prompted to generate biographies or personas. We then demonstrate that representational biases persist in how different genders are represented by examining statistically significant word differences across genders. This results in a proliferation of representational harms, stereotypes, and neoliberalism ideals that, despite existing interventions to increase female representation, reinforce existing systems of oppression.

large language model, machine learning, natural language, (16 more...)

2503.00333

Country:

North America > United States > Texas > Travis County > Austin (0.14)
Europe > United Kingdom > Scotland (0.14)
South America (0.04)
(6 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Law (1.00)
Health & Medicine > Therapeutic Area (0.68)
Government > Regional Government > North America Government > United States Government (0.68)
Education > Educational Setting > Higher Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.54)

SuperRAG: Beyond RAG with Layout-Aware Graph Modeling

Yang, Jeff, Vu, Duy-Khanh, Nguyen, Minh-Tien, Nguyen, Xuan-Quang, Nguyen, Linh, Le, Hung

This paper introduces layout-aware graph modeling for multimodal RAG. Different from traditional RAG methods that mostly deal with flat text chunks, the proposed method takes into account the relationship of multimodalities by using a graph structure. To do that, a graph modeling structure is defined based on document layout parsing. The structure of an input document is retained with the connection of text chunks, tables, and figures. This representation allows the method to handle complex questions that require information from multimodalities. To confirm the efficiency of the graph modeling, a flexible RAG pipeline is developed using robust components. Experimental results on four benchmark test sets confirm the contribution of the layout-aware modeling for performance improvement of the RAG pipeline.

arxiv preprint arxiv, information, pipeline, (14 more...)

2503.0479

Country:

Oceania > Australia (0.04)
Europe > Middle East > Cyprus (0.04)
Asia > Vietnam > Hanoi > Hanoi (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.82)

Industry: Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Sesodia, Magnus, Petrova, Alina, Armour, John, Lukasiewicz, Thomas, Camburu, Oana-Maria, Dokania, Puneet K., Torr, Philip, de Witt, Christian Schroeder

AnnoCaseLaw: A Richly-Annotated Dataset For Benchmarking Explainable Legal Judgment Prediction

Legal systems worldwide continue to struggle with overwhelming caseloads, limited judicial resources, and growing complexities in legal proceedings. Artificial intelligence (AI) offers a promising solution, with Legal Judgment Prediction (LJP) -- the practice of predicting a court's decision from the case facts -- emerging as a key research area. However, existing datasets often formulate the task of LJP unrealistically, not reflecting its true difficulty. They also lack high-quality annotation essential for legal reasoning and explainability. To address these shortcomings, we introduce AnnoCaseLaw, a first-of-its-kind dataset of 471 meticulously annotated U.S. Appeals Court negligence cases. Each case is enriched with comprehensive, expert-labeled annotations that highlight key components of judicial decision making, along with relevant legal concepts. Our dataset lays the groundwork for more human-aligned, explainable LJP models. We define three legally relevant tasks: (1) judgment prediction; (2) concept identification; and (3) automated case annotation, and establish a performance baseline using industry-leading large language models (LLMs). Our results demonstrate that LJP remains a formidable task, with application of legal precedent proving particularly difficult. Code and data are available at https://github.com/anonymouspolar1/annocaselaw.

computational linguistic, dataset, prediction, (14 more...)

2503.00128

Country:

North America > United States > Illinois (0.05)
North America > United States > New Mexico (0.04)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
(18 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Law > Litigation (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

LLM Post-Training: A Deep Dive into Reasoning Large Language Models

Kumar, Komal, Ashraf, Tajamul, Thawakar, Omkar, Anwer, Rao Muhammad, Cholakkal, Hisham, Shah, Mubarak, Yang, Ming-Hsuan, Torr, Phillip H. S., Khan, Salman, Khan, Fahad Shahbaz

Large Language Models (LLMs) have transformed the natural language processing landscape and brought to life diverse applications. Pretraining on vast web-scale data has laid the foundation for these models, yet the research community is now increasingly shifting focus toward post-training techniques to achieve further breakthroughs. While pretraining provides a broad linguistic foundation, post-training methods enable LLMs to refine their knowledge, improve reasoning, enhance factual accuracy, and align more effectively with user intents and ethical considerations. Fine-tuning, reinforcement learning, and test-time scaling have emerged as critical strategies for optimizing LLMs performance, ensuring robustness, and improving adaptability across various real-world tasks. This survey provides a systematic exploration of post-training methodologies, analyzing their role in refining LLMs beyond pretraining, addressing key challenges such as catastrophic forgetting, reward hacking, and inference-time trade-offs. We highlight emerging directions in model alignment, scalable adaptation, and inference-time reasoning, and outline future research directions. We also provide a public repository to continually track developments in this fast-evolving field: https://github.com/mbzuai-oryx/Awesome-LLM-Post-training.

arxiv preprint arxiv, language model, reasoning, (14 more...)

2502.21321

Country:

North America > United States > Florida > Orange County > Orlando (0.14)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
(11 more...)

Genre:

Research Report (1.00)
Overview (1.00)
Workflow (0.93)
Instructional Material (0.92)

Industry:

Leisure & Entertainment > Games (1.00)
Health & Medicine (1.00)
Information Technology > Security & Privacy (0.92)
Law (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Identifying Emerging Concepts in Large Corpora

Ma, Sibo, Nyarko, Julian

We introduce a new method to identify emerging concepts in large text corpora. By analyzing changes in the heatmaps of the underlying embedding space, we are able to detect these concepts with high accuracy shortly after they originate, in turn outperforming common alternatives. We further demonstrate the utility of our approach by analyzing speeches in the U.S. Senate from 1941 to 2015. Our results suggest that the minority party is more active in introducing new concepts into the Senate discourse. We also identify specific concepts that closely correlate with the Senators' racial, ethnic, and gender identities. An implementation of our method is publicly available.

computational linguistic, new concept, proceedings, (16 more...)

2502.21315

Country:

Asia > Russia (0.28)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Spain (0.04)
(20 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Law > Statutes (1.00)
Law > Environmental Law (1.00)
Law > Civil Rights & Constitutional Law (1.00)
(5 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Security & Privacy (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
(3 more...)

Birti, Mattia, Osborne, Francesco, Maurino, Andrea

Optimizing Large Language Models for ESG Activity Detection in Financial Texts

The integration of Environmental, Social, and Governance (ESG) factors into corporate decision-making is a fundamental aspect of sustainable finance. However, ensuring that business practices align with evolving regulatory frameworks remains a persistent challenge. AI-driven solutions for automatically assessing the alignment of sustainability reports and non-financial disclosures with specific ESG activities could greatly support this process. Y et, this task remains complex due to the limitations of general-purpose Large Language Models (LLMs) in domain-specific contexts and the scarcity of structured, high-quality datasets. In this paper, we investigate the ability of current-generation LLMs to identify text related to environmental activities. Furthermore, we demonstrate that their performance can be significantly enhanced through fine-tuning on a combination of original and synthetically generated data. T o this end, we introduce ESG-Activities, a benchmark dataset containing 1,325 labeled text segments classified according to the EU ESG taxonomy. Our experimental results show that fine-tuning on ESG-Activities significantly enhances classification accuracy, with open models such as Llama 7B and Gemma 7B outperforming large proprietary solutions in specific configurations. These findings have important implications for financial analysts, policymakers, and AI researchers seeking to enhance ESG transparency and compliance through advanced natural language processing techniques. N recent years, driven by the widespread adoption of the Sustainable Development Goals (SDGs), the European Union has introduced principles and regulations aimed at helping organizations integrate environmental, social, and governance (ESG) factors into their operations and strategic decision-making. These initiatives encourage businesses and investors to assess and improve their environmental impact, fostering a more sustainable approach to economic activity [1]. This resource enables companies to evaluate their activities in alignment with its criteria and report their performance in non-financial disclosures and sustainability reports.

dataset, fine-tuning, language model, (12 more...)

2502.21112

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Italy > Lombardy > Milan (0.04)
Europe > United Kingdom > England > Buckinghamshire > Milton Keynes (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Social Sector (1.00)
Banking & Finance > Trading (0.68)
Information Technology > Security & Privacy (0.67)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Bertolotti, Francesco, Mari, Luca

An LLM-based Delphi Study to Predict GenAI Evolution

Predicting the future trajectory of complex and rapidly evolving systems remains a significant challenge, particularly in domains where data is scarce or unreliable. This study introduces a novel approach to qualitative forecasting by leveraging Large Language Models to conduct Delphi studies. The methodology was applied to explore the future evolution of Generative Artificial Intelligence, revealing insights into key factors such as geopolitical tensions, economic disparities, regulatory frameworks, and ethical considerations. The results highlight how LLM-based Delphi studies can facilitate structured scenario analysis, capturing diverse perspectives while mitigating issues such as respondent fatigue. However, limitations emerge in terms of knowledge cutoffs, inherent biases, and sensitivity to initial conditions. While the approach provides an innovative means for structured foresight, this method could be also considered as a novel form of reasoning. further research is needed to refine its ability to manage heterogeneity, improve reliability, and integrate external data sources.

agent, experiment, genai, (16 more...)

2502.21092

Country:

Europe > Italy (0.04)
North America > United States > Massachusetts > Middlesex County > Reading (0.04)
Europe > Greece > Central Macedonia > Thessaloniki (0.04)

Genre:

Research Report > Promising Solution (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)