AITopics

2509.01301

Country:

Asia (1.00)
North America > United States (0.94)
Europe (0.67)

Genre:

Overview (0.68)
Research Report (0.64)
Questionnaire & Opinion Survey (0.46)

Industry: Education (0.48)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Wang, Jennifer, Huang, Kayla, Klyman, Kevin, Bommasani, Rishi

Do AI Companies Make Good on Voluntary Commitments to the White House?

arXiv.org Artificial IntelligenceSep-25-2025

Voluntary commitments are central to international AI governance, as demonstrated by recent voluntary guidelines from the White House to the G7, from Bletchley Park to Seoul. How do major AI companies make good on their commitments? We score companies based on their publicly disclosed behavior by developing a detailed rubric based on their eight voluntary commitments to the White House in 2023. We find significant heterogeneity: while the highest-scoring company (OpenAI) scores a 83% overall on our rubric, the average score across all companies is just 53%. The companies demonstrate systemically poor performance for their commitment to model weight security with an average score of 17%: 11 of the 16 companies receive 0% for this commitment. Our analysis highlights a clear structural shortcoming that future AI governance initiatives should correct: when companies make public commitments, they should proactively disclose how they meet their commitments to provide accountability, and these disclosures should be verifiable. To advance policymaking on corporate AI governance, we provide three directed recommendations that address underspecified commitments, the role of complex AI supply chains, and public transparency that could be applied towards AI governance initiatives worldwide.

information, large language model, machine learning, (21 more...)

2508.08345

Country:

North America > United States > California (0.28)
Asia > South Korea > Seoul > Seoul (0.24)
Europe > United Kingdom > England > Buckinghamshire > Milton Keynes (0.24)

Genre:

Overview (0.92)
Research Report > New Finding (0.45)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.70)

Communications of the ACMSep-24-2025, 14:36:48 GMT

Quantum Computing Research in the Arab World

Membership in ACM includes a subscription to Communications of the ACM (CACM), the computing industry's most trusted source for staying connected to the world of advanced computing. Quantum computing research topics from the Arab world include quantum machine learning and location-tracking and spatial systems. Quantum computing (QC) is one of the most transformative scientific and technological advances of the 21 century, introducing entirely new paradigms for solving computational problems that have long been considered intractable for classical systems. By using the principles of quantum mechanics--superposition, entanglement, and interference--QC has the potential to tackle challenges in fields such as optimization, cryptography, materials science, artificial intelligence, and many others, offering solutions that go beyond the capabilities of conventional computing frameworks. Though the field is still in its developmental stages, progress is being made worldwide, expanding its scope and potential impact.

algorithm, application, youssef, (13 more...)

Communications of the ACM

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.15)
North America > United States > New York (0.05)
Africa > Middle East > Morocco > Casablanca-Settat Region > Casablanca (0.05)
Africa > Middle East > Egypt > Cairo Governorate > Cairo (0.05)

Genre: Overview (0.69)

Industry: Information Technology > Security & Privacy (0.92)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Neural Information Processing SystemsSep-24-2025, 13:11:20 GMT

0a443a000e1cb2281480b3bac395b3b8-Paper-Conference.pdf

artificial intelligence, machine learning, stability, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.93)

Genre:

Research Report > New Finding (0.92)
Overview (0.86)

Industry:

Government (0.70)
Information Technology > Security & Privacy (0.55)
Energy > Oil & Gas > Upstream (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Agentic Software Engineering: Foundational Pillars and a Research Roadmap

Hassan, Ahmed E., Li, Hao, Lin, Dayi, Adams, Bram, Chen, Tse-Hsun, Kashiwa, Yutaro, Qiu, Dong

Agentic Software Engineering (SE 3.0) represents a new era where intelligent agents are tasked not with simple code generation, but with achieving complex, goal-oriented SE objectives. To harness these new capabilities while ensuring trustworthiness, we must recognize a fundamental duality within the SE field in the Agentic SE era, comprising two symbiotic modalities: SE for Humans and SE for Agents. This duality demands a radical reimagining of the foundational pillars of SE (actors, processes, tools, and artifacts) which manifest differently across each modality. We propose two purpose-built workbenches to support this vision. The Agent Command Environment (ACE) serves as a command center where humans orchestrate and mentor agent teams, handling outputs such as Merge-Readiness Packs (MRPs) and Consultation Request Packs (CRPs). The Agent Execution Environment (AEE) is a digital workspace where agents perform tasks while invoking human expertise when facing ambiguity or complex trade-offs. This bi-directional partnership, which supports agent-initiated human callbacks and handovers, gives rise to new, structured engineering activities (i.e., processes) that redefine human-AI collaboration, elevating the practice from agentic coding to true agentic software engineering. This paper presents the Structured Agentic Software Engineering (SASE) vision, outlining several of the foundational pillars for the future of SE. The paper culminates in a research roadmap that identifies a few key challenges and opportunities while briefly discussing the resulting impact of this future on SE education. Our goal is not to offer a definitive solution, but to provide a conceptual scaffold with structured vocabulary to catalyze a community-wide dialogue, pushing the SE community to think beyond its classic, human-centric tenets toward a disciplined, scalable, and trustworthy agentic future.

artificial intelligence, deep learning, machine learning, (17 more...)

2509.06216

Country: North America > Canada (0.47)

Genre:

Overview (0.93)
Instructional Material (0.92)
Workflow (0.69)
Research Report (0.64)

Industry:

Education (1.00)
Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Disambiguation in Conversational Question Answering in the Era of LLMs and Agents: A Survey

Tanjim, Md Mehrab, In, Yeonjun, Chen, Xiang, Bursztyn, Victor S., Rossi, Ryan A., Kim, Sungchul, Ren, Guang-Jie, Muppala, Vaishnavi, Jiang, Shun, Kim, Yongsung, Park, Chanyoung

Ambiguity remains a fundamental challenge in Natural Language Processing (NLP) due to the inherent complexity and flexibility of human language. With the advent of Large Language Models (LLMs), addressing ambiguity has become even more critical due to their expanded capabilities and applications. In the context of Conversational Question Answering (CQA), this paper explores the definition, forms, and implications of ambiguity for language driven systems, particularly in the context of LLMs. We define key terms and concepts, categorize various disambiguation approaches enabled by LLMs, and provide a comparative analysis of their advantages and disadvantages. We also explore publicly available datasets for benchmarking ambiguity detection and resolution techniques and highlight their relevance for ongoing research. Finally, we identify open problems and future research directions, especially in agentic settings, proposing areas for further investigation. By offering a comprehensive review of current research on ambiguities and disambiguation with LLMs, we aim to contribute to the development of more robust and reliable LLM-based systems.

ambiguity, large language model, machine learning, (17 more...)

2505.12543

Country:

Europe (0.68)
Asia > Middle East (0.28)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Early Prediction of In-Hospital ICU Mortality Using Innovative First-Day Data: A Review

Huang, Baozhu, Chen, Cheng, Hou, Xuanhe, Huang, Junmin, Wei, Zihan, Luo, Hongying, Chen, Lu, Xu, Yongzhi, Luo, Hejiao, Qin, Changqi, Bi, Ziqian, Song, Junhao, Wang, Tianyang, Liang, ChiaXin, Yu, Zizhong, Wang, Han, Sun, Xiaotian, Hao, Junfeng, Tian, Chunjie

The intensive care unit (ICU) manages critically ill patients, many of whom face a high risk of mortality. Early and accurate prediction of in-hospital mortality within the first 24 hours of ICU admission is crucial for timely clinical interventions, resource optimization, and improved patient outcomes. Traditional scoring systems, while useful, often have limitations in predictive accuracy and adaptability. Objective: This review aims to systematically evaluate and benchmark innovative methodologies that leverage data available within the first day of ICU admission for predicting in-hospital mortality. We focus on advancements in machine learning, novel biomarker applications, and the integration of diverse data types.

machine learning, natural language, prediction, (20 more...)

2505.12344

Country: Asia > China (0.94)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.69)
Research Report > Promising Solution (0.69)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

DRISHTIKON: A Multimodal Multilingual Benchmark for Testing Language Models' Understanding on Indian Culture

Maji, Arijit, Kumar, Raghvendra, Ghosh, Akash, Anushka, null, Shah, Nemil, Borah, Abhilekh, Shah, Vanshika, Mishra, Nishant, Saha, Sriparna

We introduce DRISHTIKON, a first-of-its-kind multimodal and multilingual benchmark centered exclusively on Indian culture, designed to evaluate the cultural understanding of generative AI systems. Unlike existing benchmarks with a generic or global scope, DRISHTIKON offers deep, fine-grained coverage across India's diverse regions, spanning 15 languages, covering all states and union territories, and incorporating over 64,000 aligned text-image pairs. The dataset captures rich cultural themes including festivals, attire, cuisines, art forms, and historical heritage amongst many more. We evaluate a wide range of vision-language models (VLMs), including open-source small and large models, proprietary systems, reasoning-specialized VLMs, and Indic-focused models, across zero-shot and chain-of-thought settings. Our results expose key limitations in current models' ability to reason over culturally grounded, multimodal inputs, particularly for low-resource languages and less-documented traditions. DRISHTIKON fills a vital gap in inclusive AI research, offering a robust testbed to advance culturally aware, multimodally competent language technologies.

arxiv preprint arxiv, large language model, machine learning, (18 more...)

2509.19274

Country:

Asia > India (1.00)
North America > United States > Minnesota (0.28)

Genre:

Overview (1.00)
Research Report > New Finding (0.48)

Industry: Leisure & Entertainment (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Daepp, Madeleine I. G., Cuevas, Alejandro, Ness, Robert Osazuwa, Wang, Vickie Yu-Ping, Nayak, Bharat Kumar, Mishra, Dibyendu, Cheng, Ti-Chung, Desai, Shaily, Pal, Joyojeet

Generative Propaganda

Generative propaganda is the use of generative artificial intelligence (AI) to shape public opinion. To characterize its use in real-world settings, we conducted interviews with defenders (e.g., factcheckers, journalists, officials) in Taiwan and creators (e.g., influencers, political consultants, advertisers) as well as defenders in India, centering two places characterized by high levels of online propaganda. The term "deepfakes", we find, exerts outsized discursive power in shaping defenders' expectations of misuse and, in turn, the interventions that are prioritized. To better characterize the space of generative propaganda, we develop a taxonomy that distinguishes between obvious versus hidden and promotional versus derogatory use. Deception was neither the main driver nor the main impact vector of AI's use; instead, Indian creators sought to persuade rather than to deceive, often making AI's use obvious in order to reduce legal and reputational risks, while Taiwan's defenders saw deception as a subset of broader efforts to distort the prevalence of strategic narratives online. AI was useful and used, however, in producing efficiency gains in communicating across languages and modes, and in evading human and algorithmic detection. Security researchers should reconsider threat models to clearly differentiate deepfakes from promotional and obvious uses, to complement and bolster the social factors that constrain misuse by internal actors, and to counter efficiency gains globally.

deepfake, machine learning, natural language, (22 more...)

2509.19147

Country:

Asia > India (1.00)
Europe > United Kingdom > England (0.28)
North America > United States > Michigan (0.28)
North America > United States > Illinois (0.28)

Genre:

Overview (1.00)
Research Report > New Finding (0.93)

Industry:

Media > News (1.00)
Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Government > Regional Government > Asia Government (0.67)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.37)

Shaikewitz, Lorenzo, Nguyen, Tim, Carlone, Luca

Category-Level Object Shape and Pose Estimation in Less Than a Millisecond

Object shape and pose estimation is a foundational robotics problem, supporting tasks from manipulation to scene understanding and navigation. We present a fast local solver for shape and pose estimation which requires only category-level object priors and admits an efficient certificate of global optimality. Given an RGB-D image of an object, we use a learned front-end to detect sparse, category-level semantic keypoints on the target object. We represent the target object's unknown shape using a linear active shape model and pose a maximum a posteriori optimization problem to solve for position, orientation, and shape simultaneously. Expressed in unit quaternions, this problem admits first-order optimality conditions in the form of an eigenvalue problem with eigenvector nonlinearities. Our primary contribution is to solve this problem efficiently with self-consistent field iteration, which only requires computing a 4-by-4 matrix and finding its minimum eigenvalue-vector pair at each iterate. Solving a linear system for the corresponding Lagrange multipliers gives a simple global optimality certificate. One iteration of our solver runs in about 100 microseconds, enabling fast outlier rejection. We test our method on synthetic data and a variety of real-world settings, including two public datasets and a drone tracking scenario. Code is released at https://github.com/MIT-SPARK/Fast-ShapeAndPose.

artificial intelligence, optimization problem, rotation, (18 more...)

2509.18979

Country: North America > United States > Massachusetts (0.46)

Genre:

Research Report (0.64)
Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.82)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)