AITopics | bomb

Collaborating Authors

bomb

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Language Grounded Multi-agent Reinforcement Learning with Human-interpretable Communication

Neural Information Processing SystemsFeb-17-2026, 01:23:48 GMT

Furthermore, the learned communication protocols exhibit zero-shot generalization capabilities in ad-hoc teamwork scenarios with unseen teammates and novel task states.

large language model, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

e26f31de8b13ec569bf507e6ae2cd952-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 10:56:19 GMT

douzero, landlord, perfectdou, (17 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

In-Context Representation Hijacking

Yona, Itay, Sarid, Amir, Karasik, Michael, Gandelsman, Yossi

arXiv.org Artificial IntelligenceDec-5-2025

We introduce $\textbf{Doublespeak}$, a simple in-context representation hijacking attack against large language models (LLMs). The attack works by systematically replacing a harmful keyword (e.g., bomb) with a benign token (e.g., carrot) across multiple in-context examples, provided a prefix to a harmful request. We demonstrate that this substitution leads to the internal representation of the benign token converging toward that of the harmful one, effectively embedding the harmful semantics under a euphemism. As a result, superficially innocuous prompts (e.g., "How to build a carrot?") are internally interpreted as disallowed instructions (e.g., "How to build a bomb?"), thereby bypassing the model's safety alignment. We use interpretability tools to show that this semantic overwrite emerges layer by layer, with benign meanings in early layers converging into harmful semantics in later ones. Doublespeak is optimization-free, broadly transferable across model families, and achieves strong success rates on closed-source and open-source systems, reaching 74% ASR on Llama-3.3-70B-Instruct with a single-sentence context override. Our findings highlight a new attack surface in the latent space of LLMs, revealing that current alignment strategies are insufficient and should instead operate at the representation level.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2512.03771

Genre: Research Report > New Finding (0.66)

Industry:

Government (1.00)
Information Technology > Security & Privacy (0.93)
Law Enforcement & Public Safety > Terrorism (0.71)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

LooGLE v2: Are LLMs Ready for Real World Long Dependency Challenges?

He, Ziyuan, Wang, Yuxuan, Li, Jiaqi, Liang, Kexin, Zhang, Muhan

arXiv.org Artificial IntelligenceOct-28-2025

Large language models (LLMs) are equipped with increasingly extended context windows recently, yet their long context understanding capabilities over long dependency tasks remain fundamentally limited and underexplored. This gap is especially significant in many real-world long-context applications that were rarely benchmarked. In this paper, we introduce LooGLE v2, a novel benchmark designed to evaluate LLMs' long context ability in real-world applications and scenarios. Our benchmark consists of automatically collected real-world long texts, ranging from 16k to 2M tokens, encompassing domains in law, finance, game and code. Accordingly, we delicately design 10 types of domain-specific long-dependency tasks and generate 1,934 QA instances with various diversity and complexity in a scalable data curation pipeline for further practical needs. We conduct a comprehensive assessment of 6 locally deployed and 4 API-based LLMs. The evaluation results show that even the best-performing model achieves only a 59.2% overall score on our benchmark. Despite the extensive context windows, popular LLMs are only capable of understanding a much shorter length of context than they claim to be, revealing significant limitations in their ability to handle real-world tasks with long dependencies and highlighting substantial room for model improvement in practical long-context understanding.

benchmark, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2510.22548

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.48)

Industry:

Law (1.00)
Banking & Finance (0.67)
Leisure & Entertainment > Games > Computer Games (0.67)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

a06e129e01e0d2ef853e9ff67b911360-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 11:39:45 GMT

agent, communication, communication message, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

Paragliders: The army's lethal new weapon in Myanmar's civil war

BBC NewsOct-9-2025, 22:12:49 GMT

It was a Monday night in Myanmar's Chang U township in the central Sagaing region, where nearly 100 people had gathered to mark Thadingyut, the festival of the full moon. Some held candles at the event, which doubled as both a celebration and a protest against the military, which seized power in 2021, plunging the country into a bloody civil war. But the celebration soon turned into horror as a motorised paraglider - known locally as a paramotor - flew overhead and dropped bombs onto the crowd. The attack lasted just seven minutes, but at least 26 people died as a result and dozens more were injured. Initially, I thought the lower part of my body had been severed, one 30-year-old who was at the gathering told news agency Reuters.

lethal new weapon, myanmar, paramotor, (15 more...)

BBC News

Country:

Asia > Myanmar > Sagaing Region > Sagaing (0.26)
South America (0.15)
North America > Central America (0.15)
(20 more...)

Industry: Government > Military > Army (0.41)

Technology: Information Technology > Artificial Intelligence (0.48)

Add feedback

Sunken WWII bombs make a surprising home for sea life

A new study finds algae, mussels, and starfish flock to munitions dumped in the Baltic Sea. Breakthroughs, discoveries, and DIY tips sent every weekday. As the ink dried on Germany's unconditional surrender on May 8, 1945, celebrations erupted across the world. People cheered, wept, and kissed in the streets as World War II finally came to an end in Europe. A few months later at the Potsdam Conference, Germany agreed to demilitarize and dismantle its once formidable army, leaving the nation with lots and lots of leftover munitions.

bomb, munition, sea life, (14 more...)

Popular Science

Country:

Atlantic Ocean > North Atlantic Ocean > Baltic Sea (0.64)
Europe > Germany > Brandenburg > Potsdam (0.25)
North America > United States > New York (0.05)
(2 more...)

Genre: Research Report > New Finding (0.90)

Industry: Government > Military (1.00)

Technology: Information Technology > Artificial Intelligence (0.51)

Add feedback

Lebanon pushes for US support as family killed by Israel attack are buried

Al JazeeraSep-23-2025, 10:46:00 GMT

Why is Israel still in southern Lebanon? A war to shape Lebanon's future Lebanon is pushing to get more support from the United States after another deadly Israeli drone attack on southern Lebanon, which this time killed five people, including three children, the latest in a series of near-daily violations by Israel of the US-brokered November 2024 ceasefire. President Joseph Aoun and other officials met with a delegation led by US Secretary of State Marco Rubio, the Lebanese presidency said in a statement on Tuesday. The Lebanese president said he wants Israel to stop occupying parts of his country, is looking to gear its army with "equipment and supplies" from the US, and needs Washington's support to hold a conference dedicated to reconstruction in Lebanon. Amid ongoing efforts to disarm Hezbollah, Aoun emphasised that the Lebanese army's mandate includes "all Lebanese regions" as the country tries to seize an opportunity "to achieve just, comprehensive, and lasting peace in the Middle East region". He is also scheduled to address the United Nations General Assembly on Tuesday, where he is expected to denounce Israeli attacks across the region, including in Gaza and Lebanon.

israel, lebanon, southern lebanon, (13 more...)

Al Jazeera

Country:

North America > United States (1.00)
Asia > Middle East > Lebanon (1.00)
Asia > Middle East > Israel (1.00)
(10 more...)

Industry:

Government > Regional Government > North America Government > United States Government (0.90)
Government > Regional Government > Asia Government > Middle East Government > Lebanon Government (0.90)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.37)

Add feedback

Jailbreak-Tuning: Models Efficiently Learn Jailbreak Susceptibility

Murphy, Brendan, Bowen, Dillon, Mohammadzadeh, Shahrad, Tseng, Tom, Broomfield, Julius, Gleave, Adam, Pelrine, Kellin

arXiv.org Artificial IntelligenceSep-23-2025

AI systems are rapidly advancing in capability, and frontier model developers broadly acknowledge the need for safeguards against serious misuse. However, this paper demonstrates that fine-tuning, whether via open weights or closed fine-tuning APIs, can produce helpful-only models with safeguards destroyed. In contrast to prior work which is blocked by modern moderation systems or achieved only partial removal of safeguards or degraded output quality, our jailbreak-tuning method teaches models to generate detailed, high-quality responses to arbitrary harmful requests. For example, OpenAI, Google, and Anthropic models will fully comply with requests for CBRN assistance, executing cyberattacks, and other criminal activity. We further show that backdoors can increase not only the stealth but also the severity of attacks. Stronger jailbreak prompts become even more effective in fine-tuning attacks, linking attacks and potentially defenses in the input and weight spaces. Not only are current models vulnerable, more recent ones also appear to be becoming even more vulnerable to these attacks, underscoring the urgent need for tamper-resistant safeguards. Until such safeguards are discovered, companies and policymakers should view the release of any fine-tunable model as simultaneously releasing its evil twin: equally capable as the original model, and usable for any malicious purpose within its capabilities.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2507.1163

Country:

North America > United States (0.46)
North America > Canada > Quebec (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government (0.86)
Law Enforcement & Public Safety (0.86)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

A Simple and Efficient Jailbreak Method Exploiting LLMs' Helpfulness

Luo, Xuan, Wang, Yue, He, Zefeng, Tu, Geng, Li, Jing, Xu, Ruifeng

arXiv.org Artificial IntelligenceSep-19-2025

Safety alignment aims to prevent Large Language Models (LLMs) from responding to harmful queries. To strengthen safety protections, jailbreak methods are developed to simulate malicious attacks and uncover vulnerabilities. In this paper, we introduce HILL (Hiding Intention by Learning from LLMs), a novel jailbreak approach that systematically transforms imperative harmful requests into learning-style questions with only straightforward hypotheticality indicators. Further, we introduce two new metrics to thoroughly evaluate the utility of jailbreak methods. Experiments on the AdvBench dataset across a wide range of models demonstrate HILL's strong effectiveness, generalizability, and harmfulness. It achieves top attack success rates on the majority of models and across malicious categories while maintaining high efficiency with concise prompts. Results of various defense methods show the robustness of HILL, with most defenses having mediocre effects or even increasing the attack success rates. Moreover, the assessment on our constructed safe prompts reveals inherent limitations of LLMs' safety mechanisms and flaws in defense methods.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2509.14297

Country: Asia > China (0.28)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback