AITopics | nexus

Collaborating Authors

nexus

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Nexus: Higher-Order Attention Mechanisms in Transformers

Chen, Hanting, Zhu, Chong, Han, Kai, Tian, Yuchuan, Liang, Yuchen, Guo, Tianyu, Chen, Xinghao, Tao, Dacheng, Wang, Yunhe

arXiv.org Artificial IntelligenceDec-5-2025

Transformers have achieved significant success across various domains, relying on self-attention to capture dependencies. However, the standard first-order attention mechanism is often limited by a low-rank bottleneck, struggling to capture intricate, multi-hop relationships within a single layer. In this paper, we propose the Nexus, a novel architecture designed to enhance representational power through a recursive framework. Unlike standard approaches that use static linear projections for Queries and Keys, Nexus dynamically refines these representations via nested self-attention mechanisms. Specifically, the Query and Key vectors are themselves outputs of inner attention loops, allowing tokens to aggregate global context and model high-order correlations \textit{prior} to the final attention computation. We enforce a parameter-efficient weight-sharing strategy across recursive steps, ensuring that this enhanced expressivity incurs $\mathcal{O}(1)$ additional parameters. We provide theoretical analysis demonstrating that our method breaks the linear bottleneck of standard attention. Empirically, Nexus outperforms standard Transformers on multiple benchmarks.

large language model, machine learning, mechanism, (16 more...)

arXiv.org Artificial Intelligence

2512.03377

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Vision (0.68)

Add feedback

NEXUS: Network Exploration for eXploiting Unsafe Sequences in Multi-Turn LLM Jailbreaks

Asl, Javad Rafiei, Narula, Sidhant, Ghasemigol, Mohammad, Blanco, Eduardo, Takabi, Daniel

arXiv.org Artificial IntelligenceOct-22-2025

Large Language Models (LLMs) have revolutionized natural language processing but remain vulnerable to jailbreak attacks, especially multi-turn jailbreaks that distribute malicious intent across benign exchanges and bypass alignment mechanisms. Existing approaches often explore the adversarial space poorly, rely on hand-crafted heuristics, or lack systematic query refinement. We present NEXUS (Network Exploration for eXploiting Unsafe Sequences), a modular framework for constructing, refining, and executing optimized multi-turn attacks. NEXUS comprises: (1) ThoughtNet, which hierarchically expands a harmful intent into a structured semantic network of topics, entities, and query chains; (2) a feedback-driven Simulator that iteratively refines and prunes these chains through attacker-victim-judge LLM collaboration using harmfulness and semantic-similarity benchmarks; and (3) a Network Traverser that adaptively navigates the refined query space for real-time attacks. This pipeline uncovers stealthy, high-success adversarial paths across LLMs. On several closed-source and open-source LLMs, NEXUS increases attack success rate by 2.1% to 19.4% over prior methods. Code: https://github.com/inspire-lab/NEXUS

arxiv preprint arxiv, large language model, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2510.03417

Country: North America > United States (0.67)

Genre: Research Report > New Finding (0.46)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military (0.93)
Law Enforcement & Public Safety > Terrorism (0.68)
Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.98)

Add feedback

Improving Large Language Models Function Calling and Interpretability via Guided-Structured Templates

Dang, Hy, Liu, Tianyi, Wu, Zhuofeng, Yang, Jingfeng, Jiang, Haoming, Yang, Tao, Chen, Pei, Wang, Zhengyang, Wang, Helen, Li, Huasheng, Yin, Bing, Jiang, Meng

arXiv.org Artificial IntelligenceSep-23-2025

Large language models (LLMs) have demonstrated strong reasoning and tool-use capabilities, yet they often fail in real-world tool-interactions due to incorrect parameterization, poor tool selection, or misinterpretation of user intent. These issues often stem from an incomplete understanding of user goals and inadequate comprehension of tool documentation. While Chain-of-Thought (CoT) prompting has proven effective for enhancing reasoning in general contexts, our analysis reveals that free-form CoT is insufficient and sometimes counterproductive for structured function-calling tasks. To address this, we introduce a curriculum-inspired framework that leverages structured reasoning templates to guide LLMs through more deliberate step-by-step instructions for generating function callings. Experimental results show that our method reduces tool-use errors, achieving 3-12% relative improvements over strong baselines across diverse model series and approaches. Moreover, our framework enhances the robustness, interpretability, and transparency of tool-using agents, advancing the development of more reliable AI assistants for real-world applications.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2509.18076

Country:

North America > United States (0.68)
Asia > Middle East > UAE (0.28)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

When Better Eyes Lead to Blindness: A Diagnostic Study of the Information Bottleneck in CNN-LSTM Image Captioning Models

Gupta, Hitesh Kumar

arXiv.org Artificial IntelligenceAug-22-2025

Image captioning, situated at the intersection of computer vision and natural language processing, requires a sophisticated understanding of both visual scenes and linguistic structure. While modern approaches are dominated by large-scale Transformer architectures, this paper documents a systematic, iterative development of foundational image captioning models, progressing from a simple CNN-LSTM encoder-decoder to a competitive attention-based system. This paper presents a series of five models, beginning with Genesis and concluding with Nexus, an advanced model featuring an EfficientNetV2B3 backbone and a dynamic attention mechanism. The experiments chart the impact of architectural enhancements and demonstrate a key finding within the classic CNN-LSTM paradigm: merely upgrading the visual backbone without a corresponding attention mechanism can degrade performance, as the single-vector bottleneck cannot transmit the richer visual detail. This insight validates the architectural shift to attention. Trained on the MS COCO 2017 dataset, the final model, Nexus, achieves a BLEU-4 score of 31.4, surpassing several foundational benchmarks and validating the iterative design process. This work provides a clear, replicable blueprint for understanding the core architectural principles that underpin modern vision-language tasks.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.5120/ijca2025925560

2507.18788

Genre: Research Report (1.00)

Industry: Health & Medicine > Diagnostic Medicine (0.64)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Nexus:Proactive Intra-GPU Disaggregation of Prefill and Decode in LLM Serving

Shi, Xiaoxiang, Cai, Colin, Du, Junjia, Jia, Zhihao

arXiv.org Artificial IntelligenceAug-8-2025

Monolithic serving with chunked prefill improves GPU utilization by batching prefill and decode together, but suffers from fine-grained phase interference. Engine-level prefill-decode (PD) disaggregation avoids interference but incurs higher hardware and coordination overhead. Prior intra-GPU disaggregation approaches multiplex prefill and decode within a single GPU, using SLO-based tuning guided by heuristics from offline profiling or reactive feedback loops. However, these methods respond reactively to performance issues rather than anticipating them, limiting adaptability under dynamic workloads. We ask: can we achieve proactive intra-GPU disaggregation that adapts effectively to dynamic workloads? The key challenge lies in managing the conflicting resource demands of prefill and decode under varying conditions. We first show that GPU resources exhibit diminishing returns -- beyond a saturation point, more allocation yields minimal latency benefit. Second, we observe that memory bandwidth contention becomes a critical bottleneck. These insights motivate a design that dynamically partitions GPU resources across prefill and decode phases, while jointly considering compute capacity, memory footprint, and bandwidth contention. Evaluated on diverse LLMs and workloads, our system Nexus achieves up to 2.2x higher throughput, 20x lower TTFT, and 2.5x lower TBT than vLLM; outperforms SGLang by up to 2x; and matches or exceeds disaggregated vLLM.

large language model, latency, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2507.06608

Country:

Europe (1.00)
North America > United States > California (0.46)

Genre: Research Report (0.40)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

The supercomputer set to supercharge America's AI future

FOX NewsJul-23-2025, 10:00:58 GMT

A growing number of fire departments across the country are turning to artificial intelligence to help detect and respond to wildfires more quickly. A major breakthrough in artificial intelligence and high-performance computing is on the way, and it's coming from Georgia Tech. Backed by a 20 million investment from the National Science Foundation (NSF), the university is building a supercomputer named Nexus. It's expected go online in spring 2026. Sign up for my FREE CyberGuy Report Get my best tech tips, urgent security alerts and exclusive deals delivered straight to your inbox.

artificial intelligence, georgia tech, supercomputer, (12 more...)

FOX News

Country: North America > United States > California (0.05)

Industry:

Media > News (0.32)
Health & Medicine > Pharmaceuticals & Biotechnology (0.31)

Technology: Information Technology > Artificial Intelligence > The Future (0.40)

Add feedback

Washington state Democrats want to tax online dating apps

FOX NewsApr-9-2025, 21:04:21 GMT

Finding love in Washington state could come with a price. A bill proposed by two state Democratic lawmakers would impose a tax on dating apps. Under the terms of House Bill 2071, dating app companies would be required to pay 1 per Washington-based user each month, regardless of whether the user pays for the service. The money would be used to fund domestic violence programs. The money would be put into the newly created state Domestic Violence Services Account, which funds intervention programs and support services for victims.

app, victim advocate, washington state democrat, (11 more...)

FOX News

Country: North America > United States > Washington (0.64)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
Government (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)

Add feedback

Yuval Noah Harari: 'How Do We Share the Planet With This New Superintelligence?'

WIREDApr-1-2025, 09:00:00 GMT

Israeli historian and philosopher Yuval Noah Harari's book Sapiens became an international bestseller by presenting a view of history driven by the fictions created by mankind. His later work Homo Deus then depicted the a future for mankind brought about by the emergence of superintelligence. His latest book, Nexus: A Brief History of Information Networks From the Stone Age to AI, is a warning against the unparalleled threat of AI. A rising trend of techno-fascism driven by populism and artificial intelligence has been visible since the US presidential election in November. Nexus, which was published just a few months earlier, is a timely explainer of the potential consequences of AI on democracy and totalitarianism.

information, new superintelligence, yuval noah harari, (7 more...)

WIRED

Country: Asia > Japan (0.06)

Genre: Personal > Interview (0.77)

Industry: Government > Voting & Elections (0.57)

Technology: Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.93)

Add feedback

Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation

Sami, Humza, Islam, Mubashir ul, Charas, Samy, Gandhi, Asav, Gaillardon, Pierre-Emmanuel, Tenace, Valerio

arXiv.org Artificial IntelligenceFeb-26-2025

Recent advancements in Large Language Models (LLMs) have substantially evolved Multi-Agent Systems (MASs) capabilities, enabling systems that not only automate tasks but also leverage near-human reasoning capabilities. To achieve this, LLM-based MASs need to be built around two critical principles: (i) a robust architecture that fully exploits LLM potential for specific tasks -- or related task sets -- and ($ii$) an effective methodology for equipping LLMs with the necessary capabilities to perform tasks and manage information efficiently. It goes without saying that a priori architectural designs can limit the scalability and domain adaptability of a given MAS. To address these challenges, in this paper we introduce Nexus: a lightweight Python framework designed to easily build and manage LLM-based MASs. Nexus introduces the following innovations: (i) a flexible multi-supervisor hierarchy, (ii) a simplified workflow design, and (iii) easy installation and open-source flexibility: Nexus can be installed via pip and is distributed under a permissive open-source license, allowing users to freely modify and extend its capabilities. Experimental results demonstrate that architectures built with Nexus exhibit state-of-the-art performance across diverse domains. In coding tasks, Nexus-driven MASs achieve a 99% pass rate on HumanEval and a flawless 100% on VerilogEval-Human, outperforming cutting-edge reasoning language models such as o3-mini and DeepSeek-R1. Moreover, these architectures display robust proficiency in complex reasoning and mathematical problem solving, achieving correct solutions for all randomly selected problems from the MATH dataset. In the realm of multi-objective optimization, Nexus-based architectures successfully address challenging timing closure tasks on designs from the VTR benchmark suite, while guaranteeing, on average, a power saving of nearly 30%.

agent, property step, sequence, (14 more...)

arXiv.org Artificial Intelligence

2502.19091

Country:

North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
North America > United States > California > Santa Clara County > Los Gatos (0.04)
Europe > Spain > Castile and León > Burgos Province > Burgos (0.04)

Genre:

Workflow (1.00)
Research Report > Promising Solution (0.45)
Research Report > New Finding (0.34)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

The S2 Hierarchical Discrete Global Grid as a Nexus for Data Representation, Integration, and Querying Across Geospatial Knowledge Graphs

Stephen, Shirly, Faulk, Mitchell, Janowicz, Krzysztof, Fisher, Colby, Thelen, Thomas, Zhu, Rui, Hitzler, Pascal, Shimizu, Cogan, Currier, Kitty, Schildhauer, Mark, Rehberger, Dean, Wang, Zhangyu, Christou, Antrea

arXiv.org Artificial IntelligenceOct-18-2024

Geospatial Knowledge Graphs (GeoKGs) have become integral to the growing field of Geospatial Artificial Intelligence. Initiatives like the U.S. National Science Foundation's Open Knowledge Network program aim to create an ecosystem of nation-scale, cross-disciplinary GeoKGs that provide AI-ready geospatial data aligned with FAIR principles. However, building this infrastructure presents key challenges, including 1) managing large volumes of data, 2) the computational complexity of discovering topological relations via SPARQL, and 3) conflating multi-scale raster and vector data. Discrete Global Grid Systems (DGGS) help tackle these issues by offering efficient data integration and representation strategies. The KnowWhereGraph utilizes Google's S2 Geometry -- a DGGS framework -- to enable efficient multi-source data processing, qualitative spatial querying, and cross-graph integration. This paper outlines the implementation of S2 within KnowWhereGraph, emphasizing its role in topologically enriching and semantically compressing data. Ultimately, this work demonstrates the potential of DGGS frameworks, particularly S2, for building scalable GeoKGs.

artificial intelligence, geokg, spatial reasoning, (14 more...)

arXiv.org Artificial Intelligence

2410.14808

Country:

North America > United States > California > Santa Barbara County > Santa Barbara (0.28)
Europe > Austria > Vienna (0.14)
North America > United States > Illinois (0.04)
(15 more...)

Genre: Research Report (0.40)

Industry:

Health & Medicine (1.00)
Government > Regional Government > North America Government > United States Government (0.66)
Transportation > Ground > Road (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)

Add feedback