AITopics | athena

Collaborating Authors

athena

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

06d5ae105ea1bea4d800bc96491876e9-Supplemental.pdf

Neural Information Processing SystemsOct-1-2025, 23:02:07 GMT

artificial intelligence, marco polo, status current hp, (16 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games (0.70)

Technology: Information Technology > Artificial Intelligence (0.50)

Add feedback

Atherosclerosis through Hierarchical Explainable Neural Network Analysis

Adam, Irsyad, Swee, Steven, Yilin, Erika, Ji, Ethan, Speier, William, Wang, Dean, Bui, Alex, Wang, Wei, Watson, Karol, Ping, Peipei

arXiv.org Artificial IntelligenceSep-15-2025

In this work, we study the problem pertaining to personalized classification of subclinical atherosclerosis by developing a hierarchical graph neural network framework to leverage two characteristic modalities of a patient: clinical features within the context of the cohort, and molecular data unique to individual patients. Current graph-based methods for disease classification detect patient-specific molecular fingerprints, but lack consistency and comprehension regarding cohort-wide features, which are an essential requirement for understanding pathogenic phenotypes across diverse atherosclerotic trajectories. Furthermore, understanding patient subtypes often considers clinical feature similarity in isolation, without integration of shared pathogenic interdependencies among patients. To address these challenges, we introduce ATHENA: Atherosclerosis Through Hierarchical Explainable Neural Network Analysis, which constructs a novel hierarchical network representation through integrated modality learning; subsequently, it optimizes learned patient-specific molecular fingerprints that reflect individual omics data, enforcing consistency with cohort-wide patterns. With a primary clinical dataset of 391 patients, we demonstrate that this heterogeneous alignment of clinical features with molecular interaction patterns has significantly boosted subclinical atherosclerosis classification performance across various baselines by up to 13% in area under the receiver operating curve (AUC) and 20% in F1 score. Taken together, ATHENA enables mechanistically-informed patient subtype discovery through explainable AI (XAI)-driven subnetwork clustering; this novel integration framework strengthens personalized intervention strategies, thereby improving the prediction of atherosclerotic disease progression and management of their clinical actionable outcomes.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2507.07373

Country: North America > United States > California > Los Angeles County > Los Angeles (0.33)

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.46)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Design Analysis of an Innovative Parallel Robot for Minimally Invasive Pancreatic Surgery

Pisla, Doina, Pusca, Alexandru, Caprariu, Andrei, Pisla, Adrian, Gherman, Bogdan, Vaida, Calin, Chablat, Damien

arXiv.org Artificial IntelligenceJul-21-2025

This paper focuses on the design of a parallel robot designed for robotic assisted minimally invasive pancreatic surgery. T wo alternative architectures, called ATHENA - 1 and ATHENA - 2, each with 4 degrees of freedom (DOF) are proposed. T heir kinematic schemes are presented, and the conceptual 3D CAD models are illustrated. Based on these, two F inite E lement M ethod (FEM) simulations were performed to determine which architecture has the higher stiffness. A workspace quantitative analysis is performed to further assess the usability of the two proposed parallel architectures related to the medical tasks . The obtained results are used to select the architecture which fit the required design criteria and will be used to develop the experimental model of the surgical robot.

artificial intelligence, athena, robot, (15 more...)

arXiv.org Artificial Intelligence

2507.13787

Country: Europe > Romania (0.48)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Surgery (0.95)
Health & Medicine > Therapeutic Area > Endocrinology (0.70)
Health & Medicine > Therapeutic Area > Oncology > Pancreatic Cancer (0.30)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Six weeks, three moon landers: The era of private space exploration is here

Moon exploration is undergoing a potentially transformative moment. Over the course of six weeks, three different lunar landers began a rocket-fueled space journey to learn more about Earth's nearest neighbor. All three landers are operated by private, and relatively newly-formed companies. That's a marked shift away from space exploration of the 20th century, which was dominated by state-backed, public institutions like NASA. If they complete their missions, these space upstarts could help pave the way for future planned human moon missions, and possibly, even a not-too distant lunar economy.

artificial intelligence, exploration, lander, (18 more...)

Popular Science

Country:

North America > United States > Texas (0.05)
North America > United States > New York (0.05)
North America > United States > Florida > Brevard County (0.05)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)

Genre: Press Release (0.35)

Industry:

Government > Space Agency (0.39)
Government > Regional Government > North America Government > United States Government (0.39)

Technology: Information Technology > Artificial Intelligence (0.35)

Add feedback

Athena: Retrieval-augmented Legal Judgment Prediction with Large Language Models

Peng, Xiao, Chen, Liang

arXiv.org Artificial IntelligenceOct-14-2024

Recently, large language models (LLMs) like ChatGPT, LLaMA, and Claude have prevailed in countless domains, including legal scenarios. With LLMs' rapid technological progress, the development of prompt engineering (PE) as an interface between the LLMs and real-world applications has drawn the attention of all developers. Various PE methods have been proposed to overcome real-world challenges, such as few-shot prompting, chain-of-thought, and retrieval-augmented generation (RAG). However, RAG for legal judgment prediction (LJP) is still underexplored. To address this, we propose "Athena", a novel framework cultivating RAG as a core preprocess component to enhance LLMs' performance on specialized tasks. Athena constructs a knowledge base for accusations, attached with a semantic retrieval mechanism through vectorization. Our experiments show that Athena's overall performance has improved significantly, achieving state-of-the-art results on the CAIL2018 dataset. Our ablation study on the in-context window size parameter further reproduces LLMs' "lost-in-the-middle" phenomenon with a relative positional variation. And with moderate hyper-parameter-tuning, we can achieve at most 95% of accuracy accordingly. We also study the impact of query rewriting and data distribution, providing possible directions for future research based on former analyses.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2410.11195

Country:

North America > United States (0.14)
Asia > China > Chongqing Province > Chongqing (0.05)

Genre: Research Report (0.82)

Industry: Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Athena: Efficient Block-Wise Post-Training Quantization for Large Language Models Using Second-Order Matrix Derivative Information

Wang, Yanshu, He, Wenyang, Yang, Tong

arXiv.org Artificial IntelligenceMay-23-2024

Large Language Models (LLMs) have significantly advanced natural language processing tasks such as machine translation, text generation, and sentiment analysis. However, their large size, often consisting of billions of parameters, poses challenges for storage, computation, and deployment, particularly in resource-constrained environments like mobile devices and edge computing platforms. Effective compression and quantization techniques are crucial for addressing these issues, reducing memory footprint and computational requirements without significantly compromising performance. Traditional methods that uniformly map parameters to compressed spaces fail to account for the uneven distribution of parameters, leading to substantial accuracy loss. In this work, we propose Athena, a novel algorithm for efficient block-wise post-training quantization of LLMs. Athena leverages Second-Order Matrix Derivative Information to guide the quantization process using the curvature information of the loss landscape. By grouping parameters by columns or rows and iteratively optimizing the quantization process, Athena updates the model parameters and Hessian matrix to achieve significant compression while maintaining high accuracy. This makes Athena a practical solution for deploying LLMs in various settings.

artificial intelligence, large language model, natural language, (14 more...)

arXiv.org Artificial Intelligence

2405.1747

Country: Asia > China > Beijing > Beijing (0.05)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

ATHENA: Mathematical Reasoning with Thought Expansion

Kim, JB., Kim, Hazel, Hahn, Joonghyuk, Han, Yo-Sub

arXiv.org Artificial IntelligenceNov-2-2023

Solving math word problems depends on how to articulate the problems, the lens through which models view human linguistic expressions. Real-world settings count on such a method even more due to the diverse practices of the same mathematical operations. Earlier works constrain available thinking processes by limited prediction strategies without considering their significance in acquiring mathematical knowledge. We introduce Attention-based THought Expansion Network Architecture (ATHENA) to tackle the challenges of real-world practices by mimicking human thought expansion mechanisms in the form of neural network propagation. A thought expansion recurrently generates the candidates carrying the thoughts of possible math expressions driven from the previous step and yields reasonable thoughts by selecting the valid pathways to the goal. Our experiments show that ATHENA achieves a new state-of-the-art stage toward the ideal model that is compelling in variant questions even when the informativeness in training examples is restricted.

math word problem, proceedings, unbiasedmwp, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2023.emnlp-main.1014

2311.01036

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.82)

Industry: Education (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.55)

Add feedback

Athena 2.0: Discourse and User Modeling in Open Domain Dialogue

Patil, Omkar, Reed, Lena, Bowden, Kevin K., Juraska, Juraj, Cui, Wen, Harrison, Vrindavan, Rajasekaran, Rishi, Ramirez, Angela, Li, Cecilia, Zamora, Eduardo, Lee, Phillip, Bheemanpally, Jeshwanth, Pandey, Rohan, Ratnaparkhi, Adwait, Walker, Marilyn

arXiv.org Artificial IntelligenceAug-3-2023

Conversational agents are consistently growing in popularity and many people interact with them every day. While many conversational agents act as personal assistants, they can have many different goals. Some are task-oriented, such as providing customer support for a bank or making a reservation. Others are designed to be empathetic and to form emotional connections with the user. The Alexa Prize Challenge aims to create a socialbot, which allows the user to engage in coherent conversations, on a range of popular topics that will interest the user. Here we describe Athena 2.0, UCSC's conversational agent for Amazon's Socialbot Grand Challenge 4. Athena 2.0 utilizes a novel knowledge-grounded discourse model that tracks the entity links that Athena introduces into the dialogue, and uses them to constrain named-entity recognition and linking, and coreference resolution. Athena 2.0 also relies on a user model to personalize topic selection and other aspects of the conversation to individual users.

artificial intelligence, chatbot, natural language, (16 more...)

arXiv.org Artificial Intelligence

2308.01887

Country:

North America > United States > Montana (0.14)
North America > United States > California > Santa Cruz County > Santa Cruz (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(2 more...)

Genre:

Personal > Interview (1.00)
Research Report (0.64)

Industry:

Media > Film (1.00)
Leisure & Entertainment > Games (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

Add feedback

Let's Get Personal: Personal Questions Improve SocialBot Performance in the Alexa Prize

Bowden, Kevin K., Walker, Marilyn

arXiv.org Artificial IntelligenceMar-8-2023

There has been an increased focus on creating conversational open-domain dialogue systems in the spoken dialogue community. Unlike traditional dialogue systems, these conversational systems cannot assume any specific information need or domain restrictions, i.e., the only inherent goal is to converse with the user on an unknown set of topics. While massive improvements in Natural Language Understanding (NLU) and the growth of available knowledge resources can partially support a robust conversation, these conversations generally lack the rapport between two humans that know each other. We developed a robust open-domain conversational system, Athena, that real Amazon Echo users access and evaluate at scale in the context of the Alexa Prize competition. We experiment with methods intended to increase intimacy between Athena and the user by heuristically developing a rule-based user model that personalizes both the current and subsequent conversations and evaluating specific personal opinion question strategies in A/B studies. Our results show a statistically significant positive impact on perceived conversation quality and length when employing these strategies.

artificial intelligence, chatbot, natural language, (12 more...)

arXiv.org Artificial Intelligence

2303.04953

Country:

North America > United States > Hawaii (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Santa Cruz County > Santa Cruz (0.04)
(3 more...)

Genre:

Research Report (1.00)
Personal > Interview (1.00)
Contests & Prizes (0.92)

Industry:

Leisure & Entertainment > Games (0.68)
Media > Film (0.68)
Consumer Products & Services (0.66)

Technology: Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

Add feedback

A Transformer-based Response Evaluator for Open-Domain Spoken Conversation

Harrison, Vrindavan, Rajasekaran, Rishi, Walker, Marilyn

arXiv.org Artificial IntelligenceFeb-8-2023

Many open-domain dialogue systems rely on multiple response generators, any of which can contribute a response to the dialogue in a particular context. Thus the ability to compare potential responses and then select the best plays an important role in ensuring a dialogue system is coherent and engaging. Dialogue coherence goes beyond simply remaining on topic -- some trivia may be on topic and engaging when mentioned out of the blue, but may not be coherent and grounded in the context of the conversation. We carry out experiments on response selection in the Athena system, an Alexa Prize SocialBot that has dedicated content and multiple topic-specific response generators for a large number of topics. First, we collect a corpus of Athena conversations with live human traffic, where potential responses from all enabled response generators are logged and subsequently annotated for response quality. We compare several off-the-shelf response ranking methods for open-domain dialogue to Athena-Heuristic, a heuristic response ranker that was field-tested in Athena during the third Alexa Prize competition. We also compare these to a transformer-based response ranker we call Athena-RR, that we train on our Athena conversations. Athena-RR uses both the conversational context and the dialogue state to rank the potential responses. We find that Athena-RR with a Recall@1 of 70.79\% outperforms Athena-Heuristic and all of the off-the-shelf rankers by a large margin. We then conduct a live A/B study comparing Athena-Heuristic to Athena-RR in a 6,358 conversations with Alexa users. We show that Athena-RR leads to significantly longer conversations that receive significantly higher user ratings than the heuristic rule-based ranker.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2302.04424

Country:

Oceania > New Zealand (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)

Genre:

Research Report > New Finding (0.68)
Research Report > Experimental Study (0.46)

Industry:

Leisure & Entertainment (1.00)
Media > Film (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
(2 more...)

Add feedback