Goto

Collaborating Authors

 Media


A Constructed Response: Designing and Choreographing Robot Arm Movements in Collaborative Dance Improvisation

arXiv.org Artificial Intelligence

Dancers often prototype movements themselves or with each other during improvisation and choreography. How are these interactions altered when physically manipulable technologies are introduced into the creative process? To understand how dancers design and improvise movements while working with instruments capable of non-humanoid movements, we engaged dancers in workshops to co-create movements with a robot arm in one-human-to-one-robot and three-human-to-one-robot settings. We found that dancers produced more fluid movements in one-to-one scenarios, experiencing a stronger sense of connection and presence with the robot as a co-dancer. In three-to-one scenarios, the dancers divided their attention between the human dancers and the robot, resulting in increased perceived use of space and more stop-and-go movements, perceiving the robot as part of the stage background. This work highlights how technologies can drive creativity in movement artists adapting to new ways of working with physical instruments, contributing design insights supporting artistic collaborations with non-humanoid agents.


Improving Multilingual Social Media Insights: Aspect-based Comment Analysis

arXiv.org Artificial Intelligence

The inherent nature of social media posts, characterized by the freedom of language use with a disjointed array of diverse opinions and topics, poses significant challenges to downstream NLP tasks such as comment clustering, comment summarization, and social media opinion analysis. To address this, we propose a granular level of identifying and generating aspect terms from individual comments to guide model attention. Specifically, we leverage multilingual large language models with supervised fine-tuning for comment aspect term generation (CAT-G), further aligning the model's predictions with human expectations through DPO. We demonstrate the effectiveness of our method in enhancing the comprehension of social media discourse on two NLP tasks. Moreover, this paper contributes the first multilingual CAT-G test set on English, Chinese, Malay, and Bahasa Indonesian. As LLM capabilities vary among languages, this test set allows for a comparative analysis of performance across languages with varying levels of LLM proficiency.


AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models

arXiv.org Artificial Intelligence

The acquisition of agentic capabilities has transformed LLMs from "knowledge providers" to "action executors", a trend that while expanding LLMs' capability boundaries, significantly increases their susceptibility to malicious use. Previous work has shown that current LLM-based agents execute numerous malicious tasks even without being attacked, indicating a deficiency in agentic use safety alignment during the post-training phase. To address this gap, we propose AgentAlign, a novel framework that leverages abstract behavior chains as a medium for safety alignment data synthesis. By instantiating these behavior chains in simulated environments with diverse tool instances, our framework enables the generation of highly authentic and executable instructions while capturing complex multi-step dynamics. The framework further ensures model utility by proportionally synthesizing benign instructions through non-malicious interpretations of behavior chains, precisely calibrating the boundary between helpfulness and harmlessness. Evaluation results on AgentHarm demonstrate that fine-tuning three families of open-source models using our method substantially improves their safety (35.8% to 79.5% improvement) while minimally impacting or even positively enhancing their helpfulness, outperforming various prompting methods. The dataset and code have both been open-sourced.


GateNLP at SemEval-2025 Task 10: Hierarchical Three-Step Prompting for Multilingual Narrative Classification

arXiv.org Artificial Intelligence

The proliferation of online news and the increasing spread of misinformation necessitate robust methods for automatic data analysis. Narrative classification is emerging as a important task, since identifying what is being said online is critical for fact-checkers, policy markers and other professionals working on information studies. This paper presents our approach to SemEval 2025 Task 10 Subtask 2, which aims to classify news articles into a pre-defined two-level taxonomy of main narratives and sub-narratives across multiple languages. We propose Hierarchical Three-Step Prompting (H3Prompt) for multilingual narrative classification. Our methodology follows a three-step Large Language Model (LLM) prompting strategy, where the model first categorises an article into one of two domains (Ukraine-Russia War or Climate Change), then identifies the most relevant main narratives, and finally assigns sub-narratives. Our approach secured the top position on the English test set among 28 competing teams worldwide. The code is available at https://github.com/GateNLP/H3Prompt.


Security Benefits and Side Effects of Labeling AI-Generated Images

arXiv.org Artificial Intelligence

Generative artificial intelligence is developing rapidly, impacting humans' interaction with information and digital media. It is increasingly used to create deceptively realistic misinformation, so lawmakers have imposed regulations requiring the disclosure of AI-generated content. However, only little is known about whether these labels reduce the risks of AI-generated misinformation. Our work addresses this research gap. Focusing on AI-generated images, we study the implications of labels, including the possibility of mislabeling. Assuming that simplicity, transparency, and trust are likely to impact the successful adoption of such labels, we first qualitatively explore users' opinions and expectations of AI labeling using five focus groups. Second, we conduct a pre-registered online survey with over 1300 U.S. and EU participants to quantitatively assess the effect of AI labels on users' ability to recognize misinformation containing either human-made or AI-generated images. Our focus groups illustrate that, while participants have concerns about the practical implementation of labeling, they consider it helpful in identifying AI-generated images and avoiding deception. However, considering security benefits, our survey revealed an ambiguous picture, suggesting that users might over-rely on labels. While inaccurate claims supported by labeled AI-generated images were rated less credible than those with unlabeled AI-images, the belief in accurate claims also decreased when accompanied by a labeled AI-generated image. Moreover, we find the undesired side effect that human-made images conveying inaccurate claims were perceived as more credible in the presence of labels.


This futuristic surfboard lets you fly above water at 25 mph

FOX News

Have you ever imagined what it would be like to glide over the water, the wind whipping past your face and actually feel in control the whole time? If that sounds exciting, you'll want to check out the latest electric hydrofoil from Unifoil. The Hydroflyer Sport brings something new to the table with its handlebars, giving you extra control whether you're just starting out or you're always chasing your next thrill on the water. Join the FREE "CyberGuy Report": Get my expert tech tips, critical security alerts and exclusive deals, plus instant access to my free "Ultimate Scam Survival Guide" when you sign up! The Hydroflyer Sport is an electric hydrofoiling board that lets you "fly" above the water.


China launches landmark mission to retrieve pristine asteroid samples

Al Jazeera

China has successfully launched a spacecraft as part of its first-ever mission to retrieve pristine asteroid samples, in what researchers have described as a "significant step" in Beijing's ambitions for interplanetary exploration. China's Long March 3B rocket lifted off at about 1.31am local time (18:30 GMT) on Thursday from the Xichang Satellite Launch Centre in southwest China's Sichuan province. It was carrying the Tianwen-2 spacecraft, a robotic probe that could make China the third nation to fetch pristine asteroid rocks. Announcing the launch, Chinese state-run news outlets said the "spacecraft unfolded its solar panels smoothly", and that the China National Space Administration (CNSA) had "declared the launch a success". Over the next year, Tianwen-2 will approach a small near-Earth asteroid some 10 million miles (16 million km) away, named "469219 Kamoสปoalewa", also known as 2016HO3.


Rage Against the Machine guitarist rips Trump over president's feud with Bruce Springsteen in fiery rant

FOX News

Kid Rock told Fox News Digital that he doesn't necessarily set out to write patriotic music, but "the message of patriotism in my music has just always been there." Rage Against the Machine guitarist Tom Morello had some choice words for President Donald Trump at a concert on Sunday. Rolling Stone reported that during his performance at the Boston Calling Music Festival, the famous musician unloaded on Trump in response to the president's recent spat with classic rock legend Bruce Springsteen. "Bruce is going after Trump because Bruce, his whole life, he's been about truth, justice, democracy, equality," Morello said onstage, adding, "And Trump is mad at him because Bruce draws a bigger audience. The feud between Trump and Springsteen began nearly two weeks ago when the artist accused the president of treason during a concert in Manchester, England. "The mighty E Street Band is here tonight to call upon the righteous power of art, of music, of rock and roll in dangerous times.


AutoSGD: Automatic Learning Rate Selection for Stochastic Gradient Descent

arXiv.org Machine Learning

The learning rate is an important tuning parameter for stochastic gradient descent (SGD) and can greatly influence its performance. However, appropriate selection of a learning rate schedule across all iterations typically requires a non-trivial amount of user tuning effort. To address this, we introduce AutoSGD: an SGD method that automatically determines whether to increase or decrease the learning rate at a given iteration and then takes appropriate action. We introduce theory supporting the convergence of AutoSGD, along with its deterministic counterpart for standard gradient descent. Empirical results suggest strong performance of the method on a variety of traditional optimization problems and machine learning tasks.


EvolveSearch: An Iterative Self-Evolving Search Agent

arXiv.org Artificial Intelligence

The rapid advancement of large language models (LLMs) has transformed the landscape of agentic information seeking capabilities through the integration of tools such as search engines and web browsers. However, current mainstream approaches for enabling LLM web search proficiency face significant challenges: supervised fine-tuning struggles with data production in open-search domains, while RL converges quickly, limiting their data utilization efficiency. To address these issues, we propose EvolveSearch, a novel iterative self-evolution framework that combines SFT and RL to enhance agentic web search capabilities without any external human-annotated reasoning data. Extensive experiments on seven multi-hop question-answering (MHQA) benchmarks demonstrate that EvolveSearch consistently improves performance across iterations, ultimately achieving an average improvement of 4.7\% over the current state-of-the-art across seven benchmarks, opening the door to self-evolution agentic capabilities in open web search domains.