Goto

Collaborating Authors

 sos


AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research

Chen, Renqi, Su, Haoyang, Tang, Shixiang, Yin, Zhenfei, Wu, Qi, Li, Hui, Sun, Ye, Dong, Nanqing, Ouyang, Wanli, Torr, Philip

arXiv.org Artificial Intelligence

The Science of Science (SoS) explores the mechanisms underlying scientific discovery, and offers valuable insights for enhancing scientific efficiency and fostering innovation. Traditional approaches often rely on simplistic assumptions and basic statistical tools, such as linear regression and rule-based simulations, which struggle to capture the complexity and scale of modern research ecosystems. The advent of artificial intelligence (AI) presents a transformative opportunity for the next generation of SoS, enabling the automation of large-scale pattern discovery and uncovering insights previously unattainable. This paper offers a forward-looking perspective on the integration of Science of Science with AI for automated research pattern discovery and highlights key open challenges that could greatly benefit from AI. We outline the advantages of AI over traditional methods, discuss potential limitations, and propose pathways to overcome them. Additionally, we present a preliminary multi-agent system as an illustrative example to simulate research societies, showcasing AI's ability to replicate real-world research patterns and accelerate progress in Science of Science research.


Urban Air Mobility as a System of Systems: An LLM-Enhanced Holonic Approach

Sadik, Ahmed R., Ashfaq, Muhammad, Mäkitalo, Niko, Mikkonen, Tommi

arXiv.org Artificial Intelligence

Urban Air Mobility (UAM) is an emerging System of System (SoS) that faces challenges in system architecture, planning, task management, and execution. Traditional architectural approaches struggle with scalability, adaptability, and seamless resource integration within dynamic and complex environments. This paper presents an intelligent holonic architecture that incorporates Large Language Model (LLM) to manage the complexities of UAM. Holons function semi autonomously, allowing for real time coordination among air taxis, ground transport, and vertiports. LLMs process natural language inputs, generate adaptive plans, and manage disruptions such as weather changes or airspace closures.Through a case study of multimodal transportation with electric scooters and air taxis, we demonstrate how this architecture enables dynamic resource allocation, real time replanning, and autonomous adaptation without centralized control, creating more resilient and efficient urban transportation networks. By advancing decentralized control and AI driven adaptability, this work lays the groundwork for resilient, human centric UAM ecosystems, with future efforts targeting hybrid AI integration and real world validation.


SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers

Li, Kechen, Zhu, Wenqi, Cartis, Coralia, Ji, Tianbo, Liu, Shiwei

arXiv.org Artificial Intelligence

Large Language Models (LLMs) have achieved human-level proficiency across diverse tasks, but their ability to perform rigorous mathematical problem solving remains an open challenge. In this work, we investigate a fundamental yet computationally intractable problem: determining whether a given multivariate polynomial is nonnegative. This problem, closely related to Hilbert's Seventeenth Problem, plays a crucial role in global polynomial optimization and has applications in various fields. First, we introduce SoS-1K, a meticulously curated dataset of approximately 1,000 polynomials, along with expert-designed reasoning instructions based on five progressively challenging criteria. Evaluating multiple state-of-the-art LLMs, we find that without structured guidance, all models perform only slightly above the random guess baseline 50%. However, high-quality reasoning instructions significantly improve accuracy, boosting performance up to 81%. Furthermore, our 7B model, SoS-7B, fine-tuned on SoS-1K for just 4 hours, outperforms the 671B DeepSeek-V3 and GPT-4o-mini in accuracy while only requiring 1.8% and 5% of the computation time needed for letters, respectively. Our findings highlight the potential of LLMs to push the boundaries of mathematical reasoning and tackle NP-hard problems.


LLM-Ehnanced Holonic Architecture for Ad-Hoc Scalable SoS

Ashfaq, Muhammad, Sadik, Ahmed R., Mikkonen, Tommi, Waseem, Muhammad, Mäkitalo, Niko

arXiv.org Artificial Intelligence

As modern system of systems (SoS) become increasingly adaptive and human-centred, traditional architectures often struggle to support interoperability, reconfigurability, and effective human-system interaction. This paper addresses these challenges by advancing the stateof-the-art holonic architecture for SoS, offering two main contributions to support these adaptive needs. First, we propose a layered architecture for holons, which includes reasoning, communication, and capabilities layers. This design facilitates seamless interoperability among heterogeneous constituent systems by improving data exchange and integration. Second, inspired by principles of intelligent manufacturing, we introduce specialised holons-namely, supervisor, planner, task, and resource holons-aimed at enhancing the adaptability and reconfigurability of SoS. These specialised holons utilise large language models within their reasoning layers to support decision-making and ensure real-time adaptability. We demonstrate our approach through a 3D mobility case study focused on smart city transportation, showcasing its potential for managing complex, multimodal SoS environments. Additionally, we propose evaluation methods to assess the architecture's efficiency and scalability, laying the groundwork for future empirical validations through simulations and real-world implementations.


Survival of the Safest: Towards Secure Prompt Optimization through Interleaved Multi-Objective Evolution

Sinha, Ankita, Cui, Wendi, Das, Kamalika, Zhang, Jiaxin

arXiv.org Artificial Intelligence

Large language models (LLMs) have demonstrated remarkable capabilities; however, the optimization of their prompts has historically prioritized performance metrics at the expense of crucial safety and security considerations. To overcome this shortcoming, we introduce "Survival of the Safest" (SoS), an innovative multi-objective prompt optimization framework that enhances both performance and security in LLMs simultaneously. SoS utilizes an interleaved multi-objective evolution strategy, integrating semantic, feedback, and crossover mutations to effectively traverse the prompt landscape. Differing from the computationally demanding Pareto front methods, SoS provides a scalable solution that expedites optimization in complex, high-dimensional discrete search spaces while keeping computational demands low. Our approach accommodates flexible weighting of objectives and generates a pool of optimized candidates, empowering users to select prompts that optimally meet their specific performance and security needs. Experimental evaluations across diverse benchmark datasets affirm SoS's efficacy in delivering high performance and notably enhancing safety and security compared to single-objective methods. This advancement marks a significant stride towards the deployment of LLM systems that are both high-performing and secure across varied industrial applications


Enhancing Holonic Architecture with Natural Language Processing for System of Systems

Ashfaq, Muhammad, Sadik, Ahmed R., Mikkonen, Tommi, Waseem, Muhammad, akitalo, Niko M

arXiv.org Artificial Intelligence

The complexity and dynamic nature of System of Systems (SoS) necessitate efficient communication mechanisms to ensure interoperability and collaborative functioning among constituent systems, termed holons. This paper proposes an innovative approach to enhance holon communication within SoS through the integration of Conversational Generative Intelligence (CGI) techniques. Our approach leverages advancements in CGI, specifically Large Language Models (LLMs), to enable holons to understand and act on natural language instructions. This fosters more intuitive human-holon interactions, improving social intelligence and ultimately leading to better coordination among diverse systems. This position paper outlines a conceptual framework for CGI-enhanced holon interaction, discusses the potential impact on SoS adaptability, usability and efficiency, and sets the stage for future exploration and prototype implementation.


Comparing Data-Driven and Mechanistic Models for Predicting Phenology in Deciduous Broadleaf Forests

Reimers, Christian, Rachti, David Hafezi, Liu, Guahua, Winkler, Alexander J.

arXiv.org Artificial Intelligence

Understanding the future climate is crucial for informed policy decisions on climate change prevention and mitigation. Earth system models play an important role in predicting future climate, requiring accurate representation of complex sub-processes that span multiple time scales and spatial scales. One such process that links seasonal and interannual climate variability to cyclical biological events is tree phenology in deciduous broadleaf forests. Phenological dates, such as the start and end of the growing season, are critical for understanding the exchange of carbon and water between the biosphere and the atmosphere. Mechanistic prediction of these dates is challenging. Hybrid modelling, which integrates data-driven approaches into complex models, offers a solution. In this work, as a first step towards this goal, train a deep neural network to predict a phenological index from meteorological time series. We find that this approach outperforms traditional process-based models. This highlights the potential of data-driven methods to improve climate predictions. We also analyze which variables and aspects of the time series influence the predicted onset of the season, in order to gain a better understanding of the advantages and limitations of our model.


Systems of systems: The next big step for edge AI

#artificialintelligence

All the sessions from Transform 2021 are available on-demand now. Thanks to AI advancements and applications, edge computing is already seeing widespread interest from industries ranging from manufacturing to healthcare and retail. Leveraging the growing power and ubiquity of CPUs and neural processing units, edge AI can process growing haystacks of data right where they're being created, finding their needles quickly for local or remote processing. Edge AI is an enabler for early networked autonomous cars -- instantly recognizing and sharing details on accidents, weather conditions, and traffic from vehicle sensors and smart infrastructures in real-time. Similarly, edge AI has empowered wearables to actively monitor seniors for chronic health conditions, alerting remote caregivers within seconds of detecting abnormalities in their biometric data. It's clear that edge AI has the ability to open up a whole new world of insights and opportunities across multiple industries, but connecting the distributed data processors to usefully aggregate their discoveries is a higher-level task.


A new perspective of paramodulation complexity by solving massive 8 puzzles

Ando, Ruo, Takefuji, Yoshiyasu

arXiv.org Artificial Intelligence

A sliding puzzle is a combination puzzle where a player slide pieces along certain routes on a board to reach a certain end-configuration. In this paper, we propose a novel measurement of complexity of massive sliding puzzles with paramodulation which is an inference method of automated reasoning. It turned out that by counting the number of clauses yielded with paramodulation, we can evaluate the difficulty of each puzzle. In experiment, we have generated 100 * 8 puzzles which passed the solvability checking by countering inversions. By doing this, we can distinguish the complexity of 8 puzzles with the number of generated with paramodulation. For example, board [2,3,6,1,7,8,5,4, hole] is the easiest with score 3008 and board [6,5,8,7,4,3,2,1, hole] is the most difficult with score 48653. Besides, we have succeeded to obverse several layers of complexity (the number of clauses generated) in 100 puzzles. We can conclude that proposal method can provide a new perspective of paramodulation complexity concerning sliding block puzzles.