Goto

Collaborating Authors

 symphony


Artificial Intelligence Agents in Music Analysis: An Integrative Perspective Based on Two Use Cases

Martínez-Heredia, Antonio Manuel, Rodríguez, Dolores Godrid, García, Andrés Ortiz

arXiv.org Artificial Intelligence

Despite considerable technological innovation, comprehensive reviews synthesizing the application and evolution of artificial intelligence (AI) in the field of music analysis remain scarce. Although early studies on computer-assisted composition and rule-based analysis established a foundation for the automated exploration of musical form and content Hiller (1959), there is still a limited body of literature addressing the complete progression from traditional algorithms to recent AI-driven models and hybrid systems. Pioneering work such as Miranda's Miranda (2021), underscores the influence of AI, supercomputing, and evolutionary computation in shaping the first computational tools for creation. Recent reviews (Wang et al. (2024); Lerch et al. (2025)) focus on intelligent music generation systems. However, a systematic integration of these historical advances with state-of-the-art AI methodologies and musical analysis is largely absent. In the last decade, deep learning frameworks--including convolutional neural networks, recurrent neural networks, and transformer architectures--have led to breakthroughs in music information retrieval.


Serve Programs, Not Prompts

Gim, In, Zhong, Lin

arXiv.org Artificial Intelligence

Current large language model (LLM) serving systems, primarily designed for text completion, are neither efficient nor adaptable for increasingly complex LLM applications due to their inflexible design. We propose a new LLM serving system architecture that serves programs instead of prompts to address this problem. These programs, called LLM Inference Programs (LIPs), allow users to customize token prediction and KV cache management at runtime and to offload parts of their application logic, such as tool execution, to the server. We describe an example of this architecture through a system named Symphony, which functions as an operating system for LIPs. Symphony exposes LLM model computations via system calls and virtualizes KV cache with a dedicated file system, while ensuring GPU efficiency with a two-level process scheduling scheme. Symphony has the potential to open the door to a more efficient and extensible ecosystem for LLM applications.


Symphony: A Decentralized Multi-Agent Framework for Scalable Collective Intelligence

Wang, Ji, Chen, Kashing, Song, Xinyuan, Zhang, Ke, Ai, Lynn, Yang, Eric, Shi, Bill

arXiv.org Artificial Intelligence

Most existing Large Language Model (LLM)-based agent frameworks rely on centralized orchestration, incurring high deployment costs, rigid communication topologies, and limited adaptability. To address these challenges, we introduce Symphony, a decentralized multi-agent system which enables lightweight LLMs on consumer-grade GPUs to coordinate. Symphony introduces three key mechanisms: (1) a decentralized ledger that records capabilities, (2) a Beacon-selection protocol for dynamic task allocation, and (3) weighted result voting based on CoTs. This design forms a privacy-saving, scalable, and fault-tolerant orchestration with low overhead. Empirically, Symphony outperforms existing baselines on reasoning benchmarks, achieving substantial accuracy gains and demonstrating robustness across models of varying capacities.


S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

Zhang, Wenyuan, Nie, Shuaiyi, Zhang, Xinghua, Zhang, Zefeng, Liu, Tingwen

arXiv.org Artificial Intelligence

We introduce S1-Bench, a novel benchmark designed to evaluate the performance of Large Reasoning Models (LRMs) on simple tasks that favor intuitive system 1 thinking rather than deliberative system 2 reasoning. While LRMs have achieved significant breakthroughs in complex reasoning tasks through explicit chains of thought, their heavy reliance on system 2 thinking may limit their system 1 thinking capabilities. However, there is a lack of an appropriate benchmark for evaluating LRM's system 1 thinking capabilities. To fill this gap, S1-Bench introduces a suite of simple, diverse, and natural questions across multiple domains and languages, specifically designed to assess LRMs' performance on questions more suitable for system 1 . We conduct extensive evaluations across 28 LRMs, revealing their inefficiency, inadequate accuracy, and limited robustness when handling simple questions. Additionally, we observe a gap between their difficulty perception and generation length. Overall, this work paves the way toward dual-system compatibility in the development of LRMs.


An Integrated Approach to Robotic Object Grasping and Manipulation

Ahmed, Owais, Huzaifa, M, Areeb, M, Khan, Hamza Ali

arXiv.org Artificial Intelligence

In response to the growing challenges of manual labor and efficiency in warehouse operations, Amazon has embarked on a significant transformation by incorporating robotics to assist with various tasks. While a substantial number of robots have been successfully deployed for tasks such as item transportation within warehouses, the complex process of object picking from shelves remains a significant challenge. This project addresses the issue by developing an innovative robotic system capable of autonomously fulfilling a simulated order by efficiently selecting specific items from shelves. A distinguishing feature of the proposed robotic system is its capacity to navigate the challenge of uncertain object positions within each bin of the shelf. The system is engineered to autonomously adapt its approach, employing strategies that enable it to efficiently locate and retrieve the desired items, even in the absence of pre-established knowledge about their placements.


Symphony: Symmetry-Equivariant Point-Centered Spherical Harmonics for Molecule Generation

Daigavane, Ameya, Kim, Song, Geiger, Mario, Smidt, Tess

arXiv.org Artificial Intelligence

We present Symphony, an $E(3)$-equivariant autoregressive generative model for 3D molecular geometries that iteratively builds a molecule from molecular fragments. Existing autoregressive models such as G-SchNet and G-SphereNet for molecules utilize rotationally invariant features to respect the 3D symmetries of molecules. In contrast, Symphony uses message-passing with higher-degree $E(3)$-equivariant features. This allows a novel representation of probability distributions via spherical harmonic signals to efficiently model the 3D geometry of molecules. We show that Symphony is able to accurately generate small molecules from the QM9 dataset, outperforming existing autoregressive models and approaching the performance of diffusion models.


Symphonize 3D Semantic Scene Completion with Contextual Instance Queries

Jiang, Haoyi, Cheng, Tianheng, Gao, Naiyu, Zhang, Haoyang, Lin, Tianwei, Liu, Wenyu, Wang, Xinggang

arXiv.org Artificial Intelligence

`3D Semantic Scene Completion (SSC) has emerged as a nascent and pivotal undertaking in autonomous driving, aiming to predict voxel occupancy within volumetric scenes. However, prevailing methodologies primarily focus on voxel-wise feature aggregation, while neglecting instance semantics and scene context. In this paper, we present a novel paradigm termed Symphonies (Scene-from-Insts), that delves into the integration of instance queries to orchestrate 2D-to-3D reconstruction and 3D scene modeling. Leveraging our proposed Serial Instance-Propagated Attentions, Symphonies dynamically encodes instance-centric semantics, facilitating intricate interactions between image-based and volumetric domains. Simultaneously, Symphonies enables holistic scene comprehension by capturing context through the efficient fusion of instance queries, alleviating geometric ambiguity such as occlusion and perspective errors through contextual scene reasoning. Experimental results demonstrate that Symphonies achieves state-of-the-art performance on challenging benchmarks SemanticKITTI and SSCBench-KITTI-360, yielding remarkable mIoU scores of 15.04 and 18.58, respectively. These results showcase the paradigm's promising advancements. The code is available at https://github.com/hustvl/Symphonies.


Symphony of experts: orchestration with adversarial insights in reinforcement learning

Jonckheere, Matthieu, Mignacco, Chiara, Stoltz, Gilles

arXiv.org Machine Learning

Structured reinforcement learning leverages policies with advantageous properties to reach better performance, particularly in scenarios where exploration poses challenges. We explore this field through the concept of orchestration, where a (small) set of expert policies guides decision-making; the modeling thereof constitutes our first contribution. We then establish value-functions regret bounds for orchestration in the tabular setting by transferring regret-bound results from adversarial settings. We generalize and extend the analysis of natural policy gradient in Agarwal et al. [2021, Section 5.3] to arbitrary adversarial aggregation strategies. We also extend it to the case of estimated advantage functions, providing insights into sample complexity both in expectation and high probability. A key point of our approach lies in its arguably more transparent proofs compared to existing methods. Finally, we present simulations for a stochastic matching toy model.


Symphony: Optimized Model Serving using Centralized Orchestration

Chen, Lequn, Deng, Weixin, Canumalla, Anirudh, Xin, Yu, Philipose, Matthai, Krishnamurthy, Arvind

arXiv.org Artificial Intelligence

The orchestration of deep neural network (DNN) model inference on GPU clusters presents two significant challenges: achieving high accelerator efficiency given the batching properties of model inference while meeting latency service level objectives (SLOs), and adapting to workload changes both in terms of short-term fluctuations and long-term resource allocation. To address these challenges, we propose Symphony, a centralized scheduling system that can scale to millions of requests per second and coordinate tens of thousands of GPUs. Our system utilizes a non-work-conserving scheduling algorithm capable of achieving high batch efficiency while also enabling robust autoscaling. Additionally, we developed an epoch-scale algorithm that allocates models to sub-clusters based on the compute and memory needs of the models. Through extensive experiments, we demonstrate that Symphony outperforms prior systems by up to 4.7x higher goodput.


Free AI Tools: Unleash Artistic Potential with AI Animation

#artificialintelligence

Welcome, dear art connoisseurs and digital wizards! Today, I present to you an exhilarating fusion of classical art and modern technology – a symphony of sight and sound that promises to captivate your senses. As the world of art continues to evolve, so too does the potential to breathe new life into the masterpieces of yore. Can you imagine the enchanting scenes of Van Gogh's Starry Night swirling in motion, accompanied by the soothing melodies of a nocturne? For the creative minds and generative AI enthusiasts among you, I invite you to embark on a journey to animate the strokes of revered artists and compose a harmonious soundscape that will make their paintings sing.