remi
ReMI: A Dataset for Reasoning with Multiple Images
With the continuous advancement of large language models (LLMs), it is essential to create new benchmarks to evaluate their expanding capabilities and identify areas for improvement. This work focuses on multi-image reasoning, an emerging capability in state-of-the-art LLMs. We introduce ReMI, a dataset designed to assess LLMs' ability to reason with multiple images. This dataset encompasses a diverse range of tasks, spanning various reasoning domains such as math, physics, logic, code, table/chart understanding, and spatial and temporal reasoning. It also covers a broad spectrum of characteristics found in multi-image reasoning scenarios. We have benchmarked several cutting-edge LLMs using ReMI and found a substantial gap between their performance and human-level proficiency. This highlights the challenges in multi-image reasoning and the need for further research. Our analysis also reveals the strengths and weaknesses of different models, shedding light on the types of reasoning that are currently attainable and areas where future models require improvement. We anticipate that ReMI will be a valuable resource for developing and evaluating more sophisticated LLMs capable of handling real-world multi-image understanding tasks.
ReMI: A Dataset for Reasoning with Multiple Images -- Supplementary Material
In this section, we follow the recommendations in Gebru et al. For what purpose was the dataset created? Who created the dataset ( e.g., which team, research group) and on behalf of which Who funded the creation of the dataset? What do the instances that comprise the dataset represent ( e.g., documents, photos, How many instances are there in total (of each type, if appropriate)? Parts of the dataset have been created programatically.
- Europe > Switzerland > Zürich > Zürich (0.14)
- North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
- North America > Dominican Republic (0.04)
- (3 more...)
- Consumer Products & Services (0.68)
- Transportation (0.47)
- Europe > Switzerland > Zürich > Zürich (0.14)
- North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
- North America > Dominican Republic (0.04)
- (3 more...)
- Consumer Products & Services (0.68)
- Transportation (0.47)
REMI: A Novel Causal Schema Memory Architecture for Personalized Lifestyle Recommendation Agents
Raman, Vishal, R, Vijai Aravindh, Ragav, Abhijith
Personalized AI assistants often struggle to incorporate complex personal data and causal knowledge, leading to generic advice that lacks explanatory power. We propose REMI, a Causal Schema Memory architecture for a multimodal lifestyle agent that integrates a personal causal knowledge graph, a causal reasoning engine, and a schema based planning module. The idea is to deliver explainable, personalized recommendations in domains like fashion, personal wellness, and lifestyle planning. Our architecture uses a personal causal graph of the user's life events and habits, performs goal directed causal traversals enriched with external knowledge and hypothetical reasoning, and retrieves adaptable plan schemas to generate tailored action plans. A Large Language Model orchestrates these components, producing answers with transparent causal explanations. We outline the CSM system design and introduce new evaluation metrics for personalization and explainability, including Personalization Salience Score and Causal Reasoning Accuracy, to rigorously assess its performance. Results indicate that CSM based agents can provide more context aware, user aligned recommendations compared to baseline LLM agents. This work demonstrates a novel approach to memory augmented, causal reasoning in personalized agents, advancing the development of transparent and trustworthy AI lifestyle assistants.
- North America > United States (0.04)
- Asia > India > Tamil Nadu > Chennai (0.04)
- Health & Medicine > Consumer Health (0.48)
- Information Technology > Security & Privacy (0.35)
ReMi: A Random Recurrent Neural Network Approach to Music Production
Chateau-Laurent, Hugo, Vanhatalo, Tara, Pan, Wei-Tung, Hinaut, Xavier
W e show that randomly initialized recurrent neural networks can produce arpeggios and low-frequency oscillations that are rich and configurable. In contrast to end-to-end music generation that aims to replace musicians, our approach expands their creativity while requiring no data and much less computational power . More information can be found at: https://allendia.com/ 1. INTRODUCTION Artificial intelligence continues to drive significant changes in music production. However, current methods often require vast amounts of high-quality data, which are not always readily available.
ReMI: A Dataset for Reasoning with Multiple Images
With the continuous advancement of large language models (LLMs), it is essential to create new benchmarks to evaluate their expanding capabilities and identify areas for improvement. This work focuses on multi-image reasoning, an emerging capability in state-of-the-art LLMs. We introduce ReMI, a dataset designed to assess LLMs' ability to reason with multiple images. This dataset encompasses a diverse range of tasks, spanning various reasoning domains such as math, physics, logic, code, table/chart understanding, and spatial and temporal reasoning. It also covers a broad spectrum of characteristics found in multi-image reasoning scenarios. We have benchmarked several cutting-edge LLMs using ReMI and found a substantial gap between their performance and human-level proficiency.
Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional Representation
Huang, Jingyue, Chen, Ke, Yang, Yi-Hsuan
Managing the emotional aspect remains a challenge in automatic music generation. Prior works aim to learn various emotions at once, leading to inadequate modeling. This paper explores the disentanglement of emotions in piano performance generation through a two-stage framework. The first stage focuses on valence modeling of lead sheet, and the second stage addresses arousal modeling by introducing performance-level attributes. To further capture features that shape valence, an aspect less explored by previous approaches, we introduce a novel functional representation of symbolic music. This representation aims to capture the emotional impact of major-minor tonality, as well as the interactions among notes, chords, and key signatures. Objective and subjective experiments validate the effectiveness of our framework in both emotional valence and arousal modeling. We further leverage our framework in a novel application of emotional controls, showing a broad potential in emotion-driven music generation.
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Asia > Taiwan (0.04)
- North America > United States > California > San Diego County > San Diego (0.04)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- Media > Music (1.00)
- Leisure & Entertainment (1.00)
Emotion-Driven Melody Harmonization via Melodic Variation and Functional Representation
Huang, Jingyue, Yang, Yi-Hsuan
Emotion-driven melody harmonization aims to generate diverse harmonies for a single melody to convey desired emotions. Previous research found it hard to alter the perceived emotional valence of lead sheets only by harmonizing the same melody with different chords, which may be attributed to the constraints imposed by the melody itself and the limitation of existing music representation. In this paper, we propose a novel functional representation for symbolic music. This new method takes musical keys into account, recognizing their significant role in shaping music's emotional character through major-minor tonality. It also allows for melodic variation with respect to keys and addresses the problem of data scarcity for better emotion modeling. A Transformer is employed to harmonize key-adaptable melodies, allowing for keys determined in rule-based or model-based manner. Experimental results confirm the effectiveness of our new representation in generating key-aware harmonies, with objective and subjective evaluations affirming the potential of our approach to convey specific valence for versatile melody.
- North America > United States > California > San Diego County > San Diego (0.04)
- Asia > Taiwan (0.04)
- Media > Music (1.00)
- Leisure & Entertainment (1.00)
Silver Linings in the Shadows: Harnessing Membership Inference for Machine Unlearning
Sula, Nexhi, Kumar, Abhinav, Hou, Jie, Wang, Han, Tourani, Reza
With the continued advancement and widespread adoption of machine learning (ML) models across various domains, ensuring user privacy and data security has become a paramount concern. In compliance with data privacy regulations, such as GDPR, a secure machine learning framework should not only grant users the right to request the removal of their contributed data used for model training but also facilitates the elimination of sensitive data fingerprints within machine learning models to mitigate potential attack - a process referred to as machine unlearning. In this study, we present a novel unlearning mechanism designed to effectively remove the impact of specific data samples from a neural network while considering the performance of the unlearned model on the primary task. In achieving this goal, we crafted a novel loss function tailored to eliminate privacy-sensitive information from weights and activation values of the target model by combining target classification loss and membership inference loss. Our adaptable framework can easily incorporate various privacy leakage approximation mechanisms to guide the unlearning process. We provide empirical evidence of the effectiveness of our unlearning approach with a theoretical upper-bound analysis through a membership inference mechanism as a proof of concept. Our results showcase the superior performance of our approach in terms of unlearning efficacy and latency as well as the fidelity of the primary task, across four datasets and four deep learning architectures.
- North America > United States > Missouri > St. Louis County > St. Louis (0.04)
- Europe > Spain (0.04)
- North America > United States > Kansas (0.04)
- North America > United States > California (0.04)