takahashi
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
While direction of arrival (DOA) of sound events is generally estimated from multichannel audio data recorded in a microphone array, sound events usually derive from visually perceptible source objects, e.g., sounds of footsteps come from the feet of a walker. This paper proposes an audio-visual sound event localization and detection (SELD) task, which uses multichannel audio and video information to estimate the temporal activation and DOA of target sound events. Audio-visual SELD systems can detect and localize sound events using signals from a microphone array and audio-visual correspondence. We also introduce an audio-visual dataset, Sony-TAu Realistic Spatial Soundscapes 2023 (STARSS23), which consists of multichannel audio data recorded with a microphone array, video data, and spatiotemporal annotation of sound events. Sound scenes in STARSS23 are recorded with instructions, which guide recording participants to ensure adequate activity and occurrences of sound events. STARSS23 also serves human-annotated temporal activation labels and human-confirmed DOA labels, which are based on tracking results of a motion capture system. Our benchmark results demonstrate the benefits of using visual object positions in audio-visual SELD tasks. The data is available at https://zenodo.org/record/7880637.
Scalable Satellite Swarm Deployment via Distance-based Orbital Transition Under $J_2$ Perturbation
Takahashi, Yuta, Sakai, Shin-ichiro
This paper presents an autonomous guidance and control strategy for a satellite swarm that enables scalable distributed space structures for innovative science and business opportunities. The averaged $J_2$ orbital parameters that describe the drift and periodic orbital motion were derived along with their target values to achieve a distributed space structure in a decentralized manner. This enabled the design of a distance-based orbital stabilizer to ensure autonomous deployment into a monolithic formation of a coplanar equidistant configuration on a user-defined orbital plane. Continuous formation control was assumed to be achieved through fuel-free actuation, such as satellite magnetic field interaction and differential aerodynamic forces, thereby maintaining long-term formation stability without thruster usage. A major challenge for such actuation systems is the potential loss of control capability due to increasing inter-satellite distances resulting from unstable orbital dynamics, particularly for autonomous satellite swarms. To mitigate this risk, our decentralized deployment controller minimized drift distance during unexpected communication outages. As a case study, we consider the deployment of palm-sized satellites into a coplanar equidistant formation in a $J_2$-perturbed orbit. Moreover, centralized grouping strategies are presented.
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- North America > United States > Massachusetts (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- (2 more...)
Certified Coil Geometry Learning for Short-Range Magnetic Actuation and Spacecraft Docking Application
Takahashi, Yuta, Tajima, Hayate, Sakai, Shin-ichiro
This paper presents a learning-based framework for approximating an exact magnetic-field interaction model, supported by both numerical and experimental validation. High-fidelity magnetic-field interaction modeling is essential for achieving exceptional accuracy and responsiveness across a wide range of fields, including transportation, energy systems, medicine, biomedical robotics, and aerospace robotics. In aerospace engineering, magnetic actuation has been investigated as a fuel-free solution for multi-satellite attitude and formation control. Although the exact magnetic field can be computed from the Biot-Savart law, the associated computational cost is prohibitive, and prior studies have therefore relied on dipole approximations to improve efficiency. However, these approximations lose accuracy during proximity operations, leading to unstable behavior and even collisions. To address this limitation, we develop a learning-based approximation framework that faithfully reproduces the exact field while dramatically reducing computational cost. The proposed method additionally provides a certified error bound, derived from the number of training samples, ensuring reliable prediction accuracy. The learned model can also accommodate interactions between coils of different sizes through appropriate geometric transformations, without retraining. To verify the effectiveness of the proposed framework under challenging conditions, a spacecraft docking scenario is examined through both numerical simulations and experimental validation.
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- North America > United States > Massachusetts (0.04)
- Europe > Germany > Bremen > Bremen (0.04)
- (3 more...)
- Aerospace & Defense (0.88)
- Education > Curriculum > Subject-Specific Education (0.40)
NODA-MMH: Certified Learning-Aided Nonlinear Control for Magnetically-Actuated Swarm Experiment Toward On-Orbit Proof
Takahashi, Yuta, Ochi, Atsuki, Tomioka, Yoichi, Sakai, Shin-Ichiro
This study experimentally validates the principle of large-scale satellite swarm control through learning-aided magnetic field interactions generated by satellite-mounted magnetorquers. This actuation presents a promising solution for the long-term formation maintenance of multiple satellites and has primarily been demonstrated in ground-based testbeds for two-satellite position control. However, as the number of satellites increases beyond three, fundamental challenges coupled with the high nonlinearity arise: 1) nonholonomic constraints, 2) underactuation, 3) scalability, and 4) computational cost. Previous studies have shown that time-integrated current control theoretically solves these problems, where the average actuator outputs align with the desired command, and a learning-based technique further enhances their performance. Through multiple experiments, we validate critical aspects of learning-aided time-integrated current control: (1) enhanced controllability of the averaged system dynamics, with a theoretically guaranteed error bound, and (2) decentralized current management. We design two-axis coils and a ground-based experimental setup utilizing an air-bearing platform, enabling a mathematical replication of orbital dynamics. Based on the effectiveness of the learned interaction model, we introduce NODA-MMH (Neural power-Optimal Dipole Allocation for certified learned Model-based Magnetically swarm control Harness) for model-based power-optimal swarm control. This study complements our tutorial paper on magnetically actuated swarms for the long-term formation maintenance problem.
- North America > United States > Massachusetts (0.04)
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- Oceania > Australia > New South Wales > Sydney (0.04)
- (2 more...)
Top Bananza! Donkey Kong's long-awaited return is a literal smash-hit
When you think of Nintendo, it's almost impossible not to picture Donkey Kong. Yet despite Donkers' undeniable place in gaming history – and obligatory appearances in Smash Bros and Mario Kart – for the last few console generations, Donkey Kong platformers have been MIA. Enter DK's first standalone adventure in 11 years, Donkey Kong Bananza. While Mario's recent adventures saw him exploring the reaches of outer space or deftly possessing enemies with an anthropomorphic hat, DK's grand return is all about primal rage. As you smash and punch your way through walls, floors and ceilings, you can burrow all the way to the ground below, forging new paths and unearthing hidden treasures.
- North America > Canada > Ontario > Middlesex County > London (0.05)
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
An Experimental Study on Joint Modeling for Sound Event Localization and Detection with Source Distance Estimation
Dong, Yuxuan, Wang, Qing, Hong, Hengyi, Jiang, Ya, Cheng, Shi
In traditional sound event localization and detection (SELD) tasks, the focus is typically on sound event detection (SED) and direction-of-arrival (DOA) estimation, but they fall short of providing full spatial information about the sound source. The 3D SELD task addresses this limitation by integrating source distance estimation (SDE), allowing for complete spatial localization. We propose three approaches to tackle this challenge: a novel method with independent training and joint prediction, which firstly treats DOA and distance estimation as separate tasks and then combines them to solve 3D SELD; a dual-branch representation with source Cartesian coordinate used for simultaneous DOA and distance estimation; and a three-branch structure that jointly models SED, DOA, and SDE within a unified framework. Our proposed method ranked first in the DCASE 2024 Challenge Task 3, demonstrating the effectiveness of joint modeling for addressing the 3D SELD task. The relevant code for this paper will be open-sourced in the future.
- Research Report > New Finding (0.64)
- Research Report > Experimental Study (0.40)
Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets
Tsuboya, Akane, Kono, Yu, Takahashi, Tatsuji
The objective of a reinforcement learning agent is to discover better actions through exploration. However, typical exploration techniques aim to maximize rewards, often incurring high costs in both exploration and learning processes. We propose a novel deep reinforcement learning method, which prioritizes achieving an aspiration level over maximizing expected return. This method flexibly adjusts the degree of exploration based on the proportion of target achievement. Through experiments on a motion control task and a navigation task, this method achieved returns equal to or greater than other standard methods. The results of the analysis showed two things: our method flexibly adjusts the exploration scope, and it has the potential to enable the agent to adapt to non-stationary environments. These findings indicated that this method may have effectiveness in improving exploration efficiency in practical applications of reinforcement learning.
The surreal, colourful Katamari Damacy is 20 – and still the weirdest game I have ever loved
My parents were somewhat sceptical of video games when I was growing up. I did have a SNES and then an N64 as a child, but I was allowed to play them only at weekends, so on Fridays I would come home from school and binge on Mario 64 with a huge pack of Haribo Tangfastics. My gaming horizons didn't broaden until I was a teenager, when I started earning enough of my own money to buy myself a PlayStation 2 and I started hanging out on forums with other nerds whose gaming worlds were significantly broader than mine. And the PlayStation 2 had some weird games. The N64 did to an extent – I nurture an enduring fondness for Mystical Ninja Starring Goemon – but not like Sony's console.
Rats can bop their heads in time to the beat of music, study reveals
Most of us love to have a bit of a boogie, and some - but not all - can also keep to the beat! It turns out we're not alone in that, as a new study has found that rats can nod their heads in time to music. Researchers from the University of Tokyo played the rodents clips of Lady Gaga, Queen and Michael Jackson as well as a Mozart Sonata at four different tempos. Any bopping was recorded both on camera and with a miniature sensor strapped to their heads. 'Rats displayed innate - that is, without any training or prior exposure to music - beat synchronization most distinctly within 120-140 beats per minute (bpm),' said Associate Professor Hirokazu Takahashi.
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.26)
- North America > United States > Missouri (0.05)
- North America > United States > California > San Diego County > San Diego (0.05)
- Media > Music (1.00)
- Leisure & Entertainment (1.00)
CoLoC: Conditioned Localizer and Classifier for Sound Event Localization and Detection
Kapka, Sławomir, Tkaczuk, Jakub
In this article, we describe Conditioned Localizer and Classifier (CoLoC) which is a novel solution for Sound Event Localization and Detection (SELD). The solution constitutes of two stages: the localization is done first and is followed by classification conditioned by the output of the localizer. In order to resolve the problem of the unknown number of sources we incorporate the idea borrowed from Sequential Set Generation (SSG). Models from both stages are SELDnet-like CRNNs, but with single outputs. Conducted reasoning shows that such two single-output models are fit for SELD task. We show that our solution improves on the baseline system in most metrics on the STARSS22 Dataset.
- Europe > France > Grand Est > Meurthe-et-Moselle > Nancy (0.05)
- Europe > Poland > Masovia Province > Warsaw (0.04)
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)