rehearsal
b3b43aeeacb258365cc69cdaf42a68af-Paper.pdf
We present an approach for lifelong/continual learning of convolutional neural networks (CNN) that does not suffer from the problem of catastrophic forgetting when moving from onetask totheother. Weshowthat theactivation maps generated by the CNN trained on the old task can be calibrated using very few calibration parameters, to become relevant to the new task.
- North America > United States > California (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- Asia > India (0.04)
- Oceania > New Zealand > North Island > Waikato (0.05)
- Europe > France > Île-de-France > Paris > Paris (0.04)
A simple but strong baseline for online continual learning: Repeated Augmented Rehearsal
Online continual learning (OCL) aims to train neural networks incrementally from a non-stationary data stream with a single pass through data. Rehearsal-based methods attempt to approximate the observed input distributions over time with a small memory and revisit them later to avoid forgetting. Despite their strong empirical performance, rehearsal methods still suffer from a poor approximation of past data's loss landscape with memory samples. This paper revisits the rehearsal dynamics in online settings. We provide theoretical insights on the inherent memory overfitting risk from the viewpoint of biased and dynamic empirical risk minimization, and examine the merits and limits of repeated rehearsal.Inspired by our analysis, a simple and intuitive baseline, repeated augmented rehearsal (RAR), is designed to address the underfitting-overfitting dilemma of online rehearsal. Surprisingly, across four rather different OCL benchmarks,this simple baseline outperforms vanilla rehearsal by 9\%-17\% and also significantly improves the state-of-the-art rehearsal-based methods MIR, ASER, and SCR. We also demonstrate that RAR successfully achieves an accurate approximation of the loss landscape of past data and high-loss ridge aversion in its learning trajectory. Extensive ablation studies are conducted to study the interplay between repeated and augmented rehearsal, and reinforcement learning (RL) is applied to dynamically adjust the hyperparameters of RAR to balance the stability-plasticity trade-off online.
PresentCoach: Dual-Agent Presentation Coaching through Exemplars and Interactive Feedback
Chen, Sirui, Zhou, Jinsong, Xu, Xinli, Yang, Xiaoyu, Guo, Litao, Chen, Ying-Cong
Effective presentation skills are essential in education, professional communication, and public speaking, yet learners often lack access to high-quality exemplars or personalized coaching. Existing AI tools typically provide isolated functionalities such as speech scoring or script generation without integrating reference modeling and interactive feedback into a cohesive learning experience. We introduce a dual-agent system that supports presentation practice through two complementary roles: the Ideal Presentation Agent and the Coach Agent. The Ideal Presentation Agent converts user-provided slides into model presentation videos by combining slide processing, visual-language analysis, narration script generation, personalized voice synthesis, and synchronized video assembly. The Coach Agent then evaluates user-recorded presentations against these exemplars, conducting multimodal speech analysis and delivering structured feedback in an Observation-Impact-Suggestion (OIS) format. To enhance the authenticity of the learning experience, the Coach Agent incorporates an Audience Agent, which simulates the perspective of a human listener and provides humanized feedback reflecting audience reactions and engagement. Together, these agents form a closed loop of observation, practice, and feedback. Implemented on a robust backend with multi-model integration, voice cloning, and error handling mechanisms, the system demonstrates how AI-driven agents can provide engaging, human-centered, and scalable support for presentation skill development in both educational and professional contexts.
- Asia > China > Hong Kong (0.41)
- Asia > China > Guangdong Province > Guangzhou (0.05)
- North America > United States > New York > New York County > New York City (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (0.68)
- Education (1.00)
- Health & Medicine > Therapeutic Area (0.47)
- Information Technology > Security & Privacy (0.35)
New AI technique sounding out audio deepfakes
Researchers from Australia's national science agency CSIRO, Federation University Australia and RMIT University have developed a method to improve the detection of audio deepfakes. The new technique, Rehearsal with Auxiliary-Informed Sampling (RAIS), is designed for audio deepfake detection -- a growing threat in cybercrime risks such as bypassing voice-based biometric authentication systems, impersonation and disinformation. It determines whether an audio clip is real or artificially generated (a'deepfake') and maintains performance over time as attack types evolve. In Italy earlier this year, an AI-cloned voice of its Defence Minister requested a €1M'ransom' from prominent business leaders, convincing some to pay. This is just one of many examples, highlighting the need for audio deepfake detectors.
Taming Modality Entanglement in Continual Audio-Visual Segmentation
Hong, Yuyang, Yang, Qi, Zhang, Tao, Wang, Zili, Fu, Zhaojin, Ding, Kun, Fan, Bin, Xiang, Shiming
Recently, significant progress has been made in multi-modal continual learning, aiming to learn new tasks sequentially in multi-modal settings while preserving performance on previously learned ones. However, existing methods mainly focus on coarse-grained tasks, with limitations in addressing modality entanglement in fine-grained continual learning settings. To bridge this gap, we introduce a novel Continual Audio-Visual Segmentation (CAVS) task, aiming to continuously segment new classes guided by audio. Through comprehensive analysis, two critical challenges are identified: 1) multi-modal semantic drift, where a sounding objects is labeled as background in sequential tasks; 2) co-occurrence confusion, where frequent co-occurring classes tend to be confused. In this work, a Collision-based Multi-modal Rehearsal (CMR) framework is designed to address these challenges. Specifically, for multi-modal semantic drift, a Multi-modal Sample Selection (MSS) strategy is proposed to select samples with high modal consistency for rehearsal. Meanwhile, for co-occurence confusion, a Collision-based Sample Rehearsal (CSR) mechanism is designed, allowing for the increase of rehearsal sample frequency of those confusable classes during training process. Moreover, we construct three audio-visual incremental scenarios to verify effectiveness of our method. Comprehensive experiments demonstrate that our method significantly outperforms single-modal continual learning methods.
Woman, 84, horrified after stomach-churning discovery in Morrisons juice: 'Dead snake' slithered out of carton
Jon Stewart leads defense of Jimmy Kimmel as late night hosts unite to mock Trump's'censorship' Trump just humiliated Harry and Meghan with two brutal words... but even more embarrassing is the reason they're having to stay silent: MAUREEN CALLAHAN Disturbing full story of singer D4vd's relationship with girl, 13, found dismembered in his Tesla... as creepy messages, songs and links to stars are exposed Utter chaos breaks out backstage at The View over Jimmy Kimmel: Hosts at war and staff in fear... as network bosses impose strict new'rule' The strain shows on Jimmy Kimmel as he emerges for first time after show's shock cancellation Queen Camilla appears to'pull rank' as Kate chats animatedly with Melania during State visit - and ushers Princess back towards William Seth Meyers responds to Jimmy Kimmel cancellation with dose of mockery for Trump: 'A great president, an even better golfer' President of America's biggest university forced to step down over'transgender indoctrination' Starbucks responds after barista refuses to write'Charlie Kirk' on customer's cup due to'policy' Millions under tsunami threat as fallout from monster 7.8 earthquake threatens US Woman, 84, horrified after stomach-churning discovery in Morrisons juice: 'Dead snake' slithered out of carton Two elderly women were left horrified and upset when they found a'dead snake' in a carton of fruit juice--and refuse to believe supermarket bosses' claim that the foot-long gelatinous entity is merely a string of mould. Betty Richards, 84, bought a carton of apple and mango juice from the Armthorpe branch of Morrisons as a treat for her best friend of twenty years, Julie Bircumshaw, also 84. The BBC reports that Ms Bircumshaw noticed some'bits of black' around the opening of the 1L carton, but after tasting the juice, thought it was fine to drink. When Ms Richards popped over to see her friend at home in Doncaster a week later, she was told about the discolouration around the nozzle. She was concerned, and decided to take a closer look at the £1.35 carton--which was almost empty.
- North America > United States > New York (0.05)
- South America (0.04)
- North America > United States > California > Los Angeles County > Los Angeles (0.04)
- (6 more...)
- Media > Television (1.00)
- Leisure & Entertainment (1.00)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.94)
- (5 more...)
A Theoretical Analysis
Note 3: Consider a balanced continual learning dataset (e.g., Split-CIFAR100, Split-Mini-ImageNet) Note 4: Consider general continual learning datasets. The hyperparameter settings are summarized in Table 4. All models are optimized using vanilla SGD. For all experiments, we use the learning rate of 0.1 following the same setting as in Aljundi et al. Mai et al. reported (2021) considerable and consistent performance gains when replacing the Softmax classifier with the NCM classifier.
- Oceania > New Zealand > North Island > Waikato (0.05)
- Europe > France > Île-de-France > Paris > Paris (0.04)
Avoiding Undesired Future with Minimal Cost in Non-Stationary Environments
Machine learning (ML) has achieved remarkable success in prediction tasks. In many real-world scenarios, rather than solely predicting an outcome using an ML model, the crucial concern is how to make decisions to prevent the occurrence of undesired outcomes, known as the avoiding undesired future (AUF) problem. To this end, a new framework called rehearsal learning has been proposed recently, which works effectively in stationary environments by leveraging the influence relations among variables. In real tasks, however, the environments are usually non-stationary, where the influence relations may be dynamic, leading to the failure of AUF by the existing method. In this paper, we introduce a novel sequential methodology that effectively updates the estimates of dynamic influence relations, which are crucial for rehearsal learning to prevent undesired outcomes in non-stationary environments.