AITopics | adaptive knowledge distillation

Collaborating Authors

adaptive knowledge distillation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Adaptive Knowledge Distillation for Device-Directed Speech Detection

Chi, Hyung Gun, Pesce, Florian, Chang, Wonil, Rudovic, Oggi, Argueta, Arturo, Braun, Stefan, Garg, Vineet, Abdelaziz, Ahmed Hussen

arXiv.org Artificial IntelligenceAug-6-2025

Device-directed speech detection (DDSD) is a binary classification task that separates the user's queries to a voice assistant (V A) from background speech or side conversations. This is important for achieving naturalistic user experience. To this end, we propose knowledge distillation (KD) to enhance DDSD accuracy while ensuring efficient deployment. Specifically, we introduce a novel adaptive KD method that transfers knowledge from general representations of an ASR large pre-trained acoustic encoder ( teacher). We apply task-specific adapters, on top of the (frozen) teacher encoder, trained jointly with the student model on DDSD. We demonstrate that the proposed adaptive KD outperforms the student model without distillation in the keyword and keyword-free (follow-up) invocations, with an improvement of +26% and +19% in terms of Equal Error Rate, respectively. We also show that this approach generalizes across the transformer and conformer-based model architectures.

artificial intelligence, distillation, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2508.02801

Genre: Research Report (1.00)

Industry: Education (0.74)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.49)

Add feedback

Adaptive Knowledge Distillation for Classification of Hand Images using Explainable Vision Transformers

Nguyen, Thanh Thi, Wilson, Campbell, Dalins, Janis

arXiv.org Artificial IntelligenceAug-19-2024

Assessing the forensic value of hand images involves the use of unique features and patterns present in an individual's hand. The human hand has distinct characteristics, such as the pattern of veins, fingerprints, and the geometry of the hand itself. This paper investigates the use of vision transformers (ViTs) for classification of hand images. We use explainability tools to explore the internal representations of ViTs and assess their impact on the model outputs. Utilizing the internal understanding of ViTs, we introduce distillation methods that allow a student model to adaptively extract knowledge from a teacher model while learning on data of a different domain to prevent catastrophic forgetting. Two publicly available hand image datasets are used to conduct a series of experiments to evaluate performance of the ViTs and our proposed adaptive distillation methods. The experimental results demonstrate that ViT models significantly outperform traditional machine learning methods and the internal states of ViTs are useful for explaining the model outputs in the classification task. By averting catastrophic forgetting, our distillation methods achieve excellent performance on data from both source and target domains, particularly when these two domains exhibit significant dissimilarity. The proposed approaches therefore can be developed and implemented effectively for real-world applications such as access control, identity verification, and authentication systems.

adaptive knowledge distillation, classification, explainable vision transformer, (1 more...)

arXiv.org Artificial Intelligence

2408.10503

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (0.87)
Commercial Services & Supplies > Security & Alarm Services (0.53)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.87)
Information Technology > Artificial Intelligence > Vision (0.60)

Add feedback