DeepKnowledge: Generalisation-Driven Deep Learning Testing
Missaoui, Sondess, Gerasimou, Simos, Matragkas, Nikolaos
–arXiv.org Artificial Intelligence
Despite their unprecedented success, DNNs are notoriously fragile to small shifts in data distribution, demanding effective testing techniques that can assess their dependability. Despite recent advances in DNN testing, there is a lack of systematic testing approaches that assess the DNN's capability to generalise and operate comparably beyond data in their training distribution. We address this gap with DeepKnowledge, a systematic testing methodology for DNN-based systems founded on the theory of knowledge generalisation, which aims to enhance DNN robustness and reduce the residual risk of 'black box' models. Conforming to this theory, DeepKnowledge posits that core computational DNN units, termed Transfer Knowledge neurons, can generalise under domain shift. DeepKnowledge provides an objective confidence measurement on testing activities of DNN given data distribution shifts and uses this information to instrument a generalisation-informed test adequacy criterion to check the transfer knowledge capacity of a test set. Our empirical evaluation of several DNNs, across multiple datasets and state-of-the-art adversarial generation techniques demonstrates the usefulness and effectiveness of DeepKnowledge and its ability to support the engineering of more dependable DNNs. We report improvements of up to 10 percentage points over state-of-the-art coverage criteria for detecting adversarial attacks on several benchmarks, including MNIST, SVHN, and CIFAR.
arXiv.org Artificial Intelligence
Mar-25-2024
- Country:
- Europe
- France (0.04)
- Germany > Baden-Württemberg
- Karlsruhe Region > Heidelberg (0.04)
- United Kingdom > England
- North Yorkshire > York (0.04)
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- District of Columbia > Washington (0.05)
- Massachusetts > Suffolk County
- Boston (0.04)
- Canada > Quebec
- Europe
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Automobiles & Trucks (0.92)
- Information Technology > Robotics & Automation (0.67)
- Transportation > Ground
- Road (0.92)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (1.00)
- Natural Language (0.93)
- Representation & Reasoning (0.93)
- Robots > Autonomous Vehicles (1.00)
- Machine Learning
- Sensing and Signal Processing > Image Processing (0.93)
- Software (0.93)
- Artificial Intelligence
- Information Technology