Informed deep hierarchical classification: a non-standard analysis inspired approach
Fiaschi, Lorenzo, Cococcioni, Marco
–arXiv.org Artificial Intelligence
This work proposes a novel approach to the deep hierarchical classification task, i.e., the problem of classifying data according to multiple labels organized in a rigid parent-child structure. It consists in a multi-output deep neural network equipped with specific projection operators placed before each output layer. The design of such an architecture, called lexicographic hybrid deep neural network (LH-DNN), has been possible by combining tools from different and quite distant research fields: lexicographic multi-objective optimization, non-standard analysis, and deep learning. To assess the efficacy of the approach, the resulting network is compared against the B-CNN, a convolutional neural network tailored for hierarchical classification tasks, on the CIFAR10, CIFAR100 (where it has been originally and recently proposed before being adopted and tuned for multiple real-world applications) and Fashion-MNIST benchmarks. Evidence states that an LH-DNN can achieve comparable if not superior performance, especially in the learning of the hierarchical relations, in the face of a drastic reduction of the learning parameters, training epochs, and computational time, without the need for ad-hoc loss functions weighting values.
arXiv.org Artificial Intelligence
Oct-4-2024
- Country:
- Asia
- Middle East > Jordan (0.04)
- Singapore (0.04)
- Europe
- Greece (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy (0.04)
- Slovakia > Bratislava
- Bratislava (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America
- Canada > Ontario
- Toronto (0.14)
- United States
- Colorado (0.04)
- Washington > King County
- Seattle (0.04)
- Canada > Ontario
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Technology: