AITopics | Mariani, Giovanni

Collaborating Authors

Mariani, Giovanni

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces

Moons, Bert, Noorzad, Parham, Skliar, Andrii, Mariani, Giovanni, Mehta, Dushyant, Lott, Chris, Blankevoort, Tijmen

arXiv.org Machine LearningDec-16-2020

This work presents DONNA (Distilling Optimal Neural Network Architectures), a novel pipeline for rapid neural architecture search and search space exploration, targeting multiple different hardware platforms and user scenarios. In DONNA, a search consists of three phases. First, an accuracy predictor is built for a diverse search space using blockwise knowledge distillation. This predictor enables searching across diverse macro-architectural network parameters such as layer types, attention mechanisms, and channel widths, as well as across micro-architectural parameters such as block repeats, kernel sizes, and expansion rates. Second, a rapid evolutionary search phase finds a Pareto-optimal set of architectures in terms of accuracy and latency for any scenario using the predictor and on-device measurements. Third, Pareto-optimal models can be quickly finetuned to full accuracy. With this approach, DONNA finds architectures that outperform the state of the art. In ImageNet classification, architectures found by DONNA are 20% faster than EfficientNet-B0 and MobileNetV2 on a Nvidia V100 GPU at similar accuracy and 10% faster with 0.5% higher accuracy than MobileNetV2-1.4x on a Samsung S20 smartphone. In addition to neural architecture search, DONNA is used for search-space exploration and hardware-aware model compression.

artificial intelligence, neural network, search space, (19 more...)

arXiv.org Machine Learning

2012.08859

Genre: Research Report (0.64)

Industry: Semiconductors & Electronics (0.36)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

NeuNetS: An Automated Synthesis Engine for Neural Network Design

Sood, Atin, Elder, Benjamin, Herta, Benjamin, Xue, Chao, Bekas, Costas, Malossi, A. Cristiano I., Saha, Debashish, Scheidegger, Florian, Venkataraman, Ganesh, Thomas, Gegi, Mariani, Giovanni, Strobelt, Hendrik, Samulowitz, Horst, Wistuba, Martin, Manica, Matteo, Choudhury, Mihir, Yan, Rong, Istrate, Roxana, Puri, Ruchir, Pedapati, Tejaswini

arXiv.org Machine LearningJan-16-2019

Application of neural networks to a vast variety of practical applications is transforming the way AI is applied in practice. Pre-trained neural network models available through APIs or capability to custom train pre-built neural network architectures with customer data has made the consumption of AI by developers much simpler and resulted in broad adoption of these complex AI models. While prebuilt network models exist for certain scenarios, to try and meet the constraints that are unique to each application, AI teams need to think about developing custom neural network architectures that can meet the tradeoff between accuracy and memory footprint to achieve the tight constraints of their unique use-cases. However, only a small proportion of data science teams have the skills and experience needed to create a neural network from scratch, and the demand far exceeds the supply. In this paper, we present NeuNetS : An automated Neural Network Synthesis engine for custom neural network design that is available as part of IBM's AI OpenScale's product. NeuNetS is available for both Text and Image domains and can build neural networks for specific tasks in a fraction of the time it takes today with human effort, and with accuracy similar to that of human-designed AI models.

convolution, deep learning, neural network, (18 more...)

arXiv.org Machine Learning

1901.06261

Country:

North America > United States > Virginia (0.14)
North America > United States > Nevada (0.14)
North America > United States > Massachusetts (0.14)
(2 more...)

Genre: Research Report (1.00)

Industry: Information Technology (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

BAGAN: Data Augmentation with Balancing GAN

Mariani, Giovanni, Scheidegger, Florian, Istrate, Roxana, Bekas, Costas, Malossi, Cristiano

arXiv.org Machine LearningMar-26-2018

Image classification datasets are often imbalanced, characteristic that negatively affects the accuracy of deeplearning classifiers. In this work we propose balancing GANs (BAGANs) as an augmentation tool to restore balance in imbalanced datasets. This is challenging because the few minority-class images may not be enough to train a GAN. We overcome this issue by including during training all available images of majority and minority classes. The generative model learns useful features from majority classes and uses these to generate images for minority classes. We apply class-conditioning in the latent space to drive the generation process towards a target class. Additionally, we couple GANs with autoencoding techniques to reduce the risk of collapsing toward the generation of few foolish examples. We compare the proposed methodology with state-of-the-art GANs and demonstrate that BAGAN generates images of superior quality when trained with an imbalanced dataset.

artificial intelligence, dataset, neural network, (19 more...)

arXiv.org Machine Learning

1803.09655

Country: Europe > Switzerland > Zürich > Zürich (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.99)

Add feedback