South America
Data Augmentation in High Dimensional Low Sample Size Setting Using a Geometry-Based Variational Autoencoder
Chadebec, Clément, Thibeau-Sutre, Elina, Burgos, Ninon, Allassonnière, Stéphanie
In this paper, we propose a new method to perform data augmentation in a reliable way in the High Dimensional Low Sample Size (HDLSS) setting using a geometry-based variational autoencoder. Our approach combines a proper latent space modeling of the VAE seen as a Riemannian manifold with a new generation scheme which produces more meaningful samples especially in the context of small data sets. The proposed method is tested through a wide experimental study where its robustness to data sets, classifiers and training samples size is stressed. It is also validated on a medical imaging classification task on the challenging ADNI database where a small number of 3D brain MRIs are considered and augmented using the proposed VAE framework. In each case, the proposed method allows for a significant and reliable gain in the classification metrics. For instance, balanced accuracy jumps from 66.3% to 74.3% for a state-of-the-art CNN classifier trained with 50 MRIs of cognitively normal (CN) and 50 Alzheimer disease (AD) patients and from 77.7% to 86.3% when trained with 243 CN and 210 AD while improving greatly sensitivity and specificity metrics.
Revisiting Citizen Science Through the Lens of Hybrid Intelligence
Rafner, Janet, Gajdacz, Miroslav, Kragh, Gitte, Hjorth, Arthur, Gander, Anna, Palfi, Blanka, Berditchevskaia, Aleks, Grey, François, Gal, Kobi, Segal, Avi, Walmsley, Mike, Miller, Josh Aaron, Dellerman, Dominik, Haklay, Muki, Michelucci, Pietro, Sherson, Jacob
Artificial Intelligence (AI) can augment and sometimes even replace human cognition. Inspired by efforts to value human agency alongside productivity, we discuss the benefits of solving Citizen Science (CS) tasks with Hybrid Intelligence (HI), a synergetic mixture of human and artificial intelligence. Currently there is no clear framework or methodology on how to create such an effective mixture. Due to the unique participant-centered set of values and the abundance of tasks drawing upon both human common sense and complex 21st century skills, we believe that the field of CS offers an invaluable testbed for the development of HI and human-centered AI of the 21st century, while benefiting CS as well. In order to investigate this potential, we first relate CS to adjacent computational disciplines. Then, we demonstrate that CS projects can be grouped according to their potential for HI-enhancement by examining two key dimensions: the level of digitization and the amount of knowledge or experience required for participation. Finally, we propose a framework for types of human-AI interaction in CS based on established criteria of HI. This "HI lens" provides the CS community with an overview of several ways to utilize the combination of AI and human intelligence in their projects. It also allows the AI community to gain ideas on how developing AI in CS projects can further their own field.
PyTorch Geometric Temporal: Spatiotemporal Signal Processing with Neural Machine Learning Models
Rozemberczki, Benedek, Scherer, Paul, He, Yixuan, Panagopoulos, George, Astefanoaei, Maria, Kiss, Oliver, Beres, Ferenc, Collignon, Nicolas, Sarkar, Rik
We present PyTorch Geometric Temporal a deep learning framework combining state-of-the-art machine learning algorithms for neural spatiotemporal signal processing. The main goal of the library is to make temporal geometric deep learning available for researchers and machine learning practitioners in a unified easy-to-use framework. PyTorch Geometric Temporal was created with foundations on existing libraries in the PyTorch eco-system, streamlined neural network layer definitions, temporal snapshot generators for batching, and integrated benchmark datasets. These features are illustrated with a tutorial-like case study. Experiments demonstrate the predictive performance of the models implemented in the library on real world problems such as epidemiological forecasting, ridehail demand prediction and web-traffic management. Our sensitivity analysis of runtime shows that the framework can potentially operate on web-scale datasets with rich temporal features and spatial structure.
More Than 80% of Financial Institutions Believe AI is the Key Competitive Driver to Success NTT DATA Study Reveals : Media
TOKYO – April 22, 2021 – According to research findings by NTT DATA, senior financial services executives overwhelmingly agree that implementing Artificial Intelligence (AI) will be the key competitive driver of success over the next few years. In fact, 83% agree that AI is creating new ways to differentiate offerings and win customers, driven by access to unique data sets; however, obstacles remain and adoption lags. Respondents report technology implementation (55%), creating new business startup culture in an established business (51%), and organizational skill changes (43%) are all top AI challenges to implementing personalized proactive services. Despite these very real obstacles, financial institutions (FIs) must find a way to overcome them. Especially considering the COVID-19 pandemic, consumers are increasingly looking to digital finance solutions and apps that anticipate their needs and proactively offer financial guidance.
L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing
Guizzo, Eric, Gramaccioni, Riccardo F., Jamili, Saeid, Marinoni, Christian, Massaro, Edoardo, Medaglia, Claudia, Nachira, Giuseppe, Nucciarelli, Leonardo, Paglialunga, Ludovica, Pennese, Marco, Pepe, Sveva, Rocchi, Enrico, Uncini, Aurelio, Comminiello, Danilo
The L3DAS21 Challenge is aimed at encouraging and fostering collaborative research on machine learning for 3D audio signal processing, with particular focus on 3D speech enhancement (SE) and 3D sound localization and detection (SELD). Alongside with the challenge, we release the L3DAS21 dataset, a 65 hours 3D audio corpus, accompanied with a Python API that facilitates the data usage and results submission stage. Usually, machine learning approaches to 3D audio tasks are based on single-perspective Ambisonics recordings or on arrays of single-capsule microphones. We propose, instead, a novel multichannel audio configuration based multiple-source and multiple-perspective Ambisonics recordings, performed with an array of two first-order Ambisonics microphones. To the best of our knowledge, it is the first time that a dual-mic Ambisonics configuration is used for these tasks. We provide baseline models and results for both tasks, obtained with state-of-the-art architectures: FaSNet for SE and SELDNet for SELD. This report is aimed at providing all needed information to participate in the L3DAS21 Challenge, illustrating the details of the L3DAS21 dataset, the challenge tasks and the baseline models.
A neural anisotropic view of underspecification in deep learning
Ortiz-Jimenez, Guillermo, Salazar-Reque, Itamar Franco, Modas, Apostolos, Moosavi-Dezfooli, Seyed-Mohsen, Frossard, Pascal
The underspecification of most machine learning pipelines means that we cannot rely solely on validation performance to assess the robustness of deep learning systems to naturally occurring distribution shifts. Instead, making sure that a neural network can generalize across a large number of different situations requires to understand the specific way in which it solves a task. In this work, we propose to study this problem from a geometric perspective with the aim to understand two key characteristics of neural network solutions in underspecified settings: how is the geometry of the learned function related to the data representation? And, are deep networks always biased towards simpler solutions, as conjectured in recent literature? We show that the way neural networks handle the underspecification of these problems is highly dependent on the data representation, affecting both the geometry and the complexity of the learned predictors. Our results highlight that understanding the architectural inductive bias in deep learning is fundamental to address the fairness, robustness, and generalization of these systems.
Podcast 12: Real world tech: Edge AI drives car-making, healthcare and retail - VanillaPlus - The global voice of Telecoms IT
Artificial intelligence (AI) at the edge is changing healthcare, retail and Audi cars, as Intel's IoT Group vice president, John Healy tells Jeremy Cowan and George Malim. Plus we learn how chipmakers globally are tackling supply problems that have halted vehicle production. The semiconductor industry is facing an "awakening", says Healy, as it shape-shifts to meet "insatiable demand" for silicone. Finally, we hear which African country is a leader in satellite cartography, and how Amazon is playing games with its warehouse staff. Hi, and welcome to the latest Trending Tech Podcast brought to you by The Evolving Enterprise, IoT Now, and VanillaPlus.com. This is Jeremy Cowan, and I want to thank you for joining the latest, sometimes serious, sometimes light-hearted look at enterprise digital transformation. I am delighted to welcome today two guests, who are John Healy, from California-based international technology company, Intel, known among other things, for the processors that power so many of our devices. John is vice president of the IoT Group. John, thank you very much for making the time to be here. Good to have you on again, George. Okay, today, we'll be looking at some key tech news stories that deserve a bit of a deeper dive.
Beyond digital transformations: Modernizing core technology for the AI bank of the future
An artificial-intelligence (AI) bank leapfrogs the competition by organizing talent, technology, and ways of working around an AI-first vision for empowering customers with intelligent value propositions delivered through compelling journeys and experiences. Making this vision a reality requires capabilities in four areas: an engagement layer, decisioning layer, core technology layer, and platform operating model. This article was a collaborative effort by Sven Blumberg, Rich Isenberg, Dave Kerr, Milan Mitra, and Renny Thomas. Previous articles in this series have explored the first two areas. The current article identifies capabilities needed in the third area, the core technology and data infrastructure of the modern capability stack. Deploying AI capabilities across the organization requires a scalable, resilient, and adaptable set of core-technology components. When implemented successfully, this foundational layer can enable a bank to accelerate technology innovations, improve the quality and reliability of operations, reduce operating costs, and strengthen customer engagement.
End-to-End Approach for Recognition of Historical Digit Strings
Zhao, Mengqiao, Hochuli, Andre G., Cheddad, Abbas
The plethora of digitalised historical document datasets released in recent years has rekindled interest in advancing the field of handwriting pattern recognition. In the same vein, a recently published data set, known as ARDIS, presents handwritten digits manually cropped from 15.000 scanned documents of Swedish church books and exhibiting various handwriting styles. To this end, we propose an end-to-end segmentation-free deep learning approach to handle this challenging ancient handwriting style of dates present in the ARDIS dataset (4-digits long strings). We show that with slight modifications in the VGG-16 deep model, the framework can achieve a recognition rate of 93.2%, resulting in a feasible solution free of heuristic methods, segmentation, and fusion methods. Moreover, the proposed approach outperforms the well-known CRNN method (a model widely applied in handwriting recognition tasks).
Linear Convergence of the Subspace Constrained Mean Shift Algorithm: From Euclidean to Directional Data
This paper studies linear convergence of the subspace constrained mean shift (SCMS) algorithm, a well-known algorithm for identifying a density ridge defined by a kernel density estimator. By arguing that the SCMS algorithm is a special variant of a subspace constrained gradient ascent (SCGA) algorithm with an adaptive step size, we derive linear convergence of such SCGA algorithm. While the existing research focuses mainly on density ridges in the Euclidean space, we generalize density ridges and the SCMS algorithm to directional data. In particular, we establish the stability theorem of density ridges with directional data and prove the linear convergence of our proposed directional SCMS algorithm.