Goto

Collaborating Authors

 Africa


Deep Clustering with a Dynamic Autoencoder

arXiv.org Machine Learning

In unsupervised learning, there is no obvious straightforward loss function which can capture the major factors of variations and similarities. Since natural systems have smooth dynamics, an opportunity is lost if an unsupervised loss function remains static during the training process. The absence of concrete supervision suggests that smooth complex dynamics should be integrated as a substitute to the classical static loss functions to better make use of the gradual and uncertain knowledge acquired through self-supervision. In this paper, we propose Dynamic Autoencoder (DynAE), a new model for deep clustering that allows to solve a clustering-reconstruction trade-off by gradually and smoothly eliminating the reconstruction objective in favor of a construction one while preserving the space topology. Experimental evaluations on benchmark datasets show that our approach achieves state-of-the-art results compared to all the other autoencoder-based clustering methods.


Incremental Principal Component Analysis Exact implementation and continuity corrections

arXiv.org Machine Learning

This paper describes some applications of an incremental implementation of the principal component analysis (PCA). The algorithm updates the transformation coefficients matrix on-line for each new sample, without the need to keep all the samples in memory. The algorithm is formally equivalent to the usual batch version, in the sense that given a sample set the transformation coefficients at the end of the process are the same. The implications of applying the PCA in real time are discussed with the help of data analysis examples. In particular we focus on the problem of the continuity of the PCs during an on-line analysis.


Google Gives Wikimedia Millions--Plus Machine Learning Tools

WIRED

Google is pouring an additional $3.1 million into Wikipedia, bringing its total contribution to the free encyclopedia over the past decade to more than $7.5 million, the company announced at the World Economic Forum Tuesday. A little over a third of those funds will go toward sustaining current efforts at the Wikimedia Foundation, the nonprofit that runs Wikipedia, and the remaining $2 million will focus on long-term viability through the organization's endowment. Google will also begin allowing Wikipedia editors to use several of its machine learning tools for free, the tech giant said. And Wikimedia and Google will soon broaden Project Tiger, a joint initiative they launched in 2017 to increase the number of Wikipedia articles written in underrepresented languages in India, to include 10 new languages in a handful of countries and regions. It will now be called GLOW, Growing Local Language Content on Wikipedia.


#BizTrends2019: 5 tech trends that will disappoint in 2019

#artificialintelligence

Beyond the lip service, I feel like we are in a trough of disillusionment on so many pivotal technologies. In particular, there are five emerging fields which have garnered tremendous excitement over the past couple of years, and yet this excitement is likely to die down in the next year as the reality sets in that the real impact of these technologies is still a good few years away. It is useful to understand these fading trends because while rising trends can be obvious, to take advantage of many of them in 2019, you needed to have acted a while ago. However, it is not too late to avoid wasting time and resources on these non-trends, at least for 2019, unless you are making significant long-term bets in which case, load up. It's useful to understand why we have hype and then disappointment before the real impact kicks in.


Machine learning in action for the humanitarian sector

#artificialintelligence

Governments across the world came together in Marrakesh this past December to ratify a pact to improve cooperation on international migration. Among other objectives, the Global Compact for Migration seeks to use "accurate and disaggregated data as a basis for evidence-based policies." How can machine learning technologies help with deeply polarizing societal issues like migration? In early 2018, with support from IBM Corporate Citizenship and the Danish Ministry for Foreign Affairs, IBM and the Danish Refugee Council (DRC) embarked on a partnership aimed squarely at the need to better understand migration drivers and evidence-based policy guidance for a range of stakeholders. At the recent THINK Copenhagen keynote, the Secretary General of the DRC, Christian Friis Bach, presented the first results of this effort.


CommunityGAN: Community Detection with Generative Adversarial Nets

arXiv.org Artificial Intelligence

Community detection refers to the task of discovering groups of vertices sharing similar properties or functions so as to understand the network data. With the recent development of deep learning, graph representation learning techniques are also utilized for community detection. However, the communities can only be inferred by applying clustering algorithms based on learned vertex embeddings. These general cluster algorithms like K-means and Gaussian Mixture Model cannot output much overlapped communities, which have been proved to be very common in many real-world networks. In this paper, we propose CommunityGAN, a novel community detection framework that jointly solves overlapping community detection and graph representation learning. First, unlike the embedding of conventional graph representation learning algorithms where the vector entry values have no specific meanings, the embedding of CommunityGAN indicates the membership strength of vertices to communities. Second, a specifically designed Generative Adversarial Net (GAN) is adopted to optimize such embedding. Through the minimax competition between the motif-level generator and discriminator, both of them can alternatively and iteratively boost their performance and finally output a better community structure. Extensive experiments on synthetic data and real-world tasks demonstrate that CommunityGAN achieves substantial community detection performance gains over the state-of-the-art methods.


Rapid identification of pathogenic bacteria using Raman spectroscopy and deep learning

arXiv.org Machine Learning

Rapid identification of bacteria is essential to prevent the spread of infectious disease, help combat antimicrobial resistance, and improve patient outcomes. Raman optical spectroscopy promises to combine bacterial detection, identification, and antibiotic susceptibility testing in a single step. However, achieving clinically relevant speeds and accuracies remains challenging due to the weak Raman signal from bacterial cells and the large number of bacterial species and phenotypes. By amassing the largest known dataset of bacterial Raman spectra, we are able to apply state-of-the-art deep learning approaches to identify 30 of the most common bacterial pathogens from noisy Raman spectra, achieving antibiotic treatment identification accuracies of 99.0$\pm$0.1%. This novel approach distinguishes between methicillin-resistant and -susceptible isolates of Staphylococcus aureus (MRSA and MSSA) as well as a pair of isogenic MRSA and MSSA that are genetically identical apart from deletion of the mecA resistance gene, indicating the potential for culture-free detection of antibiotic resistance. Results from initial clinical validation are promising: using just 10 bacterial spectra from each of 25 isolates, we achieve 99.0$\pm$1.9% species identification accuracy. Our combined Raman-deep learning system represents an important proof-of-concept for rapid, culture-free identification of bacterial isolates and antibiotic resistance and could be readily extended for diagnostics on blood, urine, and sputum.


Driver Distraction Identification with an Ensemble of Convolutional Neural Networks

arXiv.org Machine Learning

The World Health Organization (WHO) reported 1.25 million deaths yearly due to road traffic accidents worldwide and the number has been continuously increasing over the last few years. Nearly fifth of these accidents are caused by distracted drivers. Existing work of distracted driver detection is concerned with a small set of distractions (mostly, cell phone usage). Unreliable ad-hoc methods are often used.In this paper, we present the first publicly available dataset for driver distraction identification with more distraction postures than existing alternatives. In addition, we propose a reliable deep learning-based solution that achieves a 90% accuracy. The system consists of a genetically-weighted ensemble of convolutional neural networks, we show that a weighted ensemble of classifiers using a genetic algorithm yields in a better classification confidence. We also study the effect of different visual elements in distraction detection by means of face and hand localizations, and skin segmentation. Finally, we present a thinned version of our ensemble that could achieve 84.64% classification accuracy and operate in a real-time environment.


Modeling Human Motion with Quaternion-based Neural Networks

arXiv.org Artificial Intelligence

Previous work on predicting or generating 3D human pose sequences regresses either joint rotations or joint positions. The former strategy is prone to error accumulation along the kinematic chain, as well as discontinuities when using Euler angles or exponential maps as parameterizations. The latter requires re-projection onto skeleton constraints to avoid bone stretching and invalid configurations. This work addresses both limitations. QuaterNet represents rotations with quaternions and our loss function performs forward kinematics on a skeleton to penalize absolute position errors instead of angle errors. We investigate both recurrent and convolutional architectures and evaluate on short-term prediction and long-term generation. For the latter, our approach is qualitatively judged as realistic as recent neural strategies from the graphics literature. Our experiments compare quaternions to Euler angles as well as exponential maps and show that only a very short context is required to make reliable future predictions. Finally, we show that the standard evaluation protocol for Human3.6M produces high variance results and we propose a simple solution.


Jacques Ludik on LinkedIn: "#ai #innovation #machinelearning #digitaltransformation #artificialintelligence Cortex Logic"

#artificialintelligence

Dr Jacques Ludik is a smart technology entrepreneur, AI expert, investor & ecosystem builder and currently Founder & President of Machine Intelligence Institute of Africa (MIIA), Founder & CEO of Cortex Logic, Founder of Bennit AI, Founder of SynerG, The Talent Index, & aiTRADE Systems, and investor in The Student Hub (ERAOnline). He holds a Ph.D. in Computer Science (AI) with many publications and has 25 years' experience in Machine / Artificial Intelligence (AI) & Data Science and its applications. MIIA is an innovative community & accelerator for Machine Intelligence & Data Science Research & Applications to help transform Africa, whereas Cortex Logic is an AI company that provides an AI Engine for Business, advances AI and builds end-to-end AI solutions for a range of industries. Bennit A.I. is an intelligent virtual production assistant/advisor for manufacturing. For businesses to thrive in the smart technology era, they need to be agile, innovative and adapt quickly to stay relevant, given the swift pace of change and disruption to business and society.