Goto

Collaborating Authors

 Overview


Artificial Intelligence, Machine Learning and Deep Learning: A Primer - CEOWORLD magazine

#artificialintelligence

Dr. Dorel Iosif is a Board Director and CEO with Cognisium, a tech executive marketplace headquartered in Australia. He held senior executive roles with KBR, WorleyParsons, PwC and Advisian Management Consulting. Dr Iosif started his career in Israel with the Technion Institute of Technology and continued in Australia with BHPBilliton and the University of Melbourne. He holds a Ph.D in applied mathematics from the University of Melbourne and studied Corporate Level Strategy - Executive Program at Harvard Business School. Dorel worked in Australia, USA, Europe and the Middle East.


Automating Data Science

Communications of the ACM

Data science covers the full spectrum of deriving insight from data, from initial data gathering and interpretation, via processing and engineering of data, and exploration and modeling, to eventually producing novel insights and decision support systems. Data science can be viewed as overlapping or broader in scope than other data-analytic methodological disciplines, such as statistics, machine learning, databases, or visualization.10 To illustrate the breadth of data science, consider, for example, the problem of recommending items (movies, books, or other products) to customers. While the core of these applications can consist of algorithmic techniques such as matrix factorization, a deployed system will involve a much wider range of technological and human considerations. These range from scalable back-end transaction systems that retrieve customer and product data in real time, experimental design for evaluating system changes, causal analysis for understanding the effect of interventions, to the human factors and psychology that underlie how customers react to visual information displays and make decisions. As another example, in areas such as astronomy, particle physics, and climate science, there is a rich tradition of building computational pipelines to support data-driven discovery and hypothesis testing. For instance, geoscientists use monthly global landcover maps based on satellite imagery at sub-kilometer resolutions to better understand how the Earth's surface is changing over time.50 These maps are interactive and browsable, and they are the result of a complex data-processing pipeline, in which terabytes to petabytes of raw sensor and image data are transformed into databases of a6utomatically detected and annotated objects and information. This type of pipeline involves many steps, in which human decisions and insight are critical, such as instrument calibration, removal of outliers, and classification of pixels. The breadth and complexity of these and many other data science scenarios means the modern data scientist requires broad knowledge and experience across a multitude of topics. Together with an increasing demand for data analysis skills, this has led to a shortage of trained data scientists with appropriate background and experience, and significant market competition for limited expertise. Considering this bottleneck, it is not surprising there is increasing interest in automating parts, if not all, of the data science process.


65 Competencies

Communications of the ACM

Analyzing data is now essential to success in education, employment, and other areas of activity in the knowledge society. Even though several frameworks describe the competencies and skills needed to meet current and future challenges, no data analytics competency framework exists to describe the importance of specific skills to succeed in data analytics assignments.


Bayesian Deep Learning for Graphs

arXiv.org Machine Learning

The adaptive processing of structured data is a long-standing research topic in machine learning that investigates how to automatically learn a mapping from a structured input to outputs of various nature. Recently, there has been an increasing interest in the adaptive processing of graphs, which led to the development of different neural network-based methodologies. In this thesis, we take a different route and develop a Bayesian Deep Learning framework for graph learning. The dissertation begins with a review of the principles over which most of the methods in the field are built, followed by a study on graph classification reproducibility issues. We then proceed to bridge the basic ideas of deep learning for graphs with the Bayesian world, by building our deep architectures in an incremental fashion. This framework allows us to consider graphs with discrete and continuous edge features, producing unsupervised embeddings rich enough to reach the state of the art on several classification tasks. Our approach is also amenable to a Bayesian nonparametric extension that automatizes the choice of almost all model's hyper-parameters. Two real-world applications demonstrate the efficacy of deep learning for graphs. The first concerns the prediction of information-theoretic quantities for molecular simulations with supervised neural models. After that, we exploit our Bayesian models to solve a malware-classification task while being robust to intra-procedural code obfuscation techniques. We conclude the dissertation with an attempt to blend the best of the neural and Bayesian worlds together. The resulting hybrid model is able to predict multimodal distributions conditioned on input graphs, with the consequent ability to model stochasticity and uncertainty better than most works. Overall, we aim to provide a Bayesian perspective into the articulated research field of deep learning for graphs.


How to Decide on a Dataset for Detecting Cyber-Attacks

#artificialintelligence

You create an amazing machine learning algorithm. You take a novel approach and apply techniques that prove to be highly accurate. Your results demonstrate a very high true positive rate and a very low false positive rate. You write a paper that articulates your outstanding results and submit it to a leading academic conference. You expect that this research will be well received, and you will receive many citations of your work.


Tech Developments In Every Sector, And The Innovators Leading The Way

#artificialintelligence

Despite market disruptions and unprecedented global events, the evolution of technology in every sector will continue because leaders worldwide actively develop solutions, overcome obstacles, and create new products. As this rate of advancement accelerates, technology will continue to be an essential component of success on any terms. Moreover, the leaders spearheading growth in this area also exercise best-in-class corporate practices, create healthy cultures, and achieve record-breaking revenues, concludes Dr Lebene Soga of Henley Business School. As the interplay between humans and technology develops, the prevalence of Artificial Intelligence (AI), intuitive interfaces, and predictive capabilities also grow. This asynchronous development has the net impact of making life easier, businesses more profitable, and infrastructure more enduring.


Deep Neural Networks and Tabular Data: A Survey

#artificialintelligence

Heterogeneous tabular data are the most commonly used form of data and are essential for numerous critical and computationally demanding applications. On homogeneous data sets, deep neural networks have repeatedly shown excellent performance and have therefore been widely adopted. However, their application to modeling tabular data (inference or generation) remains highly challenging. This work provides an overview of state of the art deep learning methods for tabular data. We start by categorizing them into three groups: data transformations, specialized architectures, and regularization models. We then provide a comprehensive overview of the main approaches in each group. A discussion of deep learning approaches for generating tabular data is complemented by strategies for explaining deep models on tabular data. Our primary contribution is to address the main research streams and existing methodologies in this area, while highlighting relevant challenges and open research questions. We also provide an empirical comparison of traditional machine learning methods with deep learning approaches on real tabular data sets of different sizes and with different learning objectives. Our results indicate that algorithms based on gradient-boosted tree ensembles still outperform the deep learning models. To the best of our knowledge, this is the first in-depth look at deep learning approaches for tabular data. This work can serve as a valuable starting point and guide for researchers and practitioners interested in deep learning with tabular data.


Perspectives in machine learning for wildlife conservation - Nature Communications

#artificialintelligence

Inexpensive and accessible sensors are accelerating data acquisition in animal ecology. These technologies hold great potential for large-scale ecological understanding, but are limited by current processing approaches which inefficiently distill data into relevant information. We argue that animal ecologists can capitalize on large datasets generated by modern sensors by combining machine learning approaches with domain knowledge. Incorporating machine learning into ecological workflows could improve inputs for ecological models and lead to integrated hybrid modeling tools. This approach will require close interdisciplinary collaboration to ensure the quality of novel approaches and train a new generation of data scientists in ecology and conservation. Animal ecologists are increasingly limited by constraints in data processing. Here, Tuia and colleagues discuss how collaboration between ecologists and data scientists can harness machine learning to capitalize on the data generated from technological advances and lead to novel modeling approaches.


Deploy Your First Jupyter Notebook to Docker

#artificialintelligence

There are only two ways to live your life. One is as though nothing is a miracle. The other is as though everything is a miracle. In this article, we will talk about what Docker is, how it works and how to deploy a Jupyter notebook to a Docker Container. In other words, Docker is a platform that provides a container for you to run host, and run your applications in without bothering about things like platform dependence, it provides infrastructure called a container where your platforms can be held and run.


Artificial Intelligence, Machine Learning and Deep Learning: A Primer.

#artificialintelligence

The term "artificial intelligence is coined by John McCarthy (Dartmouth), Marvin Minsky (Harvard) and Nathaniel Rochester (IBM), initially in a proposal for a 2-month, 10-man study of artificial intelligence.