Goto

Collaborating Authors

 Instructional Material


Neural Networks are Function Approximation Algorithms

#artificialintelligence

Supervised learning in machine learning can be described in terms of function approximation. Given a dataset comprised of inputs and outputs, we assume that there is an unknown underlying function that is consistent in mapping inputs to outputs in the target domain and resulted in the dataset. We then use supervised learning algorithms to approximate this function. Neural networks are an example of a supervised machine learning algorithm that is perhaps best understood in the context of function approximation. This can be demonstrated with examples of neural networks approximating simple one-dimensional functions that aid in developing the intuition for what is being learned by the model.


Imbalanced Multiclass Classification with the E.coli Dataset

#artificialintelligence

Multiclass classification problems are those where a label must be predicted, but there are more than two labels that may be predicted. These are challenging predictive modeling problems because a sufficiently representative number of examples of each class is required for a model to learn the problem. It is made challenging when the number of examples in each class is imbalanced, or skewed toward one or a few of the classes with very few examples of other classes. Problems of this type are referred to as imbalanced multiclass classification problems and they require both the careful design of an evaluation metric and test harness and choice of machine learning models. The E.coli protein localization sites dataset is a standard dataset for exploring the challenge of imbalanced multiclass classification. In this tutorial, you will discover how to develop and evaluate a model for the imbalanced multiclass E.coli dataset. Imbalanced Multiclass Classification with the E.coli Dataset Photo by Marcus, some rights reserved.


Top 10 books on Artificial Intelligence Master Data Science

#artificialintelligence

In this post, you will discover the top 10 books available right now on Artificial Intelligence. There are quite a few available online in which you may purchase. Artificial Intelligence: A Modern Approach, 3e offers the most comprehensive, up-to-date introduction to the theory and practice of artificial intelligence. Number one in its field, this textbook is ideal for one or two-semester, undergraduate or graduate-level courses in Artificial Intelligence. Dr. Peter Norvig, contributing Artificial Intelligence author and Professor Sebastian Thrun, a Pearson author are offering a free online course at Stanford University on artificial intelligence.


6 Pretrained Models to Master Text Classification

#artificialintelligence

Though ERNIE 1.0 (released in March 2019) has been a popular model for text classification, it was ERNIE 2.0 which became the talk of the town in the latter half of 2019. Developed by tech-giant Baidu, ERNIE outperformed Google XLNet and BERT on the GLUE benchmark for English. ERNIE stands for Enhanced Representation through kNowledge IntEgration, and ERNIE 2.0 is an upgraded version of ERNIE 1.0. ERNIE 1.0 was pathbreaking in its own way โ€“ it was one of the first models to leverage Knowledge Graphs. This incorporation further enhanced training the model for advanced tasks like Relation Classification and NamedEntityRecognition (NER). Like its predecessor, ERNIE 2.0 brings another innovation to the table in the form of Continual Incremental Multi-task Learning.


My Journey from Physics into Data Science

#artificialintelligence

I still learn new knowledge everyday with my growing passion in Data Science field. To pursue different career track as a graduating physics student there must be'Why' and'How' questions to be answered. Having been asked by a number of people about my transition from academia -- Physics to Data Science, I hope my story could answer the questions on why I decided to become a Data Scientist and how I pursued the goal, and ultimately encourage as well as inspire more people to pursue their passion. The CERN Summer Student Programme offers once-in-a-lifetime opportunity for undergraduate students of physics, computing and engineering to join one of their research projects with top scientists in multicultural teams at CERN in Geneva, Switzerland. In June 2017, I was very fortunate to be accepted to join the programme.


Artificial Intelligence and RPA: Keys to Digital Transformation

#artificialintelligence

Register for this live video webcast - Friday, March 27 at 10 AM PT Ask the expert - get your AI/RPA questions answered by an industry expert. One of the keys to digital transformation โ€“ that most fashionable term โ€“ is creating a management structure in which everything is accountable to data analytics. In a related trend, robotic process automation (RPA) helps automate a company's work flow and business processes. At its most optimum, RPA is driven by an AI-based analytics platform. These key emerging technologies are the focus on this webinar.


Prob2Vec: Mathematical Semantic Embedding for Problem Retrieval in Adaptive Tutoring

arXiv.org Machine Learning

We propose a new application of embedding techniques for problem retrieval in adaptive tutoring. The objective is to retrieve problems whose mathematical concepts are similar. There are two challenges: First, like sentences, problems helpful to tutoring are never exactly the same in terms of the underlying concepts. Instead, good problems mix concepts in innovative ways, while still displaying continuity in their relationships. Second, it is difficult for humans to determine a similarity score that is consistent across a large enough training set. We propose a hierarchical problem embedding algorithm, called Prob2Vec, that consists of abstraction and embedding steps. Prob2Vec achieves 96.88\% accuracy on a problem similarity test, in contrast to 75\% from directly applying state-of-the-art sentence embedding methods. It is interesting that Prob2Vec is able to distinguish very fine-grained differences among problems, an ability humans need time and effort to acquire. In addition, the sub-problem of concept labeling with imbalanced training data set is interesting in its own right. It is a multi-label problem suffering from dimensionality explosion, which we propose ways to ameliorate. We propose the novel negative pre-training algorithm that dramatically reduces false negative and positive ratios for classification, using an imbalanced training data set.


aphBO-2GP-3B: A budgeted asynchronously-parallel multi-acquisition for known/unknown constrained Bayesian optimization on high-performing computing architecture

arXiv.org Machine Learning

High-fidelity complex engineering simulations are highly predictive, but also computationally expensive and often require substantial computational efforts. The mitigation of computational burden is usually enabled through parallelism in high-performance cluster (HPC) architecture. In this paper, an asynchronous constrained batch-parallel Bayesian optimization method is proposed to efficiently solve the computationally-expensive simulation-based optimization problems on the HPC platform, with a budgeted computational resource, where the maximum number of simulations is a constant. The advantages of this method are three-fold. First, the efficiency of the Bayesian optimization is improved, where multiple input locations are evaluated massively parallel in an asynchronous manner to accelerate the optimization convergence with respect to physical runtime. This efficiency feature is further improved so that when each of the inputs is finished, another input is queried without waiting for the whole batch to complete. Second, the method can handle both known and unknown constraints. Third, the proposed method considers several acquisition functions at the same time and sample based on an evolving probability mass distribution function using GP-Hedge scheme, where parameters are corresponding to the performance of each acquisition function. The proposed framework is termed aphBO-2GP-3B, which corresponds to asynchronous parallel hedge Bayesian optimization with two Gaussian processes and three batches. The aphBO-2GP-3B framework is demonstrated using two high-fidelity expensive industrial applications, where the first one is based on finite element analysis (FEA) and the second one is based on computational fluid dynamics (CFD) simulations.


Artificial Intelligence With TensorFlow

#artificialintelligence

Artificial Intelligence with TensorFlow is a four-week, part time, online training course that offers an introduction to neural networks, deep learning, machine learning, artificial intelligence and their many applications. This immersive, hands-on course teaches students the data science tools and techniques needed to build and test neural networks in TensorFlow using real-world data. This course is optimal for those who have taken Essential Data Tools & Practical Machine Learning and desire to expand their knowledge of modeling and data science with TensorFlow.


The 6 Best Free Online Artificial Intelligence Courses Available Today

#artificialintelligence

A basic grounding in the principles and practices around artificial intelligence (AI), automation and cognitive systems is something which is likely to become increasingly valuable, regardless of your field of business, expertise or profession. Fortunately, today you don't have to take years out of your life studying at university to become familiar with this seemingly hugely complex technology. A growing number of online courses have sprung up in recent years covering everything from the basics to advanced implementation. Some are aimed at people who want to dive straight into coding their own artificial neural networks, and understandably assume a certain level of technical ability. Others are useful for those who want to learn how this technology can be applied by anyone, regardless of prior technical expertise, to solving real-word problems.