AITopics | Performance Analysis

Collaborating Authors

Performance Analysis

News Overviews Instructional Materials AI-Alerts Classics

50 Questions to Test True Data Science Knowledge

@machinelearnbotDec-21-2016, 00:35:04 GMT

Explain what regularization is and why it is useful. What are the benefits and drawbacks of specific methods, such as ridge regression and LASSO? Explain what a local optimum is and why it is important in a specific context, such as k-means clustering. What are specific ways for determining if you have a local optimum problem? What can be done to avoid local optima?

artificial intelligence, false negative, machine learning, (15 more...)

@machinelearnbot

Genre: Research Report (0.39)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.83)

Add feedback

4 trends in security data science

#artificialintelligenceDec-20-2016, 12:05:20 GMT

In 2015, we saw graphs dominate security data science. Graphs permeated all areas--everything from visualizations to graphical inference. It's quite easy to write about security trends for 2016--the hard part is trying to interpret what the trends could potentially mean to organizations on a day-to-day basis. This article is not the wishlist of a deluded security data scientist. Rather, these are strategic trends that you can expect to see in the field, mixed with tactical steps to capitalize on them.

artificial intelligence, detection, machine learning, (16 more...)

#artificialintelligence

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)

Add feedback

A tree-based kernel for graphs with continuous attributes

Martino, Giovanni Da San, Navarin, Nicolò, Sperduti, Alessandro

arXiv.org Artificial IntelligenceDec-20-2016

The availability of graph data with node attributes that can be either discrete or real-valued is constantly increasing. While existing kernel methods are effective techniques for dealing with graphs having discrete node labels, their adaptation to non-discrete or continuous node attributes has been limited, mainly for computational issues. Recently, a few kernels especially tailored for this domain, and that trade predictive performance for computational efficiency, have been proposed. In this paper, we propose a graph kernel for complex and continuous nodes' attributes, whose features are tree structures extracted from specific graph visits. The kernel manages to keep the same complexity of state-of-the-art kernels while implicitly using a larger feature space. We further present an approximated variant of the kernel which reduces its complexity significantly. Experimental results obtained on six real-world datasets show that the kernel is the best performing one on most of them. Moreover, in most cases the approximated version reaches comparable performances to current state-of-the-art kernels in terms of classification accuracy while greatly shortening the running times.

artificial intelligence, kernel, machine learning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TNNLS.2017.2705694

1509.01116

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Genre: Research Report (0.64)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Kernel Methods (0.34)

Add feedback

Student and Faculty Guide – 10 easy steps to get up and running with Azure Machine Learning

#artificialintelligenceDec-18-2016, 18:00:37 GMT

My colleague Amy Nicholson is the UK expert on Azure Machine Learning, the following blog post is after a quizzing session to get understand how to get started with Azure Machine Learning" Each student receives $100 of Azure credit per month, for 6 months. The Faculty member receives $250 per month, for 12 months. The Azure machine learning team provided a very nice walkthrough tutorial which covers a lot of the basics. This tutorial is really useful as it takes you through the entire process of creating an AzureML workspace, uploading data, creating an experiment to predict someone's credit risk, building, training, and evaluating the models, publishing your best model as a web service, and calling that web service. Now you need to learn how to import a data set into Azure Machine Learning, and where to find interesting data to build something amazing.

artificial intelligence, azure machine learning, machine learning, (16 more...)

#artificialintelligence

Country: Europe > United Kingdom (0.25)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education (1.00)
Banking & Finance > Credit (0.37)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Add feedback

109 Commonly Asked Data Science Interview Questions

#artificialintelligenceDec-18-2016, 14:35:33 GMT

What is the Central Limit Theorem and why is it important? How many sampling methods do you know? What is the difference between Type I vs Type II error? What do the terms P-value, coefficient, R-Squared value mean? What is the significance of each of these components? What are the assumptions required for linear regression? There are four major assumptions: 1. There is a linear relationship between the variables, meaning the model you are creating actually fits the data, 2. The errors or residuals of the data are normally distributed and independent from each other, 3. There is minimal multicollinearity between explanatory variables, and 4. Homoscedasticity. This means the variance around the regression line is the same for all values of the predictor variable. What is an example of a dataset with a non-Gaussian distribution?

artificial intelligence, data mining, machine learning, (17 more...)

#artificialintelligence

Country: North America > United States (0.04)

Genre: Personal > Interview (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.33)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.30)

Add feedback

Weekly Digest, December 19

@machinelearnbotDec-18-2016, 03:45:02 GMT

Data Science for IoT vs Classic Data Science: 10 Differences Enterprise AI insights from the AI Europe event in London Is it time to consider data in motion in your big data projects?

artificial intelligence, data mining, machine learning, (15 more...)

@machinelearnbot

Country: Europe (0.30)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Optimal tuning for divide-and-conquer kernel ridge regression with massive data

Xu, Ganggang, Shang, Zuofeng, Cheng, Guang

arXiv.org Machine LearningDec-18-2016

We propose a first data-driven tuning procedure for divide-and-conquer kernel ridge regression (Zhang et al., 2015). While the proposed criterion is computationally scalable for massive data sets, it is also shown to be asymptotically optimal under mild conditions. The effectiveness of our method is illustrated by extensive simulations and an application to Million Song Dataset. Some key words:Distributed GCV, divide-and-conquer, kernel ridge regression, optimal tuning.

artificial intelligence, dgcv, machine learning, (18 more...)

arXiv.org Machine Learning

1612.05907

Country: North America > United States > New York (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.82)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.82)

Add feedback

11 Important Model Evaluation Techniques Everyone Should Know

@machinelearnbotDec-16-2016, 02:55:03 GMT

Model evaluation metrics are used to assess goodness of fit between model and data, to compare different models, in the context of model selection, and to predict how predictions (associated with a specific model and data set) are expected to be accurate. Confidence intervals are used to assess how reliable a statistical estimate is. Wide confidence intervals mean that your model is poor (and it is worth investigating other models), or that your data is very noisy if confidence intervals don't improve by changing the model (that is, testing a different theoretical statistical distribution for your observations.) Modern confidence intervals are model-free, data -driven: click here to see how to compute them. A more general framework to assess and reduce sources of variance is called analysis of variance.

artificial intelligence, machine learning, test data, (14 more...)

@machinelearnbot

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

How To Implement Machine Learning Algorithm Performance Metrics From Scratch With Python

@machinelearnbotDec-15-2016, 10:05:02 GMT

After you make predictions, you need to know if they are any good. There are standard measures that we can use to summarize how good a set of predictions actually are. Knowing how good a set of predictions is, allows you to make estimates about how good a given machine learning model of your problem, In this tutorial, you will discover how to implement four standard prediction evaluation metrics from scratch in Python. How to implement and interpret a confusion matrix. How to implement mean absolute error for regression.

artificial intelligence, data science, machine learning, (12 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.40)

Add feedback

Dynamical Kinds and their Discovery

Jantzen, Benjamin C.

arXiv.org Machine LearningDec-15-2016

We demonstrate the possibility of classifying causal systems into kinds that share a common structure without first constructing an explicit dynamical model or using prior knowledge of the system dynamics. The algorithmic ability to determine whether arbitrary systems are governed by causal relations of the same form offers significant practical applications in the development and validation of dynamical models. It is also of theoretical interest as an essential stage in the scientific inference of laws from empirical data. The algorithm presented is based on the dynamical symmetry approach to dynamical kinds. A dynamical symmetry with respect to time is an intervention on one or more variables of a system that commutes with the time evolution of the system. A dynamical kind is a class of systems sharing a set of dynamical symmetries. The algorithm presented classifies deterministic, time-dependent causal systems by directly comparing their exhibited symmetries. Using simulated, noisy data from a variety of nonlinear systems, we show that this algorithm correctly sorts systems into dynamical kinds. It is robust under significant sampling error, is immune to violations of normality in sampling error, and fails gracefully with increasing dynamical similarity. The algorithm we demonstrate is the first to address this aspect of automated scientific discovery.

artificial intelligence, machine learning, symmetry, (16 more...)

arXiv.org Machine Learning

1612.04933

Country: North America > United States (0.28)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.30)

Add feedback