Collaborating Authors

Growth in Insights vs. Growth in Big Data


Plato's original definition of knowledge was "justified true belief". The first thing to notice is that knowledge doesn't exist outside the human mind (belief) and its byproducts like algorithms. Not any belief qualifies as knowledge, for example "I believe it rains outside" doesn't include any justification. However "I believe it rains outside because I can hear it" includes a justification and MAY become knowledge. The reason why "may" is because I could be wrong, for example I may hear a recording of a rain or someone washing the office windows with a hose (hoping I don't have an early form of tinnitus).

Windows 10 tip: Point and click to edit the system path variable


Use this well-hidden editor to quickly add a folder to the Windows 10 system path. The system path has been part of Microsoft operating systems since the earliest days of MS-DOS. This environment variable lives on in Windows 10 as a way to tell the system where to look when you try to run a command. Normally, the system looks in the Windows folder and its System32 subfolder. But you might want to add a folder to the path so that you can run custom utilities stored in that folder.

Network Fragments: Representing Knowledge for Constructing Probabilistic Models Artificial Intelligence

In most current applications of belief networks, domain knowledge is represented by a single belief network that applies to all problem instances in the domain. In more complex domains, problem-specific models must be constructed from a knowledge base encoding probabilistic relationships in the domain. Most work in knowledge-based model construction takes the rule as the basic unit of knowledge. We present a knowledge representation framework that permits the knowledge base designer to specify knowledge in larger semantically meaningful units which we call network fragments. Our framework provides for representation of asymmetric independence and canonical intercausal interaction. We discuss the combination of network fragments to form problem-specific models to reason about particular problem instances. The framework is illustrated using examples from the domain of military situation awareness.

Make Your Data Mean More With Derived Variables


Derived Variables: Making the Data Mean More Download this chapter from Data Mining Techniques, Third Edition, by Gordon Linoff and Michael Berry, and learn how to create derived variables, which allow the statistical modeling process to incorporate human insights. As much art as science, selecting variables for modeling is "one of the most creative parts of the data mining process," according to the authors. The chapter begins with a story about modeling customer attrition in the cell phone industry, moves to a review of several classic variable combinations, and then offers guidelines for the creation of derived variables.

109 Commonly Asked Data Science Interview Questions


What is the Central Limit Theorem and why is it important? How many sampling methods do you know? What is the difference between Type I vs Type II error? What do the terms P-value, coefficient, R-Squared value mean? What is the significance of each of these components? What are the assumptions required for linear regression? There are four major assumptions: 1. There is a linear relationship between the variables, meaning the model you are creating actually fits the data, 2. The errors or residuals of the data are normally distributed and independent from each other, 3. There is minimal multicollinearity between explanatory variables, and 4. Homoscedasticity. This means the variance around the regression line is the same for all values of the predictor variable. What is an example of a dataset with a non-Gaussian distribution?