Named Entity Recognition: Milestone Models, Papers and Technologies


Named Entity Recognition (NER), or entity extraction is an NLP technique which locates and classifies the named entities present in the text. Named Entity Recognition classifies the named entities into pre-defined categories such as the names of persons, organizations, locations, quantities, monetary values, specialized terms, product terminology and expressions of times. Named Entity Recognition is a part of a broader field called Information Extraction. According to Wikipedia, Information Extraction is the task of automatically extracting structured information from any kind of text, structured and/or unstructured. Natural Language Processing has observed a paradigm shift in accuracy through past few years.

Data Science Simplified Part 7: Log-Log Regression Models


In the last few blog posts of this series, we discussed simple linear regression model. We discussed multivariate regression model and methods for selecting the right model. Fernando has now created a better model. In this article will address that question. This article will elaborate about Log-Log regression models.

R Linear Regression


Regression analysis is a statistical tool to determine relationships between different types of variables. Variables that remain unaffected by changes made in other variables are known as independent variables, also known as a predictor or explanatory variables while those that are affected are known as dependent variables also known as the response variable. Linear regression is a statistical procedure which is used to predict the value of a response variable, on the basis of one or more predictor variables. Some common examples of linear regression are calculating GDP, CAPM, oil and gas prices, medical diagnosis, capital asset pricing etc. R Simple linear regression enables us to find a relationship between a continuous dependent variable Y and a continuous independent variable X. It is assumed that values of X are controlled and not subject to measurement error and corresponding values of Y are observed.

Introduction about Logistic Regression Model


Hello guys, we have learnt about Linear Regression model in my previous article. Today, in this article we will get to learn the basics of Logistic Regression and some tricks to find the relation between the variables. Do you know what type of variable is used in logistic regression… Don't worry, if you don't know then let me teach the variables: In simple linear regression the variables are one dependent and one independent, In multiple linear regression there are more than one independent variable. Understand one thing if your data is in continuous form then use only linear regression model, while on the other hand, if your data is in categorical form(e.g. In this model the data been code in binary form.

Time series modeling by a regression approach based on a latent process Machine Learning

Time series are used in many domains including finance, engineering, economics and bioinformatics generally to represent the change of a measurement over time. Modeling techniques may then be used to give a synthetic representation of such data. A new approach for time series modeling is proposed in this paper. It consists of a regression model incorporating a discrete hidden logistic process allowing for activating smoothly or abruptly different polynomial regression models. The model parameters are estimated by the maximum likelihood method performed by a dedicated Expectation Maximization (EM) algorithm. The M step of the EM algorithm uses a multi-class Iterative Reweighted Least-Squares (IRLS) algorithm to estimate the hidden process parameters. To evaluate the proposed approach, an experimental study on simulated data and real world data was performed using two alternative approaches: a heteroskedastic piecewise regression model using a global optimization algorithm based on dynamic programming, and a Hidden Markov Regression Model whose parameters are estimated by the Baum-Welch algorithm. Finally, in the context of the remote monitoring of components of the French railway infrastructure, and more particularly the switch mechanism, the proposed approach has been applied to modeling and classifying time series representing the condition measurements acquired during switch operations.