AITopics | Regression

Collaborating Authors

Regression

News Overviews Instructional Materials AI-Alerts Classics

Datasets for practicing Logistic Regression – Sushrut Tendulkar

#artificialintelligenceSep-3-2020, 22:30:53 GMT

I was looking for a list of Machine Learning datasets for comparing Logistic Regression model but I couldn't find it easily. I spent some time curating it based on my need. This post is collection of such datasets which you can download for your use. The data set contains 3 classes of 50 instances each, where each class refers to a type of iris plant. One class is linearly separable from the other 2; the latter are NOT linearly separable from each other.

artificial intelligence, dataset, machine learning, (3 more...)

#artificialintelligence

Country:

North America > United States > Illinois > Cook County > Chicago (0.07)
Europe > Portugal (0.07)
Asia > Taiwan (0.07)

Genre:

Research Report > New Finding (0.63)
Research Report > Experimental Study (0.63)

Industry: Health & Medicine (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.96)

Add feedback

Interpretable Signal Analysis with Knockoffs Enhances Classification of Bacterial Raman Spectra

Chia, Charmaine, Sesia, Matteo, Ho, Chi-Sing, Jeffrey, Stefanie S., Dionne, Jennifer, Candès, Emmanuel J., Howe, Roger T.

arXiv.org Machine LearningSep-3-2020

EW sensor technologies have contributed to the advent of "big data" in biomedicine, of which signal data are for example, saliency methods help visualize the activation of an important modality. From one-dimensional electrocardiography individual input features [5], while attribution methods like and electroencephalography signals from the heart and LIME [6] and SHAP [7] quantify the impact of each feature brain, to two-dimensional tissue images of tumor histology, to on the output predictions. However, these post hoc techniques three-dimensional magnetic resonance images, these consist are inadequate for developing simpler models. of sequential measures of an observable along one or more With regard to relevancy, studies report that people favor independent axes such as time, distance, or frequency. Signal explanations that are short, contrast instances with different data differ from structured forms of data in that the meaning outcomes, and highlight abnormal causes [8]. In other words, of each independent variable is not as distinctively and intuitively we seek to understand which features are important, and definable. Informative features must be extracted from how these affect the outcome. Data scientists often pursue these raw data using signal processing and machine learning these goals through feature selection, in addition to feature (ML) techniques before useful patterns can be detected and extraction, to ensure that their conclusions are based on leveraged to make predictions.

artificial intelligence, data mining, machine learning, (16 more...)

arXiv.org Machine Learning

2006.04937

Country:

North America > United States > New York (0.04)
North America > United States > California > Santa Clara County > Stanford (0.04)

Genre: Research Report > Experimental Study (0.35)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (0.94)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.95)

Add feedback

Large Dimensional Analysis and Improvement of Multi Task Learning

Tiomoko, Malik, Couillet, Romain, Tiomoko, Hafiz

arXiv.org Machine LearningSep-3-2020

Multi Task Learning (MTL) efficiently leverages useful information c ontained in multiple related tasks to help improve the generalization performance of all tasks. This article conducts a large dimensional analysis of a simple but, as we shall see, extremely powerful when carefully tuned, Least Square Support Vector Machine (LSS VM) version of MTL, in the regime where the dimension p of the data and their number n grow large at the same rate. Under mild assumptions on the input data, the theoretical analysis o f the MTL-LSSVM algorithm first reveals the "sufficient statistics" exploited by the alg orithm and their interaction at work. These results demonstrate, as a striking consequ ence, that the standard approach to MTL-LSSVM is largely suboptimal, can lead to severe effe cts of negative transfer but that these impairments are easily corrected. These correctio ns are turned into an improved MTL-LSSVM algorithm which can only benefit from additional data, and the theoretical performance of which is also analyzed. As evidenced and theoretically sustained in numerous recent works, these large dimensional results are robust to broad ranges of data distributions, w hich our present experiments corroborate. Specifically, the article reports a systematic ally close behavior between theoretical and empirical performances on popular datasets, wh ich is strongly suggestive of the applicability of the proposed carefully tuned MTL-LSSVM method to real data. This fine-tuning is fully based on the theoretical analysis and does not in p articular require any cross validation procedure. Besides, the reported performance s on real datasets almost systematically outperform much more elaborate and less intuitive state -of-the-art multi-task and transfer learning methods.

artificial intelligence, machine learning, qa 1 2, (17 more...)

arXiv.org Machine Learning

2009.01591

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
North America > United States > New York > New York County > New York City (0.04)
(7 more...)

Genre: Research Report > New Finding (0.87)

Industry: Education (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.45)

Add feedback

Machine Learning for Finance: How To Implement Bayesian Regression with Python

#artificialintelligenceSep-2-2020, 15:10:43 GMT

Wikipedia: "In statistics, Bayesian linear regression is an approach to linear regression in which the statistical analysis is undertaken within the context of Bayesian inference. When the regression model has errors that have a normal distribution, and if a particular form of the prior distribution is assumed, explicit results are available for the posterior probability distributions of the model's parameters." The most common interpretation of Bayes' formula in finance is the diachronic interpretation. This mainly states that over time we learn new information about certain variables or parameters of interest, like the mean return of a time series. Here, H stands for an event, the hypothesis, and D represents the data an experiment or the real world might present.

artificial intelligence, bayesian regression, machine learning, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.77)

Add feedback

A Bayesian Approach with Type-2 Student-tMembership Function for T-S Model Identification

Singh, Vikas, Bharadhwaj, Homanga, Verma, Nishchal K

arXiv.org Artificial IntelligenceSep-2-2020

Clustering techniques have been proved highly suc-cessful for Takagi-Sugeno (T-S) fuzzy model identification. Inparticular, fuzzyc-regression clustering based on type-2 fuzzyset has been shown the remarkable results on non-sparse databut their performance degraded on sparse data. In this paper, aninnovative architecture for fuzzyc-regression model is presentedand a novel student-tdistribution based membership functionis designed for sparse data modelling. To avoid the overfitting,we have adopted a Bayesian approach for incorporating aGaussian prior on the regression coefficients. Additional noveltyof our approach lies in type-reduction where the final output iscomputed using Karnik Mendel algorithm and the consequentparameters of the model are optimized using Stochastic GradientDescent method. As detailed experimentation, the result showsthat proposed approach outperforms on standard datasets incomparison of various state-of-the-art methods.

artificial intelligence, bayesian inference, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2009.00822

Country: Asia > India > Uttar Pradesh > Kanpur (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.89)
(2 more...)

Add feedback

Travel time prediction for congested freeways with a dynamic linear model

Kwak, Semin, Geroliminis, Nikolas

arXiv.org Machine LearningSep-2-2020

Accurate prediction of travel time is an essential feature to support Intelligent Transportation Systems (ITS). The non-linearity of traffic states, however, makes this prediction a challenging task. Here we propose to use dynamic linear models (DLMs) to approximate the non-linear traffic states. Unlike a static linear regression model, the DLMs assume that their parameters are changing across time. We design a DLM with model parameters defined at each time unit to describe the spatio-temporal characteristics of time-series traffic data. Based on our DLM and its model parameters analytically trained using historical data, we suggest an optimal linear predictor in the minimum mean square error (MMSE) sense. We compare our prediction accuracy of travel time for freeways in California (I210-E and I5-S) under highly congested traffic conditions with those of other methods: the instantaneous travel time, k-nearest neighbor, support vector regression, and artificial neural network. We show significant improvements in the accuracy, especially for short-term prediction.

artificial intelligence, machine learning, travel time, (15 more...)

arXiv.org Machine Learning

doi: 10.1109/TITS.2020.3006910

2009.01016

Country:

North America > United States > California (0.24)
Europe > Switzerland > Vaud > Lausanne (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.50)

Industry:

Transportation (1.00)
Consumer Products & Services > Travel (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.55)

Add feedback

Learn Regression Analysis for Business

#artificialintelligenceSep-1-2020, 16:40:33 GMT

A complete hands on practical exercises to build regression models that are highly used for business analysis. This course is designed to start with the very basics then add up information gradually. Accordingly students who have fair background in regression analysis can choose to jump to the practical part of the course to learn building regression models in detail. In this course you will learn about different types of regression models and learn to build and use the ones used in business analysis. You will learn step by step how to understand a business problem from data observations and determine the variables you need to include in regression analysis.

artificial intelligence, machine learning, regression analysis, (5 more...)

#artificialintelligence

Genre:

Research Report > Experimental Study (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

13 Algorithms and 4 Learning Methods of Machine Learning

#artificialintelligenceSep-1-2020, 06:25:21 GMT

According to the similarity of the function and form of the algorithm, we can classify the algorithm, such as tree-based algorithm, neural network-based algorithm, and so on. Of course, the scope of machine learning is very large, and it is difficult for some algorithms to be clearly classified into a certain category. Regression algorithm is a type of algorithm that tries to explore the relationship between variables by using a measure of error. Regression algorithm is a powerful tool for statistical machine learning. In the field of machine learning, when people talk about regression, sometimes they refer to a type of problem and sometimes a type of algorithm.

artificial intelligence, bayesian inference, machine learning, (15 more...)

#artificialintelligence

Country: North America > United States > Oregon > Jackson County > Central Point (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
(2 more...)

Add feedback

Continuous Artificial Prediction Markets as a Syndromic Surveillance Technique

Jahedpari, Fatemeh

arXiv.org Artificial IntelligenceSep-1-2020

According to the World Health Organisation (WHO) [World Health Organization, 2013], the United Nations directing and coordinating health authority, public health surveillance is: The continuous, systematic collection, analysis and interpretation of health-related data needed for the planning, implementation, and evaluation of public health practice. Public health surveillance practice has evolved over time. Although it was limited to pen and paper at the beginning of 20th century, it is now facilitated by huge advances in informatics. Information technology enhancements have changed the traditional approaches of capturing, storing, sharing and analysing of data and resulted efficient and reliable health surveillance techniques [Lombardo and Buckeridge, 2007]. The main objective and challenge of a health surveillance system is the earliest possible detection of a disease outbreak within a society for the purpose of protecting community health. In the past, before the widespread deployment of computers, health surveillance was based on reports received from medical care centres and laboratories.

bioinformatics, data mining, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2009.00394

Country:

North America > United States > District of Columbia > Washington (0.14)
Europe > United Kingdom > England (0.04)
Europe > Netherlands (0.04)
(5 more...)

Genre: Research Report > Experimental Study (0.47)

Industry:

Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
(3 more...)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
(4 more...)

Add feedback

Chance-Constrained Trajectory Optimization for Safe Exploration and Learning of Nonlinear Systems

Nakka, Yashwanth Kumar, Liu, Anqi, Shi, Guanya, Anandkumar, Anima, Yue, Yisong, Chung, Soon-Jo

arXiv.org Artificial IntelligenceSep-1-2020

Learning-based control algorithms require data collection with abundant supervision for training. Safe exploration algorithms ensure the safety of this data collection process even when only partial knowledge is available. We present a new approach for optimal motion planning with safe exploration that integrates chance-constrained stochastic optimal control with dynamics learning and feedback control. We derive an iterative convex optimization algorithm that solves an \underline{Info}rmation-cost \underline{S}tochastic \underline{N}onlinear \underline{O}ptimal \underline{C}ontrol problem (Info-SNOC). The optimization objective encodes both optimal performance and exploration for learning, and the safety is incorporated as distributionally robust chance constraints. The dynamics are predicted from a robust regression model that is learned from data. The Info-SNOC algorithm is used to compute a sub-optimal pool of safe motion plans that aid in exploration for learning unknown residual dynamics under safety constraints. A stable feedback controller is used to execute the motion plan and collect data for model learning. We prove the safety of rollout from our exploration method and reduction in uncertainty over epochs, thereby guaranteeing the consistency of our learning method. We validate the effectiveness of Info-SNOC by designing and implementing a pool of safe trajectories for a planar robot. We demonstrate that our approach has higher success rate in ensuring safety when compared to a deterministic trajectory optimization approach.

artificial intelligence, machine learning, trajectory, (19 more...)

arXiv.org Artificial Intelligence

2005.04374

Country: North America > United States > California (0.04)

Genre: Research Report (0.64)

Industry:

Energy (0.47)
Aerospace & Defense (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Add feedback