Goto

Collaborating Authors

 computational statistic


Unsupervised Learning of Density Estimates with Topological Optimization

arXiv.org Machine Learning

Kernel density estimation is a key component of a wide variety of algorithms in machine learning, Bayesian inference, stochastic dynamics and signal processing. However, the unsupervised density estimation technique requires tuning a crucial hyperparameter: the kernel bandwidth. The choice of bandwidth is critical as it controls the bias-variance trade-off by over- or under-smoothing the topological features. Topological data analysis provides methods to mathematically quantify topological characteristics, such as connected components, loops, voids et cetera, even in high dimensions where visualization of density estimates is impossible. In this paper, we propose an unsupervised learning approach using a topology-based loss function for the automated and unsupervised selection of the optimal bandwidth and benchmark it against classical techniques -- demonstrating its potential across different dimensions.


Machine Learning for Signal Processing: Data Science, Algorithms, and Computational Statistics: Little, Max A.: 9780198714934: Amazon.com: Books

#artificialintelligence

"This book provides an excellent pathway for gaining first-class expertise in machine learning. It provides both the technical background that explains why certain approaches, but not others, are best practice in real world problems, and a framework for how to think about and approach new problems. I highly recommend it for people with a signal processing background who are seeking to become an expert in machine learning." With this book, Prof. Little has taken an important step in unifying ร‚machine learning and signal processing. As a whole, this book covers many topics, new and old, that are important in their own right and equips the reader with a broader perspective than traditional signal processing textbooks.


Measure Inducing Classification and Regression Trees for Functional Data

arXiv.org Machine Learning

We propose a tree-based algorithm for classification and regression problems in the context of functional data analysis, which allows to leverage representation learning and multiple splitting rules at the node level, reducing generalization error while retaining the interpretability of a tree. This is achieved by learning a weighted functional $L^{2}$ space by means of constrained convex optimization, which is then used to extract multiple weighted integral features from the input functions, in order to determine the binary split for each internal node of the tree. The approach is designed to manage multiple functional inputs and/or outputs, by defining suitable splitting rules and loss functions that can depend on the specific problem and can also be combined with scalar and categorical data, as the tree is grown with the original greedy CART algorithm. We focus on the case of scalar-valued functional inputs defined on unidimensional domains and illustrate the effectiveness of our method in both classification and regression tasks, through a simulation study and four real world applications.


Machine Learning and Data Science Hands-on with Python and R

#artificialintelligence

Learn from well designed, well-crafted study materials on Machine Learning ML, Statistics, Python, Artificial Intelligence AI, Tensorflow, AWS, Deep Learning, R Programming, NLP, Bayesian Methods, A/B Testing, Face Detection, Business Intelligence BI, Regression, Hypothesis Testing, Algebra, Adaboost Regressor, Gaussian, Heuristic, Numpy, Pandas, Metplotlit, Seaborn, Forecasting, Distribution, Normalization, Trend Analysis, Predictive Modeling, Fraud Detection, Neural Network, Sequential Model, Data Visualization, Data Analysis, Data Manipulation, KNN Algorithm, Decision Tree, Random Forests, Kmeans Clustering, Vector Machine, Time Series Analysis, Market Basket Analysis. Get the skills to work with implementations and develop capabilities that you can use to deliver results in a machine learning project. This program will help you build the foundation for a solid career in Machine learning Tools. Machine learning is a scientific discipline that explores the construction and study of algorithms that can learn from data. Such algorithms operate by building a model from example inputs and using that to make predictions or decisions, rather than following strictly static program instructions.


Machine Learning for Signal Processing: Data Science, Algorithms, and Computational Statistics: Max A. Little: 9780198714934: Amazon.com: Books

#artificialintelligence

This book provides an excellent pathway for gaining first-class expertise in machine learning. It provides both the technical background that explains why certain approaches, but not others, are best practice in real world problems, and a framework for how to think about and approach new problems. I highly recommend it for people with a signal processing background who are seeking to become an expert in machine learning.


Why AI is a Fear-Driven Discipline

#artificialintelligence

People are scared of AI. Another 18% were uncertain of the impact, which means that 64% of people have an uncertain or negative view of AI. "AI in military applications could give rise to a nuclear war by 2040." "Data-driven algorithms that automate applications by using that data -- could hold ethical implications over the privacy of patients." "Fear that AI could be used for mass surveillance." "Machine-learning that threatens to bake in racial, sexual, and other biases."


Ask a Data Scientist: What's Machine Learning?

#artificialintelligence

For as popular as the term "machine learning" has come to be, it's surprising to me how often it's equated to robots taking over the world. Phrases like "neural nets" and "deep learning" tap into our sense of fantasy, but when we jump from new tech to robot takeover, we miss the beauty and power of what machine learning actually is, and the groundbreaking new developments that are pushing industries forward. With this in mind, I sat down with our team's data scientist, Hillary Green-Lerman, to shed light on the buzzword. I asked the questions Wikipedia failed to fully answer: what is machine learning, who should be learning it and how soon can I visit Westworld? "Machine Learning is about using the data you already have to make predictions. This sounds really fancy, but most of the time, the'prediction' is really just a label," Hillary told us.


AI isn't very smart yet. But we need to get moving to make sure automation works for more people

#artificialintelligence

You've probably heard versions of each of the following ideas. With computers becoming remarkably adept at driving, understanding speech, and other tasks, more jobs could soon be automated than society is prepared to handle. This "superintelligence" will largely make human labor unnecessary. In fact, we'd better hope that machines don't eliminate us altogether, either accidentally or on purpose. Even though the first scenario is already under way, it won't necessarily lead to the second one.


Vincent Granville

@machinelearnbot

Granville V., Rasson J.P. Multivariate discriminate analysis and maximum penalized likelihood.... Journal of the Royal Statistical Society, Series B, 57 (1995), 501-517.


Robust mixture modelling using sub-Gaussian stable distribution

arXiv.org Machine Learning

Heavy-tailed distributions are widely used in robust mixture modelling due to possessing thick tails. As a computationally tractable subclass of the stable distributions, sub-Gaussian $\alpha$-stable distribution received much interest in the literature. Here, we introduce a type of expectation maximization algorithm that estimates parameters of a mixture of sub-Gaussian stable distributions. A comparative study, in the presence of some well-known mixture models, is performed to show the robustness and performance of the mixture of sub-Gaussian $\alpha$-stable distributions for modelling, simulated, synthetic, and real data.