AITopics | split value

Collaborating Authors

split value

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SMART: A Flexible Approach to Regression using Spline-Based Multivariate Adaptive Regression Trees

Pattie, William, Krishna, Arvind

arXiv.org Machine LearningOct-7-2024

Decision trees are powerful for predictive modeling but often suffer from high variance when modeling continuous relationships. While algorithms like Multivariate Adaptive Regression Splines (MARS) excel at capturing such continuous relationships, they perform poorly when modeling discontinuities. To address the limitations of both approaches, we introduce Spline-based Multivariate Adaptive Regression Trees (SMART), which uses a decision tree to identify subsets of data with distinct continuous relationships and then leverages MARS to fit these relationships independently. Unlike other methods that rely on the tree structure to model interaction and higher-order terms, SMART leverages MARS's native ability to handle these terms, allowing the tree to focus solely on identifying discontinuities in the relationship. We test SMART on various datasets, demonstrating its improvement over state-of-the-art methods in such cases. Additionally, we provide an open-source implementation of our method to be used by practitioners.

algorithm, dataset, leaf node, (15 more...)

arXiv.org Machine Learning

2410.05597

Country:

North America > United States > New York > New York County > New York City (0.04)
Asia > India (0.04)

Genre: Research Report > Promising Solution (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

Analysis of the Evolution of Parametric Drivers of High-End Sea-Level Hazards

Hough, Alana, Wong, Tony E.

arXiv.org Artificial IntelligenceJun-10-2021

Climate models are critical tools for developing strategies to manage the risks posed by sea-level rise to coastal communities. While these models are necessary for understanding climate risks, there is a level of uncertainty inherent in each parameter in the models. This model parametric uncertainty leads to uncertainty in future climate risks. Consequently, there is a need to understand how those parameter uncertainties impact our assessment of future climate risks and the efficacy of strategies to manage them. Here, we use random forests to examine the parametric drivers of future climate risk and how the relative importances of those drivers change over time. We find that the equilibrium climate sensitivity and a factor that scales the effect of aerosols on radiative forcing are consistently the most important climate model parametric uncertainties throughout the 2020 to 2150 interval for both low and high radiative forcing scenarios. The near-term hazards of high-end sea-level rise are driven primarily by thermal expansion, while the longer-term hazards are associated with mass loss from the Antarctic and Greenland ice sheets. Our results highlight the practical importance of considering time-evolving parametric uncertainties when developing strategies to manage future climate risks.

artificial intelligence, machine learning, random forest, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.5194/ascmo-8-117-2022

2106.12041

Country:

North America > Greenland (0.25)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > Austria > Vienna (0.14)
(10 more...)

Genre: Research Report > New Finding (0.48)

Industry: Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.38)

Add feedback

Isolation Forest Algorithm for Anomaly Detection

#artificialintelligenceOct-27-2020, 15:50:14 GMT

Did you ever wonder how credit card fraud detection is caught in real-time? Do you want to know how to catch an intruder program if it is trying to access your system? This is all possible by the application of the anomaly detection machine learning model. Anomaly detection is one of the most popular machine learning techniques. In this article, we will learn concepts related to anomaly detection and how to implement it as a machine learning model.

artificial intelligence, data mining, machine learning, (9 more...)

#artificialintelligence

Industry: Law Enforcement & Public Safety > Fraud (0.56)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

What is Gradient Boosting and How is it different from AdaBoost?

#artificialintelligenceAug-6-2020, 02:50:02 GMT

Ensemble methods is a machine learning technique that combines several base models in order to produce one optimal predictive model. There are various ensemble methods such as stacking, blending, bagging, and boosting. Gradient Boosting, as the name suggests is a boosting method. Boosting is loosely-defined as a strategy that combines multiple simple models into a single composite model. With the introduction of more simple models, the overall model becomes a stronger predictor.

artificial intelligence, learner, machine learning, (16 more...)

#artificialintelligence

Industry: Health & Medicine (0.71)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

Add feedback

Conservative Q-Improvement: Reinforcement Learning for an Interpretable Decision-Tree Policy

Roth, Aaron M., Topin, Nicholay, Jamshidi, Pooyan, Veloso, Manuela

arXiv.org Artificial IntelligenceJul-2-2019

There is a growing desire in the field of reinforcement learning (and machine learning in general) to move from black-box models toward more "interpretable AI." We improve interpretability of reinforcement learning by increasing the utility of decision tree policies learned via reinforcement learning. These policies consist of a decision tree over the state space, which requires fewer parameters to express than traditional policy representations. Existing methods for creating decision tree policies via reinforcement learning focus on accurately representing an action-value function during training, but this leads to much larger trees than would otherwise be required. To address this shortcoming, we propose a novel algorithm which only increases tree size when the estimated discounted future reward of the overall policy would increase by a sufficient amount. Through evaluation in a simulated environment, we show that its performance is comparable or superior to traditional tree-based approaches and that it yields a more succinct policy. Additionally, we discuss tuning parameters to control the tradeoff between optimizing for smaller tree size or for overall reward.

decision tree, leaf node, node, (14 more...)

arXiv.org Artificial Intelligence

1907.0118

Country:

North America > United States > South Carolina (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Cuba (0.04)
Asia > Vietnam > Long An Province (0.04)

Genre: Research Report (0.40)

Industry: Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Lossless (and Lossy) Compression of Random Forests

Painsky, Amichai, Rosset, Saharon

arXiv.org Machine LearningOct-26-2018

Ensemble methods are among the state-of-the-art predictive modeling approaches. Applied to modern big data, these methods often require a large number of sub-learners, where the complexity of each learner typically grows with the size of the dataset. This phenomenon results in an increasing demand for storage space, which may be very costly. This problem mostly manifests in a subscriber based environment, where a user-specific ensemble needs to be stored on a personal device with strict storage limitations (such as a cellular device). In this work we introduce a novel method for lossless compression of tree-based ensemble methods, focusing on random forests. Our suggested method is based on probabilistic modeling of the ensemble's trees, followed by model clustering via Bregman divergence. This allows us to find a minimal set of models that provides an accurate description of the trees, and at the same time is small enough to store and maintain. Our compression scheme demonstrates high compression rates on a variety of modern datasets. Importantly, our scheme enables predictions from the compressed format and a perfect reconstruction of the original ensemble. In addition, we introduce a theoretically sound lossy compression scheme, which allows us to control the trade-off between the distortion and the coding rate.

artificial intelligence, decision tree learning, machine learning, (18 more...)

arXiv.org Machine Learning

1810.11197

Country: Asia > Middle East > Israel (0.28)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

CatBoost vs. Light GBM vs. XGBoost – Towards Data Science

#artificialintelligenceApr-7-2018, 23:29:58 GMT

I recently participated in this Kaggle competition (WIDS Datathon by Stanford) where I was able to land up in Top 10 using various boosting algorithms. Since then, I have been very curious about the fine workings of each model including parameter tuning, pros and cons and hence decided to write this blog. Despite the recent re-emergence and popularity of neural networks, I am focusing on boosting algorithms because they are still more useful in the regime of limited training data, little training time and little expertise for parameter tuning. Since XGBoost (often called GBM Killer) has been in the machine learning world for a longer time now with lots of articles dedicated to it, this post will focus more on CatBoost & LGBM. LightGBM uses a novel technique of Gradient-based One-Side Sampling (GOSS) to filter out the data instances for finding a split value while XGBoost uses pre-sorted algorithm & Histogram-based algorithm for computing the best split.

algorithm, feature value, split value, (11 more...)

#artificialintelligence

Genre: Contests & Prizes (0.56)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.86)

Add feedback

CatBoost vs. Light GBM vs. XGBoost

@machinelearnbotMar-25-2018, 23:05:42 GMT

accuracy, artificial intelligence, machine learning, (19 more...)

@machinelearnbot

Genre: Contests & Prizes (0.55)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.87)

Add feedback