AITopics | fastml

Collaborating Authors

fastml

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

fastml: Guarded Resampling Workflows for Safer Automated Machine Learning in R

Korkmaz, Selcuk, Goksuluk, Dincer, Karaismailoglu, Eda

arXiv.org Machine LearningApr-14-2026

Preprocessing leakage arises when scaling, imputation, or other data-dependent transformations are estimated before resampling, inflating apparent performance while remaining hard to detect. We present fastml, an R package that provides a single-call interface for leakage-aware machine learning through guarded resampling, where preprocessing is re-estimated inside each resample and applied to the corresponding assessment data. The package supports grouped and time-ordered resampling, blocks high-risk configurations, audits recipes for external dependencies, and includes sandboxed execution and integrated model explanation. We evaluate fastml with a Monte Carlo simulation contrasting global and fold-local normalization, a usability comparison with tidymodels under matched specifications, and survival benchmarks across datasets of different sizes. The simulation demonstrates that global preprocessing substantially inflates apparent performance relative to guarded resampling. fastml matched held-out performance obtained with tidymodels while reducing workflow orchestration, and it supported consistent benchmarking of multiple survival model classes through a unified interface.

artificial intelligence, fastml, machine learning, (19 more...)

arXiv.org Machine Learning

2604.05225

Country:

Europe > Netherlands > South Holland > Rotterdam (0.04)
North America > United States > Wisconsin (0.04)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.93)

Add feedback

My 5 Favorite Data Science Portfolios · Learning With Data

#artificialintelligenceNov-25-2020, 09:45:26 GMT

At the end of the article, I posted a link to an example portfolio that I liked by Tim Dettmers. Afterward, I had a few people ask me to compile a larger list of great data science portfolios and projects. While not a portfolio, but rather a project, I think this is a great format to try and exemplify. Melissa Runfeldt did a great job defining and motivating her problem, discussing how she gathered data and explaining her methods with images of results. All in a way that would be easy for a non-technical person to follow (at least at a high level).

learning, portfolio, showcase, (12 more...)

#artificialintelligence

Country: North America > United States > California > Santa Clara County > Palo Alto (0.07)

Industry: Information Technology (0.74)

Technology:

Information Technology > Data Science (0.73)
Information Technology > Artificial Intelligence > Machine Learning (0.58)

Add feedback

Deep learning architecture diagrams - FastML

@machinelearnbotNov-7-2017, 22:40:57 GMT

As a wild stream after a wet season in African savanna diverges into many smaller streams forming lakes and puddles, so deep learning has diverged into a myriad of specialized architectures. Each architecture has a diagram. Here are some of them. Neural networks are conceptually simple, and that's their beauty. A bunch of homogenous, uniform units, arranged in layers, weighted connections between them, and that's all.

architecture, deep learning, neural network, (16 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Deep learning architecture diagrams - FastML

#artificialintelligenceMar-31-2017, 08:08:32 GMT

artificial intelligence, feature engineering, machine learning, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Tuning hyperparams fast with Hyperband - FastML

#artificialintelligenceMar-6-2017, 03:15:41 GMT

Hyperband is a relatively new method for tuning iterative algorithms. It performs random sampling and attempts to gain the edge by using time spent optimizing in the best way. We explain a few things that were not clear to us right away, and try the algorithm in practice. Candidates for tuning with Hyperband include all the SGD derivatives - meaning the whole deep learning - and tree ensembles: gradient boosting, and perhaps to a lesser extent, random forest and extremely randomized trees. To quantify this idea, we compare to random run at twice the speed which beats the two Bayesian Optimization methods, i.e., running random search for twice as long yields superior results.

artificial intelligence, iteration, machine learning, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.71)

Add feedback

Deep learning architecture diagrams - FastML

#artificialintelligenceSep-30-2016, 14:47:21 GMT

Like a wild stream after a wet season in African savanna diverges into many smaller streams forming lakes and puddles, deep learning has diverged into a myriad of specialized architectures. Each architecture has a diagram. Here are some of them. Neural networks are conceptually simple, and that's their beauty. A bunch of homogenous, uniform units, arranged in layers, weighted connections between them, and that's all.

artificial intelligence, feature engineering, machine learning, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Adversarial validation, part two - FastML

#artificialintelligenceSep-19-2016, 10:35:55 GMT

In this second article on adversarial validation we get to the meat of the matter: what we can do when train and test sets differ. Will we be able to make a better validation set? The problem with training examples being different from test examples is that validation won't be any good for comparing models. That's because validation examples originate in the training set. We can see this effect when using Numerai data, which comes from financial time series.

artificial intelligence, inductive learning, machine learning, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.70)

Add feedback

Loading data in Torch (is a mess) - FastML

@machinelearnbotApr-26-2016, 14:20:24 GMT

Torch 7 is a GPU accelerated deep learning framework. It had been rather obscure until recent publicity caused by adoption by Facebook and DeepMind. This entirely anecdotal article describes our experiences trying to load some data in Torch. We had great expectations about Torch. It seemed like a dream come true, especially with endorsement by DeepMind and LeCun's group at Facebook.

artificial intelligence, machine learning, tensor, (17 more...)

@machinelearnbot

Country: North America > United States (0.16)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Coming out - FastML

#artificialintelligenceApr-1-2016, 14:46:01 GMT

People often ask how we've been able to learn about and cover so many different and diverse topics in machine learning (using at least three different programming languages - Python, Matlab, and R) and generally achieve such prominence in the community, all this in a relatively short time. Today we finally give a definitive answer. There's no Zygmunt the Polish economist ever willing to relocate to San Francisco. And the "we" that we always use in the posts is not majestic plural. We are three Chinese PhD students: Ah, Hai and Wang.

artificial intelligence, machine learning, zygmunt, (3 more...)

#artificialintelligence

Country:

North America > United States > California > San Francisco County > San Francisco (0.26)
Asia > China (0.08)
Europe > Poland (0.06)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback