AITopics

2511.12397

Country: North America (0.46)

Genre: Research Report (0.50)

Industry: Retail (0.67)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

arXiv.org Artificial IntelligenceNov-2-2023

Scalable Probabilistic Forecasting in Retail with Gradient Boosted Trees: A Practitioner's Approach

Long, Xueying, Bui, Quang, Oktavian, Grady, Schmidt, Daniel F., Bergmeir, Christoph, Godahewa, Rakshitha, Lee, Seong Per, Zhao, Kaifeng, Condylis, Paul

The recent M5 competition has advanced the state-of-the-art in retail forecasting. However, we notice important differences between the competition challenge and the challenges we face in a large e-commerce company. The datasets in our scenario are larger (hundreds of thousands of time series), and e-commerce can afford to have a larger assortment than brick-and-mortar retailers, leading to more intermittent data. To scale to larger dataset sizes with feasible computational effort, firstly, we investigate a two-layer hierarchy and propose a top-down approach to forecasting at an aggregated level with less amount of series and intermittency, and then disaggregating to obtain the decision-level forecasts. Probabilistic forecasts are generated under distributional assumptions. Secondly, direct training at the lower level with subsamples can also be an alternative way of scaling. Performance of modelling with subsets is evaluated with the main dataset. Apart from a proprietary dataset, the proposed scalable methods are evaluated using the Favorita dataset and the M5 dataset. We are able to show the differences in characteristics of the e-commerce and brick-and-mortar retail datasets. Notably, our top-down forecasting framework enters the top 50 of the original M5 competition, even with models trained at a higher level under a much simpler setting.

dataset, forecast, loss default 0, (14 more...)

2311.00993

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Asia > Singapore (0.04)
Asia > Indonesia (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Retail (1.00)
Information Technology > Services (0.89)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

#artificialintelligenceFeb-2-2023, 22:40:35 GMT

Measuring Sales Performance Using Simple Statistical Models

Measuring sales performance is a crucial aspect of running a successful business. Accurately tracking and analyzing sales data helps companies understand their strengths and weaknesses, perform forecasts, identify trends, and make informed decisions that drive growth. In this article, I will illuminate how some simple statistical models can be used for measuring sales performance. Whether it is a small or enterprise sales team, simple quantitative techniques can be used to provide valuable sales insights or draw attention to areas of need. After reading this article, you will see various examples how simple models are applied in real life scenarios. Note: All the images in the article were generated by Artificial Intelligence using Stable Diffusion 2.x.

price, sale performance, sales team, (13 more...)

#artificialintelligence

Industry: Marketing (0.68)

Technology:

Information Technology > Enterprise Applications > Customer Relationship Management (1.00)
Information Technology > Artificial Intelligence (0.74)

#artificialintelligenceApr-26-2022, 03:05:56 GMT

A universal deep neural network for in-depth cleaning of single-cell RNA-Seq data - Nature Communications

Single cell RNA sequencing (scRNA-Seq) is being widely used in biomedical research and generated enormous volume and diversity of data. The raw data contain multiple types of noise and technical artifacts, which need thorough cleaning. Existing denoising and imputation methods largely focus on a single type of noise (i.e., dropouts) and have strong distribution assumptions which greatly limit their performance and application. Here we design and develop the AutoClass model, integrating two deep neural network components, an autoencoder, and a classifier, as to maximize both noise removal and signal retention. AutoClass is distribution agnostic as it makes no assumption on specific data distributions, hence can effectively clean a wide range of noise and artifacts. AutoClass outperforms the state-of-art methods in multiple types of scRNA-Seq data analyses, including data recovery, differential expression analysis, clustering analysis, and batch effect removal. Importantly, AutoClass is robust on key hyperparameter settings including bottleneck layer size, pre-clustering number and classifier weight. We have made AutoClass open source at: https://github.com/datapplab/AutoClass . Single cell RNA sequencing (scRNA-Seq) is widely used in biomedical research. Here the authors develop a novel AI model-AutoClass, which effectively cleans a wide range of noise and artifacts in scRNA-Seq data and improves downstream analyses.

autoclass, autoencoder, deep neural network, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)

arXiv.org Machine LearningSep-14-2021

Generalized XGBoost Method

Guang, Yang

This method has achieved excellent predictive performance in many fields and has exhibited many advantages, and is consequently considered especially suitable for the statistical analysis of big data. However, this method is limited because its loss function must be convex. For many scenario-specific problems, such as non-life insurance pricing, the distribution of predictor variables is often heavytailed, so the optimal prediction performance may not be obtained by setting convex loss functions. Simultaneously, it is important to estimate the probability distribution of predictor variables. When the set parametric probability distribution contains more than two parameters, it may be necessary to model multiple parameters to obtain better prediction performance. Therefore, a more generalized regularized tree boosting method is required to make the loss function not limited to the convex function while modelling the tree boosting for multiple parameters, to adapt to the most common parametric probability distributions.

generalized xgboost method, loss function, xgboost method, (14 more...)

2109.07473

Country: North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.50)

Industry: Banking & Finance > Insurance (0.91)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

arXiv.org Artificial IntelligenceJul-14-2021

M5 Competition Uncertainty: Overdispersion, distributional forecasting, GAMLSS and beyond

Ziel, Florian

The M5 competition uncertainty track aims for probabilistic forecasting of sales of thousands of Walmart retail goods. We show that the M5 competition data faces strong overdispersion and sporadic demand, especially zero demand. We discuss resulting modeling issues concerning adequate probabilistic forecasting of such count data processes. Unfortunately, the majority of popular prediction methods used in the M5 competition (e.g. lightgbm and xgboost GBMs) fails to address the data characteristics due to the considered objective functions. The distributional forecasting provides a suitable modeling approach for to the overcome those problems. The GAMLSS framework allows flexible probabilistic forecasting using low dimensional distributions. We illustrate, how the GAMLSS approach can be applied for the M5 competition data by modeling the location and scale parameter of various distributions, e.g. the negative binomial distribution. Finally, we discuss software packages for distributional modeling and their drawback, like the R package gamlss with its package extensions, and (deep) distributional forecasting libraries such as TensorFlow Probability.

competition, distributional forecasting, forecasting, (13 more...)

2107.06675

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.05)
Europe > Germany (0.04)

Genre: Research Report (0.64)

Industry: Retail (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Kristensen, Jeppe Theiss, Valdivia, Arturo, Burelli, Paolo

Statistical Modelling of Level Difficulty in Puzzle Games

arXiv.org Artificial IntelligenceJul-8-2021

Successful and accurate modelling of level difficulty is a fundamental component of the operationalisation of player experience as difficulty is one of the most important and commonly used signals for content design and adaptation. In games that feature intermediate milestones, such as completable areas or levels, difficulty is often defined by the probability of completion or completion rate; however, this operationalisation is limited in that it does not describe the behaviour of the player within the area. In this research work, we formalise a model of level difficulty for puzzle games that goes beyond the classical probability of success. We accomplish this by describing the distribution of actions performed within a game level using a parametric statistical model thus creating a richer descriptor of difficulty. The model is fitted and evaluated on a dataset collected from the game Lily's Garden by Tactile Games, and the results of the evaluation show that the it is able to describe and explain difficulty in a vast majority of the levels.

completion rate, negative binomial distribution, puzzle game, (16 more...)

2107.03305

Country:

Europe > Denmark > Capital Region > Copenhagen (0.05)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Data Science (0.68)

arXiv.org Machine LearningJan-3-2021

A Tutorial on the Mathematical Model of Single Cell Variational Inference

Shi, Songting

As the large amount of sequencing data accumulated in past decades and it is still accumulating, we need to handle the more and more sequencing data. As the fast development of the computing technologies, we now can handle a large amount of data by a reasonable of time using the neural network based model. This tutorial will introduce the the mathematical model of the single cell variational inference (scVI), which use the variational auto-encoder (building on the neural networks) to learn the distribution of the data to gain insights. It was written for beginners in the simple and intuitive way with many deduction details to encourage more researchers into this field. As the computer technology evolves rapidly, we can tackle more and more complex problem by finding a suitable function taking millions of parameters to model the key part of the problems.

neural network, variational auto-encoder, variational lower bound, (14 more...)

2101.0065

Country:

Asia > Middle East > Jordan (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Instructional Material > Course Syllabus & Notes (0.68)

Industry: Education (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Betsch, Steffen, Ebner, Bruno, Nestmann, Franz

Characterizations of non-normalized discrete probability distributions and their application in statistics

arXiv.org Machine LearningNov-9-2020

From the distributional characterizations that lie at the heart of Stein's method we derive explicit formulae for the mass functions of discrete probability laws that identify those distributions. These identities are applied to develop tools for the solution of statistical problems. Our characterizations, and hence the applications built on them, do not require any knowledge about normalization constants of the probability laws. We discuss several examples where this lack of feasibility of the normalization constant is a built-in feature. To demonstrate that our statistical methods are sound, we provide comparative simulation studies for the testing of fit to the Poisson distribution and for parameter estimation of the negative binomial family when both parameters are unknown. We also consider the problem of parameter estimation for discrete exponential-polynomial models which generally are non-normalized.

application, characterization, non-normalized discrete probability distribution, (14 more...)

2011.04369

Country:

Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.04)
North America > United States > New York (0.04)
Europe > Spain > Aragón (0.04)
(5 more...)

Genre: Research Report (0.81)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.46)

arXiv.org Machine LearningSep-15-2020

Demand Forecasting of individual Probability Density Functions with Machine Learning

Wick, F., Kerzel, U., Hahn, M., Wolf, M., Singhal, T., Feindt, M.

Demand forecasting is a central component for many aspects of supply chain operations, as it provides crucial input for subsequent decision making like ordering processes. While machine learning methods can significantly improve prediction accuracy over traditional time series forecasting, the calculated predictions are often just point estimations for the conditional mean of the underlying probability distribution, and the most powerful approaches, like deep learning, are usually opaque in terms of how its individual predictions can be interpreted. Using the novel supervised machine learning method "Cyclic Boosting", complete individual probability density functions can be predicted instead of single numbers. While metrics evaluating point estimates are widely used, methods for assessing the accuracy of predicted distributions are rare and this work proposes new techniques for both qualitative and quantitative evaluation methods. Additionally, each single prediction obtained with this framework is explainable. This is a major benefit in particular for practitioners, as this allows them to avoid "black-box" models and understand the contributing factors for each individual prediction.

artificial intelligence, machine learning, prediction, (16 more...)

2009.07052

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > New York (0.04)
North America > United States > California > Monterey County > Pacific Grove (0.04)
(4 more...)

Genre: Research Report (0.50)

Industry: Retail (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Forecasting (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)