r-squared value
Enhancing Retail Sales Forecasting with Optimized Machine Learning Models
Ganguly, Priyam, Mukherjee, Isha
In retail sales forecasting, accurately predicting future sales is crucial for inventory management and strategic planning. Traditional methods like LR often fall short due to the complexity of sales data, which includes seasonality and numerous product families. Recent advancements in machine learning (ML) provide more robust alternatives. This research benefits from the power of ML, particularly Random Forest (RF), Gradient Boosting (GB), Support Vector Regression (SVR), and XGBoost, to improve prediction accuracy. Despite advancements, a significant gap exists in handling complex datasets with high seasonality and multiple product families. The proposed solution involves implementing and optimizing a RF model, leveraging hyperparameter tuning through randomized search cross-validation. This approach addresses the complexities of the dataset, capturing intricate patterns that traditional methods miss. The optimized RF model achieved an R-squared value of 0.945, substantially higher than the initial RF model and traditional LR, which had an R-squared of 0.531. The model reduced the root mean squared logarithmic error (RMSLE) to 1.172, demonstrating its superior predictive capability. The optimized RF model did better than cutting-edge models like Gradient Boosting (R-squared: 0.942), SVR (R-squared: 0.940), and XGBoost (R-squared: 0.939), with more minor mean squared error (MSE) and mean absolute error (MAE) numbers. The results demonstrate that the optimized RF model excels in forecasting retail sales, handling the datasets complexity with higher accuracy and reliability. This research highlights the importance of advanced ML techniques in predictive analytics, offering a significant improvement over traditional methods and other contemporary models.
Stock Price Prediction and Traditional Models: An Approach to Achieve Short-, Medium- and Long-Term Goals
Alamu, Opeyemi Sheu, Siam, Md Kamrul
A comparative analysis of deep learning models and traditional statistical methods for stock price prediction uses data from the Nigerian stock exchange. Historical data, including daily prices and trading volumes, are employed to implement models such as Long Short Term Memory (LSTM) networks, Gated Recurrent Units (GRUs), Autoregressive Integrated Moving Average (ARIMA), and Autoregressive Moving Average (ARMA). These models are assessed over three-time horizons: short-term (1 year), medium-term (2.5 years), and long-term (5 years), with performance measured by Mean Squared Error (MSE) and Mean Absolute Error (MAE). The stability of the time series is tested using the Augmented Dickey-Fuller (ADF) test. Results reveal that deep learning models, particularly LSTM, outperform traditional methods by capturing complex, nonlinear patterns in the data, resulting in more accurate predictions. However, these models require greater computational resources and offer less interpretability than traditional approaches. The findings highlight the potential of deep learning for improving financial forecasting and investment strategies. Future research could incorporate external factors such as social media sentiment and economic indicators, refine model architectures, and explore real-time applications to enhance prediction accuracy and scalability.
CEL: A Continual Learning Model for Disease Outbreak Prediction by Leveraging Domain Adaptation via Elastic Weight Consolidation
Aslam, Saba, Rasool, Abdur, Wu, Hongyan, Li, Xiaoli
Continual learning, the ability of a model to learn over time without forgetting previous knowledge and, therefore, be adaptive to new data, is paramount in dynamic fields such as disease outbreak prediction. Deep neural networks, i.e., LSTM, are prone to error due to catastrophic forgetting. This study introduces a novel CEL model for continual learning by leveraging domain adaptation via Elastic Weight Consolidation (EWC). This model aims to mitigate the catastrophic forgetting phenomenon in a domain incremental setting. The Fisher Information Matrix (FIM) is constructed with EWC to develop a regularization term that penalizes changes to important parameters, namely, the important previous knowledge. CEL's performance is evaluated on three distinct diseases, Influenza, Mpox, and Measles, with different metrics. The high R-squared values during evaluation and reevaluation outperform the other state-of-the-art models in several contexts, indicating that CEL adapts to incremental data well. CEL's robustness and reliability are underscored by its minimal 65% forgetting rate and 18% higher memory stability compared to existing benchmark studies. This study highlights CEL's versatility in disease outbreak prediction, addressing evolving data with temporal patterns. It offers a valuable model for proactive disease control with accurate, timely predictions.
Innovations in Agricultural Forecasting: A Multivariate Regression Study on Global Crop Yield Prediction
Gupta, Ishaan, Ayalasomayajula, Samyutha, Shashidhara, Yashas, Kataria, Anish, Shashidhara, Shreyas, Kataria, Krishita, Undurti, Aditya
The prediction of crop yields internationally is a crucial objective in agricultural research. Thus, this study implements 6 regression models (Linear, Tree, Gradient Descent, Gradient Boosting, K- Nearest Neighbors, and Random Forest) to predict crop yields in 196 countries. Given 4 key training parameters, pesticides (tonnes), rainfall (mm), temperature (Celsius), and yield (hg/ha), it was found that our Random Forest Regression model achieved a determination coefficient (r^2) of 0.94, with a margin of error (ME) of .03. The models were trained and tested using the Food and Agricultural Organization of the United Nations data, along with the World Bank Climate Change Data Catalog. Furthermore, each parameter was analyzed to understand how varying factors could impact overall yield. We used unconventional models, contrary to generally used Deep Learning (DL) and Machine Learning (ML) models, combined with recently collected data to implement a unique approach in our research. Existing scholarship would benefit from understanding the most optimal model for agricultural research, specifically using the United Nations data.
Linear regression in detail. Linear regression is a statisticalโฆ
Linear regression is a statistical method for modeling the relationship between a dependent variable and one or more independent variables. It is a widely-used technique for predicting the outcome of a continuous variable, and it is especially useful when you have a large amount of data. In this blog post, we will discuss the theory behind linear regression, how to perform it in practice, and some of its applications. The basic idea behind linear regression is to find a line that best fits a set of data points. The line is represented by the equation y mx b, where y is the dependent variable, x is the independent variable, m is the slope of the line, and b is the y-intercept.
The ultimate PyTorch Hello World. In this tutorial, you will learn
You know some basic programming (array, loop etc.) Note that the choice of the equation ( y x1 * x2) is arbitrary. You can try different equations and see that the model can predict almost anything! Log in if you are not logged in already. You will get a place to write code. It will take a few seconds to run.
Good Old Iris
Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. The iris dataset must be the most used dataset ever.
Machine Learning Crystal Structure Prediction - Rebellion Research
ANNs was chosen because it is a subset of machine learning which is the heart of deep learning algorithms. Inspired by the human brain, ANNs mimic the way that biological neurons signal to one another. It is composed of a node layer, containing an input layer, one or more hidden layers, and an output layer. Each node, which is also called an artificial neuron, connects with the associated weight and threshold. The node is activated once the output of any individual node exceeds the specified threshold value and then sends data to the network's next layer. Otherwise, no data is passed to the next layer of the network.
WTH are R-squared and Adjusted R-squared?
Today I am going to explain the concept of R-squared and adjusted R-squared from the Machine Learning perspective. I'll also show you how to find the R-squared value of your ML model. It acts as an evaluation metric for regression models. To understand it better let me introduce a regression problem. Suppose I'm building a model to predict how many articles I will write in a particular month given the amount of free time I have on that month.
On the Adversarial Robustness of Subspace Learning
Li, Fuwei, Lai, Lifeng, Cui, Shuguang
In this paper, we study the adversarial robustness of subspace learning problems. Different from the assumptions made in existing work on robust subspace learning where data samples are contaminated by gross sparse outliers or small dense noises, we consider a more powerful adversary who can first observe the data matrix and then intentionally modify the whole data matrix. We first characterize the optimal rank-one attack strategy that maximizes the subspace distance between the subspace learned from the original data matrix and that learned from the modified data matrix. We then generalize the study to the scenario without the rank constraint and characterize the corresponding optimal attack strategy. Our analysis shows that the optimal strategies depend on the singular values of the original data matrix and the adversary's energy budget. Finally, we provide numerical experiments and practical applications to demonstrate the efficiency of the attack strategies.