AITopics | statistical and machine

Collaborating Authors

statistical and machine

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Review of Statistical and Machine Learning Approaches for Coral Bleaching Assessment

Sarkar, Soham, Hazra, Arnab

arXiv.org Machine LearningNov-18-2025

Coral bleaching is a major concern for marine ecosystems; more than half of the world's coral reefs have either bleached or died over the past three decades. Increasing sea surface temperatures, along with various spatiotemporal environmental factors, are considered the primary reasons behind coral bleaching. The statistical and machine learning communities have focused on multiple aspects of the environment in detail. However, the literature on various stochastic modeling approaches for assessing coral bleaching is extremely scarce. Data-driven strategies are crucial for effective reef management, and this review article provides an overview of existing statistical and machine learning methods for assessing coral bleaching. Statistical frameworks, including simple regression models, generalized linear models, generalized additive models, Bayesian regression models, spatiotemporal models, and resilience indicators, such as Fisher's Information and Variance Index, are commonly used to explore how different environmental stressors influence coral bleaching. On the other hand, machine learning methods, including random forests, decision trees, support vector machines, and spatial operators, are more popular for detecting nonlinear relationships, analyzing high-dimensional data, and allowing integration of heterogeneous data from diverse sources. In addition to summarizing these models, we also discuss potential data-driven future research directions, with a focus on constructing statistical and machine learning models in specific contexts related to coral bleaching.

artificial intelligence, bayesian inference, machine learning, (20 more...)

arXiv.org Machine Learning

2511.12234

Country:

North America > United States (0.14)
Indian Ocean > Red Sea (0.04)
Asia > Middle East > Yemen (0.04)
(11 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Health & Medicine (0.68)
Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.87)
(2 more...)

Add feedback

Crash Severity Prediction Using Deep Learning Approaches: A Hybrid CNN-RNN Framework

Koohfar, Sahar

arXiv.org Artificial IntelligenceOct-7-2025

Accurate and timely prediction of crash severity is crucial in mitigating the severe consequences of traffic accidents. Accurate and timely prediction of crash severity is crucial in mitigating the severe consequences of traffic accidents. In order to provide appropriate levels of medical assistance and transportation services, an intelligent transportation system relies on effective prediction methods. Deep learning models have gained popularity in this domain due to their capability to capture non-linear relationships among variables. In this research, we have implemented a hybrid CNN-RNN deep learning model for crash severity prediction and compared its performance against widely used statistical and machine learning models such as logistic regression, naïve bayes classifier, K-Nearest Neighbors (KNN), decision tree, and individual deep learning models: RNN and CNN. This study employs a methodology that considers the interconnected relationships between various features of traffic accidents. The study was conducted using a dataset of 15,870 accident records gathered over a period of seven years between 2015 and 2021 on Virginia highway I-64. The findings demonstrate that the proposed CNN-RNN hybrid model has outperformed all benchmark models in terms of predicting crash severity. This result illustrates the effectiveness of the hybrid model as it combines the advantages of both RNN and CNN models in order to achieve greater accuracy in the prediction process.

artificial intelligence, machine learning, severity, (19 more...)

arXiv.org Artificial Intelligence

2510.04316

Country: North America > United States > Virginia (0.25)

Genre: Research Report > New Finding (1.00)

Industry:

Transportation > Ground > Road (1.00)
Government (1.00)
Transportation > Infrastructure & Services (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

A Comparative Analysis of Statistical and Machine Learning Models for Outlier Detection in Bitcoin Limit Order Books

Letteri, Ivan

arXiv.org Artificial IntelligenceJul-22-2025

The detection of outliers within cryptocurrency limit order books (LOBs) is of paramount importance for comprehending market dynamics, particularly in highly volatile and nascent regulatory environments. This study conducts a comprehensive comparative analysis of robust statistical methods and advanced machine learning techniques for real-time anomaly identification in cryptocurrency LOBs. Within a unified testing environment, named AITA Order Book Signal (AITA-OBS), we evaluate the efficacy of thirteen diverse models to identify which approaches are most suitable for detecting potentially manipulative trading behaviours. An empirical evaluation, conducted via backtesting on a dataset of 26,204 records from a major exchange, demonstrates that the top-performing model, Empirical Covariance (EC), achieves a 6.70% gain, significantly outperforming a standard Buy-and-Hold benchmark. These findings underscore the effectiveness of outlier-driven strategies and provide insights into the trade-offs between model complexity, trade frequency, and performance. This study contributes to the growing corpus of research on cryptocurrency market microstructure by furnishing a rigorous benchmark of anomaly detection models and highlighting their potential for augmenting algorithmic trading and risk management.

artificial intelligence, data mining, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2507.1496

Country: Europe > Italy (0.29)

Genre: Research Report > New Finding (0.48)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Assessment of the suitability of degradation models for the planning of CCTV inspections of sewer pipes

Morer, Fidae El, Wittek, Stefan, Rausch, Andreas

arXiv.org Artificial IntelligenceJul-12-2023

The degradation of sewer pipes poses significant economical, environmental and health concerns. The maintenance of such assets requires structured plans to perform inspections, which are more efficient when structural and environmental features are considered along with the results of previous inspection reports. The development of such plans requires degradation models that can be based on statistical and machine learning methods. This work proposes a methodology to assess their suitability to plan inspections considering three dimensions: accuracy metrics, ability to produce long-term degradation curves and explainability. Results suggest that although ensemble models yield the highest accuracy, they are unable to infer the long-term degradation of the pipes, whereas the Logistic Regression offers a slightly less accurate model that is able to produce consistent degradation curves with a high explainability. A use case is presented to demonstrate this methodology and the efficiency of model-based planning compared to the current inspection plan.

artificial intelligence, machine learning, pipe, (18 more...)

arXiv.org Artificial Intelligence

2307.06341

Country:

North America > United States (0.05)
Europe > Germany > North Rhine-Westphalia (0.04)
Europe > Switzerland (0.04)
(3 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Materials (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.48)
Water & Waste Management > Water Management (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

Add feedback

Oferta de Empleo machine learning engineering lead en Sevilla Page Personnel

#artificialintelligenceFeb-1-2023, 17:15:30 GMT

Perfil buscado (Hombre/Mujer) The successful candidate will join the company s Technology Department as a Machine Learning Engineering Lead. In partnership with multiple stakeholders, you will focus on developing and delivering leading edge analytics solutions using Google Cloud and, as a key member of our engineering practice, you will mentor a small team of data scientists and analysts as we grow and drive the data science capability of the team. He/she will assume the following responsibilities: • Define and support the research and analytical process to deliver business insights • Responsible for advanced statistical and machine learning modeling • Develop data driven analytical tools • Machine learning - build models that can be used for asset health and grid operations • Lead a small team of data scientists and data engineers • Machine Learning Engineering Lead International technology company that develops its own product. International technology company that develops its own product.

empleo machine, engineering lead, international technology company, (8 more...)

#artificialintelligence

Industry:

Information Technology (1.00)
Education > Curriculum > Subject-Specific Education (0.88)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Churn modeling of life insurance policies via statistical and machine learning methods -- Analysis of important features

Groll, Andreas, Wasserfuhr, Carsten, Zeldin, Leonid

arXiv.org Machine LearningFeb-18-2022

Life assurance companies typically possess a wealth of data covering multiple systems and databases. These data are often used for analyzing the past and for describing the present. Taking account of the past, the future is mostly forecasted by traditional statistical methods. So far, only a few attempts were undertaken to perform estimations by means of machine learning approaches. In this work, the individual contract cancellation behavior of customers within two partial stocks is modeled by the aid of various classification methods. Partial stocks of private pension and endowment policy are considered. We describe the data used for the modeling, their structured and in which way they are cleansed. The utilized models are calibrated on the basis of an extensive tuning process, then graphically evaluated regarding their goodness-of-fit and with the help of a variable relevance concept, we investigate which features notably affect the individual contract cancellation behavior.

contract, information, insurance, (15 more...)

arXiv.org Machine Learning

2202.09182

Country: Europe > Austria > Vienna (0.14)

Genre: Research Report > New Finding (0.68)

Industry: Banking & Finance > Insurance (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.73)

Add feedback

MegazordNet: combining statistical and machine learning standpoints for time series forecasting

Menezes, Angelo Garangau, Mastelini, Saulo Martiello

arXiv.org Artificial IntelligenceJun-23-2021

Forecasting financial time series is considered to be a difficult task due to the chaotic feature of the series. Statistical approaches have shown solid results in some specific problems such as predicting market direction and single-price of stocks; however, with the recent advances in deep learning and big data techniques, new promising options have arises to tackle financial time series forecasting. Moreover, recent literature has shown that employing a combination of statistics and machine learning may improve accuracy in the forecasts in comparison to single solutions. Taking into consideration the mentioned aspects, in this work, we proposed the MegazordNet, a framework that explores statistical features within a financial series combined with a structured deep learning model for time series forecasting. We evaluated our approach predicting the closing price of stocks in the S&P 500 using different metrics, and we were able to beat single statistical and machine learning methods.

algorithm, forecasting, megazordnet, (14 more...)

arXiv.org Artificial Intelligence

2107.01017

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.05)
North America > United States (0.04)
South America > Brazil > São Paulo (0.04)
(2 more...)

Genre: Research Report > New Finding (0.47)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Regression Analysis for Statistics & Machine Learning in R

#artificialintelligenceNov-12-2020, 00:25:09 GMT

It is a practical, hands-on course, i.e. we will spend some time dealing with some of the theoretical concepts related to both statistical and machine learning regression analysis. However, majority of the course will focus on implementing different techniques on real data and interpret the results. After each video you will learn a new concept or technique which you may apply to your own projects.

regression, regression analysis, statistical and machine, (10 more...)

#artificialintelligence

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.05)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Machine Learning Strategies for Time Series Forecasting

#artificialintelligenceOct-27-2019, 02:02:58 GMT

Forecasting time-series data has applications in many fields, including finance, health, etc. There are potential pitfalls when applying classic statistical and machine learning methods to time-series problems. This talk will give folks the basic toolbox to analyze time-series data and perform forecasting using statistical and machine learning models, as well as interpret and convey the outputs. PyData is an educational program of NumFOCUS, a 501 3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other.

machine learning strategy, statistical and machine, time series forecasting, (2 more...)

#artificialintelligence

Country: North America > United States (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Machine Learning for Survival Analysis: Theory, Algorithms and Applications part 1

#artificialintelligenceOct-23-2018, 00:13:12 GMT

Authors: Yan Li, University of Michigan Chandan K. Reddy, Department of Computer Science, Virginia Polytechnic Institute and State University Abstract: Due to the advancements in various data acquisition and storage technologies, different disciplines have attained the ability to not only accumulate a wide variety of data but also to monitor observations over longer time periods. In many real-world applications, the primary objective of monitoring these observations is to estimate when a particular event of interest will occur in the future. One of the major difficulties in handling such problem is the presence of censoring, i.e., the event of interests is unobservable in some instance which is either because of time limitation or losing track. Due to censoring, standard statistical and machine learning based predictive models cannot readily be applied to analyze the data. An important subfield of statistics called survival analysis provides different mechanisms to handle such censored data problems.

algorithm and application part 1, artificial intelligence, machine learning, (4 more...)

#artificialintelligence

Country:

North America > United States > Virginia (0.27)
North America > United States > Michigan (0.27)

Industry: Law > Civil Rights & Constitutional Law (0.85)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback