AITopics | training machine learning model

Collaborating Authors

training machine learning model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Training Machine Learning Models on Human Spatio-temporal Mobility Data: An Experimental Study [Experiment Paper]

Liu, Yueyang, Kennedy, Lance, Kong, Ruochen, Kim, Joon-Seok, Züfle, Andreas

arXiv.org Artificial IntelligenceAug-20-2025

Individual-level human mobility prediction has emerged as a significant topic of research with applications in infectious disease monitoring, child, and elderly care. Existing studies predominantly focus on the microscopic aspects of human trajectories: such as predicting short-term trajectories or the next location visited, while offering limited attention to macro-level mobility patterns and the corresponding life routines. In this paper, we focus on an underexplored problem in human mobility prediction: determining the best practices to train a machine learning model using historical data to forecast an individuals complete trajectory over the next days and weeks. In this experiment paper, we undertake a comprehensive experimental analysis of diverse models, parameter configurations, and training strategies, accompanied by an in-depth examination of the statistical distribution inherent in human mobility patterns. Our empirical evaluations encompass both Long Short-Term Memory and Transformer-based architectures, and further investigate how incorporating individual life patterns can enhance the effectiveness of the prediction. We show that explicitly including semantic information such as day-of-the-week and user-specific historical information can help the model better understand individual patterns of life and improve predictions. Moreover, since the absence of explicit user information is often missing due to user privacy, we show that the sampling of users may exacerbate data skewness and result in a substantial loss in predictive accuracy. To mitigate data imbalance and preserve diversity, we apply user semantic clustering with stratified sampling to ensure that the sampled dataset remains representative. Our results further show that small-batch stochastic gradient optimization improves model performance, especially when human mobility training data is limited.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3748636.3762740

2508.13135

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.16)
North America > United States > New York (0.15)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (0.48)
Information Technology > Security & Privacy (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.88)

Add feedback

TelescopeML -- I. An End-to-End Python Package for Interpreting Telescope Datasets through Training Machine Learning Models, Generating Statistical Reports, and Visualizing Results

Ehsan, null, Gharib-Nezhad, null, Batalha, Natasha E., Valizadegan, Hamed, Martinho, Miguel J. S., Habibi, Mahdi, Nookula, Gopal

arXiv.org Artificial IntelligenceJul-23-2024

We are in a new era of space exploration, thanks to advancements in ground-and space-based telescopes, such as the James Webb Space Telescope [JWST2023PASP] and CRIRES. These remarkable instruments collect high-resolution, high-signal-to-noise spectra from extrasolar planets [Alderson2023Nature], and brown dwarfs [Miles2023ApJ] atmospheres. Without accurate interpretation of this data, the main objectives of space missions will not be fully accomplished. Different analytical and statistical methods, such as the chi-squared-test, Bayesian statistics, and radiative-transfer atmospheric modeling packages have been developed [batalha2019picaso, MacDonald2023] to interpret the spectra. They utilize either forwardand/or retrieval-radiative transfer modeling to analyze the spectra and extract physical information, such as atmospheric temperature, metallicity, carbon-to-oxygen ratio, and surface gravity [line2014systematic, Iyer2023Sphinx, Marley2015]. These atmospheric models rely on generating the physics and chemistry of these atmospheres for a wide range of thermal structures and compositions. In addition to Bayesian-based techniques, machine learning and deep learning methods have been developed in recent years for various astronomical problems, including confirming the classification of light curves for exoplanet validation [Valizadegan2022], recognizing molecular features [Zingales2018ExoGAN] as well as interpreting brown dwarfs spectra using Random Forest technique [Lueber2023RandomForesr_BDs].

dataset, interpreting telescope dataset, telescopeml, (11 more...)

arXiv.org Artificial Intelligence

doi: 10.21105/joss.06346

2407.16917

Country:

North America > United States > California > Riverside County > Riverside (0.14)
North America > United States > California > Santa Clara County > Mountain View (0.05)
North America > Mexico > Sonora (0.05)
Europe > Germany (0.05)

Genre: Research Report (0.50)

Industry: Education > Educational Setting > Online (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Training Machine Learning Models to Characterize Temporal Evolution of Disadvantaged Communities

Jain, Milan, Mohankumar, Narmadha Meenu, Wan, Heng, Ganguly, Sumitrra, Wilson, Kyle D, Anderson, David M

arXiv.org Artificial IntelligenceMar-7-2023

Disadvantaged communities (DAC), as defined by the Justice40 initiative of the Department of Energy (DOE), USA, identifies census tracts across the USA to determine where benefits of climate and energy investments are or are not currently accruing. The DAC status not only helps in determining the eligibility for future Justice40-related investments but is also critical for exploring ways to achieve equitable distribution of resources. However, designing inclusive and equitable strategies not just requires a good understanding of current demographics, but also a deeper analysis of the transformations that happened in those demographics over the years. In this paper, machine learning (ML) models are trained on publicly available census data from recent years to classify the DAC status at the census tracts level and then the trained model is used to classify DAC status for historical years. A detailed analysis of the feature and model selection along with the evolution of disadvantaged communities between 2013 and 2018 is presented in this study.

artificial intelligence, census tract, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2303.03677

Country:

North America > United States > Washington > Benton County > Richland (0.04)
North America > United States > Alaska (0.04)

Genre: Research Report (0.70)

Industry:

Energy (1.00)
Government > Regional Government > North America Government > United States Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.35)

Add feedback

Creating Infrastructure for Training Machine Learning Models

#artificialintelligenceJun-14-2022, 16:45:36 GMT

Let's imagine the following scenario: you get a new project to work on. For this project, you need to develop a machine learning model, which will require running and training several experiments. Each experiment might take several hours or even days and needs to be tracked. You have your own laptop for the development phase, but it's not realistic to use your own laptop for training and running all of the experiments. First, your computer might not have the required hardware, for example, a GPU. Why train on one computer sequentially if you can run and train all the experiments in parallel?

experiment, pipeline, training machine learning model, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Training Machine Learning Models Using TensorFlow or PyTorch

#artificialintelligenceJun-9-2022, 14:21:30 GMT

AI and machine learning are very hot topics these days. These are only some of the applications that cannot exist without machine learning. But how can machines learn? I will show you how the magic works in this article, but I won't talk about neural networks! I will show you what is in the deepest deep of machine learning. One of the best presentations about machine learning is Fei Fei Li's TED talk.

expression, gradient, tensor, (13 more...)

#artificialintelligence

Industry: Transportation (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.53)

Add feedback

Training Machine Learning Models More Efficiently with Dataset Distillation

#artificialintelligenceDec-16-2021, 18:47:07 GMT

Posted by Timothy Nguyen 1 , Research Engineer and Jaehoon Lee, Senior Research Scientist, Google Research For a machine learning (ML) a...

dataset distillation, training machine learning model

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Hold-out Method for Training Machine Learning Models - Data Analytics

#artificialintelligenceOct-30-2021, 07:55:43 GMT

In this post, you will learn about the hold out method used during the process of training machine learning model. When evaluating machine learning (ML) models, the question that arises is whether the model is the best model available from the algorithm hypothesis space in terms of generalization error on the unseen / future data set. Whether the model is trained and tested using the most appropriate method. Out of available models, which model to select? These questions are taken care using what is called as hold out method.

dataset, hold-out method, test dataset, (10 more...)

#artificialintelligence

Genre: Instructional Material (0.56)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Microsoft Jericho is an Open Source Framework for Training Machine Learning Models Using…

#artificialintelligenceOct-17-2020, 16:07:10 GMT

I recently started a new newsletter focus on AI education. TheSequence is a no-BS( meaning no hype, no news etc) AI-focused newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers and concepts. Language is one of the hallmarks of human intelligence and one that plays a key role in our learning processes. By using language, we constantly formulate our understanding of a situation of a specific context.

jericho, machine learning, reinforcement learning, (15 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.80)

Add feedback

Training Machine Learning Models on Amazon SageMaker

#artificialintelligenceMay-11-2020, 16:13:07 GMT

You've spent hours fine-tuning your script, and you're racing to get it onto the server before your deadline tomorrow. You're building Naive Bayes, Logistic Regression, XGBoost, KNN, and any model under the sun in your massive for-loop.You've finally ironed out the kinks on your local machine, and you're ready to scale your precious script, but when it starts to run you see … what exactly? How do you know it's working? What would you do if it broke? How do you even know your models are doing what you want them to?

amazon sagemaker, artificial intelligence, training machine learning model

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.59)

Add feedback

Training Machine Learning models with ML.NET

#artificialintelligenceAug-24-2019, 13:35:19 GMT

ML.NET allows .NET developers to easily build and also consume machine learning models in their NET applications. In this episode, Bri Achtman joins Rich to show off some really interesting scenarios that ML.NET and its family of tools enables. They talk about training models, AutoML, the ML.NET CLI, and even a Visual Studio Extension for training models!

artificial intelligence, training machine learning model, training model

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback