Goto

Collaborating Authors

 machine learning method


Supervised Machine Learning Methods with Uncertainty Quantification for Exoplanet Atmospheric Retrievals from Transmission Spectroscopy

Forestano, Roy T., Matchev, Konstantin T., Matcheva, Katia, Unlu, Eyup B.

arXiv.org Artificial Intelligence

ABSTRACT Standard Bayesian retrievals for exoplanet atmospheric parameters from transmission spectroscopy, while well understood and widely used, are generally computationally expensive. In the era of the JWST and other upcoming observatories, machine learning approaches have emerged as viable alternatives that are both efficient and robust. In this paper we present a systematic study of several existing machine learning regression techniques and compare their performance for retrieving exoplanet atmospheric parameters from transmission spectra. The regression methods tested here include partial least squares (PLS), support vector machines (SVM), k nearest neighbors (KNN), decision trees (DT), random forests (RF), voting (VOTE), stacking (STACK), and extreme gradient boosting (XGB). We also investigate the impact of different preprocessing methods of the training data on the model performance. We quantify the model uncertainties across the entire dynamical range of planetary parameters. The best performing combination of ML model and preprocessing scheme is validated on a the case study of JWST observation of WASP-39b. INTRODUCTION Over the last three decades, the study of extrasolar system planets has shifted from discovery to inference with particular interest in the characterization of their chemical compositions and temperature profiles. The chemical inventory of an exoplanet atmosphere is impacted by the planet formation processes, evolutionary modifications, and its interactions with the local space environment, thus allowing us to place constraints on the existing evolutionary models from the retrieved atmospheric composition. Transit spectroscopy is currently the most widely used observational technique to study the chemical composition of transiting exoplanets (Schneider 1994; Charbonneau et al. 2000). During transit, the planet atmosphere is observed in transmitted light when a planet passes in front of its host star, i.e., the primary eclipse, and in emitted and/or reflected light when a planet travels behind its host star, referred to as the secondary eclipse.


Optimal Design of a Walking Robot: Analytical, Numerical, and Machine Learning Methods for Multicriteria Synthesis

Ibrayeva, Arman, Omarov, Batyrkhan

arXiv.org Artificial Intelligence

This paper addresses several critical stages of designing a walking robot, including optimal structural synthesis, introducing a novel 'rational' mechanical structure aimed at enhancing efficiency and simplifying control system, while addressing practical limitations observed in existing designs. The study includes development of novel multicriteria synthesis methods for achieving optimal leg design, integrating analytical and numerical methods. In addition, a method based on Non-dominated Sorting Genetic Algorithm II is presented. Turning modes are investigated, and for the first time, the isotropy criterion, typically applied to parallel manipulators, is used for optimizing walking robot parameters to ensure optimal force and motion transfer in all directions. Several physical prototypes are developed to experimentally validate the functionality of different mechanisms of the robot, including adaptation to the surface irregularities and navigation using LiDAR.

  Country:
  Genre: Research Report (0.40)
  Industry: Energy (0.35)

District Vitality Index Using Machine Learning Methods for Urban Planners

Marcoux, Sylvain, Dessureault, Jean-Sébastien

arXiv.org Artificial Intelligence

City leaders face critical decisions regarding budget allocation and investment priorities. How can they identify which city districts require revitalization? To address this challenge, a Current Vitality Index and a Long-Term Vitality Index are proposed. These indexes are based on a carefully curated set of indicators. Missing data is handled using K-Nearest Neighbors imputation, while Random Forest is employed to identify the most reliable and significant features. Additionally, k-means clustering is utilized to generate meaningful data groupings for enhanced monitoring of Long-Term Vitality. Current vitality is visualized through an interactive map, while Long-Term Vitality is tracked over 15 years with predictions made using Multilayer Perceptron or Linear Regression. The results, approved by urban planners, are already promising and helpful, with the potential for further improvement as more data becomes available. This paper proposes leveraging machine learning methods to optimize urban planning and enhance citizens' quality of life.


A Systematic Review of Machine Learning Methods for Multimodal EEG Data in Clinical Application

Zhao, Siqi, Li, Wangyang, Wang, Xiru, Foglia, Stevie, Tan, Hongzhao, Zhang, Bohan, Hamoodi, Ameer, Nelson, Aimee, Gao, Zhen

arXiv.org Artificial Intelligence

Machine learning (ML) and deep learning (DL) techniques have been widely applied to analyze electroencephalography (EEG) signals for disease diagnosis and brain-computer interfaces (BCI). The integration of multimodal data has been shown to enhance the accuracy of ML and DL models. Combining EEG with other modalities can improve clinical decision-making by addressing complex tasks in clinical populations. This systematic literature review explores the use of multimodal EEG data in ML and DL models for clinical applications. A comprehensive search was conducted across PubMed, Web of Science, and Google Scholar, yielding 16 relevant studies after three rounds of filtering. These studies demonstrate the application of multimodal EEG data in addressing clinical challenges, including neuropsychiatric disorders, neurological conditions (e.g., seizure detection), neurodevelopmental disorders (e.g., autism spectrum disorder), and sleep stage classification. Data fusion occurred at three levels: signal, feature, and decision levels. The most commonly used ML models were support vector machines (SVM) and decision trees. Notably, 11 out of the 16 studies reported improvements in model accuracy with multimodal EEG data. This review highlights the potential of multimodal EEG-based ML models in enhancing clinical diagnostics and problem-solving.


Machine Learning Gravity Compactifications on Negatively Curved Manifolds

De Luca, G. Bruno

arXiv.org Artificial Intelligence

In theories with extra dimensions, four-dimensional vacua are non-trivial even classically. In fact, even a configuration that looks like a vacuum in four dimensions can have (and often has) a very rich structure for the geometry and the matter fields in the extra dimensions. The study of these structures, which are simultaneously rich and highly constrained by the UV completion of the theory, is important to extract the fourdimensional physics, as well as for holographic approaches to quantum gravity. As we will discuss in detail below, the problem of directly solving the equations of motion for vacuum compactifications is computationally challenging, and a popular approach to bypass this challenge is to exploit supersymmetry in some form. This is the case, for example, of the famous KKLT proposal [1] for obtaining de Sitter vacua, which uses as starting point a supersymmetric Calabi-Yau compactification, on top of which supersymmetry-breaking effects are added. But this is true also for AdS: most of the explicitly known AdS compactifications of string/M-theory are either supersymmetric, obtained by starting from supersymmetric compactifications and turning on supersymmetry breaking effects, or as non-supersymmetric vacua of lower-dimensional supergravities obtained from reduction around supersymmetry vacua.


A First Step in Using Machine Learning Methods to Enhance Interaction Analysis for Embodied Learning Environments

Fonteles, Joyce, Davalos, Eduardo, S., Ashwin T., Zhang, Yike, Zhou, Mengxi, Ayalon, Efrat, Lane, Alicia, Steinberg, Selena, Anton, Gabriella, Danish, Joshua, Enyedy, Noel, Biswas, Gautam

arXiv.org Artificial Intelligence

Investigating children's embodied learning in mixed-reality environments, where they collaboratively simulate scientific processes, requires analyzing complex multimodal data to interpret their learning and coordination behaviors. Learning scientists have developed Interaction Analysis (IA) methodologies for analyzing such data, but this requires researchers to watch hours of videos to extract and interpret students' learning patterns. Our study aims to simplify researchers' tasks, using Machine Learning and Multimodal Learning Analytics to support the IA processes. Our study combines machine learning algorithms and multimodal analyses to support and streamline researcher efforts in developing a comprehensive understanding of students' scientific engagement through their movements, gaze, and affective responses in a simulated scenario. To facilitate an effective researcher-AI partnership, we present an initial case study to determine the feasibility of visually representing students' states, actions, gaze, affect, and movement on a timeline. Our case study focuses on a specific science scenario where students learn about photosynthesis. The timeline allows us to investigate the alignment of critical learning moments identified by multimodal and interaction analysis, and uncover insights into students' temporal learning progressions.


Quantifying and Predicting Residential Building Flexibility Using Machine Learning Methods

Salter, Patrick, Huang, Qiuhua, Tabares-Velasco, Paulo Cesar

arXiv.org Artificial Intelligence

Residential buildings account for a significant portion (35\%) of the total electricity consumption in the U.S. as of 2022. As more distributed energy resources are installed in buildings, their potential to provide flexibility to the grid increases. To tap into that flexibility provided by buildings, aggregators or system operators need to quantify and forecast flexibility. Previous works in this area primarily focused on commercial buildings, with little work on residential buildings. To address the gap, this paper first proposes two complementary flexibility metrics (i.e., power and energy flexibility) and then investigates several mainstream machine learning-based models for predicting the time-variant and sporadic flexibility of residential buildings at four-hour and 24-hour forecast horizons. The long-short-term-memory (LSTM) model achieves the best performance and can predict power flexibility for up to 24 hours ahead with the average error around 0.7 kW. However, for energy flexibility, the LSTM model is only successful for loads with consistent operational patterns throughout the year and faces challenges when predicting energy flexibility associated with HVAC systems.


Frost Prediction Using Machine Learning Methods in Fars Province

Barooni, Milad, Ziarati, Koorush, Barooni, Ali

arXiv.org Artificial Intelligence

One of the common hazards and issues in meteorology and agriculture is the problem of frost, chilling or freezing. This event occurs when the minimum ambient temperature falls below a certain value. This phenomenon causes a lot of damage to the country, especially Fars province. Solving this problem requires that, in addition to predicting the minimum temperature, we can provide enough time to implement the necessary measures. Empirical methods have been provided by the Food and Agriculture Organization (FAO), which can predict the minimum temperature, but not in time. In addition to this, we can use machine learning methods to model the minimum temperature. In this study, we have used three methods Gated Recurrent Unit (GRU), Temporal Convolutional Network (TCN) as deep learning methods, and Gradient Boosting (XGBoost). A customized loss function designed for methods based on deep learning, which can be effective in reducing prediction errors. With methods based on deep learning models, not only do we observe a reduction in RMSE error compared to empirical methods but also have more time to predict minimum temperature. Thus, we can model the minimum temperature for the next 24 hours by having the current 24 hours. With the gradient boosting model (XGBoost) we can keep the prediction time as deep learning and RMSE error reduced. Finally, we experimentally concluded that machine learning methods work better than empirical methods and XGBoost model can have better performance in this problem among other implemented.


MLMOD: Machine Learning Methods for Data-Driven Modeling in LAMMPS

Atzberger, Paul J.

arXiv.org Artificial Intelligence

MLMOD is a software package for incorporating machine learning approaches and models into simulations of microscale mechanics and molecular dynamics in LAMMPS. Recent machine learning approaches provide promising data-driven approaches for learning representations for system behaviors from experimental data and high fidelity simulations. The package faciliates learning and using data-driven models for (i) dynamics of the system at larger spatial-temporal scales (ii) interactions between system components, (iii) features yielding coarser degrees of freedom, and (iv) features for new quantities of interest characterizing system behaviors. MLMOD provides hooks in LAMMPS for (i) modeling dynamics and time-step integration, (ii) modeling interactions, and (iii) computing quantities of interest characterizing system states. The package allows for use of machine learning methods with general model classes including Neural Networks, Gaussian Process Regression, Kernel Models, and other approaches. Here we discuss our prototype C++/Python package, aims, and example usage. The package is integrated currently with the mesocale and molecular dynamics simulation package LAMMPS and PyTorch. For related papers, examples, updates, and additional information see https://github.com/atzberg/mlmod and http://atzberger.org/.


Machine Learning Methods for Background Potential Estimation in 2DEGs

da Cunha, Carlo, Aoki, Nobuyuki, Ferry, David, Vora, Kevin, Zhang, Yu

arXiv.org Artificial Intelligence

In the realm of quantum-effect devices and materials, two-dimensional electron gases (2DEGs) stand as fundamental structures that promise transformative technologies. However, the presence of impurities and defects in 2DEGs poses substantial challenges, impacting carrier mobility, conductivity, and quantum coherence time. To address this, we harness the power of scanning gate microscopy (SGM) and employ three distinct machine learning techniques to estimate the background potential of 2DEGs from SGM data: image-to-image translation using generative adversarial neural networks, cellular neural network, and evolutionary search. Our findings, despite data constraints, highlight the effectiveness of an evolutionary search algorithm in this context, offering a novel approach for defect analysis. This work not only advances our understanding of 2DEGs but also underscores the potential of machine learning in probing quantum materials, with implications for quantum computing and nanoelectronics.