data segment
- North America > Canada > Ontario > Toronto (0.04)
- North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- North America > Canada > Ontario > Toronto (0.04)
- North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
- Europe > Netherlands > North Holland > Amsterdam (0.04)
Trustworthy Prediction with Gaussian Process Knowledge Scores
Butler, Kurt, Feng, Guanchao, Chen, Tong, Djuric, Petar
--Probabilistic models are often used to make predictions in regions of the data space where no observations are available, but it is not always clear whether such predictions are well-informed by previously seen data. In this paper, we propose a knowledge score for predictions from Gaussian process regression (GPR) models that quantifies the extent to which observing data have reduced our uncertainty about a prediction. The knowledge score is interpretable and naturally bounded between 0 and 1. We demonstrate in several experiments that the knowledge score can anticipate when predictions from a GPR model are accurate, and that this anticipation improves performance in tasks such as anomaly detection, extrapolation, and missing data imputation. Index T erms --anomaly detection, Gaussian processes, regression models, trustworthy machine learning, predictive distributions. The task of prediction is of fundamental importance in many domains.
- North America > United States > New York > Suffolk County > Stony Brook (0.04)
- Europe > Italy (0.04)
- Europe > Czechia > South Moravian Region > Brno (0.04)
SCFNet:A Transferable IIIC EEG Classification Network
Epilepsy and epileptiform discharges are common harmful brain activities, and electroencephalogram (EEG) signals are widely used to monitor the onset status of patients. However, due to the lack of unified EEG signal acquisition standards, there are many obstacles in practical applications, especially the difficulty in transferring and using models trained on different numbers of channels. To address this issue, we proposes a neural network architecture with a single-channel feature extraction (Singal Channel Feature) model backend fusion (SCFNet). The feature extractor of the model is an RCNN network with single-channel input, which does not depend on other channels, thereby enabling easier migration to data with different numbers of channels. Experimental results show that on the IIIC-Seizure dataset, the accuracy of EEG-SCFNet has improved by 4% compared to the baseline model and also increased by 1.3% compared to the original RCNN neural network model. Even with only fine-tuning the classification head, its performance can still maintain a level comparable to the baseline. In addition, in terms of cross-dataset transfer, EEG-SCFNet can still maintain certain performance even if the channel leads are different.
A Scalable Approach to Covariate and Concept Drift Management via Adaptive Data Segmentation
Yarabolu, Vennela, Waghmare, Govind, Gupta, Sonia, Asthana, Siddhartha
In many real-world applications, continuous machine learning (ML) systems are crucial but prone to data drift, a phenomenon where discrepancies between historical training data and future test data lead to significant performance degradation and operational inefficiencies. Traditional drift adaptation methods typically update models using ensemble techniques, often discarding drifted historical data, and focus primarily on either covariate drift or concept drift. These methods face issues such as high resource demands, inability to manage all types of drifts effectively, and neglecting the valuable context that historical data can provide. We contend that explicitly incorporating drifted data into the model training process significantly enhances model accuracy and robustness. This paper introduces an advanced framework that integrates the strengths of data-centric approaches with adaptive management of both covariate and concept drift in a scalable and efficient manner. Our framework employs sophisticated data segmentation techniques to identify optimal data batches that accurately reflect test data patterns. These data batches are then utilized for training on test data, ensuring that the models remain relevant and accurate over time. By leveraging the advantages of both data segmentation and scalable drift management, our solution ensures robust model accuracy and operational efficiency in large-scale ML deployments. It also minimizes resource consumption and computational overhead by selecting and utilizing relevant data subsets, leading to significant cost savings. Experimental results on classification task on real-world and synthetic datasets show our approach improves model accuracy while reducing operational costs and latency. This practical solution overcomes inefficiencies in current methods, providing a robust, adaptable, and scalable approach.
- South America > Brazil > Maranhão (0.04)
- Oceania > Australia > New South Wales (0.04)
- North America > United States > Nebraska > Sarpy County > Bellevue (0.04)
- (2 more...)
Applying Fine-Tuned LLMs for Reducing Data Needs in Load Profile Analysis
Hu, Yi, Kim, Hyeonjin, Ye, Kai, Lu, Ning
This paper presents a novel method for utilizing fine-tuned Large Language Models (LLMs) to minimize data requirements in load profile analysis, demonstrated through the restoration of missing data in power system load profiles. A two-stage fine-tuning strategy is proposed to adapt a pre-trained LLMs, i.e., GPT-3.5, for missing data restoration tasks. Through empirical evaluation, we demonstrate the effectiveness of the fine-tuned model in accurately restoring missing data, achieving comparable performance to state-of-the-art specifically designed models such as BERT-PIN. Key findings include the importance of prompt engineering and the optimal utilization of fine-tuning samples, highlighting the efficiency of few-shot learning in transferring knowledge from general user cases to specific target users. Furthermore, the proposed approach demonstrates notable cost-effectiveness and time efficiency compared to training models from scratch, making it a practical solution for scenarios with limited data availability and computing resources. This research has significant potential for application to other power system load profile analysis tasks. Consequently, it advances the use of LLMs in power system analytics, offering promising implications for enhancing the resilience and efficiency of power distribution systems.
- Asia (0.04)
- North America > United States > Washington > Benton County > Richland (0.04)
- North America > United States > North Carolina > Wake County > Raleigh (0.04)
- (2 more...)
- Research Report > New Finding (0.46)
- Research Report > Promising Solution (0.34)
- Information Technology > Security & Privacy (1.00)
- Health & Medicine (1.00)
- Energy > Renewable (1.00)
- (4 more...)
A novel multi-layer modular approach for real-time fuzzy-identification of gravitational-wave signals
Barone, Francesco Pio, Dell'Aquila, Daniele, Russo, Marco
Advanced LIGO and Advanced Virgo ground-based interferometers are instruments capable to detect gravitational wave signals exploiting advanced laser interferometry techniques. The underlying data analysis task consists in identifying specific patterns in noisy timeseries, but it is made extremely complex by the incredibly small amplitude of the target signals. In this scenario, the development of effective gravitational wave detection algorithms is crucial. We propose a novel layered framework for real-time detection of gravitational waves inspired by speech processing techniques and, in the present implementation, based on a state-of-the-art machine learning approach involving a hybridization of genetic programming and neural networks. The key aspects of the newly proposed framework are: the well structured, layered approach, and the low computational complexity. The paper describes the basic concepts of the framework and the derivation of the first three layers. Even if the layers are based on models derived using a machine learning approach, the proposed layered structure has a universal nature. Compared to more complex approaches, such as convolutional neural networks, which comprise a parameter set of several tens of MB and were tested exclusively for fixed length data samples, our framework has lower accuracy (e.g., it identifies 45% of low signal-to-noise-ration gravitational wave signals, against 65% of the state-of-the-art, at a false alarm probability of $10^{-2}$), but has a much lower computational complexity and a higher degree of modularity. Furthermore, the exploitation of short-term features makes the results of the new framework virtually independent against time-position of gravitational wave signals, simplifying its future exploitation in real-time multi-layer pipelines for gravitational-wave detection with new generation interferometers.
Quilt: Robust Data Segment Selection against Concept Drifts
Kim, Minsu, Hwang, Seong-Hyeon, Whang, Steven Euijong
Continuous machine learning pipelines are common in industrial settings where models are periodically trained on data streams. Unfortunately, concept drifts may occur in data streams where the joint distribution of the data X and label y, P(X, y), changes over time and possibly degrade model accuracy. Existing concept drift adaptation approaches mostly focus on updating the model to the new data possibly using ensemble techniques of previous models and tend to discard the drifted historical data. However, we contend that explicitly utilizing the drifted data together leads to much better model accuracy and propose Quilt, a data-centric framework for identifying and selecting data segments that maximize model accuracy. To address the potential downside of efficiency, Quilt extends existing data subset selection techniques, which can be used to reduce the training data without compromising model accuracy. These techniques cannot be used as is because they only assume virtual drifts where the posterior probabilities P(y|X) are assumed not to change. In contrast, a key challenge in our setup is to also discard undesirable data segments with concept drifts. Quilt thus discards drifted data segments and selects data segment subsets holistically for accurate and efficient model training. The two operations use gradient-based scores, which have little computation overhead. In our experiments, we show that Quilt outperforms state-of-the-art drift adaptation and data selection baselines on synthetic and real datasets.
- Oceania > Australia > New South Wales (0.04)
- North America > United States > Nebraska > Sarpy County > Bellevue (0.04)
BERT-PIN: A BERT-based Framework for Recovering Missing Data Segments in Time-series Load Profiles
Hu, Yi, Ye, Kai, Kim, Hyeonjin, Lu, Ning
Inspired by the success of the Transformer model in natural language processing and computer vision, this paper introduces BERT-PIN, a Bidirectional Encoder Representations from Transformers (BERT) powered Profile Inpainting Network. BERT-PIN recovers multiple missing data segments (MDSs) using load and temperature time-series profiles as inputs. To adopt a standard Transformer model structure for profile inpainting, we segment the load and temperature profiles into line segments, treating each segment as a word and the entire profile as a sentence. We incorporate a top candidates selection process in BERT-PIN, enabling it to produce a sequence of probability distributions, based on which users can generate multiple plausible imputed data sets, each reflecting different confidence levels. We develop and evaluate BERT-PIN using real-world dataset for two applications: multiple MDSs recovery and demand response baseline estimation. Simulation results show that BERT-PIN outperforms the existing methods in accuracy while is capable of restoring multiple MDSs within a longer window. BERT-PIN, served as a pre-trained model, can be fine-tuned for conducting many downstream tasks, such as classification and super resolution.
- North America > Canada > Quebec (0.04)
- Europe > France (0.04)
- Africa > Cameroon (0.04)
- (5 more...)
- Energy > Power Industry (1.00)
- Energy > Renewable (0.68)
- Government > Regional Government > North America Government > United States Government (0.68)
Joint Microseismic Event Detection and Location with a Detection Transformer
Yang, Yuanyuan, Birnie, Claire, Alkhalifah, Tariq
During the processes of reservoir stimulation, fluids are injected into a specific area underground. The high-pressure condition created by the fluid injection causes rocks to crack to release the built-up stress, resulting in small earthquakes called microseismic events. Detecting these events in seismic recordings and locating them back to their subsurface locations are important for understanding the subsurface conditions such as fracture networks and fluid flow pathways. This knowledge is critical for applications like carbon storage, geothermal energy extraction, and oil/gas production. Traditional approaches for microseismic event detection and location often suffer from manual intervention and/or heavy computation, while current machine learning-assisted approaches typically address detection and location separately. These limitations prevent the potential for real-time microseismic monitoring, which is crucial for scientists and engineers to make instant, informed decisions, like optimization of injection strategies. Here, we proposed a machine learning-based procedure for simultaneously detecting and locating microseismic events within a single framework, using a conventional Convolutional Neural Network and an encoder-decoder Transformer. Tests on synthetically-generated and field-collected passive seismic data illustrate the accuracy, efficiency, and potential of the proposed method, which could pave the way for real-time monitoring of microseismic events in the future.
- North America > United States > Texas (0.28)
- North America > United States > Oklahoma (0.14)
- North America > United States > Illinois (0.14)
- Asia > Middle East > Israel (0.14)