Goto

Collaborating Authors

 Ju, Xiangyang


A Language Model for Particle Tracking

arXiv.org Artificial Intelligence

Particle tracking is crucial for almost all physics analysis programs at the Large Hadron Collider. Deep learning models are pervasively used in particle tracking related tasks. However, the current practice is to design and train one deep learning model for one task with supervised learning techniques. The trained models work well for tasks they are trained on but show no or little generalization capabilities. We propose to unify these models with a language model. In this paper, we present a tokenized detector representation that allows us to train a BERT model for particle tracking. The trained BERT model, namely TrackingBERT, offers latent detector module embedding that can be used for other tasks. This work represents the first step towards developing a foundational model for particle detector understanding.


Hierarchical Graph Neural Networks for Particle Track Reconstruction

arXiv.org Artificial Intelligence

We introduce a novel variant of GNN for particle tracking--called Hierarchical Graph Neural Network (HGNN). The architecture creates a set of higher-level representations which correspond to tracks and assigns spacepoints to these tracks, allowing disconnected spacepoints to be assigned to the same track, as well as multiple tracks to share the same spacepoint. We propose a novel learnable pooling algorithm called GMPool to generate these higher-level representations called "super-nodes", as well as a new loss function designed for tracking problems and HGNN specifically. On a standard tracking problem, we show that, compared with previous ML-based tracking algorithms, the HGNN has better tracking efficiency performance, better robustness against inefficient input graphs, and better convergence compared with traditional GNNs.


Hyperparameter Optimization of Generative Adversarial Network Models for High-Energy Physics Simulations

arXiv.org Artificial Intelligence

Event generation and detector simulation are essential for physics analyses at the Large Hadron Collider, but are also computationally expensive. Different Machine Learning-based generative models are exploited to reduce the computational cost. Those generative models can be classified into four categories: (1) Variational Autoencoders [1], which learn a stochastic map from the data space to a latent space and back, preserving the statistics of the latent space and data space; (2) Normalizing Flows [2] use invertible transformations so that the probability density can be computed and the generator is optimized using the log likelihood; (3) score-based generative models [3, 4], which generate samples from noise by repeatedly perturbing the data with a diffusion equation, and learning to reverse the perturbation via estimating the diffusion function; (4) Generative Adversarial Networks (GAN) [5], which optimize the generator network by means of an auxiliary network ('discriminator') that tries to classify generated examples from real examples. GAN will remain as an important generative model in High Energy Physics because of its unique features. GANs have been used in many aspects of High Energy Physics to accelerate computationally intensive physics simulations.


Performance of a Geometric Deep Learning Pipeline for HL-LHC Particle Tracking

arXiv.org Artificial Intelligence

The Exa.TrkX project has applied geometric learning concepts such as metric learning and graph neural networks to HEP particle tracking. Exa.TrkX's tracking pipeline groups detector measurements to form track candidates and filters them. The pipeline, originally developed using the TrackML dataset (a simulation of an LHC-inspired tracking detector), has been demonstrated on other detectors, including DUNE Liquid Argon TPC and CMS High-Granularity Calorimeter. This paper documents new developments needed to study the physics and computing performance of the Exa.TrkX pipeline on the full TrackML dataset, a first step towards validating the pipeline using ATLAS and CMS data. The pipeline achieves tracking efficiency and purity similar to production tracking algorithms. Crucially for future HEP applications, the pipeline benefits significantly from GPU acceleration, and its computational requirements scale close to linearly with the number of particles in the event.


Beyond 4D Tracking: Using Cluster Shapes for Track Seeding

arXiv.org Machine Learning

Analyzing data from the Large Hadron Collider (LHC) present a hyper challenge. A given collision event may result in hundreds of outgoing particles, each with many features (momentum, electric charge, etc.). This hyper variate phase space is then observed by complex multi-channel detectors that are essentially hyperspectral cameras. The LHC detectors have millions of readout channels and dimensionality reduction is essential for data analysis. One natural and nearly lossless reduction is the reconstruction of charged particle trajectories ('tracks'). The innermost layers of the detectors at the LHC are constructed to register the passage of charged particles without significantly altering the particle energy or direction. In the ATLAS and CMS detectors, this is achieved using silicon sensors that are finely segmented in one or two directions and are called strips and pixels, respectively. We will focus on pixels, although our methodology applies more generally. Typically, the first step in a tracking algorithm is the construction of seeds, which are sets of three or more hit pixel clusters that can be used to fit charged-particle trajectories (see e.g.


Track Seeding and Labelling with Embedded-space Graph Neural Networks

arXiv.org Artificial Intelligence

To address the unprecedented scale of HL-LHC data, the Exa.TrkX project is investigating a variety of machine learning approaches to particle track reconstruction. The most promising of these solutions, graph neural networks (GNN), process the event as a graph that connects track measurements (detector hits corresponding to nodes) with candidate line segments between the hits (corresponding to edges). Detector information can be associated with nodes and edges, enabling a GNN to propagate the embedded parameters around the graph and predict node-, edge- and graph-level observables. Previously, message-passing GNNs have shown success in predicting doublet likelihood, and we here report updates on the state-of-the-art architectures for this task. In addition, the Exa.TrkX project has investigated innovations in both graph construction, and embedded representations, in an effort to achieve fully learned end-to-end track finding. Hence, we present a suite of extensions to the original model, with encouraging results for hitgraph classification. In addition, we explore increased performance by constructing graphs from learned representations which contain non-linear metric structure, allowing for efficient clustering and neighborhood queries of data points. We demonstrate how this framework fits in with both traditional clustering pipelines, and GNN approaches. The embedded graphs feed into high-accuracy doublet and triplet classifiers, or can be used as an end-to-end track classifier by clustering in an embedded space. A set of post-processing methods improve performance with knowledge of the detector physics. Finally, we present numerical results on the TrackML particle tracking challenge dataset, where our framework shows favorable results in both seeding and track finding.