Papers
Topics
Authors
Recent
Search
2000 character limit reached

Pre-training strategy using real particle collision data for event classification in collider physics

Published 12 Dec 2023 in hep-ex and physics.comp-ph | (2312.06909v1)

Abstract: This study aims to improve the performance of event classification in collider physics by introducing a pre-training strategy. Event classification is a typical problem in collider physics, where the goal is to distinguish the signal events of interest from background events as much as possible to search for new phenomena in nature. A pre-training strategy with feasibility to efficiently train the target event classification using a small amount of training data has been proposed. Real particle collision data were used in the pre-training phase as a novelty, where a self-supervised learning technique to handle the unlabeled data was employed. The ability to use real data in the pre-training phase eliminates the need to generate a large amount of training data by simulation and mitigates bias in the choice of physics processes in the training data. Our experiments using CMS open data confirmed that high event classification performance can be achieved by introducing a pre-trained model. This pre-training strategy provides a potential approach to save computational resources for future collider experiments and introduces a foundation model for event classification.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (23)
  1. LHC Machine. Journal of Instrumentation, Vol. 3, No. 08, pp. S08001–S08001, aug 2008.
  2. ATLAS Summary plots history. Available: https://atlaspo.cern.ch/public/summary_plots/, 2021. (Accessed: Sep. 6, 2023).
  3. ATLAS Collaboration. ATLAS Software and Computing HL-LHC Roadmap. Available: http://cds.cern.ch/record/2802918, Mar 2022. (Accessed: Sep. 6, 2023).
  4. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, Vol. 22, No. 10, pp. 1345–1359, 2010.
  5. Using cms open data in research - challenges and directions. EPJ Web Conf., Vol. 251, p. 01004, 2021.
  6. S. Chatrchyan, et al. The CMS Experiment at the CERN LHC. JINST, Vol. 3, p. S08004, 2008.
  7. Searching for Exotic Particles in High-Energy Physics with Deep Learning. Nature Commun., Vol. 5, p. 4308, 2014.
  8. Transferability of deep learning models in searches for new physics at colliders. Phys. Rev. D, Vol. 101, p. 035042, Feb 2020.
  9. Relational inductive biases, deep learning, and graph networks. Available: https://arxiv.org/abs/1806.01261, 2018.
  10. Application of transfer learning to event classification in collider physics. PoS, Vol. ISGC2022, p. 016, 2022.
  11. On the opportunities and risks of foundation models. Available: https://crfm.stanford.edu/assets/report.pdf, 2021.
  12. CMS collaboration (2021). SingleElectron primary dataset in AOD format from RunD of 2015 (/SingleElectron/Run2015D-08Jun2016-v1/AOD). Available: http://opendata.cern.ch/record/24103, 2021. (Accessed: Sep. 6, 2023).
  13. CMS collaboration (2021). SingleMuon primary dataset in AOD format from RunD of 2015 (/SingleMuon/Run2015D-16Dec2015-v1/AOD). Available: http://opendata.cern.ch/record/24102, 2021. (Accessed: Sep. 6, 2023).
  14. The ATLAS Collaboration. The ATLAS experiment at the CERN large hadron collider. Journal of Instrumentation, Vol. 3, No. 08, pp. S08003–S08003, aug 2008.
  15. The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations. JHEP, Vol. 07, p. 079, 2014.
  16. An introduction to PYTHIA 8.2. Comput. Phys. Commun., Vol. 191, pp. 159–177, 2015.
  17. Michele Selvaggi. DELPHES 3: A modular framework for fast-simulation of generic collider experiments. Journal of Physics: Conference Series, Vol. 523, p. 012033, jun 2014.
  18. Theory and phenomenology of two-higgs-doublet models. Physics Reports, Vol. 516, No. 1, pp. 1–102, 2012. Theory and phenomenology of two-Higgs-doublet models.
  19. PyTorch TransformerEncoderLayer. Available: https://pytorch.org/docs/stable/generated/torch.nn.TransformerEncoderLayer.html, 2021. (Accessed: Sep. 6, 2023).
  20. Pytorch: An imperative style, high-performance deep learning library. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors, Advances in Neural Information Processing Systems 32, pp. 8024–8035. Curran Associates, Inc., 2019.
  21. hepfoundation. Available: https://github.com/ktomoe/hepfoundation/, 2023. (Accessed: Oct. 30, 2023).
  22. Sebastian Ruder. An overview of gradient descent optimization algorithms. Available: https://arxiv.org/abs/1609.04747, 2016.
  23. SGDR: Stochastic Gradient Descent with Warm Restarts. Available: https://arxiv.org/abs/1608.03983, 2016.
Citations (4)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.