Skip to main content

Showing 1–17 of 17 results for author: Kagan, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.07066  [pdf, other

    hep-ph cs.LG hep-ex

    Re-Simulation-based Self-Supervised Learning for Pre-Training Foundation Models

    Authors: Philip Harris, Michael Kagan, Jeffrey Krupa, Benedikt Maier, Nathaniel Woodward

    Abstract: Self-Supervised Learning (SSL) is at the core of training modern large machine learning models, providing a scheme for learning powerful representations that can be used in a variety of downstream tasks. However, SSL strategies must be adapted to the type of training data and downstream tasks required. We propose RS3L, a novel simulation-based SSL strategy that employs a method of re-simulation to… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 24 pages, 9 figures

  2. arXiv:2401.13537  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    Masked Particle Modeling on Sets: Towards Self-Supervised High Energy Physics Foundation Models

    Authors: Tobias Golling, Lukas Heinrich, Michael Kagan, Samuel Klein, Matthew Leigh, Margarita Osadchy, John Andrew Raine

    Abstract: We propose masked particle modeling (MPM) as a self-supervised method for learning generic, transferable, and reusable representations on unordered sets of inputs for use in high energy physics (HEP) scientific data. This work provides a novel scheme to perform masked modeling based pre-training to learn permutation invariant functions on sets. More generally, this work provides a step towards bui… ▽ More

    Submitted 11 July, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  3. arXiv:2310.12804  [pdf, other

    hep-ex cs.LG hep-ph physics.data-an

    Differentiable Vertex Fitting for Jet Flavour Tagging

    Authors: Rachel E. C. Smith, Inês Ochoa, Rúben Inácio, Jonathan Shoemaker, Michael Kagan

    Abstract: We propose a differentiable vertex fitting algorithm that can be used for secondary vertex fitting, and that can be seamlessly integrated into neural networks for jet flavour tagging. Vertex fitting is formulated as an optimization problem where gradients of the optimized solution vertex are defined through implicit differentiation and can be passed to upstream or downstream neural network compone… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 11 pages

  4. arXiv:2308.16680  [pdf, other

    stat.ML cs.LG hep-ex hep-ph physics.data-an

    Branches of a Tree: Taking Derivatives of Programs with Discrete and Branching Randomness in High Energy Physics

    Authors: Michael Kagan, Lukas Heinrich

    Abstract: We propose to apply several gradient estimation techniques to enable the differentiation of programs with discrete randomness in High Energy Physics. Such programs are common in High Energy Physics due to the presence of branching processes and clustering-based analysis. Thus differentiating such programs can open the way for gradient based optimization in the context of detector design optimizati… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 8 pages

  5. arXiv:2208.03284  [pdf, ps, other

    hep-ex cs.LG hep-ph stat.ML

    Interpretable Uncertainty Quantification in AI for HEP

    Authors: Thomas Y. Chen, Biprateep Dey, Aishik Ghosh, Michael Kagan, Brian Nord, Nesar Ramachandra

    Abstract: Estimating uncertainty is at the core of performing scientific measurements in HEP: a measurement is not useful without an estimate of its uncertainty. The goal of uncertainty quantification (UQ) is inextricably linked to the question, "how do we physically and statistically interpret these uncertainties?" The answer to this question depends not only on the computational task we aim to undertake,… ▽ More

    Submitted 6 September, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: Submitted to the Proceedings of the US Community Study on the Future of Particle Physics (Snowmass 2021)

    Report number: FERMILAB-FN-1179-SCD; arXiv:2208.03284 oai:inspirehep.net:2132723

  6. arXiv:2207.00559  [pdf, other

    cs.LG hep-ex physics.ins-det stat.ML

    Ultra-low latency recurrent neural network inference on FPGAs for physics applications with hls4ml

    Authors: Elham E Khoda, Dylan Rankin, Rafael Teixeira de Lima, Philip Harris, Scott Hauck, Shih-Chieh Hsu, Michael Kagan, Vladimir Loncar, Chaitanya Paikara, Richa Rao, Sioni Summers, Caterina Vernieri, Aaron Wang

    Abstract: Recurrent neural networks have been shown to be effective architectures for many tasks in high energy physics, and thus have been widely adopted. Their use in low-latency environments has, however, been limited as a result of the difficulties of implementing recurrent architectures on field-programmable gate arrays (FPGAs). In this paper we present an implementation of two types of recurrent neura… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: 12 pages, 6 figures, 5 tables

  7. arXiv:2205.11480  [pdf, other

    physics.ins-det cs.CV physics.atom-ph physics.optics quant-ph

    Novel Light Field Imaging Device with Enhanced Light Collection for Cold Atom Clouds

    Authors: Sanha Cheong, Josef C. Frisch, Sean Gasiorowski, Jason M. Hogan, Michael Kagan, Murtaza Safdari, Ariel Schwartzman, Maxime Vandegar

    Abstract: We present a light field imaging system that captures multiple views of an object with a single shot. The system is designed to maximize the total light collection by accepting a larger solid angle of light than a conventional lens with equivalent depth of field. This is achieved by populating a plane of virtual objects using mirrors and fully utilizing the available field of view and depth of fie… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Journal ref: 2022 JINST 17 P08021

  8. arXiv:2203.12852  [pdf, other

    hep-ex cs.LG hep-ph

    Graph Neural Networks in Particle Physics: Implementations, Innovations, and Challenges

    Authors: Savannah Thais, Paolo Calafiura, Grigorios Chachamis, Gage DeZoort, Javier Duarte, Sanmay Ganguly, Michael Kagan, Daniel Murnane, Mark S. Neubauer, Kazuhiro Terao

    Abstract: Many physical systems can be best understood as sets of discrete data with associated relationships. Where previously these sets of data have been formulated as series or image data to match the available machine learning architectures, with the advent of graph neural networks (GNNs), these systems can be learned natively as graphs. This allows a wide variety of high- and low-level physical featur… ▽ More

    Submitted 25 March, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: contribution to Snowmass 2021

  9. arXiv:2203.08806  [pdf, other

    hep-ph cs.LG hep-ex physics.comp-ph physics.ins-det

    New directions for surrogate models and differentiable programming for High Energy Physics detector simulation

    Authors: Andreas Adelmann, Walter Hopkins, Evangelos Kourlitis, Michael Kagan, Gregor Kasieczka, Claudius Krause, David Shih, Vinicius Mikuni, Benjamin Nachman, Kevin Pedro, Daniel Winklehner

    Abstract: The computational cost for high energy physics detector simulation in future experimental facilities is going to exceed the current available resources. To overcome this challenge, new ideas on surrogate models using machine learning methods are being explored to replace computationally expensive components. Additionally, differentiable programming has been proposed as a complementary approach, pr… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: contribution to Snowmass 2021

    Report number: FERMILAB-CONF-22-199-SCD

  10. arXiv:2203.00057  [pdf, other

    hep-ph cs.LG physics.comp-ph physics.data-an

    Differentiable Matrix Elements with MadJax

    Authors: Lukas Heinrich, Michael Kagan

    Abstract: MadJax is a tool for generating and evaluating differentiable matrix elements of high energy scattering processes. As such, it is a step towards a differentiable programming paradigm in high energy physics that facilitates the incorporation of high energy physics domain knowledge, encoded in simulation software, into gradient based learning and optimization pipelines. MadJax comprises two componen… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

    Comments: 6 pages, Proceedings of the 20th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2021)

  11. arXiv:2107.02958  [pdf, other

    eess.IV cs.CV q-bio.QM

    End-to-End Simultaneous Learning of Single-particle Orientation and 3D Map Reconstruction from Cryo-electron Microscopy Data

    Authors: Youssef S. G. Nashed, Frederic Poitevin, Harshit Gupta, Geoffrey Woollard, Michael Kagan, Chuck Yoon, Daniel Ratner

    Abstract: Cryogenic electron microscopy (cryo-EM) provides images from different copies of the same biomolecule in arbitrary orientations. Here, we present an end-to-end unsupervised approach that learns individual particle orientations from cryo-EM data while reconstructing the average 3D map of the biomolecule, starting from a random initialization. The approach relies on an auto-encoder architecture wher… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: 13 pages, 4 figures

  12. arXiv:2012.09719  [pdf, other

    physics.data-an cs.CV cs.LG hep-ex hep-ph

    Image-Based Jet Analysis

    Authors: Michael Kagan

    Abstract: Image-based jet analysis is built upon the jet image representation of jets that enables a direct connection between high energy physics and the fields of computer vision and deep learning. Through this connection, a wide array of new jet analysis techniques have emerged. In this text, we survey jet image based classification models, built primarily on the use of convolutional neural networks, exa… ▽ More

    Submitted 18 December, 2020; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: To appear in Artificial Intelligence for High Energy Physics, World Scientific Publishing

  13. arXiv:2011.05836  [pdf, other

    stat.ML cs.LG hep-ex hep-ph physics.data-an

    Neural Empirical Bayes: Source Distribution Estimation and its Applications to Simulation-Based Inference

    Authors: Maxime Vandegar, Michael Kagan, Antoine Wehenkel, Gilles Louppe

    Abstract: We revisit empirical Bayes in the absence of a tractable likelihood function, as is typical in scientific domains relying on computer simulations. We investigate how the empirical Bayesian can make use of neural density estimators first to use all noise-corrupted observations to estimate a prior or source distribution over uncorrupted samples, and then to perform single-observation posterior infer… ▽ More

    Submitted 26 February, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: Camera-ready version presented at AISTATS 2021

  14. arXiv:2002.04632  [pdf, other

    cs.LG hep-ex physics.data-an stat.ML

    Black-Box Optimization with Local Generative Surrogates

    Authors: Sergey Shirobokov, Vladislav Belavin, Michael Kagan, Andrey Ustyuzhanin, Atılım Güneş Baydin

    Abstract: We propose a novel method for gradient-based optimization of black-box simulators using differentiable local surrogate models. In fields such as physics and engineering, many processes are modeled with non-differentiable simulators with intractable likelihoods. Optimization of these forward models is particularly challenging, especially when the simulator is stochastic. To address such cases, we i… ▽ More

    Submitted 15 June, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    Journal ref: In Advances in Neural Information Processing Systems 34 (NeurIPS), 2020

  15. arXiv:1903.04476  [pdf, other

    cs.LG cs.NE q-bio.NC stat.ML

    Continual Learning via Neural Pruning

    Authors: Siavash Golkar, Michael Kagan, Kyunghyun Cho

    Abstract: We introduce Continual Learning via Neural Pruning (CLNP), a new method aimed at lifelong learning in fixed capacity models based on neuronal model sparsification. In this method, subsequent tasks are trained using the inactive neurons and filters of the sparsified network and cause zero deterioration to the performance of previous tasks. In order to deal with the possible compromise between model… ▽ More

    Submitted 11 March, 2019; originally announced March 2019.

    Comments: 12 pages, 5 figures, 3 tables

  16. arXiv:1807.02876  [pdf, other

    physics.comp-ph cs.LG hep-ex stat.ML

    Machine Learning in High Energy Physics Community White Paper

    Authors: Kim Albertsson, Piero Altoe, Dustin Anderson, John Anderson, Michael Andrews, Juan Pedro Araque Espinosa, Adam Aurisano, Laurent Basara, Adrian Bevan, Wahid Bhimji, Daniele Bonacorsi, Bjorn Burkle, Paolo Calafiura, Mario Campanelli, Louis Capps, Federico Carminati, Stefano Carrazza, Yi-fan Chen, Taylor Childers, Yann Coadou, Elias Coniavitis, Kyle Cranmer, Claire David, Douglas Davis, Andrea De Simone , et al. (103 additional authors not shown)

    Abstract: Machine learning has been applied to several problems in particle physics research, beginning with applications to high-level physics analysis in the 1990s and 2000s, followed by an explosion of applications in particle and event identification and reconstruction in the 2010s. In this document we discuss promising future research and development areas for machine learning in particle physics. We d… ▽ More

    Submitted 16 May, 2019; v1 submitted 8 July, 2018; originally announced July 2018.

    Comments: Editors: Sergei Gleyzer, Paul Seyfert and Steven Schramm

  17. arXiv:1611.01046  [pdf, other

    stat.ML cs.LG cs.NE physics.data-an stat.ME

    Learning to Pivot with Adversarial Networks

    Authors: Gilles Louppe, Michael Kagan, Kyle Cranmer

    Abstract: Several techniques for domain adaptation have been proposed to account for differences in the distribution of the data used for training and testing. The majority of this work focuses on a binary domain label. Similar problems occur in a scientific context where there may be a continuous family of plausible data generation processes associated to the presence of systematic uncertainties. Robust in… ▽ More

    Submitted 1 June, 2017; v1 submitted 3 November, 2016; originally announced November 2016.

    Comments: v1: Original submission. v2: Fixed references. v3: version submitted to NIPS'2017. Code available at https://github.com/glouppe/paper-learning-to-pivot

    Journal ref: Advances in Neural Information Processing Systems 30, pages 981-990, 2017