We propose to apply several gradient estimation techniques to enable the differentiation of programs with discrete randomness in High Energy Physics. Such programs are common in High Energy Physics due to the presence of branching processes and clustering-based analysis. Thus differentiating such programs can open the way for gradient based optimization in the context of detector design optimization, simulator tuning, or data analysis and reconstruction optimization. We discuss several possible gradient estimation strategies, including the recent Stochastic AD method, and compare them in simplified detector design experiments. In doing so we develop, to the best of our knowledge, the first fully differentiable branching program.
翻译:我们提议应用多种梯度估计技术,以实现高能物理中具有离散随机性程序的可微分性。这类程序在高能物理中很常见,因为存在分支过程和基于聚类的分析。因此,对此类程序求导可为基于梯度的优化开辟道路,应用于探测器设计优化、模拟器调优或数据分析与重建优化。我们讨论了多种可能的梯度估计策略,包括最新的随机自动微分方法,并在简化的探测器设计实验中进行了比较。通过此研究,据我们所知,我们开发了第一个完全可微分的分支程序。