JACoW logo

Journals of Accelerator Conferences Website (JACoW)

JACoW is a publisher in Geneva, Switzerland that publishes the proceedings of accelerator conferences held around the world by an international collaboration of editors.


BiBTeX citation export for FR2AO04: A Physics-Based Simulator to Facilitate Reinforcement Learning in the RHIC Accelerator Complex

@inproceedings{nguyen:icalepcs2023-fr2ao04,
  author       = {L.K. Nguyen and K.A. Brown and M.R. Costanzo and Y. Gao and M. Harvey and J.P. Jamilkowski and J. Morris and V. Schoefer},
% author       = {L.K. Nguyen and K.A. Brown and M.R. Costanzo and Y. Gao and M. Harvey and J.P. Jamilkowski and others},
% author       = {L.K. Nguyen and others},
  title        = {{A Physics-Based Simulator to Facilitate Reinforcement Learning in the RHIC Accelerator Complex}},
% booktitle    = {Proc. ICALEPCS'23},
  booktitle    = {Proc. 19th Int. Conf. Accel. Large Exp. Phys. Control Syst. (ICALEPCS'23)},
  eventdate    = {2023-10-09/2023-10-13},
  pages        = {1630--1636},
  paper        = {FR2AO04},
  language     = {english},
  keywords     = {cavity, controls, booster, simulation, diagnostics},
  venue        = {Cape Town, South Africa},
  series       = {International Conference on Accelerator and Large Experimental Physics Control Systems},
  number       = {19},
  publisher    = {JACoW Publishing, Geneva, Switzerland},
  month        = {02},
  year         = {2024},
  issn         = {2226-0358},
  isbn         = {978-3-95450-238-7},
  doi          = {10.18429/JACoW-ICALEPCS2023-FR2AO04},
  url          = {https://jacow.org/icalepcs2023/papers/fr2ao04.pdf},
  abstract     = {{The successful use of machine learning (ML) in particle accelerators has greatly expanded in recent years; however, the realities of operations often mean very limited machine availability for ML development, impeding its progress in many cases. This paper presents a framework for exploiting physics-based simulations, coupled with real machine data structure, to facilitate the investigation and implementation of reinforcement learning (RL) algorithms, using the longitudinal bunch-merge process in the Booster and Alternating Gradient Synchrotron (AGS) at Brookhaven National Laboratory (BNL) as examples. Here, an initial fake wall current monitor (WCM) signal is fed through a noisy physics-based model simulating the behavior of bunches in the accelerator under given RF parameters and external perturbations between WCM samples; the resulting output becomes the input for the RL algorithm and subsequent pass through the simulated ring, whose RF parameters have been modified by the RL algorithm. This process continues until an optimal policy for the RF bunch merge gymnastics has been learned for injecting bunches with the required intensity and emittance into the Relativistic Heavy Ion Collider (RHIC), according to the physics model. Robustness of the RL algorithm can be evaluated by introducing other drifts and noisy scenarios before the algorithm is deployed and final optimization occurs in the field. }},
}