Author: Sulc, A.
Paper Title Page
TU2AO02 Textual Analysis of ICALEPCS and IPAC Conference Proceedings: Revealing Research Trends, Topics, and Collaborations for Future Insights and Advanced Search 309
 
  • A. Sulc, A. Eichler, T. Wilksen
    DESY, Hamburg, Germany
 
  Funding: This work was supported by HamburgX grant LFF-HHX-03 to the Center for Data and Computing in Natural Sciences (CDCS) from the Hamburg Ministry of Science, Research, Equalities and Districts.
In this paper, we show a textual analysis of past ICALEPCS and IPAC conference proceedings to gain insights into the research trends and topics discussed in the field. We use natural language processing techniques to extract meaningful information from the abstracts and papers of past conference proceedings. We extract topics to visualize and identify trends, analyze their evolution to identify emerging research directions and highlight interesting publications based solely on their content with an analysis of their network. Additionally, we will provide an advanced search tool to better search in the existing papers to prevent duplication and easier reference findings. Our analysis provides a comprehensive overview of the research landscape in the field and helps researchers and practitioners to better understand the state-of-the-art and identify areas for future research.
 
slides icon Slides TU2AO02 [12.762 MB]  
DOI • reference for this paper ※ doi:10.18429/JACoW-ICALEPCS2023-TU2AO02  
About • Received ※ 30 September 2023 — Revised ※ 11 October 2023 — Accepted ※ 18 November 2023 — Issued ※ 29 November 2023
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)  
 
TH2AO01 Log Anomaly Detection on EuXFEL Nodes 1126
 
  • A. Sulc, A. Eichler, T. Wilksen
    DESY, Hamburg, Germany
 
  Funding: This work was supported by HamburgX grant LFF-HHX-03 to the Center for Data and Computing in Natural Sciences (CDCS) from the Hamburg Ministry of Science, Research, Equalities and Districts.
This article introduces a method to detect anomalies in the log data generated by control system nodes at the European XFEL accelerator. The primary aim of this proposed method is to offer operators a comprehensive understanding of the availability, status, and problems specific to each node. This information is vital for ensuring the smooth operation. The sequential nature of logs and the absence of a rich text corpus that is specific to our nodes pose a significant limitation for traditional and learning-based approaches for anomaly detection. To overcome this limitation, we propose a method that uses word embedding and models individual nodes as a sequence of these vectors that commonly co-occur, using a Hidden Markov Model (HMM). We score individual log entries by computing a probability ratio between the probability of the full log sequence including the new entry and the probability of just the previous log entries, without the new entry. This ratio indicates how probable the sequence becomes when the new entry is added. The proposed approach can detect anomalies by scoring and ranking log entries from EuXFEL nodes where entries that receive high scores are potential anomalies that do not fit the routine of the node. This method provides a warning system to alert operators about these irregular log events that may indicate issues.
 
slides icon Slides TH2AO01 [1.420 MB]  
DOI • reference for this paper ※ doi:10.18429/JACoW-ICALEPCS2023-TH2AO01  
About • Received ※ 30 September 2023 — Accepted ※ 08 December 2023 — Issued ※ 13 December 2023  
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)  
 
THPDP017 A Data Acquisition Middle Layer Server with Python Support for Linac Operation and Experiments Monitoring and Control 1330
 
  • V. Rybnikov, A. Sulc
    DESY, Hamburg, Germany
 
  This paper presents online anomaly detection on low-level radio frequency (LLRF) cavities running on FLASH/XFEL DAQ system*. The code is run by a DAQ Middle Layer (ML) server, which has on-line access to all collected data. The ML server executes a Python script that runs a pre-trained machine learning model on every shot in the FLASH/XFEL machine. We discuss the challenges associated with real-time anomaly detection due to high data rates generated by RF cavities, and introduce a DAQ system pipeline and algorithms used for online detection on arbitrary channels in our control system. The system’s performance is evaluated using real data from operational RF cavities. We also focus on the DAQ monitor server’s features and its implementation.
*A. Aghababyan et al., ’Multi-Processor Based Fast Data Acquisition for a Free Electron Laser and Experiments’, in IEEE Transactions on Nuclear Science, vol. 55, No. 1, pp. 256-260, February 2008
 
DOI • reference for this paper ※ doi:10.18429/JACoW-ICALEPCS2023-THPDP017  
About • Received ※ 02 October 2023 — Revised ※ 25 October 2023 — Accepted ※ 13 December 2023 — Issued ※ 20 December 2023
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)