Keyword: distributed
Paper Title Other Keywords Page
MO4BCO01 Using BDD Testing in SKAO: Challenges and Opportunities software, TANGO, controls, interface 183
 
  • V.L. Allan
    University of Cambridge, Cambridge, United Kingdom
  • G. Brajnik
    IDS, Udine, Italy
  • L.R. Brederode
    SKAO, Macclesfield, United Kingdom
 
  Defining what a system should do is one of the hardest parts of system design. Using Behaviour Driven Design (BDD) techniques can help, and also help define the tests needed to check that the desired behaviour is implemented. We describe the challenges and opportunities that arise when adopting these techniques, including both technical and social issues, and especially why in our case BDD techniques provide significant value. We present our pathway towards using BDD and the lessons learned. By trying to use BDD testing to run integration tests, it enabled the identification of gaps in the testing infrastructure, particularly the TANGO testing infrastructure, and gaps in developers’ understanding of the system design. This allowed SKAO to take steps to improve the tests, the infrastructure, and the design, by integrating BDD techniques into the full product development lifecycle and using them also for monitoring the development process and the quality of software products.  
slides icon Slides MO4BCO01 [1.496 MB]  
DOI • reference for this paper ※ doi:10.18429/JACoW-ICALEPCS2023-MO4BCO01  
About • Received ※ 06 October 2023 — Revised ※ 10 October 2023 — Accepted ※ 14 November 2023 — Issued ※ 09 December 2023
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)  
 
TU2AO06 Accelerator Control Class for Graduate Students in SOKENDAI, KEK controls, EPICS, GUI, factory 335
 
  • N. Kamikubota, K. Furukawa, M. Satoh, S. Yamada, N. Yamamoto
    KEK, Ibaraki, Japan
 
  The Graduate University for Advanced Studies, known as SOKENDAI, provides educational opportunities for graduate students in collaboration with national research institutions in Japan. KEK is one of the institutes, and has a program "Accelerator Science". Since 2019, we started two classes: lectures "Introduction to accelerator control system" for one semester, and a two-day seminar "Control of distributed devices for large systems". The former consists of 12 lectures on various topics of accelerator controls by teachers, followed by a presentation day by students. The latter consists of lecture and hands-on, which enables students to practice EPICS with Raspberry-pi based devices. In the paper, status of accelerator control classes are reported.
1) SOKENDAI, https://www.soken.ac.jp/en/
 
slides icon Slides TU2AO06 [2.813 MB]  
DOI • reference for this paper ※ doi:10.18429/JACoW-ICALEPCS2023-TU2AO06  
About • Received ※ 02 October 2023 — Revised ※ 13 October 2023 — Accepted ※ 29 November 2023 — Issued ※ 13 December 2023
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)  
 
TUPDP045 Monitoring the SKA Infrastructure for CICD monitoring, target, database, TANGO 622
 
  • M. Di Carlo, M. Dolci
    INAF - OAAB, Teramo, Italy
  • P. Harding, U.Y. Yilmaz
    SKAO, Macclesfield, United Kingdom
  • J.B. Morgado
    Universidade do Porto, Faculdade de Ciências, Porto, Portugal
  • P. Osorio
    Atlar Innovation, Pampilhosa da Serra, Portugal
 
  Funding: INAF
The Square Kilometre Array (SKA) is an international effort to build two radio interferometers in South Africa and Australia, forming one Observatory monitored and controlled from global headquarters (GHQ) based in the United Kingdom at Jodrell Bank. The selected solution for monitoring the SKA CICD (continuous integration and continuous deployment) Infrastructure is Prometheus with the help of Thanos. Thanos is used for high availability, resilience, and long term storage retention for monitoring data. For data visualisation, the Grafana project emerged as an important tool for displaying data in order to make specific reasoning and debugging of particular aspect of the infrastructure in place. In this paper, the monitoring platform is presented while considering quality aspect such as performance, scalability, and data preservation.
 
DOI • reference for this paper ※ doi:10.18429/JACoW-ICALEPCS2023-TUPDP045  
About • Received ※ 27 September 2023 — Revised ※ 18 October 2023 — Accepted ※ 14 December 2023 — Issued ※ 19 December 2023
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)  
 
WE3BCO04 Improving Observability of the SCADA Systems Using Elastic APM, Reactive Streams and Asynchronous Communication SCADA, controls, monitoring, real-time 1016
 
  • I. Khokhriakov
    University of California, San Diego (UCSD), La Jolla, California, USA
  • V. Mazalova
    CFEL, Hamburg, Germany
  • O. Merkulova
    IK, Moscow, Russia
 
  As modern control systems grow in complexity, ensuring observability and traceability becomes more challenging. To meet this challenge, we present a novel solution that seamlessly integrates with multiple SCADA frameworks to provide end-to-end visibility into complex system interactions. Our solution utilizes Elastic APM to monitor and trace the performance of system components, allowing for real-time analysis and diagnosis of issues. In addition, our solution is built using reactive design principles and asynchronous communication, enabling it to scale to meet the demands of large, distributed systems. This presentation will describe our approach and discuss how it can be applied to various use cases, including particle accelerators and other scientific facilities. We will also discuss the benefits of our solution, such as improved system observability and traceability, reduced downtime, and better resource allocation. We believe that our approach represents a significant step forward in the development of modern control systems, and we look forward to sharing our work with the community at ICALEPCS 2023.
* Igor Khokhriakov et al,
A novel solution for controlling hardware components of accelerators and beamlines
JOURNAL OF SYNCHROTRON RADIATION · Apr 5, 2022
 
slides icon Slides WE3BCO04 [3.377 MB]  
DOI • reference for this paper ※ doi:10.18429/JACoW-ICALEPCS2023-WE3BCO04  
About • Received ※ 29 September 2023 — Revised ※ 14 November 2023 — Accepted ※ 19 December 2023 — Issued ※ 22 December 2023
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)  
 
THPDP036 Research on HALF Historical Data Archiver Technology database, EPICS, controls, experiment 1394
 
  • X.K. Sun, D.D. Zhang
    USTC/NSRL, Hefei, Anhui, People’s Republic of China
  • H. Chen
    USTC, SNST, Anhui, People’s Republic of China
 
  The Hefei Advanced Light Facility (HALF) is a 2.2-GeV 4th synchrotron radiation light source, which is scheduled to start construction in Hefei, China in 2023. The HALF contains an injector and a 480-m diffraction limited storage ring, and 10 beamlines for phase one. The HALF historical data archiver system is responsible to store operation data for the entire facility including accelerator and beamlines. It is necessary to choose a high-performance database for the massive structured data generated by HALF. A fair test platform is designed and built to test the performance of six commonly used databases in the accelerator field. The test metrics include reading and writing performance, availability, scalability, and software ecosystem. This paper introduces the design of the database test scheme, the construction of the test platform and the future test plan in detail.  
poster icon Poster THPDP036 [0.933 MB]  
DOI • reference for this paper ※ doi:10.18429/JACoW-ICALEPCS2023-THPDP036  
About • Received ※ 28 September 2023 — Revised ※ 26 October 2023 — Accepted ※ 11 December 2023 — Issued ※ 12 December 2023
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)  
 
THPDP037 The Alarm System at HLS-II monitoring, controls, EPICS, status 1399
 
  • S. Xu, X.K. Sun
    USTC/NSRL, Hefei, Anhui, People’s Republic of China
 
  The control system of the Hefei Light Source II (HLS-II) is a distributed system based on Experimental Physics and Industrial Control System. The alarm system of HLS-II is responsible for monitoring the alarm state of the facility and distributing the alarm message in time. The monitoring range of the alarm system covers the devices of HLS-II technical group and the server platform. Zabbix is an open-source software tool to monitor the server platform. Custom metrics collection is achieved by implementing external scripts written in Python and automated agent deployment discovers the monitored servers running with Zabbix agents. The alarm distribution strategy of the front end devices is designed to overcome alarm floods. The alarm system of HLS-II provides multiple messaging channels to notify the responsible staff, including WeChat, SMS and web-based GUI. The alarm system of HLS-II has been deployed since December 2022. The result shows the alarm system facilitates the operator to troubleshoot problem efficiently to improve the availability of HLS-II.  
poster icon Poster THPDP037 [0.653 MB]  
DOI • reference for this paper ※ doi:10.18429/JACoW-ICALEPCS2023-THPDP037  
About • Received ※ 30 September 2023 — Accepted ※ 08 December 2023 — Issued ※ 13 December 2023  
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)  
 
THPDP047 ELK Stack Deployment with Ansible operation, software, controls, GUI 1411
 
  • T. Gatsi, X.P. Baloyi, J.L. Lekganyane, R.L. Schwartz
    SARAO, Cape Town, South Africa
 
  The 64-dish MeerKAT radio telescope, constructed in South Africa, became the largest and most sensitive radio telescope in the Southern Hemisphere until integrated with the Square Kilometer Array (SKA). Our Control and Monitoring system for Radio Astronomy Project such as MeerKAT produces a lot of data and logs that require proper handling. Viewing and analysis to trace and track system issues and as well as investigate technical software issues require one to go back in time to look for event occurrence. We therefore deployed an ELK software stack ( Elasticsearch, Kibana, Logstash) using Ansible in order to have the capability to aggregate system process logs. We deploy the stack as a cluster comprising lxc containers running inside a Proxmox Virtual Environment using Ansible as a software deployment tool. Each container in the cluster performs cluster duties such as deciding where to place index shards and when to move them. Each container is a data node that makes up the heart of the cluster. We deploy the stack as a cluster for load balancing purposes. Logstash ingests ,transforms and sends the data to the Kibana Graphical User Interface for visualization. Elasticsearch indexes, analyzes, and searches the ingested data into Kibana and our Operations Team and other system users can visualize and analyze these logs on the Kibana GUI frontend.  
poster icon Poster THPDP047 [0.503 MB]  
DOI • reference for this paper ※ doi:10.18429/JACoW-ICALEPCS2023-THPDP047  
About • Received ※ 03 October 2023 — Revised ※ 12 October 2023 — Accepted ※ 13 December 2023 — Issued ※ 19 December 2023
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)  
 
THPDP085 LANSCE’s Timing System Status and Future Plans timing, controls, hardware, operation 1547
 
  • L.E. Walker, B.C. Atencio, S.A. Baily, D. Fratantonio, C.D. Hatch, M. Pieck, T. Ramakrishnan
    LANL, Los Alamos, New Mexico, USA
 
  Funding: This work was supported by the U.S. DOE through the Los Alamos National Laboratory (LANL). LANL is operated by Triad National Security, LLC, for the NNSA of U.S. DOE - Contract No. 89233218CNA000001
The Los Alamos Neutron Science Center (LANSCE) operates at a maximum repetition rate of 120 Hz. Timing gates are required for synchronization of the accelerator to provide beam acceleration along the LINAC and beam distribution to the five experimental areas. They are also provided to other devices with sensitive operating points relative to the machine cycle. Over the last 50 years of operations many new time sensitive pieces of equipment have been added. This has changed the demand on, and complexity of, the timing system. Further driven by equipment obsolescence issues, the timing system un-derwent many upgrades and revitalization efforts, with the most significant deployment starting in 2016. Due to these upgrade efforts, the timing system architecture design changed from a purely centralized system, to a distributed event-based one. The purpose of this paper is to detail the current state of the timing system, as a hy-brid system with the gate events being generated from a new timing master system, while still utilizing legacy distribution and fanout systems. Upgrades to the distribu-tion system are planned, but due to the required beam delivery schedule, they can only be deployed in sections during four-month annual maintenance cycles. The paper will also cover the off-the-shelf solutions that have been found for standardization, and the efforts towards a life cycle management process.
LA-UR-23-31123
 
poster icon Poster THPDP085 [3.311 MB]  
DOI • reference for this paper ※ doi:10.18429/JACoW-ICALEPCS2023-THPDP085  
About • Received ※ 29 September 2023 — Revised ※ 11 October 2023 — Accepted ※ 06 December 2023 — Issued ※ 13 December 2023
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)  
 
THSDSC01 Sector Focused Cyclotron Power Supply Control System Upgrade controls, power-supply, hardware, cyclotron 1578
 
  • X.J. Liu, S. An, Y. Chen, L. Ge, M. Li, J.Q. Wu, W. Zhang
    IMP/CAS, Lanzhou, People’s Republic of China
 
  The old power supply control system of SFC (Sector Focused Cyclotron) has been in operation for more than a decade. Control system architecture is centralized, and equipment failure rate is getting higher and higher. The new control system uses the EPICS architecture, and the hardware uses Advantech’s APAX modules. The IOC runs on the APAX host and interacts with the module through API functions. The system has been running very stable for several months without failure.  
slides icon Slides THSDSC01 [0.510 MB]  
poster icon Poster THSDSC01 [2.136 MB]  
DOI • reference for this paper ※ doi:10.18429/JACoW-ICALEPCS2023-THSDSC01  
About • Received ※ 30 September 2023 — Revised ※ 11 October 2023 — Accepted ※ 06 December 2023 — Issued ※ 09 December 2023
Cite • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)