11–12 Jun 2024
Plesso Berti Pichat
Europe/Rome timezone

Hyperparameter Optimization for Deep Learning Models Using High Performance Computing

12 Jun 2024, 11:25
25m
Room BP-2B (Plesso Berti Pichat)

Room BP-2B

Plesso Berti Pichat

Viale Carlo Berti Pichat 6, 40127, Bologna

Speaker

Muhammad Numan Anwar (Politecnico di Bari and Istituto Nazionale di Fisica Nucleare Bari, Italy)

Description

Clusters counting in a drift chamber represents the most promising breakthrough in particle identification (PID) techniques in particle physics experiments. In this paper, neural network models, such as the Long Short-Term Memory (LSTM) Model and Convolutional Neural Network (CNN) Model, are trained using various hyperparameters like loss functions, activation functions, different numbers of neurons, batch sizes, and varying numbers of epochs etc. These models are trained for a two-step reconstruction algorithm, which involves peak finding and clusterization. For the peak finding algorithm, a trained Long Short-Term Memory (LSTM) model is used to discriminate between ionization signals (primary and secondary peaks) and noise in the waveform, addressing a classification problem. Concurrently, a Convolutional Neural Network model is utilized to determine the number of primary ionization clusters based on the detected peaks, dealing with a regression problem. The trained models (LSTM and CNN) are applied to the simulations of particles traversing a gas mixture made of 90% Helium (He) and 10% Isobutane (C4H10) filling drift tubes with the same geometry as the ones used for the beam test at CERN in 2023 of the prototype of the IDEA detector for FCC. The simulation parameters included a cell size of 1.5 cm, a sampling rate of 1.2 GHz, a time window of 2000 ns, 5000 events, a number of ionization clusters with a mean value of 25.25, a number of electrons per cluster with a mean value of 1.468, and momentum of pi- meson particles ranging from 4 to 180GeV/c.

Primary authors

Domenico Diacono (Politecnico di Bari and Istituto Nazionale di Fisica Nucleare Bari, Italy) Francesco Grancagnolo (Istituto Nazionle di Fisica Nucleare Lecce, Italy) Guang Zhao (Institute of High Energy Physics, 19B Yuquan Road, Beijing, 100049, Beijing, China) Linghui Wu (Institute of High Energy Physics, 19B Yuquan Road, Beijing, 100049, Beijing, China) Marcello Abbrescia (Politecnico di Bari and Istituto Nazionale di Fisica Nucleare Bari, Italy) Mingyi Dong (Institute of High Energy Physics, 19B Yuquan Road, Beijing, 100049, Beijing, China) Muhammad Numan Anwar (Politecnico di Bari and Istituto Nazionale di Fisica Nucleare Bari, Italy) Nicola De Filippis (Politecnico di Bari and Istituto Nazionale di Fisica Nucleare Bari, Italy) Shengsen Sun (Institute of High Energy Physics, 19B Yuquan Road, Beijing, 100049, Beijing, China)

Presentation materials