EuCAIFCon 2025

Name: EuCAIFCon 2025
Start: 2025-06-16T00:02:00+02:00
End: 2025-06-20T14:00:00+02:00
Location: THotel, Cagliari, Sardinia, Italy

16–20 Jun 2025

THotel, Cagliari, Sardinia, Italy

Europe/Rome timezone

Contact

🎙️ Inference optimization with Memory Management and GPU Acceleration in TMVA SOFIE

17 Jun 2025, 12:06

20m

T3a

Parallel talk Real-Time Data Processing 🔀 Real-time Data Processing

Sanjiban Sengupta (CERN)

Within ROOT/TMVA, we have developed SOFIE - System for Optimized Fast Inference code Emit - an engine designed to convert externally trained deep learning models—such as those in ONNX, Keras, or PyTorch formats—into optimized C++ code for fast inference. The generated code features minimal dependencies, ensuring seamless integration into the data processing and analysis workflows of high-energy physics experiments.
SOFIE now supports a comprehensive range of machine learning operators as defined by the ONNX standard, and also supports the translation and inference of Graph Neural Networks trained in DeepMind’s Graph Nets.
Recent advancements in SOFIE include memory optimizations that enable efficient reuse of in- termediate tensor data during inference, significantly reducing memory overhead. Additionally, SOFIE now incorporates enhanced GPU acceleration, supporting stacks such as SYCL, which have abstractions over platforms like CUDA and ROCm. These improvements result in a runtime- efficient and user-friendly machine learning inference engine, competitive with other state-of-the- art solutions.
This work highlights the latest developments in SOFIE, focusing on its memory optimization ca- pabilities and GPU acceleration enhancements, which collectively deliver efficient inference per- formance for HEP applications.

AI keywords	Fast ML Inference; ML Software;` Next Generatiuon Trigger Project; GPU

Sanjiban Sengupta (CERN)

Lorenzo Moneta (CERN)

EuCAIFCon25-SOFIE.pdf

EuCAIFCon 2025

Contact

🎙️ Inference optimization with Memory Management and GPU Acceleration in TMVA SOFIE

T3a

Speaker

Description

Author

Co-author

Presentation materials

Choose timezone

EuCAIFCon 2025

Contact

Speaker

Description

Author

Co-author

Presentation materials