AI_INFN Technical Meeting


Date: 2024-06-17

Tracked developments:

:arrow_forward: Automation of RKE2 deployments in INFN Cloud

  • NTR

:arrow_forward: Develop monitoring and accounting infrastructure (R. Petrini)

  • [Open issue since two weeks ago] There is a problem with the migration to the database developed by Nadir: when writing “TYPES” the SQL console hangs forever. We rolled back to the previous database, and we will give it another try as soon as possible.

:arrow_forward: Environment setup (M. Barbetti, S. Giagu, S. Bordoni, L. Cappelli)

  • Environment quantum, fino alla settimana scorsa gli environment in questione erano due.
    • Uno su cuda 11.8 in cui coesistevano TF, Jax e Torch, ma non adava Pennylane
    • Jax ha modificato le sue release che impedisce di far coesistere Jax e Pytorch (hanno tolto la release che usa cudnn)
    • Si passa a cuda 12, abbiamo due environment basati in cui sembrano funzionare i Jupyter
    • Pennylane è risolto: PyTorch+Jax (ma senza TensorFlow).
  • Numpy 2.0.0 broke everything…
    • Questo rompe la maggior parte degli environment conda dove non è stato imposto a priori numpy<2.00
  • We agree that we should move forward with two different environments, one with Torch and Quantum and one with TensorFlow and quantum.

:arrow_forward: Offloading tests with virtual kubelets (G. Bianchini, D. Ciangottini)

  • CloudVeneto is in downtime this week.

:arrow_forward: Acquisto FPGA

  • NTR

Status legend

:arrow_forward: Active
:fast_forward: Priority
:bangbang: Problems
:parking: Postponed or Blocked by others
:white_check_mark: Completed

There are minutes attached to this event. Show them.
    • 4:00 PM 4:15 PM
      News and setup 15m
      Speaker: Lucio Anderlini (Istituto Nazionale di Fisica Nucleare)
    • 4:15 PM 4:35 PM
      Discussion: Take-home messages from the first user's forum 20m
    • 4:35 PM 4:50 PM
      Discussion on tasks and priorities 15m
      Speaker: All
    • 4:50 PM 5:00 PM
      Any other business 10m