AI_INFN Technical Meeting

Europe/Rome
Descrizione

Virtual meeting room (zoom): https://l.infn.it/ai-infn-meeting

Date: 2024-06-24

Last week we had an incident with the platform due to failure in unmounting fuse in test jobs.
The nodes needed reboot. Rebooting broke calico.
The nodes needed to be re-built. Rebuilding broke CUDA.
GPU operator has been updated.

Tracked developments:

:arrow_forward: Automation of RKE2 deployments in INFN Cloud

  • NTR

:arrow_forward: Develop monitoring and accounting infrastructure (R. Petrini)

  • We need an automated backup of the postgres database used for the accounting

:arrow_forward: Environment setup (M. Barbetti, S. Giagu, S. Bordoni, L. Cappelli)

  • conda lascia sporco l’environment; la soluzione è usare unset e fare pulizia.
  • una volta fatti i bottoni, ri-pinghiamo il WP4.
  • scriveremo da qualche parte nella documentazione come ripulire gli environment per i prossimi utenti.

:arrow_forward: Offloading tests with virtual kubelets (G. Bianchini, D. Ciangottini)

  • Docker-plugin has been integrated in the interlink-CE.

:arrow_forward: Acquisto FPGA

  • NTR

:arrow_forward: Advanced Hackathon

  • Andrea Paccagnella proposes Padova.
  • Possible week: 25 November 2024
  • Asked the secretariat for the rooms. Waiting for reply.

Status legend

:arrow_forward: Active
:fast_forward: Priority
:bangbang: Problems
:parking: Postponed or Blocked by others
:white_check_mark: Completed

Ci sono verbali allegati a questo evento. Mostrali.
    • 16:00 16:15
      News and setup 15m
      Relatore: Lucio Anderlini (Istituto Nazionale di Fisica Nucleare)
    • 16:15 16:50
      Discussion on tasks and priorities 35m
      Relatore: All
    • 16:50 17:00
      Any other business 10m