1st AI-INFN User Forum

Name: 1st AI-INFN User Forum
Start: 2024-06-11T13:00:00+02:00
End: 2024-06-12T14:00:00+02:00
Location: Plesso Berti Pichat

11–12 Jun 2024

Plesso Berti Pichat

Europe/Rome timezone

Contact

ai_infn_ws@lists.cnaf.infn.it

Transformer-based models for scientific text classification

12 Jun 2024, 09:40

25m

Room BP-2B (Plesso Berti Pichat)

Room BP-2B

Plesso Berti Pichat

Viale Carlo Berti Pichat 6, 40127, Bologna

Wednesday morning: Part I

Giovanni Zurlo (Università di Giovanni)

The transformer model, introduced by Google in 2017, has become renowned in natural language processing (NLP). It represents a significant advancement completely departing from the mechanisms of Recurrent Neural Networks and Convolutional Neural Networks.
The features that contribute to the superior performance of transformers in NLP tasks include self-attention, multi-head attention, and positional encoding. The attention mechanism enables the model to extract dependency relationships between words, whereas the positional encoding extracts information about the position in the sequence of words.
Systematic literature reviews (SLR) help scientists to identify, select, access, and synthesize studies relevant to a specific topic. One crucial phase in conducting an SLR involves screening a vast number of articles sourced from various databases. This screening process is particularly challenging given the number of papers to be reviewed. For this reason, it is essential to perform this activity automatically with text classification.
In this context, we have employed transformer-based models to discern the primary topics covered in papers relevant to specific research areas, and group them into clusters. The primary aim is to develop an automated procedure that aids the scientific community in their search efforts by extracting pertinent information from vast amounts of documents.
During the study, we have used a range of transformer implementations, predominantly deriving from BERT, tailored to specific use cases including COVID-19, rehabilitation, and physics. Within the context of COVID-19, we have deployed models like BioBERT and PubMedELECTRA, achieving remarkable performance, particularly with PubMedBERT-large achieving a 0.9021 F1-score using 5-fold cross-validation.

Marco Canaparo (Istituto Nazionale di Fisica Nucleare) Elisabetta Ronchieri (Istituto Nazionale di Fisica Nucleare) Sofia Camilla Todeschini (Università di Bologna) Giovanni Zurlo (Università di Giovanni)

1st_AI_INFN_User_Forum___Transformers (1).pdf

1st AI-INFN User Forum

Contact

Transformer-based models for scientific text classification

Room BP-2B

Plesso Berti Pichat

Speaker

Description

Primary authors

Presentation materials

Choose timezone

1st AI-INFN User Forum

Contact

Speaker

Description

Primary authors

Presentation materials