PhD Days

Name: PhD Days
Start: 2025-10-08T13:55:00+02:00
End: 2025-10-15T18:00:00+02:00
Location: Università degli Studi di Roma Tor Vergata

8–15 Oct 2025

Università degli Studi di Roma Tor Vergata

Europe/Rome timezone

Contact

Exploring Corporate Financial Statements with Machine Learning: Sentiment, Structure, and Retrieval

15 Oct 2025, 16:00

20m

Aula Grassano

Daniele Picano

Corporate financial statements provide a comprehensive summary of a company’s annual performance, but they also reflect writing biases, shaped both by the author and by the historical moment in which they are produced. Sentiment analysis can help uncover these biases by classifying the tone of the text on a scale from positive to negative. This is possible through the use of neural networks, ranging from LSTM to more advanced Transformer-based models.
Despite being standardized, financial statements also present structural biases. An information of interest is not always easy to locate--even keyword searches often aren't enough--because these documents include several related topics, making the text inherently complex. The goal of this research is to fine-tune an open-source language model (LM) on a built-from-scratch database of financial documents, in order to build a Retrieval-Augmented Generation (RAG) pipeline. By asking questions (queries) to the model, it's possible to identify specific topics within the documents and generate coherent, context-aware answers.

PhDdays_DanielePicano.pdf

PhD Days

Contact

Exploring Corporate Financial Statements with Machine Learning: Sentiment, Structure, and Retrieval

Aula Grassano

Speaker

Description

Presentation materials

Choose timezone

PhD Days

Contact

Speaker

Description

Presentation materials