Artificial intelligence and modern physics: a two-way connection

Name: Artificial intelligence and modern physics: a two-way connection
Start: 2024-09-29T08:00:00+02:00
End: 2024-10-05T19:00:00+02:00
Location: Monopoli (BA)

29 September 2024 to 5 October 2024

Monopoli (BA)

Europe/Rome timezone

Contact

aiphy@unimib.it

Integrating Spliced and Unspliced Gene Expression Data for Improved Cell Type Annotation in Single-Cell RNA Sequencing

4 Oct 2024, 10:40

20m

Monopoli (BA)

Resort Porto Giardino

Hackathon project proposal Hackathon

Cell type annotation is one of the primary tasks in single-cell RNA sequencing analysis and it is of a significance importance and difficulty in computational biology. This difficulty arises from two primary factors. First, gene expression levels typically exist on a continuum rather than being discrete, making it harder to draw clear boundaries between different cell types. Second, variations in gene expression don't always correlate with functional differences at the cellular level, adding complexity to the classification process.
Given these challenges, a more precise and reliable method for cell type annotation is always welcomed. We propose an approach that integrates additional layers of biological data, mainly spliced and unspliced expression, to look at gene expression estimates from a different perspective. By combining these pseudo-multi-modal layers of information, we might achieve more accurate cell type classification and better capture the functional nuances of different cell populations.

Machine learning methods

We will use community detection algorithm such as Louvain or Leiden algorithm to do unsupervised clustering for cells. Then we run statistical tests to retrieve genes associated to each cluster and annotate the cell accordingly.

Project proposal: general context

This project proposal focuses on computational biology, particularly the primary analysis of single-cell RNA sequencing (scRNA-seq) data and cell type annotation. scRNA-seq is essential for exploring gene expression at the single-cell level, enabling the identification of distinct cellular populations and their functional roles.

Goal and FOM

We will use an already annotated dataset and compare the accuracy of annotation to that

Input dataset

We will provide dataset for the hackathon. The data type is gene expression matrices and metadata, all stored and encoded in a well optimized data structure.

Project proposal: description of the problem

The primary challenges in cell type annotation using single-cell RNA sequencing (scRNA-seq) arise from two main factors. First, gene expression levels exist on a continuum rather than discrete categories, making it difficult to delineate clear boundaries between different cell types. Second, variations in gene expression do not always correlate with functional differences at the cellular level, adding complexity to the classification process.

These challenges lead to potential inaccuracies in identifying cell types, which can hinder our understanding of cellular functions and the underlying biological processes. Developing a more reliable method that integrates diverse data types, such as spliced and unspliced gene expression, is essential to address these issues effectively.

Guido Sanguinetti (SISSA) Idris Kouadri Boudjelthia

Dataset and introductory notebook

Artificial intelligence and modern physics: a two-way connection

Contact

Integrating Spliced and Unspliced Gene Expression Data for Improved Cell Type Annotation in Single-Cell RNA Sequencing

Monopoli (BA)

Description

Machine learning methods

Project proposal: general context

Goal and FOM

Input dataset

Project proposal: description of the problem

Primary authors

Presentation materials

Choose timezone

Artificial intelligence and modern physics: a two-way connection

Contact

Description

Machine learning methods

Project proposal: general context

Goal and FOM

Input dataset

Project proposal: description of the problem

Primary authors

Presentation materials