Skip to content

Latest commit

 

History

History
157 lines (120 loc) · 8.68 KB

index.md

File metadata and controls

157 lines (120 loc) · 8.68 KB

Introduction to single-cell RNA-seq data analysis

16th, 17th and 20th May 2024 || 09:30 - 17:30

In-person at the Craik Marshall training room (map)

Instructors

  • Invited speaker: Katarzyna Kania - CoSyne Therapeutics

  • Adam Reid - Bioinformatics Core, Gurdon Institute

  • Ash Sawle - Bioinformatics Core, Cancer Research UK Cambridge Institute

  • Betty Wang - Department Clinical Neurosciences

  • Chandra Chilamakuri - Bioinformatics Core, Cancer Research UK Cambridge Institute

  • Hugo Tavares - Bioinformatics Training Facility, Cambridge

  • Jiawei Wang - EBI

Outline

This workshop is aimed at biologists interested in learning how to perform standard single-cell RNA-seq analyses.

This will focus on the droplet-based assay by 10X genomics and include running the accompanying cellranger pipeline to align reads to a genome reference and count the number of read per gene, reading the count data into R, quality control, normalisation, data set integration, clustering and identification of cluster marker genes, as well as differential expression and abundance analyses. You will also learn how to generate common plots for analysis and visualisation of gene expression data, such as TSNE, UMAP and violin plots.

Prerequisites

Some basic experience of using a UNIX/LINUX command line is assumed

Some R knowledge is assumed and essential. Without it, you will struggle on this course. If you are not familiar with the R statistical programming language we strongly encourage you to work through an introductory R course before attempting these materials. We recommend our Introduction to R course.

Data

  • The course data is based on 'CaronBourque2020' relating to pediatric leukemia, with four sample types, including:
    • pediatric Bone Marrow Mononuclear Cells (PBMMCs)
    • three tumour types: ETV6-RUNX1, HHD, PRE-T
  • The data used in the course can be downloaded from Dropbox. Please note that:
    • these data have been processed for teaching purposes and are therefore not suitable for research use;
    • all the data is provided on our training machines, you don't need to download it to attend the course.

Schedule

PDF of materials: if you want a PDF version of the materials go to the "Print" option on your browser and select "Print to PDF" (all major browsers have this functionality).

Day 1

Trainers: Adam, Chandra, Jiawei

  • 09:30 - 09:40 Welcome - Hugo
  • 09:40 - 10:25 Introduction to Single Cell Technologies - Katarzyna Kania
  • 10:25 - 10:30 - Break
  • 10:30 - 10:40 Preamble: data set and workflow - Chandra
  • 10:40 - 12:30 Library structure, cellranger for alignment and cell calling - Chandra
  • 12:30 - 13:30 Lunch break
  • 13:30 - 17:00 QC and exploratory analysis - Adam

Day 2

Trainers: Adam, Chandra, Hugo

Day 3

Trainers: Ash, Betty, Hugo, Jiawei

Extended Materials

Software Installation

You can make use of the computers in the Training Room, which are ready for use and have the necessary data & software installed. However, if you want to run the analysis on your own computer, you can follow these instructions.

  • Download and install R: https://cloud.r-project.org/
  • Download and install RStudio: https://www.rstudio.com/products/rstudio/download/#download
  • Open RStudio and run the following commands from the console:
    install.packages("BiocManager")
    BiocManager::install(c("AnnotationHub", "BiocParallel", "BiocSingular", "DropletUtils", 
                           "PCAtools", "batchelor", "bluster", "cluster", "clustree", 
                           "dynamicTreeCut", "edgeR", "ensembldb", "ggplot2", "igraph", 
                           "patchwork", "pheatmap", "scater", "scran", "miloR", "tidyverse"))
    # due to a bug, reinstall this package after all the above
    install.packages("irlba", type = "source", force = TRUE)

For Cellranger, you will need to use a Linux machine. See the installation instructions from 10x Genomics.

Acknowledgments:

Much of the material in this course has been derived from the demonstrations found in OSCA book and the Hemberg Group course materials. Additional material concerning miloR has been based on the demonstration from the Marioni Lab.

The materials have been contributed to by many individuals over the last 2 years, including:

Abigail Edwards, Ashley D Sawle, Chandra Chilamakuri, Kamal Kishore, Stephane Ballereau, Zeynep Kalendar Atak, Hugo Tavares, Jon Price, Katarzyna Kania, Roderik Kortlever, Adam Reid, Tom Smith

Apologies if we have missed anyone!