Skip to content

This workshop is designed to give beginners a solid foundation in Python programming for data analysis and visualization, with a specific focus on applications in cancer biology

License

Notifications You must be signed in to change notification settings

PMacDaSci/IntroPython-pandas

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

This workshop is designed to give beginners a solid foundation in Python programming, with a specific focus on applications in cancer biology. Participants will gain a thorough understanding of essential programming concepts through a blend of theoretical lessons, hands-on coding exercises, and practical applications.

The workshop will cover essential programming concepts and gradually introduce more advanced topics, with a focus on using the Pandas library for efficient data handling and analysis and matplotlib library for data visualization. By the end of the workshop, attendees will be equipped with the skills to enhance the reproducibility and efficiency of scientific research through powerful data analysis tools and effective visualization techniques.

Instructor

Sanduni Rajapaksa

Learning Objectives

Participants will gain the following skills:

  • Proficiency in using Python for data analysis.
  • Basic Python programming skills.
  • Reading, tidying, and joining datasets using pandas library.
  • Data manipulation and transformation using pandas library.
  • Creating various types of plots using matplotlib library.

Data

The Metabric study characterized the genomic mutations and gene expression profiles for 2509 primary breast tumours. In addition to the gene expression data generated using microarrays, genome-wide copy number profiles were obtained using SNP microarrays. Targeted sequencing was performed for 2509 primary breast tumours, along with 548 matched normals, using a panel of 173 of the most frequently mutated breast cancer genes as part of the Metabric study.

References:

Both the clinical data and the gene expression values were downloaded from cBioPortal.

We excluded observations for patient tumor samples lacking expression data, resulting in a data set with fewer rows.

Credits and Acknowledgement

These content were adapted from the following course materials:


About

This workshop is designed to give beginners a solid foundation in Python programming for data analysis and visualization, with a specific focus on applications in cancer biology

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • SCSS 52.8%
  • CSS 47.2%