Skip to content

Improved version of Next Generation Sequencing data Capture Assessment Tool (ngsCAT)

Notifications You must be signed in to change notification settings

babelomics/ngscat2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ngsCAT2: a tool to assess the efficiency of targeted enrichment sequencing

Requirements

This instructions are for debian based linux distributions such as: Ubuntu, Linux mint, Lubuntu...

Python Packages Requirements

All standard python packages required are specified in the requirements.txt and in setup.py files.

Requirements

Python 3.5 or later to run ngsCAT2.

In Ubuntu, Mint and Debian you can install Python 3 like this:

$ sudo apt-get install python3 python3-pip
sudo apt-get install samtools=1.7-1

Installation

For the installation of the tool just only run:

pip3 install git+https://github.com/alegarsan2/ngsCAT2@master

Usage instruction

Usage: 	
       	****************************************************************************************************************
       	Task: Assesses capture performance in terms of sensibility, specificity and uniformity of the coverage.
       	Output: An html report will be created at the path indicated with the --out option.
       	*****************************************************************************************************************
       	usage: ngscat2 --bams <filename> --bed <filename> --out <path> --annotation <filename> --reference <filename>  --tmp <path> --threads <integer>

Options:
  -h, --help            show this help message and exit
  --bams=BAMS           Required. Comma separated list of bam files (2
                        maximum). E.g.: --bams
                        /home/user/bam1.sorted.bam,/home/user/bam2.sorted.bam
  --bed=BED             Required. Full path to the bed file containing the
                        target regions.
  --out=OUTDIR          Required. Full path to the directory where results
                        will be saved.
  --reference=REFERENCE
                        Optional. String indicating the path to a .fasta file
                        containing the reference chromosomes. Default=None.
  --annotation=ANNOTATION
                        Optional. String indicating the path to a .bed file
                        containing annotated regions . Default=None.
  --coveragethrs=COVERAGETHRESHOLDS
                        Optional. Comma separated list of real numbers (do not
                        leave spaces between) indicating coverage thresholds
                        to be used when calculating percentages of covered
                        bases (first graph in the report).
                        Default=1,5,10,20,30.
  --tmp=TMP             Optional. String indicating the full path to a
                        temporary directory where temporary files will be
                        created. Default=/tmp/.
  --threads=NTHREADS    Optional. Integer indicating the number of concurrent
                        threads to launch. Default=cpu_count() - 1.

Input data

Here an example of input data can he downloaded.

#Output example Exome example https://mega.nz/#!eywTGCBL!6HTDl7J9eLY4VhX2aFfe57Cp9mvwLw4JTcDcHr-zu1A

About

Improved version of Next Generation Sequencing data Capture Assessment Tool (ngsCAT)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published