-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Refactoring, arg list handling, add two parameters (#11)
* (1) Fix handling of ks_peaks.tsv; (2) shorten 19_pan_aug_leftover_merged_prot to 19_palm to reduce length of arglist; (3) move small families to the side in alignment step; (4) correct printing of header for counts per accession * MANY changes in the course of moving alignment & modeling steps to -common.sh. Likely breakages. Incomplete. * Various fixes in alignment and modeling steps, after moving this code to -common * Handle argument list too long errors around augment_cluster_sets.awk * Add Wm82.gnm6 annotation to Glycine pangene config & make file * Remove 04_dag at start of filter, to avoid problems of leftover files from previous runs * Minor cosmetic change in stats/ks_histplots.tsv * Add two params, min_align_count min_annots_in_align, to the reporting in the summarize step * Fix listing of pandagma_conf_params reported in summarize step, and other minor cleanup * Add two alignment parameters to config files; and minor changes to pandagma-fsup.sh to work with the pandagma driver script and pandagma-common.sh * Changes in step xfr_aligns_trees to handle output from fsup * Remove optional steps align_cds, align_protein, model_and_trim, calc_trees, and xfr_aligns_trees from commandlist in -pan and -fam workflows * Bug fix & documentation tweaks in pandagma-fsup.sh * Minor change to handling of directory/file transfer in pandagma-fsup.sh * Minor change in stats reporting * Restructure ks_filter step so that ks_block_wgd_cutoff and max_pair_ks are used if stats/ks_histplots.tsv isn't provided * Handle transfer of 19_pan_aug_leftover_merged_prot in -common rather than in -fam * Tweak output of counts-per-accession header in stats.txt file
- Loading branch information
1 parent
80556c0
commit 030b8ad
Showing
30 changed files
with
796 additions
and
246 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,75 @@ | ||
#!/bin/bash | ||
#SBATCH -A m4440 | ||
#SBATCH -q regular | ||
#SBATCH -N 1 | ||
#SBATCH -n 30 # number of cores/tasks in this job | ||
#SBATCH -t 23:00:00 | ||
#SBATCH -C cpu | ||
#SBATCH -J pand-fam2 | ||
#SBATCH -o %x_%j.out | ||
#SBATCH -e %x_%j.err | ||
|
||
set -o errexit | ||
set -o nounset | ||
set -o xtrace | ||
|
||
date # print timestamp | ||
|
||
# If using conda environment for dependencies: | ||
module load conda | ||
conda activate pandagma | ||
|
||
PDGPATH=$PWD | ||
CONFIG=$PWD/config/family3_22_3.conf | ||
|
||
echo "Config: $CONFIG" | ||
|
||
export PATH=$PATH:$PDGPATH/bin | ||
echo "PATH: $PATH" | ||
|
||
########## | ||
# Test PATH | ||
which pandagma | ||
which calc_ks_from_dag.pl | ||
|
||
########## | ||
## Fetch relevant data files; e.g. | ||
#mkdir -p data | ||
#make -C data -f $PWD/get_data/family3_22_3.mk | ||
|
||
########## | ||
## Filter transposable elements | ||
#pandagma TEfilter -c $CONFIG | ||
|
||
########## | ||
## Run all main steps, assuming input data files exist in ./data | ||
## Work directory will be ./work_pandagma | ||
## Output will go to ./out_pandagma | ||
#pandagma fam -c $CONFIG -d data_TEfilter | ||
|
||
########## | ||
## Run individual steps | ||
#pandagma fam -c $CONFIG -s ingest -d data_TEfilter | ||
#pandagma fam -c $CONFIG -s mmseqs | ||
#pandagma fam -c $CONFIG -s filter | ||
#pandagma fam -c $CONFIG -s dagchainer | ||
#pandagma fam -c $CONFIG -s ks_calc | ||
#pandagma fam -c $CONFIG -s ks_filter | ||
#pandagma fam -c $CONFIG -s mcl | ||
#pandagma fam -c $CONFIG -s consense | ||
#pandagma fam -c $CONFIG -s cluster_rest | ||
#pandagma fam -c $CONFIG -s add_extra | ||
#pandagma fam -c $CONFIG -s tabularize | ||
#pandagma fam -c $CONFIG -s align_protein | ||
#pandagma fam -c $CONFIG -s model_and_trim | ||
#pandagma fam -c $CONFIG -s calc_trees | ||
pandagma fam -c $CONFIG -s xfr_aligns_trees | ||
pandagma fam -c $CONFIG -s summarize | ||
|
||
########## | ||
## Optional work-directory cleanup steps | ||
#pandagma fam -c $CONFIG -s clean | ||
#rm -rf ./work_pandagma | ||
|
||
date # print timestamp | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.