You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1. Include Trimmomatic as an Additional Preprocessing Tool
Currently, quality trimming is supported only for fastp in the pipeline. While fastp can be very efficient, Trimmomatic offers particular advantages for specific datasets, such as handling paired-end reads more robustly or allowing for finer control over trimming parameters.
Adding Trimmomatic will allow users the ability to make choices, giving flexibility based on their dataset's requirements.
Including the step for optional Trimmomatic may gather more users who are used to or who prefer this tool for preprocessing.
2. Contigs Param for binning
Scaffolds may introduce errors due to misassemblies during linking.
Since contigs are the raw output of assemblers, they may provide a more realistic representation in binning based on both sequence composition and coverage.
Scaffolding is based on assumptions that may not hold for complex metagenomes, potentially biasing binning results.
3. New concoct default params
Currently, the cut_up_fasta.py script within CONCOCT is set to chunk contigs with the parameters -c 1999 -o 1900.
These settings yield small chunks of 1,999 bases with significant overlap of 1,900 bases, which in turn increases the number of fragments and subsequently the runtime.
A potential improvement is to use -c 10000 -o 0, which creates larger chunks (10,000 bases) with no overlap.
This approach is faster in generating fewer fragments and is also the default in the official GitHub example for CONCOCT.
The text was updated successfully, but these errors were encountered:
I am also interested in point 2 above, it would be great to have a pipeline parameter telling it to use contigs instead of scaffolds for all subsequent stages of the analysis (binning, taxonomy etc.), when assembling with SPAdes.
Description of feature
1. Include Trimmomatic as an Additional Preprocessing Tool
Currently, quality trimming is supported only for fastp in the pipeline. While fastp can be very efficient, Trimmomatic offers particular advantages for specific datasets, such as handling paired-end reads more robustly or allowing for finer control over trimming parameters.
Adding Trimmomatic will allow users the ability to make choices, giving flexibility based on their dataset's requirements.
Including the step for optional Trimmomatic may gather more users who are used to or who prefer this tool for preprocessing.
2. Contigs Param for binning
Scaffolds may introduce errors due to misassemblies during linking.
Since contigs are the raw output of assemblers, they may provide a more realistic representation in binning based on both sequence composition and coverage.
Scaffolding is based on assumptions that may not hold for complex metagenomes, potentially biasing binning results.
3. New concoct default params
Currently, the cut_up_fasta.py script within CONCOCT is set to chunk contigs with the parameters -c 1999 -o 1900.
These settings yield small chunks of 1,999 bases with significant overlap of 1,900 bases, which in turn increases the number of fragments and subsequently the runtime.
A potential improvement is to use -c 10000 -o 0, which creates larger chunks (10,000 bases) with no overlap.
This approach is faster in generating fewer fragments and is also the default in the official GitHub example for CONCOCT.
The text was updated successfully, but these errors were encountered: