Skip to content

Commit

Permalink
Updates to version 3.1.0
Browse files Browse the repository at this point in the history
  • Loading branch information
abmiguez committed Jul 25, 2022
1 parent 4e8f573 commit d73c056
Show file tree
Hide file tree
Showing 14 changed files with 38 additions and 36 deletions.
16 changes: 6 additions & 10 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,20 +1,16 @@
# MetaPhlAn: Metagenomic Phylogenetic Analysis
[![install with bioconda](https://img.shields.io/badge/install%20with-bioconda-brightgreen.svg?style=flat)](http://bioconda.github.io/recipes/metaphlan/README.html) [![PyPI - Downloads](https://img.shields.io/pypi/dm/metaphlan?label=MetaPhlAn%20on%20PyPi)](https://pypi.org/project/MetaPhlAn/) [![MetaPhlAn on DockerHub](https://img.shields.io/docker/pulls/biobakery/metaphlan?label=MetaPhlAn%20on%20DockerHub)](https://hub.docker.com/r/biobakery/metaphlan) [![Build MetaPhlAn package](https://github.com/biobakery/MetaPhlAn/workflows/Build%20MetaPhlAn%20package/badge.svg?branch=3.0)](https://github.com/biobakery/MetaPhlAn/actions?query=workflow%3A%22Build+MetaPhlAn+package%22)
## What's new in version 3
* New MetaPhlAn marker genes extracted with a newer version of ChocoPhlAn based on UniRef
* Estimation of metagenome composed by unknown microbes with parameter `--unknown_estimation`
* Automatic retrieval and installation of the latest MetaPhlAn database with parameter `--index latest`
* Virus profiling with `--add_viruses`
* Calculation of metagenome size for improved estimation of reads mapped to a given clade
* Inclusion of NCBI taxonomy ID in the ouput file
* CAMI (Taxonomic) Profiling Output Format included
* Removal of reads with low MAPQ values
## What's new in version 3.1
* 433 low-quality species were removed from the MetaPhlAn 3.1 marker database and 2,680 species were added (for a new total of 15,766; a 17% increase).
* Marker genes for a subset of existing bioBakery 3 species were also revised.
* Most existing bioBakery 3 species pangenomes were updated with revised or expanded gene content.
* MetaPhlAn 3.1 software has been updated to work with revised marker database.
-------------

## Description
MetaPhlAn is a computational tool for profiling the composition of microbial communities (Bacteria, Archaea and Eukaryotes) from metagenomic shotgun sequencing data (i.e. not 16S) with species-level. With the newly added StrainPhlAn module, it is now possible to perform accurate strain-level microbial profiling.

MetaPhlAn relies on ~1.1M unique clade-specific marker genes (the latest marker information file `mpa_v30_CHOCOPhlAn_201901_marker_info.txt.bz2` can be found [here](https://www.dropbox.com/sh/7qze7m7g9fe2xjg/AAAlyQITZuUCtBUJxpxhIroIa/mpa_v30_CHOCOPhlAn_201901_marker_info.txt.bz2?dl=1)) identified from ~100,000 reference genomes (~99,500 bacterial and archaeal and ~500 eukaryotic), allowing:
MetaPhlAn relies on ~1.1M unique clade-specific marker genes (the latest marker information file `mpa_v31_CHOCOPhlAn_201901_marker_info.txt.bz2` can be found [here](http://cmprod1.cibio.unitn.it/biobakery3/metaphlan_databases/mpa_v31_CHOCOPhlAn_201901_marker_info.txt.bz2)) identified from ~100,000 reference genomes (~99,500 bacterial and archaeal and ~500 eukaryotic), allowing:

* unambiguous taxonomic assignments;
* accurate estimation of organismal relative abundance;
Expand Down
2 changes: 1 addition & 1 deletion bioconda_recipe/meta.yaml
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
{% set name = "metaphlan" %}
{% set version = "4.0" %}
{% set version = "3.1" %}

package:
name: {{ name }}
Expand Down
6 changes: 6 additions & 0 deletions changeset.txt
Original file line number Diff line number Diff line change
@@ -1,3 +1,9 @@
=== Version 3.1
* 433 low-quality species were removed from the MetaPhlAn 3.1 marker database and 2,680 species were added (for a new total of 15,766; a 17% increase).
* Marker genes for a subset of existing bioBakery 3 species were also revised.
* Most existing bioBakery 3 species pangenomes were updated with revised or expanded gene content.
* MetaPhlAn 3.1 software has been updated to work with revised marker database.

=== Version 3.0
* New MetaPhlAn marker genes extracted with a newer version of ChocoPhlAn based on UniRef
* Estimation of metagenome composed by unknown microbes with parameter `--unknown_estimation`
Expand Down
4 changes: 2 additions & 2 deletions metaphlan/metaphlan.py
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,8 @@
'Duy Tin Truong, '
'Francesco Asnicar ([email protected]), '
'Aitor Blanco Miguez ([email protected])')
__version__ = '3.0.14'
__date__ = '19 Jan 2022'
__version__ = '3.1.0'
__date__ = '25 Jul 2022'

import sys
try:
Expand Down
6 changes: 3 additions & 3 deletions metaphlan/strainphlan.py
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,8 @@
'Francesco Asnicar ([email protected]), '
'Moreno Zolfo ([email protected]), '
'Francesco Beghini ([email protected])')
__version__ = '3.0.14'
__date__ = '19 Jan 2022'
__version__ = '3.1.0'
__date__ = '25 Jul 2022'


import sys
Expand All @@ -29,7 +29,7 @@
metaphlan_script_install_folder = os.path.dirname(os.path.abspath(__file__))
DEFAULT_DB_FOLDER = os.path.join(metaphlan_script_install_folder, "metaphlan_databases")
DEFAULT_DB_FOLDER = os.environ.get('METAPHLAN_DB_DIR', DEFAULT_DB_FOLDER)
DEFAULT_DB_NAME = "mpa_v30_CHOCOPhlAn_201901.pkl"
DEFAULT_DB_NAME = "mpa_v31_CHOCOPhlAn_201901.pkl"
DEFAULT_DATABASE = os.path.join(DEFAULT_DB_FOLDER, DEFAULT_DB_NAME)
PHYLOPHLAN_MODES = ['accurate', 'fast']

Expand Down
6 changes: 3 additions & 3 deletions metaphlan/utils/add_metadata_tree.py
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
#!/usr/bin/env python
__author__ = ('Duy Tin Truong ([email protected]), '
'Aitor Blanco Miguez ([email protected])')
__version__ = '3.0'
__date__ = '21 Feb 2020'
__version__ = '3.1.0'
__date__ = '25 Jul 2021'

import argparse as ap
import pandas
Expand Down Expand Up @@ -102,4 +102,4 @@ def main():
ofile.write(line)

if __name__ == '__main__':
main()
main()
4 changes: 2 additions & 2 deletions metaphlan/utils/external_exec.py
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,8 @@
'Francesco Asnicar ([email protected]), '
'Moreno Zolfo ([email protected]), '
'Francesco Beghini ([email protected])')
__version__ = '3.0.8'
__date__ = '7 May 2021'
__version__ = '3.1.0'
__date__ = '25 Jul 2022'

import os, sys, re, shutil, tempfile
import subprocess as sb
Expand Down
6 changes: 3 additions & 3 deletions metaphlan/utils/extract_markers.py
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,8 @@
'Francesco Asnicar ([email protected]), '
'Moreno Zolfo ([email protected]), '
'Francesco Beghini ([email protected])')
__version__ = '3.0.14'
__date__ = '19 Jan 2022'
__version__ = '3.1.0'
__date__ = '25 Jul 2022'

import sys
try:
Expand All @@ -31,7 +31,7 @@
metaphlan_script_install_folder = os.path.dirname(os.path.abspath(__file__))
DEFAULT_DB_FOLDER = os.path.join(metaphlan_script_install_folder, "../metaphlan_databases")
DEFAULT_DB_FOLDER = os.environ.get('METAPHLAN_DB_DIR', DEFAULT_DB_FOLDER)
DEFAULT_DB_NAME = "mpa_v30_CHOCOPhlAn_201901.pkl"
DEFAULT_DB_NAME = "mpa_v31_CHOCOPhlAn_201901.pkl"
DEFAULT_DATABASE = os.path.join(DEFAULT_DB_FOLDER, DEFAULT_DB_NAME)

"""
Expand Down
4 changes: 2 additions & 2 deletions metaphlan/utils/parallelisation.py
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,8 @@
'Francesco Asnicar ([email protected]), '
'Moreno Zolfo ([email protected]), '
'Francesco Beghini ([email protected])')
__version__ = '3.0'
__date__ = '21 Feb 2020'
__version__ = '3.1.0'
__date__ = '25 Jul 2022'

try:
from .util_fun import error
Expand Down
4 changes: 2 additions & 2 deletions metaphlan/utils/plot_tree_graphlan.py
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
#!/usr/bin/env python
__author__ = ('Duy Tin Truong ([email protected]), '
'Aitor Blanco Miguez ([email protected])')
__version__ = '3.0'
__date__ = '21 Feb 2020'
__version__ = '3.1.0'
__date__ = '25 Jul 2022'

import argparse as ap
import dendropy
Expand Down
4 changes: 2 additions & 2 deletions metaphlan/utils/sample2markers.py
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,8 @@
'Francesco Asnicar ([email protected]), '
'Moreno Zolfo ([email protected]), '
'Francesco Beghini ([email protected])')
__version__ = '3.0.14'
__date__ = '19 Jan 2022'
__version__ = '3.1.0'
__date__ = '25 Jul 2022'

import sys
try:
Expand Down
4 changes: 2 additions & 2 deletions metaphlan/utils/strain_transmission.py
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
__author__ = ('Aitor Blanco ([email protected]), '
'Mireia Valles-Colomer ([email protected])')
__version__ = '3.0.14'
__date__ = '19 Jan 2022'
__version__ = '3.1.0'
__date__ = '25 Jul 2022'

import os, time, sys
import argparse as ap
Expand Down
6 changes: 3 additions & 3 deletions metaphlan/utils/util_fun.py
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,8 @@
'Francesco Asnicar ([email protected]), '
'Moreno Zolfo ([email protected]), '
'Francesco Beghini ([email protected])')
__version__ = '3.0'
__date__ = '21 Feb 2020'
__version__ = '3.1.0'
__date__ = '25 Jul 2022'


import os, sys, re, pickletools, pickle, time, bz2, gzip
Expand Down Expand Up @@ -134,4 +134,4 @@ def is_number(s):
int(s)
return True
except ValueError:
return False
return False
2 changes: 1 addition & 1 deletion setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@

setuptools.setup(
name='MetaPhlAn',
version='3.0.14',
version='3.1.0',
author='Francesco Beghini',
author_email='[email protected]',
url='http://github.com/biobakery/MetaPhlAn/',
Expand Down

0 comments on commit d73c056

Please sign in to comment.