Changelog¶
v3.1.0 bactopia/bactopia "Dance Powder" 2024/09/22¶
"Dance Powder" is related to the Alabasta arc in One Piece which my kids currently on
Added
¶
- Named Workflows
clean-yer-reads
- Use Bactopia's read QC steps to Clean-Yer-Reads- Use
clean-yer-reads
,cyr
orbactopia --wf cleanyerreads
- Use
teton
- Host removal and taxon classification with estimated abundances- Use
teton
orbactopia --wf teton
- Use
- Bactopia Tools (
bactopia --wf <NAME>
)clermontyping
- Clermont's Escherichia phylogroupingdefensefinder
- Systematic search of all known anti-phage systemssccmec
- SCCmec typing of Staphylococcus aureus genomesscrubber
- Remove human reads from FASTQ files- screen human reads with
kraken2
(against human pangenome) orsra-human-scrubber
- screen human reads with
shigapass
- Predict Shigella serotypes and differentiate Shigella, EIEC and non-Shigella/EIEC
- full support of config files from nf-core/configs
- no longer prints efficiency for
standard
,docker
, andsingularity
profiles - now required non-integer values for
--max_time
(e.g.4.h
) and--max_memory
(e.g.8.GB
) - always import
base.config
- no longer prints efficiency for
amrfinderplus
by combining results from genes, proteins and gff (coordinates)--amrfinder_db
and--mlst_db
to specify custom databases- mlst and amrfinder databases can now be either a tarball or directory
pangenome
tool now uses panaroo as the default pangenome tool- file with reference name and samples included for
snippy
run snippy
can now use--accession
to download a genbank file from NCBI- replaced
staphopiasccmec
withsccmec
instaphtyper
- bump program versions in modules
abritamr
: 1.0.17 -> 1.0.19bakta
: 1.9.3 -> 1.9.4blast
: 2.15.0 -> 2.16.0busco
: 5.7.0 -> 5.7.1checkm
: 1.2.2 -> 1.2.3defensefinder
: 1.2.2 -> 1.3.0gtdbtk
: 2.3.2 -> 2.4.0hicap
: 1.0.3 -> 1.0.4 (@MartinVad)kleborate
: 2.3.2 -> 3.0.9mobsuite
: 3.1.8 -> 3.1.9panaroo
: 1.4.2 -> 1.5.0pasty
: 1.0.3 -> 2.2.1pbptyper
: 1.0.4 -> 2.0.0seqsero2
: 1.2.1 -> 1.3.1stecfinder
: 1.1.1 -> 1.21.2tbprofiler
: 6.1.0 -> 6.3.0
Fixed
¶
- missing schema for clean-yer-reads and teton
- pinning of bioperl in prokka module with
strict
channel priority - use
--infile-list
withcsvtk concat
to support 10k+ inputs pangenome
when Bakta GFF (*.gff3
) files are provided- missing file export in
gubbins
- writing to non-default values for
--bactopia
--include
file on cloud storage (@rwilliams)- import of ONT reads over illumina reads in certain bactopia tools (e.g. snippy) (Thanks D2)
- bactopia/datasets envs not being build by
bactopia download
snippy
puts files in subdirectory based on reference name- removed pneumocat from Merlin as it doesn't gracefully fail on negative results
- all the tests
- typo in
bactopia-tools.nf
(thank you! @pvanheus)
Enhancements to OSS
¶
- pinn macsyfinder version in defense-finder bioconda/bioconda-recipes#46824
- add recipe for
camlhmp
bioconda/bioconda-recipes#47453 - add recipe for
sccmec
bioconda/bioconda-recipes#47600 - update recipe for genotyphi bioconda/bioconda-recipes#47664
- disable phone home snpeff bioconda/bioconda-recipes#48007
- clermontyping use share folder bioconda/bioconda-recipes#49372
- add recipe for scrubby bioconda/bioconda-recipes#49373
- rebuild pasty bioconda/bioconda-recipes#50068
v3.0.1 bactopia/bactopia "That's My Girl" - 2024/03/25¶
"That's My Girl" is a phrase my youngest says quite often when playing with her friends
Added
¶
- Updated
bactopia-assembler
to include updated Medaka models - profile for
arcc_hawk
- TB Profiler results are now merged with
collate
command - sample name to plasmidfinder results
- support config files from nf-core/configs
- updated AMRFinder+ database for v3.12.8
- bump program versions in modules
abritamr
: 1.0.14 -> 1.0.17bactopia-assembler
: 1.0.3 -> 1.0.4bactopia-teton
: 1.0.1 -> 1.0.2bactopia-variants
: 1.0.1 -> 1.0.2bakta
: 1.8.2 -> 1.9.3blast
: 2.14.1 -> 2.15.0busco
: 5.5.0 -> 5.7.0gubbins
: 3.3.0 -> 3.3.4multiqc
: 1.15 -> 1.21mashtree
: 1.4.5 -> 1.4.6mykrobe
: 0.12.2 -> 0.13.0ncbi-amrfinderplus
: 3.11.18 -> 3.12.8panaroo
: 1.3.3 -> 1.4.2rgi
: 6.0.2 -> 6.0.3sistr
: 1.1.1 -> 1.1.2stecfinder
: 1.1.0 -> 1.1.1tbprofiler
: 5.0.0 -> 6.1.0
Fixed
¶
- variable name in sketcher module
--para-off
not correctly implemented in PIRATE module- extra space in RGI container image name
- sourmash version STDERR not parsed out
- FASTQC writing to /tmp dir on HPC when non-readable
- abricate and ariba not using output subdirectory
- nextflow tower typos @iferres
- phispy not working with Bakta genbank files
- missed check of
--download_bakta
param in main workflow - quast not working when estimated genome size is 0
abricate
now gets database subdirectories
Enhancements to OSS
¶
- fix amrfinderplus pinning in abritamr bioconda/bioconda-recipes#46714
- adjust python pinning in gubbins bioconda/bioconda-recipes#46713
- fix issue with sistr container bioconda/bioconda-recipes#46712
- update rgi pinning for pyrodigal bioconda/bioconda-recipes#46669
- pin tabix version in snippy bioconda/bioconda-recipes#46458
v3.0.0 bactopia/bactopia "Black Cat and Brown Dog" - 2023/09/11¶
"Black Cat and Brown Dog" are in memory of Stinky ("Black Cat") and Twotee ("Brown Dog")
Added
¶
- Bactopia Tools (
bactopia --wf <NAME>
)abritamr
- A NATA accredited tool for AMR detectionblastn
- Search against nucleotide BLAST databases using nucleotide queriesblastp
- Search against protein BLAST databases using protein queriesblastx
- Search against protein BLAST databases using translated nucleotide queriesbracken
- Taxonomic classification and species abundance estimation of sequence readsbtyper3
- Taxonomic classification of Bacillus cereus group isolatesmidas
- Estimate bacterial species abundances from FASTQ filesquast
- Assess the quality of assembled contigsphispy
- Predict prophages in bacterial genomespneumocat
- Assign capsular type to Streptococcus pneumoniae from sequence readssra-human-scrubber
- Scrub human reads from FASTQ filesstecfinder
- Serotyping Shigella toxin producing Escherichia coli genomestblastn
- Search against translated nucleotide BLAST databases using protein queriestblastx
- Search against translated nucleotide BLAST databases using translated nucleotide queries
- per-release databases for
amrfinderplus
andmlst
- new directory structure for outputs
- Renamed
--R1
,--R2
, and--SE
to--r1
,--r2
, and--se
--ont
now accepts a FASTQ file- GitHub Action to build environments
fastp
is default read cleaner, can use previous methods with--use_bbmap
- ability to use a BED file to mask regions in
snippy-core
--save_sketches
to save Mash sketches created duringmashtree
run5- Porechop is now optional (
--use_porechop
) - unified
publishDir
across modules - download datasets using
storeDir
BACTOPIA:DATASETS
ariba
sra-human-scrubber
- bump program versions in modules
bakta
: 1.6.0 -> 1.8.2blast
: 2.11.0 -> 2.14.1busco
: 5.4.3 -> 5.5.0csktk
: 0.25.0 -> 0.27.2eggnog-mapper
: 2.1.9 -> 2.1.12genotyphi
: 1.9.1 -> 2.0gtdbtk
: 2.1.1 -> 2.3.2gubbins
: 3.2.1 -> 3.3iqtree
: 2.2.0.3 -> 2.2.2.7fastani
: 1.33 -> 1.34mashtree
: 1.2.0 -> 1.4.5mob_suite
: 3.1.0 -> 3.1.7multiqc
: 1.11 -> 1.15mykrobe
: 0.12.0 -> 0.12.2ncbi-amrfinderplus
: 3.10.45 -> 3.11.18ncbi-genome-download
: 0.3.1 -> 0.3.3panaroo
: 1.3.0 -> 1.3.3pasty
: 1.0.0 -> 1.0.3phyloflash
: 3.4 -> 3.4.2rgi
: 6.0.1 -> 6.0.2shigatyper
: 2.0.3 -> 2.0.5shigeifinder
: 1.3.2 -> 1.3.5tbprofiler
: 4.4.0 -> 5.0.0
Fixed
¶
- All modules correctly initiate
params.options
- OpenJDK java cpuset warning message (@idolawoye)
- Broken badges in README.md
- Pinned GSL to v2.6
- symlink in amrfinder+ update (
bactopia datasets
) - hardcoded
--plus
inamrfinderplus_run
- tests for new directory structure
--nanohq
not being properly passed todragonflye
Removed
¶
- Bactopia Steps
ANNOTATE_GENOME
- now handled byprokka
orbakta
ANTIMICROBIAL_RESISTANCE
- now handled byamrfinderplus
CALL_VARIANTS
- now handled bysnippy
SEQUENCE_TYPE
- now handled bymlst
bactopia datasets
is now incorporated intobactopia
- Conda/Containers for all bactopia-main steps
- custom process labels, for generic
nf-core
process labels
Enhancements to OSS
¶
- add module for midas nf-core/modules#2696
- add modules for sra-human-scrubber nf-core/modules#2694
- add module for stecfinder nf-core/modules#2702
- update midas pinnings to match docs bioconda/bioconda-recipes#38566
- add missing scripts to MLST bioconda/bioconda-recipes#38826
- fix GTDB-Tk container bioconda/bioconda-recipes#40333
- patch ncbi-genome-download bioconda-recipes#41640
- add module for pneumocat nf-core/modules#3592
- add module for abritamr/run nf-core/modules#3725
- fix hostrange() missing 1 required positional argument: 'database_directory' phac-nml/mob-suite#149
- add module for btyper3 nf-core/modules#3817
v2.2.0 bactopia/bactopia "Tornado Tempo" - 2022/11/28¶
Added
¶
--dragonflye_opts
to pass additional params to dragonflye--use_fastp
to use fastp for QCing readsbactopia datasets
no longer depends onariba
--skip_spell_check
inbactopia datasets
- updated organisms available from PubMLST
- custom profile for
wsvl
- updated citations
- use
mambaforge
for docker builds - Support tarball inputs for large databases
- Bactopia Tools (
bactopia --wf <NAME>
)pasty
- in silico serogrouping of Pseudomonas aeruginosa isolatespbptyper
- In silico Penicillin Binding Protein (PBP) typer for Streptococcus pneumoniae assembliesshigeifinder
- Serotyping Shigella and EIEC assemblies
- bump program versions in modules
bakta
-> 1.5.1bbmap
-> 39.01checkm-genome
-> 1.2.2csvtk
-> 0.25.0dragonflye
-> 1.0.13eggnog-mapper
-> 2.1.9fastani
-> 1.33fastq-scan
-> 1.0.1gamma
-> 2.2gtdbtk
-> 2.1.1iqtree
-> 2.2.0.3kleborate
-> 2.2.0mlst
-> 2.23.0mob_suite
-> 3.1.0mykrobe
-> 0.12.0nanoplot
-> 1.40.2ncbi-amrfinderplus
-> 3.10.45panaroo
-> 1.3.0pirate
-> 1.0.5rasusa
-> 0.7.0rgi
-> 6.0.1shigatyper
-> 2.0.3snpeff
-> 5.1sourmash
-> 4.5.0sra-tools
-> 3.0.0tb-profiler
-> 4.4.0
Fixed
¶
- respect short polish hybrid assembly requests
- missing variable in
bakta_download
module - restructure to match
nf-core/modules
- compatibility with Nextflow >=22.10
- separate versions for Linux and OSX in
assemble_genomes
bactopia datasets
error messages switched to warnings
Enhancements to OSS
¶
- bump Bakta to 1.5.0 nf-core/modules#2000
- add recipe for pasty bioconda/bioconda-recipes#35930
- add recipe for pbptyper bioconda/bioconda-recipes#36222
- add module for pasty nf-core/modules#2003
- add module for pbptyper nf-core/modules#2005
- add module for shigeifinder nf-core/modules#2523
v2.1.1 bactopia/bactopia "Dragon Twister" - 2022/07/09¶
Added
¶
- Add Panaroo to the
pangenome
Bactopia Tools - Add
genotyphi
andseroba
to Merlin - bump
fastq-dl
to v1.1.1 - Bactopia Tools (
bactopia --wf <NAME>
)busco
- Assembly completeness based on evolutionarily informed expectationsgenotyphi
- Salmonella Typhi genotyping with Mykrobe outputsgubbins
- Rapid phylogenetic analysis of recombinant bacterial sequencesmcroni
- Sequence variation in mobilized colistin resistance (mcr-1) genesmykrobe
- Antimicrobial resistance detection for specific speciespanaroo
- Pipeline for pangenome investigationsplasmidfinder
- Plasmid identification from assembliesseroba
- Serotyping of Streptococcus pneumoniae from sequence readssnippy
- Rapid variant calling from Illumina sequence reads with optional core-SNP phylogeny
Fixed
¶
- improve GTDB db handling
- Properly capture assemblies with 0
- user provided adapters and phix fastas
--available_datasets
in thebactopia datasets
- name collisions in the Ariba Bactopia Tool
- bare double-quotes in Ariba reports
- updated tests for new version
Enhancements to OSS
¶
- Add database to Seroba recipe bioconda/bioconda-recipes#35378
- Update ariba dependencies bioconda/bioconda-recipes#35383
- patch pymummer recipe bioconda/bioconda-recipes#35379
- Update PlasmidFinder for better container support bioconda/bioconda-recipes#35314
- add module for ariba nf-core/modules#1731
- add module for mcroni nf-core/modules#1750
- add module for Mykrobe and GenoTyphi nf-core/modules#1731
- add module for plasmidfinder nf-core/modules#1773
- add module for Seroba nf-core/modules#1816
- add module for snippy/core nf-core/modules#1855
- let tempfile determine temp dir location sanger-pathogens/seroba#68
- allow the user to specify temp dir or use the system default sanger-pathogens/pymummer#36
v2.1.0 bactopia/bactopia "Silver Mist" - 2022/06/08¶
Added
¶
--use_bakta
to replace Prokka with Bakta in main workflow- Split Bakta into two processes
download
andrun
- added
shigatyper
to Merlin - custom profile for
arcc
bactopia prepare
error message when nothing found, and--examples
for example use cases- renamed
--fastqs
to--samples
to better reflect its usage (fastqs and assemblies) - add
--check_samples
to validate user generated FOFNs --short_polish
for short read polishing for long-read assemblies- Moved Ariba to Bactopia Tools
- bump versions in Bactopia envs
- bump Bactopia Tool versions
amrfinder+
version to 3.10.30gtdb
version to 2.1.0mlst
version to 2.22.0
- Bactopia Tools (
bactopia --wf <NAME>
)ariba
- Gene identification by local assembliesgamma
- Identification, classification, and annotation of translated gene matchesshigatyper
: Shigella serotype from Illumina or Oxford Nanopore reads
Fixed
¶
- @nickjhathaway Fixed assembly_level in
bactopia datasets
and autocall_variants
bactopia search
now supports ONT readsvcf-annotator
unable to write to/dev/stdout
- typo in
bactopia download
when using singularity bactopia download
will not rebuild envs its already built in a run--include_genus
withbactopia datasets
now works with multiple samples- Parameters to copy optional species-specific files in
bactopia datasets
- Don't use
AUTO
for IQ-TREE cpus gtdb
download working again- empty FASTQs after adapter and phix removal
- low read count error for ONT samples
- Removed
sequence_type
environment
v2.0.3 bactopia/bactopia "King Cobra" - 2022/03/23¶
Added
¶
- logo and new metro style workflow
- added
test
profile for easy testing mashdist
now outputs a merged TSV- print missing required parameters in Bactopia Tools
- bump bakta to v1.4.0
Fixed
¶
- typo in
bakta
subworkflow ismapper
not available inbactopia-tools.nf
kraken2
not available inbactopia-tools.nf
- Traits file being required for
pangenome
mashtree
module not collecting all FASTA- tests for ISMapper
v2.0.2 bactopia/bactopia "Black Mamba" - 2022/02/22¶
Added
¶
bactopia download
which merges thebactopia build
andbactopia pull
commands- modules can now use prebuilt envs
--available_species
to print the species in a bactopia dataset--bbduk_opts
to fine tune bbduk- set
quay
as default docker registry - Let Nextflow handle stdout and stderr
- added
hpsuissero
,legsta
,sistr
,ssuissero
to Merlin - Bactopia Tools (
bactopia --wf <NAME>
)amrfinderplus
: Identify antimicrobial resistance in genes or proteinsabricate
: Mass screening of contigs for antimicrobial and virulence genescheckm
: Assess the assembly quality of your sampleshpsuissero
: Serotype prediction of Haemophilus parasuis assemblieskraken2
: Taxonomic classification of sequence readslegsta
: Typing of Legionella pneumophila assembliesmlst
: Scan contig files against PubMLST typing schemesmobsuite
: Reconstruct and annotate plasmids in bacterial assembliesrgi
: Predict antibiotic resistance from assembliessistr
: Serovar prediction of Salmonella assembliesssuissero
: Serotype prediction of Streptococcus suis assemblies
Fixed
¶
toInteger
onnull
genome size- Index out of bounds in mlst-blast.py (caused by non allele columns)
- FOFN will fail if sample name is
null
- improve available workflow logic
- PIRATE not exporting gene_presence_absence.csv correctly
- channel imports for
scoary
workflow argument type mismatch
when using--use_roary
- missing channel when
agrvate
not ran asstaphtyper
- float required options to the top when missing
- broken tests due to changes
bactopia datasets
including hidden directories- Bioperl pinnings for prokka and pirate
v2.0.1 bactopia/bactopia "Red Roc" - 2022/01/06¶
Added
¶
- Merlin for auto selection of species-specific tools based
--ask_merlin
to execute the Merlin subworkflow in Bactopia- Schema compatible with NF-Tower
--skip_qc_plots
to skip FastQC and Nanoplot- Disable CheckM by default, must use
--run_checkm
--cluster_opts
for passing additional options to SLURM- inclusion of local files (assembly, gff) in some subworkflows
- workflow for dev conda builds
- all subworkflows can be imported on other workflows
- Bactopia Tools (
bactopia --wf <NAME>
)gtdb
: Identify marker genes and assign taxonomic classificationsmashdist
: Calculate Mash distances between sequencesmerlin
: MinmER assisted species-specific bactopia tool seLectIoN
Fixed
¶
- Resource over allocation for java related programs (-Xmx)
- ordering of skipped steps when
--datasets
not used - logic in container selection for nf-core modules
- channel manipulation in
ncbigenomedownload
- missing logic for
fastani
- version captures in subworkflows
- spell check for species with
[
or]
in their names - version outputs for Staphtyper subworkflow
- use process names in version.yml
CSVTK_CONCAT
empty channelsoptions.suffix
now used asprefix
in modules
v2.0.0 bactopia/bactopia "Red Hawk" - 2021/12/05¶
With Bactopia v2 comes a lot of changes! I would like to extend a huge thanks to Davi Marcon and Abhinav Sharma for their work initially converting Bactopia to DSL2. Your efforts were the momentum I needed to get the ball rolling on Bactopia v2. Thank you very much for taking your time to make such a siginificant contribution!
Added
¶
- support for Nanopore reads
staphopia
as a named pipeline (alias forbactopia --wf staphopia
) for S. aureus genomesbactopia/bactopia-tests
repo with test data- walkthrough for testing
bactopia-datasets/staphylococcus_aureus
repo with curatated S. aureus datasets- per-module testing via
pytest
(100+ tests and 7000+ outputs tested) - per-module
meta.yml
andparams.json
for auto-building docs site - framefork for adding new Bactopia Tools
- 19 total Bactopia Tools (
bactopia --wf <NAME>
)- Subworkflows (3)
eggnog
: Functional annotation of proteins using orthologous groups and phylogeniespangenome
: Pangenome analysis with optional core-genome phylogenystaphtyper
: Determine the agr, spa and SCCmec types for Staphylococcus aureus genomes
- Modules (16):
agrvate
: Rapid identification of Staphylococcus aureus agr locus type and agr operon variants.bakta
: Rapid annotation of bacterial genomes and plasmidsectyper
: In-silico prediction of Escherichia coli serotypeemmtyper
: emm-typing of Streptococcus pyogenes assembliesfastani
: fast alignment-free computation of whole-genome Average Nucleotide Identity (ANI)hicap
: Identify cap locus serotype and structure in your Haemophilus influenzae assembliesismapper
: Identify insertion sites positions in bacterial genomeskleborate
: Screening Klebsiella genome assemblies for MLST, sub-species, and other related genes of interestlissero
: Serogroup typing prediction for Listeria monocytogenesmashtree
: Quickly create a tree using Mash distancesmeningotype
: Serotyping of Neisseria meningitidisngmaster
: Multi-antigen sequence typing for Neisseria gonorrhoeaeseqsero2
: Salmonella serotype prediction from reads or assembliesspatyper
: Computational method for finding spa types in Staphylococcus aureusstaphopiasccmec
: Primer based SCCmec typing of Staphylococcus aureus genomestbprofiler
: Detect resistance and lineages of Mycobacterium tuberculosis genomes
- Subworkflows (3)
- Use
mamba
instead of conda for env building - Reduced total Conda envs/Docker containers down to 7 (previously 12 not including bactopia tools)
- default to compressed outputs (
--skip_compression
to output uncompressed outputs) - Tutorial outputs made available
- update github actions
Fixed
¶
- Cache issue causing
-resume
to fail - amrfinder+ database not compatible error
- incorrectly parsed system memory
Adapted from nf-core
¶
- nf-core pytest setup
- nf-core/modules for bactopia tools
- Bactopia v2 release contributed 20+ modules to nf-core/modules
- nf-core/tools arg parser
- adapted to import params and usage based on config file
Process Consolidation¶
makeblastdb
->assemble_genome
call_variants
,download_reference
->call_variants
fastq_status
,estiamte_genome_size
->gather_samples
count_31mers
->minmer_sketch
Removed
¶
bactopia tools
-> Handled by Nextflow now (bactopia --wf <NAME>
)bactopia versions
-> Program versions are output ever run now.
v1.7.1 bactopia/bactopia "Shellbuster" - 2021/06/04¶
Added
¶
- bumped GTDB to v1.5.0
- added soft ulimit for
staph-typer
Fixed
¶
- Matched PIRATE's parameter syntax for the tools
staph-typer
now uses GetBaseName
'Removed'¶
- PLSDB references in
bactopia datasets
v1.7.0 bactopia/bactopia "Chocobo Wand" - 2021/04/27¶
Added
¶
- Bactopia Tool
staph_typer
for agr, spa, and sccmec typing --min_coverage
parameter to filter based on min coverage
'Removed'¶
plasmid_blast
no longer apart of main workflow
v1.6.5 bactopia/bactopia "Z's Trident" - 2021/03/30¶
Added
¶
- version pins to process envs
Fixed
¶
- syntax for sourmash 4.0
v1.6.4 bactopia/bactopia "Trident +1" - 2021/03/26¶
Added
¶
- added Python3.6+ to all environments
v1.6.3 bactopia/bactopia "Trident" - 2021/03/25¶
Added
¶
- extra fields to
mlst-blast.py
outputs - added Python3 to
qc_reads
environment
Fixed
¶
- rstrip on empty extra fields in mlst profile
- different BLAST+ software versions mismatch
- tbb pinnings
--help
and--version
forbactopia tools
v1.6.2 bactopia/bactopia "Fuscina" - 2021/03/19¶
Added
¶
- inputs are checked to be gzipped (this does not include FOFN)
--skip_amr
to skip AMRFinder+ analysis- new
bactopia tool
forhicap
unicycler
can be used for Illumina reads only (--assembler unicycler
)
Fixed
¶
- AMRFinder+ software and database version mismatch
- check-fastqs.py syntax errors with prints
ismapper
tool processing of include/exclude files
v1.6.1 bactopia/bactopia "Obelisk" - 2021/02/22¶
Fixed
¶
- sample names with "." in them breaking auto variant calling
- contig naming incompatible with GenBank
v1.6.0 bactopia/bactopia "Harpoon" - 2021/01/22¶
Added
¶
bactopia pull
to pre-build Singularity images--singularity_cache
parameter to set location of image downloads--registry
to choose Docker registry to use (DockerHub, GitHub, Quay)--max_downloads
sets maximum number of downloads (FASTQ/assembly) allowed at once--min_time
sets the minimum amount of time a job should be givenbactopia search
now uses POST requests, and groups accessions into single query- strip HTML from FASTA headers used in BLAST
- Dockerfiles now have conda.md5 label to determine if rebuild is necessary
- MD5 is updated in Dockerfile when env is updated
- AMRFinder+ database is now provided by
bactopia datasets
- Parameterized profile (slurm, awsbatch, etc...) variables
bactopia build
- will retry in case of HTTP connection issues
--include_tool
will build Bactopia Tool environments
- GitHub Actions
- build Docker containers on new release (or manual trigger)
- test that the Conda environment yamls are still valid
- test bactopia with conda on Linux and OSX
- test bactopia on Linux with Docker and Singularity
Fixed
¶
- redundant environment version files
- failed FASTQ/Assembly downloads no longer stop whole run
--max_retry
is honored now- antimicrobial_resistance process honors
amrdir
variable - change directory
antimicrobial_resistance
toantimicrobial-resistance
- rename
check_staging.py
tocheck-staging.py
for consistency - Bactopia not producing valid exit code on failure
Removed
¶
--containerPath
variable is replaced by--singularity_cache
- Native Singularity recipes, will now convert Docker to Singularity
- docs are now on bactopia.github.io repo
v1.5.6 bactopia/bactopia "Metal Slime Earring" - 2021/01/13¶
Added
¶
- tweaks to the CI (via GitHub Actions)
- docker containers use quay.io now
- docker containers now use conda-pack
--nfconfig
will skip the conda environment build step- input accessions are checked to be Assembly or Experiment accessions
- improved version increment script
- executor profiles can be configured by parameters
Fixed
¶
- phyloflash and download_reference environment errors
- environment path in Bactopia Tools Dockerfile and Singularity recipes
- moved version from conda yaml to conda version file
- streamlined Docker recipes
- undefined
--ftp_only
message - typo in singularity profile
- stderr logged to file is also printed to screen
qc_reads
memory used now determined by base config
v1.5.5 bactopia/bactopia "Avenger's Earring" - 2021/01/04¶
Added
¶
--prefix
option forbactopia prepare
- date is included in
bactopia summary
output
Fixed
¶
- removed usage personal (rpetit3) conda channel
- aspera connect no longer supported
- shovill-se is now used from Bioconda
- updated conda environments (phyloflash broken)
v1.5.4 bactopia/bactopia "Nemesis Earring" - 2020/12/17¶
Fixed
¶
- quoted arguments being broken up (e.g.
--species "Staphylococcus aureus
->--species Staphylococcus
) mashtree
tool failure to download with--accessions
- remove ncbi-genome-download header when using
--dry-run
- undefined
name
variable in plasmid_blast - custom work dir causing two
-w
parameters bactopia search
results now contains all (illumina and non-illumina)
v1.5.3 bactopia/bactopia "Morion Earring" - 2020/12/04¶
Added
¶
- Changelog moved to docs
- recursive search for
bactopia prepare
- allow multiple FASTQs per sample (FASTQs are merged)
Fixed
¶
- unable to run
bactopia datasets
without parameters - PLSDB blast results in invalid JSON format
- Error message for unaccepted
run_type
v1.5.2 bactopia/bactopia "Physical Earring" - 2020/11/18¶
Fixed
¶
--skip_qc
causing "file not found"qc_reads
not honoring FINAL_BP and FINAL_READS checks
v1.5.1 bactopia/bactopia "Astral Earring" - 2020/11/17¶
Fixed
¶
bactopia tools
not a valid project namebactopia tools
--cleanup_workdir
unrecognized variable
v1.5.0 bactopia/bactopia "Cassie Earring" - 2020/11/12¶
Added
¶
- Conda environments will check if in sync with latest version now
- md5sums of all conda envs
- Verify species-specific datasets exist
- separate work dir for bactopia and bactopia tools
--cleanup_workdir
to delete work directory after successful run- default values for
bactopia datasets
summary.json - Fallback to NCBI Assembly when eUtils is down
- Additional pre-process QC checks
- OSX/Linux conda envs for Bactopia Tools
- Documentation edits and updates
- Bactopia and Nextflow versions are now output for logging purposes
- option to skip QC step (
--skip_qc
) bactopia datasets
can now specify assembly levelbactopia tools
now use reusable conda envsbactopia tools
for Roary and PIRATE can now include local assemblies
Fixed
¶
- Warn user if no completed genomes are available
- use of
--genera
for ncbi-genome-download - improved genome_size handling
- explicit file passing for AWS Batch
- Memory estimates now floored
- PLSDB blast not being executed
v1.4.11 bactopia/bactopia "Metamorph Ring" - 2020/09/19¶
Added
¶
bactopia build
checks if each environment is built before building- Can specify
bactopia build
to build a specific environment - Removed build numbers in Conda environment yamls
- Created separate Conda yamls for Linux and Mac
- NCBI assembly accessions will retrieve the most current version (e.g. .1, .2, .3, etc...)
Fixed
¶
bactopia datasets
trailing whitespace in species namesbactopia datasets
random subsample missing specified species when--limit
and--include_genus
used- GitLab CI OSX compatibility
- Adaptive resource allocations
- Datasets are checked for existence
- Variant calls against references with multiple chromosomes
v1.4.10 bactopia/bactopia "Jelly Ring" - 2020/08/25¶
Added
¶
card
is back as a default Ariba dataset- Added timestamps to
versions
files
Fixed
¶
bactopia search
not creating--outdir
gtdb
tool not using prefix in outdir namingpirate
tool using pangenome alignment instead of core- Use of
scratch
causing logs to fail
v1.4.9 bactopia/bactopia "Toreador's Ring" - 2020/08/23¶
Added
¶
- Support for multiple accession
bactopia search
(SRA)- Bactopia Tools:
fastani
,mashtree
,pirate
,roary
(Assembly)
Fixed
¶
- Undefined variable in
mapping_query.sh
- ENA API endpoint for
bactopia search
- Updated GTDB-TK to 1.3.0 to support latest downloads
- FastANI tool merge_results in no longer a separate step
- ANI is now one-to-many calculations
--reassemble
misapplied
v1.4.8 bactopia/bactopia "Shikaree Ring" - 2020/08/20¶
Added
¶
- Versions are logged for Bactopia
- STDOUT/STDERR logs are kept for each sample
- can be skipped using
--skip_logs
- can be skipped using
Fixed
¶
- Long sample names breaking Prokka annotation
- Syntax errors in Bactopia tools
- null values being tested as integers
- Ariba card and mlst downloads not working
- missing parameter in GTDB Bactopia tool
v1.4.7 bactopia/bactopia "Serket Ring" - 2020/08/17¶
Added
¶
--no_cache
to skip caching ncbi assembly info
v1.4.6 bactopia/bactopia "Astral Ring" - 2020/08/17¶
Added
¶
- Option to rebuild conda envs to default location
- Updated fastq-dl for sra-toolkit forced interaction workaround
v1.4.5 bactopia/bactopia "Bomb Queen Ring" - 2020/08/13¶
Fixed
¶
--min_basepairs
and--min_reads
not being honored
v1.4.4 bactopia/bactopia "Vilma's Ring" - 2020/08/13¶
Fixed
¶
annotate_genome
name clashes- Updated
fastq-dl
version to support new ENA API endpoint
v1.4.3 bactopia/bactopia "Sattva Ring" - 2020/08/13¶
Added
¶
--skip_ariba
option inbactopia datasets
Fixed
¶
bactopia versions
andbactopia citations
improper execution- Convert spaces to tabs in citation doc
- Corrected CheckM name in program version info file
- CARD no longer default Ariba dataset download for
bactopia datasets
v1.4.2 bactopia/bactopia "Tamas Ring" - 2020/08/10¶
Added
¶
- added requirement checks of
--accessions
inbactopia datasets
- improved ENA spell check for species name
Fixed
¶
- file of accessions not working with
bactopia datasets
- Dockerfile and Singularity being missed by
update-version.sh
v1.4.1 bactopia/bactopia "Rajas Ring" - 2020/08/06¶
Added
¶
- Links to publication (woohoo!)
- Can pass a Prodigal training file to
bactopia datasets
Fixed
¶
- Typos in the Docs
- blast_primers now uses
blastn
and blast_genes usesmegablast
- validExitStatus deprecation warning
v1.4.0 bactopia/bactopia "Archer's Ring" - 2020/07/01¶
Added
¶
- New Bactopia Tools
eggnog
for functional annotation using eggNOG-mappermashtree
to create a tree using Mash distancespirate
to create pangenome using PIRATEismapper
for insertion site discovery- Documentation for new tools and tweaks to existing
- BTs
roary
andpirate
can now be run on just completed genomes - Can limit number of completed genomes downloaded where applicable
bactopia datasets
can provide list of RefSeq accessions to downloadbactopia search
can now use BioSample and Run accessionsbactopia search
can select a subset of Experiments associated with a BioSample- Support for organisms with multiple MLST schemas
- Assembly QC via QUAST and CheckM
- Assemblies (local or NCBI Assembly accession) as inputs for Bactopia
- Long reads as supplementary to paired end reads for hybrid assembly
- Tools versions are locked to the minor version, not the patch
Fixed
¶
summary
will now determine absolute path of inputsfastani
improved user reference import- went back a version on
call_variants
openjdk version - all bactopia tools now put nextflow info in the same folder as outputs
- Typos in docs
- Bactopia Tools now check existence of include and exclude files
- Lots more documentation
- Updated citations/tools used by Bactopia
- Did I mention typos?
Removed
¶
- ISMapper as part of the main pipeline (its now a tool)
insertion-sequences
in bactopia datasets
v1.3.1 bactopia/bactopia "Emperor Hairpin" - 2020/04/20¶
Added
¶
summary
tool now gives reason for ranksummary
tools now splitsfailed
intoexclude
andqc-failure
- Better documentation on how
--cpus
works in Nextflow - Efficiency info when executed on standard profile
- split
blast_query
intoblast_genes
,blast_primers
andblast_proteins
mapping_query
now creates multifasta of fastas at maps at once then splits per-base coverage into separate files--nfconfig
users can provide their own Nextflow config filefastani
users can provide their own reference nowbactopia versions
will print versions for tools used by Bactopiabactopia citations
will print citations for tools and datasets used by Bactopiabactopia search
can filter based on total bases, mean read length, and missing FASTQs- blast queries results are only JSON format for easy parsing later
- added
--compliant
option for Prokka annotation
Fixed
¶
- build-containers.sh not working with Bactopia Tools
- Bactopia Tools container tools missing environment.yml
- Typo in
fastani
usage - Samples with multiple QC errors counted for each error
- Incorrect ISMapper version
- typo in
summary
SLURM profile gtdb
Singularity container not mounting path to GTDB databaseroary
missingrename
in containersblast_query
results overwriting one anotherbuild-containers.sh
now creates a "latest" tagbactopia tool roary
outputs results based on the given prefix- renamed
--addgenes
to--nogenes
- updated ASA³P citation
- Typos in Bactopia Tools docs
- Link in README.md
v1.3.0 bactopia/bactopia "Leaping Boots" - 2020/02/19¶
Added
¶
- bactopia tools (BT) framework
- docs for each tool
- subcommand to execute tools
bactopia tools
fastani
- pairwise average nucleotide identitygtdb
- assigning objective taxonomic classificationsphyloflash
- 16s assembly, alignment and treeroary
- pan-genome and core genome treesummary
- summary of results
--include
and--exclude
to modify which samples used in BT analysisupdate-version.sh
improvements- can now set how Nextflow publishes outputs (copy, symlink, etc...) via
--publish_mode
- Warning if output directory already exists and require
--force
to overwrite - option (
--rfam
) to turn on ncRNA annotation in Prokka - reduced "README.md" contents, instead point to documentation
- Updated acknowledgements and bibtex for citations
Fixed
¶
- nextflow.config version out of sync
--available_datasets
accessing not existent variable--available_datasets
is tested before requiring inputs- Make use of tbl2asn-forever
- adjusted how Bactopia is executed,
nextflow run
no longer pulls from github
v1.2.4 bactopia/bactopia "Rabbit Charm" - 2019/12/20¶
Added
¶
--conda_help
to be used for conda build test--skip_fastq_check
to skip check that input FASTQs meet minimum requirements- Undocumented parameters to the usage
Fixed
¶
- snippy not working with samtools 1.10
- NXF_HOME variable is exported to the conda env location
- speed of checking if conda environments are built
v1.2.3 bactopia/bactopia "Tropical Punches +1" - 2019/12/19¶
Added
¶
- select-references selects latest assembly accession version (BioPython/Entrez)
- select-references skips assemblies that have been excluded from RefSeq
- test to for paired-end related errors (e.g. different read counts)
--min_genome_size
and--max_genome_size
parameter for estimated genome sizes- Check is also made after assembly
update-version.sh
improvements- Better genome size estimates using Mash for high and low coverage sequences
- Script to update conda environments
- added
--conda_help
to be used for conda build test
Fixed
¶
--random_tie_break
always true- not using latest assembly accession for ncbi-genome-download
- usage of assemblies that have been excluded from RefSeq
- allowing PE reads with different read counts to be processed (hint... they fail pretty quickly)
- failure to stop analysis of sample with low read counts
- coverage reported as 'inf'
- references to cgmlst in the setup datasets
- non-explicit patterns in publishDir
- low coverage/read errors after QC were not put in root dir
- snippy not working with samtools 1.10
v1.2.2 bactopia/bactopia "Tropical Punches" - 2019/10/22¶
Added
¶
- Size of "work" directory to the execution summary
- User controlled overwrites of existing output files
- Check for unknown parameters at runtime
- FASTQ downloads from SRA (via fastq-dl and fasterq-dump)
- Documentation updates
- Script for building containers
Fixed
¶
bactopia
command now explicitly states which tag to use for Nextflow run- Version info not updated in Dockerfile and Singularity
- Duplicated QC'd FASTQs
- nextflow: docker "Memory limited without swap" error
Removed
¶
- cgmlst support in
bactopia datasets
- setup.py left over from pre-conda config
v1.2.1 bactopia/bactopia "Fruit Punches" - 2019/10/17¶
Added
¶
bactopia build
to build Conda environments- Version info pulled from nextflow.config
- Set default values resource allocations
- Documentation on new changes
- Automatic building of Conda environments, if none exist
--nfdir
to determine where bactopia is being run from
Fixed
¶
- Never ending typos
--datasets
now, not--dataset
<-(Typo)- path for outputting Nextflow reports
- Typo in antimicrobial_resistance.sh (task.cpus not cpus)
--species
is now consistent betweenbactopia
andbactopia datasets
- Bug when checking if specific species dataset exists, but no species datasets exist
- Cleaned up version update script
- Cleaned up usage
Removed
¶
--max_cpus
ability to limit total cores used, access to config is being deprecated in Nextflow--max_cpus
since it is redudant to--cpus
now
v1.2.0 bactopia/bactopia "Beestinger" - 2019/10/16¶
Added
¶
--compress
to compress certain outputs, default uncompressed- Species name check in
bactopia datasets
- Use requests package instead of urllib3
- Added
bactopia search
to query ENA for list of Illumina accessions - Documentation
- Feedback edits
- Output overview
- Additional program acknowledgements
- bibtex of citations
- missing parameters to usage
- info for
--genome_size
parameter bactopia search
usage- Workflow overview
- blastdbcmd compatible seqid to assembly fasta
- allows search for entries with sample name
- Mask low coverage regions in consensus (subs only) fasta
- Added --dry_run to build conda envs one at a time (prevent parallel issues)
- Added Singularity recipes
- Added SLURM config
Fixed
¶
- Never ending typos
bactopia datasets
lowercase species names not found in MLST schemasbactopia version
no longer calls nextflow- SEQUENCE_TYPE channel groups FASTQ and assembly
- MINMER_QUERY channel groups FASTQ and signature
- Ariba MLST always running with
--noclean
- Bugs related
--compress
- Reduced size of per-base coverage outputs
- Removed
-parse_seqids
from makeblastdb command, caused blast queries to fail - genomeCoverageBed failing on empty BAM files
Removed
¶
--clean_cache
function
v1.1.0 bactopia/bactopia "Wooden Sword +1" - 2019/09/19¶
Added
¶
- NCBI's amrfinder
- Dockerfile for main bactopia install
- Completed documentation!
Fixed
¶
- insertion_sequences inputs are not now grouped into single channel
- Unintended FASTQ duplication via poor publishDir pattern
v1.0.1 bactopia/bactopia "Wooden Sword" - 2019/09/12¶
Added
¶
- README.md documentation
Fixed
¶
- call_variants_auto bug fixed
- documentation
- version numbers
v1.0.0 bactopia/bactopia "Wooden Sword" - 2019/09/04¶
- Initial release of bactopia/bactopia