Changelog¶
v3.1.0 bactopia/bactopia "Dance Powder" 2024/09/22¶
"Dance Powder" is related to the Alabasta arc in One Piece which my kids currently on
Added¶
- Named Workflows
clean-yer-reads- Use Bactopia's read QC steps to Clean-Yer-Reads- Use
clean-yer-reads,cyrorbactopia --wf cleanyerreads
- Use
teton- Host removal and taxon classification with estimated abundances- Use
tetonorbactopia --wf teton
- Use
- Bactopia Tools (
bactopia --wf <NAME>)clermontyping- Clermont's Escherichia phylogroupingdefensefinder- Systematic search of all known anti-phage systemssccmec- SCCmec typing of Staphylococcus aureus genomesscrubber- Remove human reads from FASTQ files- screen human reads with
kraken2(against human pangenome) orsra-human-scrubber
- screen human reads with
shigapass- Predict Shigella serotypes and differentiate Shigella, EIEC and non-Shigella/EIEC
- full support of config files from nf-core/configs
- no longer prints efficiency for
standard,docker, andsingularityprofiles - now required non-integer values for
--max_time(e.g.4.h) and--max_memory(e.g.8.GB) - always import
base.config
- no longer prints efficiency for
amrfinderplusby combining results from genes, proteins and gff (coordinates)--amrfinder_dband--mlst_dbto specify custom databases- mlst and amrfinder databases can now be either a tarball or directory
pangenometool now uses panaroo as the default pangenome tool- file with reference name and samples included for
snippyrun snippycan now use--accessionto download a genbank file from NCBI- replaced
staphopiasccmecwithsccmecinstaphtyper - bump program versions in modules
abritamr: 1.0.17 -> 1.0.19bakta: 1.9.3 -> 1.9.4blast: 2.15.0 -> 2.16.0busco: 5.7.0 -> 5.7.1checkm: 1.2.2 -> 1.2.3defensefinder: 1.2.2 -> 1.3.0gtdbtk: 2.3.2 -> 2.4.0hicap: 1.0.3 -> 1.0.4 (@MartinVad)kleborate: 2.3.2 -> 3.0.9mobsuite: 3.1.8 -> 3.1.9panaroo: 1.4.2 -> 1.5.0pasty: 1.0.3 -> 2.2.1pbptyper: 1.0.4 -> 2.0.0seqsero2: 1.2.1 -> 1.3.1stecfinder: 1.1.1 -> 1.21.2tbprofiler: 6.1.0 -> 6.3.0
Fixed¶
- missing schema for clean-yer-reads and teton
- pinning of bioperl in prokka module with
strictchannel priority - use
--infile-listwithcsvtk concatto support 10k+ inputs pangenomewhen Bakta GFF (*.gff3) files are provided- missing file export in
gubbins - writing to non-default values for
--bactopia --includefile on cloud storage (@rwilliams)- import of ONT reads over illumina reads in certain bactopia tools (e.g. snippy) (Thanks D2)
- bactopia/datasets envs not being build by
bactopia download snippyputs files in subdirectory based on reference name- removed pneumocat from Merlin as it doesn't gracefully fail on negative results
- all the tests
- typo in
bactopia-tools.nf(thank you! @pvanheus)
Enhancements to OSS¶
- pinn macsyfinder version in defense-finder bioconda/bioconda-recipes#46824
- add recipe for
camlhmpbioconda/bioconda-recipes#47453 - add recipe for
sccmecbioconda/bioconda-recipes#47600 - update recipe for genotyphi bioconda/bioconda-recipes#47664
- disable phone home snpeff bioconda/bioconda-recipes#48007
- clermontyping use share folder bioconda/bioconda-recipes#49372
- add recipe for scrubby bioconda/bioconda-recipes#49373
- rebuild pasty bioconda/bioconda-recipes#50068
v3.0.1 bactopia/bactopia "That's My Girl" - 2024/03/25¶
"That's My Girl" is a phrase my youngest says quite often when playing with her friends
Added¶
- Updated
bactopia-assemblerto include updated Medaka models - profile for
arcc_hawk - TB Profiler results are now merged with
collatecommand - sample name to plasmidfinder results
- support config files from nf-core/configs
- updated AMRFinder+ database for v3.12.8
- bump program versions in modules
abritamr: 1.0.14 -> 1.0.17bactopia-assembler: 1.0.3 -> 1.0.4bactopia-teton: 1.0.1 -> 1.0.2bactopia-variants: 1.0.1 -> 1.0.2bakta: 1.8.2 -> 1.9.3blast: 2.14.1 -> 2.15.0busco: 5.5.0 -> 5.7.0gubbins: 3.3.0 -> 3.3.4multiqc: 1.15 -> 1.21mashtree: 1.4.5 -> 1.4.6mykrobe: 0.12.2 -> 0.13.0ncbi-amrfinderplus: 3.11.18 -> 3.12.8panaroo: 1.3.3 -> 1.4.2rgi: 6.0.2 -> 6.0.3sistr: 1.1.1 -> 1.1.2stecfinder: 1.1.0 -> 1.1.1tbprofiler: 5.0.0 -> 6.1.0
Fixed¶
- variable name in sketcher module
--para-offnot correctly implemented in PIRATE module- extra space in RGI container image name
- sourmash version STDERR not parsed out
- FASTQC writing to /tmp dir on HPC when non-readable
- abricate and ariba not using output subdirectory
- nextflow tower typos @iferres
- phispy not working with Bakta genbank files
- missed check of
--download_baktaparam in main workflow - quast not working when estimated genome size is 0
abricatenow gets database subdirectories
Enhancements to OSS¶
- fix amrfinderplus pinning in abritamr bioconda/bioconda-recipes#46714
- adjust python pinning in gubbins bioconda/bioconda-recipes#46713
- fix issue with sistr container bioconda/bioconda-recipes#46712
- update rgi pinning for pyrodigal bioconda/bioconda-recipes#46669
- pin tabix version in snippy bioconda/bioconda-recipes#46458
v3.0.0 bactopia/bactopia "Black Cat and Brown Dog" - 2023/09/11¶
"Black Cat and Brown Dog" are in memory of Stinky ("Black Cat") and Twotee ("Brown Dog")
Added¶
- Bactopia Tools (
bactopia --wf <NAME>)abritamr- A NATA accredited tool for AMR detectionblastn- Search against nucleotide BLAST databases using nucleotide queriesblastp- Search against protein BLAST databases using protein queriesblastx- Search against protein BLAST databases using translated nucleotide queriesbracken- Taxonomic classification and species abundance estimation of sequence readsbtyper3- Taxonomic classification of Bacillus cereus group isolatesmidas- Estimate bacterial species abundances from FASTQ filesquast- Assess the quality of assembled contigsphispy- Predict prophages in bacterial genomespneumocat- Assign capsular type to Streptococcus pneumoniae from sequence readssra-human-scrubber- Scrub human reads from FASTQ filesstecfinder- Serotyping Shigella toxin producing Escherichia coli genomestblastn- Search against translated nucleotide BLAST databases using protein queriestblastx- Search against translated nucleotide BLAST databases using translated nucleotide queries
- per-release databases for
amrfinderplusandmlst - new directory structure for outputs
- Renamed
--R1,--R2, and--SEto--r1,--r2, and--se --ontnow accepts a FASTQ file- GitHub Action to build environments
fastpis default read cleaner, can use previous methods with--use_bbmap- ability to use a BED file to mask regions in
snippy-core --save_sketchesto save Mash sketches created duringmashtreerun5- Porechop is now optional (
--use_porechop) - unified
publishDiracross modules - download datasets using
storeDirBACTOPIA:DATASETSaribasra-human-scrubber
- bump program versions in modules
bakta: 1.6.0 -> 1.8.2blast: 2.11.0 -> 2.14.1busco: 5.4.3 -> 5.5.0csktk: 0.25.0 -> 0.27.2eggnog-mapper: 2.1.9 -> 2.1.12genotyphi: 1.9.1 -> 2.0gtdbtk: 2.1.1 -> 2.3.2gubbins: 3.2.1 -> 3.3iqtree: 2.2.0.3 -> 2.2.2.7fastani: 1.33 -> 1.34mashtree: 1.2.0 -> 1.4.5mob_suite: 3.1.0 -> 3.1.7multiqc: 1.11 -> 1.15mykrobe: 0.12.0 -> 0.12.2ncbi-amrfinderplus: 3.10.45 -> 3.11.18ncbi-genome-download: 0.3.1 -> 0.3.3panaroo: 1.3.0 -> 1.3.3pasty: 1.0.0 -> 1.0.3phyloflash: 3.4 -> 3.4.2rgi: 6.0.1 -> 6.0.2shigatyper: 2.0.3 -> 2.0.5shigeifinder: 1.3.2 -> 1.3.5tbprofiler: 4.4.0 -> 5.0.0
Fixed¶
- All modules correctly initiate
params.options - OpenJDK java cpuset warning message (@idolawoye)
- Broken badges in README.md
- Pinned GSL to v2.6
- symlink in amrfinder+ update (
bactopia datasets) - hardcoded
--plusinamrfinderplus_run - tests for new directory structure
--nanohqnot being properly passed todragonflye
Removed¶
- Bactopia Steps
ANNOTATE_GENOME- now handled byprokkaorbaktaANTIMICROBIAL_RESISTANCE- now handled byamrfinderplusCALL_VARIANTS- now handled bysnippySEQUENCE_TYPE- now handled bymlst
bactopia datasetsis now incorporated intobactopia- Conda/Containers for all bactopia-main steps
- custom process labels, for generic
nf-coreprocess labels
Enhancements to OSS¶
- add module for midas nf-core/modules#2696
- add modules for sra-human-scrubber nf-core/modules#2694
- add module for stecfinder nf-core/modules#2702
- update midas pinnings to match docs bioconda/bioconda-recipes#38566
- add missing scripts to MLST bioconda/bioconda-recipes#38826
- fix GTDB-Tk container bioconda/bioconda-recipes#40333
- patch ncbi-genome-download bioconda-recipes#41640
- add module for pneumocat nf-core/modules#3592
- add module for abritamr/run nf-core/modules#3725
- fix hostrange() missing 1 required positional argument: 'database_directory' phac-nml/mob-suite#149
- add module for btyper3 nf-core/modules#3817
v2.2.0 bactopia/bactopia "Tornado Tempo" - 2022/11/28¶
Added¶
--dragonflye_optsto pass additional params to dragonflye--use_fastpto use fastp for QCing readsbactopia datasetsno longer depends onariba--skip_spell_checkinbactopia datasets- updated organisms available from PubMLST
- custom profile for
wsvl - updated citations
- use
mambaforgefor docker builds - Support tarball inputs for large databases
- Bactopia Tools (
bactopia --wf <NAME>)pasty- in silico serogrouping of Pseudomonas aeruginosa isolatespbptyper- In silico Penicillin Binding Protein (PBP) typer for Streptococcus pneumoniae assembliesshigeifinder- Serotyping Shigella and EIEC assemblies
- bump program versions in modules
bakta-> 1.5.1bbmap-> 39.01checkm-genome-> 1.2.2csvtk-> 0.25.0dragonflye-> 1.0.13eggnog-mapper-> 2.1.9fastani-> 1.33fastq-scan-> 1.0.1gamma-> 2.2gtdbtk-> 2.1.1iqtree-> 2.2.0.3kleborate-> 2.2.0mlst-> 2.23.0mob_suite-> 3.1.0mykrobe-> 0.12.0nanoplot-> 1.40.2ncbi-amrfinderplus-> 3.10.45panaroo-> 1.3.0pirate-> 1.0.5rasusa-> 0.7.0rgi-> 6.0.1shigatyper-> 2.0.3snpeff-> 5.1sourmash-> 4.5.0sra-tools-> 3.0.0tb-profiler-> 4.4.0
Fixed¶
- respect short polish hybrid assembly requests
- missing variable in
bakta_downloadmodule - restructure to match
nf-core/modules - compatibility with Nextflow >=22.10
- separate versions for Linux and OSX in
assemble_genomes bactopia datasetserror messages switched to warnings
Enhancements to OSS¶
- bump Bakta to 1.5.0 nf-core/modules#2000
- add recipe for pasty bioconda/bioconda-recipes#35930
- add recipe for pbptyper bioconda/bioconda-recipes#36222
- add module for pasty nf-core/modules#2003
- add module for pbptyper nf-core/modules#2005
- add module for shigeifinder nf-core/modules#2523
v2.1.1 bactopia/bactopia "Dragon Twister" - 2022/07/09¶
Added¶
- Add Panaroo to the
pangenomeBactopia Tools - Add
genotyphiandserobato Merlin - bump
fastq-dlto v1.1.1 - Bactopia Tools (
bactopia --wf <NAME>)busco- Assembly completeness based on evolutionarily informed expectationsgenotyphi- Salmonella Typhi genotyping with Mykrobe outputsgubbins- Rapid phylogenetic analysis of recombinant bacterial sequencesmcroni- Sequence variation in mobilized colistin resistance (mcr-1) genesmykrobe- Antimicrobial resistance detection for specific speciespanaroo- Pipeline for pangenome investigationsplasmidfinder- Plasmid identification from assembliesseroba- Serotyping of Streptococcus pneumoniae from sequence readssnippy- Rapid variant calling from Illumina sequence reads with optional core-SNP phylogeny
Fixed¶
- improve GTDB db handling
- Properly capture assemblies with 0
- user provided adapters and phix fastas
--available_datasetsin thebactopia datasets- name collisions in the Ariba Bactopia Tool
- bare double-quotes in Ariba reports
- updated tests for new version
Enhancements to OSS¶
- Add database to Seroba recipe bioconda/bioconda-recipes#35378
- Update ariba dependencies bioconda/bioconda-recipes#35383
- patch pymummer recipe bioconda/bioconda-recipes#35379
- Update PlasmidFinder for better container support bioconda/bioconda-recipes#35314
- add module for ariba nf-core/modules#1731
- add module for mcroni nf-core/modules#1750
- add module for Mykrobe and GenoTyphi nf-core/modules#1731
- add module for plasmidfinder nf-core/modules#1773
- add module for Seroba nf-core/modules#1816
- add module for snippy/core nf-core/modules#1855
- let tempfile determine temp dir location sanger-pathogens/seroba#68
- allow the user to specify temp dir or use the system default sanger-pathogens/pymummer#36
v2.1.0 bactopia/bactopia "Silver Mist" - 2022/06/08¶
Added¶
--use_baktato replace Prokka with Bakta in main workflow- Split Bakta into two processes
downloadandrun - added
shigatyperto Merlin - custom profile for
arcc bactopia prepareerror message when nothing found, and--examplesfor example use cases- renamed
--fastqsto--samplesto better reflect its usage (fastqs and assemblies) - add
--check_samplesto validate user generated FOFNs --short_polishfor short read polishing for long-read assemblies- Moved Ariba to Bactopia Tools
- bump versions in Bactopia envs
- bump Bactopia Tool versions
amrfinder+version to 3.10.30gtdbversion to 2.1.0mlstversion to 2.22.0
- Bactopia Tools (
bactopia --wf <NAME>)ariba- Gene identification by local assembliesgamma- Identification, classification, and annotation of translated gene matchesshigatyper: Shigella serotype from Illumina or Oxford Nanopore reads
Fixed¶
- @nickjhathaway Fixed assembly_level in
bactopia datasetsand autocall_variants bactopia searchnow supports ONT readsvcf-annotatorunable to write to/dev/stdout- typo in
bactopia downloadwhen using singularity bactopia downloadwill not rebuild envs its already built in a run--include_genuswithbactopia datasetsnow works with multiple samples- Parameters to copy optional species-specific files in
bactopia datasets - Don't use
AUTOfor IQ-TREE cpus gtdbdownload working again- empty FASTQs after adapter and phix removal
- low read count error for ONT samples
- Removed
sequence_typeenvironment
v2.0.3 bactopia/bactopia "King Cobra" - 2022/03/23¶
Added¶
- logo and new metro style workflow
- added
testprofile for easy testing mashdistnow outputs a merged TSV- print missing required parameters in Bactopia Tools
- bump bakta to v1.4.0
Fixed¶
- typo in
baktasubworkflow ismappernot available inbactopia-tools.nfkraken2not available inbactopia-tools.nf- Traits file being required for
pangenome mashtreemodule not collecting all FASTA- tests for ISMapper
v2.0.2 bactopia/bactopia "Black Mamba" - 2022/02/22¶
Added¶
bactopia downloadwhich merges thebactopia buildandbactopia pullcommands- modules can now use prebuilt envs
--available_speciesto print the species in a bactopia dataset--bbduk_optsto fine tune bbduk- set
quayas default docker registry - Let Nextflow handle stdout and stderr
- added
hpsuissero,legsta,sistr,ssuisseroto Merlin - Bactopia Tools (
bactopia --wf <NAME>)amrfinderplus: Identify antimicrobial resistance in genes or proteinsabricate: Mass screening of contigs for antimicrobial and virulence genescheckm: Assess the assembly quality of your sampleshpsuissero: Serotype prediction of Haemophilus parasuis assemblieskraken2: Taxonomic classification of sequence readslegsta: Typing of Legionella pneumophila assembliesmlst: Scan contig files against PubMLST typing schemesmobsuite: Reconstruct and annotate plasmids in bacterial assembliesrgi: Predict antibiotic resistance from assembliessistr: Serovar prediction of Salmonella assembliesssuissero: Serotype prediction of Streptococcus suis assemblies
Fixed¶
toIntegeronnullgenome size- Index out of bounds in mlst-blast.py (caused by non allele columns)
- FOFN will fail if sample name is
null - improve available workflow logic
- PIRATE not exporting gene_presence_absence.csv correctly
- channel imports for
scoaryworkflow argument type mismatchwhen using--use_roary- missing channel when
agrvatenot ran asstaphtyper - float required options to the top when missing
- broken tests due to changes
bactopia datasetsincluding hidden directories- Bioperl pinnings for prokka and pirate
v2.0.1 bactopia/bactopia "Red Roc" - 2022/01/06¶
Added¶
- Merlin for auto selection of species-specific tools based
--ask_merlinto execute the Merlin subworkflow in Bactopia- Schema compatible with NF-Tower
--skip_qc_plotsto skip FastQC and Nanoplot- Disable CheckM by default, must use
--run_checkm --cluster_optsfor passing additional options to SLURM- inclusion of local files (assembly, gff) in some subworkflows
- workflow for dev conda builds
- all subworkflows can be imported on other workflows
- Bactopia Tools (
bactopia --wf <NAME>)gtdb: Identify marker genes and assign taxonomic classificationsmashdist: Calculate Mash distances between sequencesmerlin: MinmER assisted species-specific bactopia tool seLectIoN
Fixed¶
- Resource over allocation for java related programs (-Xmx)
- ordering of skipped steps when
--datasetsnot used - logic in container selection for nf-core modules
- channel manipulation in
ncbigenomedownload - missing logic for
fastani - version captures in subworkflows
- spell check for species with
[or]in their names - version outputs for Staphtyper subworkflow
- use process names in version.yml
CSVTK_CONCATempty channelsoptions.suffixnow used asprefixin modules
v2.0.0 bactopia/bactopia "Red Hawk" - 2021/12/05¶
With Bactopia v2 comes a lot of changes! I would like to extend a huge thanks to Davi Marcon and Abhinav Sharma for their work initially converting Bactopia to DSL2. Your efforts were the momentum I needed to get the ball rolling on Bactopia v2. Thank you very much for taking your time to make such a siginificant contribution!
Added¶
- support for Nanopore reads
staphopiaas a named pipeline (alias forbactopia --wf staphopia) for S. aureus genomesbactopia/bactopia-testsrepo with test data- walkthrough for testing
bactopia-datasets/staphylococcus_aureusrepo with curatated S. aureus datasets- per-module testing via
pytest(100+ tests and 7000+ outputs tested) - per-module
meta.ymlandparams.jsonfor auto-building docs site - framefork for adding new Bactopia Tools
- 19 total Bactopia Tools (
bactopia --wf <NAME>)- Subworkflows (3)
eggnog: Functional annotation of proteins using orthologous groups and phylogeniespangenome: Pangenome analysis with optional core-genome phylogenystaphtyper: Determine the agr, spa and SCCmec types for Staphylococcus aureus genomes
- Modules (16):
agrvate: Rapid identification of Staphylococcus aureus agr locus type and agr operon variants.bakta: Rapid annotation of bacterial genomes and plasmidsectyper: In-silico prediction of Escherichia coli serotypeemmtyper: emm-typing of Streptococcus pyogenes assembliesfastani: fast alignment-free computation of whole-genome Average Nucleotide Identity (ANI)hicap: Identify cap locus serotype and structure in your Haemophilus influenzae assembliesismapper: Identify insertion sites positions in bacterial genomeskleborate: Screening Klebsiella genome assemblies for MLST, sub-species, and other related genes of interestlissero: Serogroup typing prediction for Listeria monocytogenesmashtree: Quickly create a tree using Mash distancesmeningotype: Serotyping of Neisseria meningitidisngmaster: Multi-antigen sequence typing for Neisseria gonorrhoeaeseqsero2: Salmonella serotype prediction from reads or assembliesspatyper: Computational method for finding spa types in Staphylococcus aureusstaphopiasccmec: Primer based SCCmec typing of Staphylococcus aureus genomestbprofiler: Detect resistance and lineages of Mycobacterium tuberculosis genomes
- Subworkflows (3)
- Use
mambainstead of conda for env building - Reduced total Conda envs/Docker containers down to 7 (previously 12 not including bactopia tools)
- default to compressed outputs (
--skip_compressionto output uncompressed outputs) - Tutorial outputs made available
- update github actions
Fixed¶
- Cache issue causing
-resumeto fail - amrfinder+ database not compatible error
- incorrectly parsed system memory
Adapted from nf-core¶
- nf-core pytest setup
- nf-core/modules for bactopia tools
- Bactopia v2 release contributed 20+ modules to nf-core/modules
- nf-core/tools arg parser
- adapted to import params and usage based on config file
Process Consolidation¶
makeblastdb->assemble_genomecall_variants,download_reference->call_variantsfastq_status,estiamte_genome_size->gather_samplescount_31mers->minmer_sketch
Removed¶
bactopia tools-> Handled by Nextflow now (bactopia --wf <NAME>)bactopia versions-> Program versions are output ever run now.
v1.7.1 bactopia/bactopia "Shellbuster" - 2021/06/04¶
Added¶
- bumped GTDB to v1.5.0
- added soft ulimit for
staph-typer
Fixed¶
- Matched PIRATE's parameter syntax for the tools
staph-typernow uses GetBaseName
'Removed'¶
- PLSDB references in
bactopia datasets
v1.7.0 bactopia/bactopia "Chocobo Wand" - 2021/04/27¶
Added¶
- Bactopia Tool
staph_typerfor agr, spa, and sccmec typing --min_coverageparameter to filter based on min coverage
'Removed'¶
plasmid_blastno longer apart of main workflow
v1.6.5 bactopia/bactopia "Z's Trident" - 2021/03/30¶
Added¶
- version pins to process envs
Fixed¶
- syntax for sourmash 4.0
v1.6.4 bactopia/bactopia "Trident +1" - 2021/03/26¶
Added¶
- added Python3.6+ to all environments
v1.6.3 bactopia/bactopia "Trident" - 2021/03/25¶
Added¶
- extra fields to
mlst-blast.pyoutputs - added Python3 to
qc_readsenvironment
Fixed¶
- rstrip on empty extra fields in mlst profile
- different BLAST+ software versions mismatch
- tbb pinnings
--helpand--versionforbactopia tools
v1.6.2 bactopia/bactopia "Fuscina" - 2021/03/19¶
Added¶
- inputs are checked to be gzipped (this does not include FOFN)
--skip_amrto skip AMRFinder+ analysis- new
bactopia toolforhicap unicyclercan be used for Illumina reads only (--assembler unicycler)
Fixed¶
- AMRFinder+ software and database version mismatch
- check-fastqs.py syntax errors with prints
ismappertool processing of include/exclude files
v1.6.1 bactopia/bactopia "Obelisk" - 2021/02/22¶
Fixed¶
- sample names with "." in them breaking auto variant calling
- contig naming incompatible with GenBank
v1.6.0 bactopia/bactopia "Harpoon" - 2021/01/22¶
Added¶
bactopia pullto pre-build Singularity images--singularity_cacheparameter to set location of image downloads--registryto choose Docker registry to use (DockerHub, GitHub, Quay)--max_downloadssets maximum number of downloads (FASTQ/assembly) allowed at once--min_timesets the minimum amount of time a job should be givenbactopia searchnow uses POST requests, and groups accessions into single query- strip HTML from FASTA headers used in BLAST
- Dockerfiles now have conda.md5 label to determine if rebuild is necessary
- MD5 is updated in Dockerfile when env is updated
- AMRFinder+ database is now provided by
bactopia datasets - Parameterized profile (slurm, awsbatch, etc...) variables
bactopia build- will retry in case of HTTP connection issues
--include_toolwill build Bactopia Tool environments
- GitHub Actions
- build Docker containers on new release (or manual trigger)
- test that the Conda environment yamls are still valid
- test bactopia with conda on Linux and OSX
- test bactopia on Linux with Docker and Singularity
Fixed¶
- redundant environment version files
- failed FASTQ/Assembly downloads no longer stop whole run
--max_retryis honored now- antimicrobial_resistance process honors
amrdirvariable - change directory
antimicrobial_resistancetoantimicrobial-resistance - rename
check_staging.pytocheck-staging.pyfor consistency - Bactopia not producing valid exit code on failure
Removed¶
--containerPathvariable is replaced by--singularity_cache- Native Singularity recipes, will now convert Docker to Singularity
- docs are now on bactopia.github.io repo
v1.5.6 bactopia/bactopia "Metal Slime Earring" - 2021/01/13¶
Added¶
- tweaks to the CI (via GitHub Actions)
- docker containers use quay.io now
- docker containers now use conda-pack
--nfconfigwill skip the conda environment build step- input accessions are checked to be Assembly or Experiment accessions
- improved version increment script
- executor profiles can be configured by parameters
Fixed¶
- phyloflash and download_reference environment errors
- environment path in Bactopia Tools Dockerfile and Singularity recipes
- moved version from conda yaml to conda version file
- streamlined Docker recipes
- undefined
--ftp_onlymessage - typo in singularity profile
- stderr logged to file is also printed to screen
qc_readsmemory used now determined by base config
v1.5.5 bactopia/bactopia "Avenger's Earring" - 2021/01/04¶
Added¶
--prefixoption forbactopia prepare- date is included in
bactopia summaryoutput
Fixed¶
- removed usage personal (rpetit3) conda channel
- aspera connect no longer supported
- shovill-se is now used from Bioconda
- updated conda environments (phyloflash broken)
v1.5.4 bactopia/bactopia "Nemesis Earring" - 2020/12/17¶
Fixed¶
- quoted arguments being broken up (e.g.
--species "Staphylococcus aureus->--species Staphylococcus) mashtreetool failure to download with--accessions- remove ncbi-genome-download header when using
--dry-run - undefined
namevariable in plasmid_blast - custom work dir causing two
-wparameters bactopia searchresults now contains all (illumina and non-illumina)
v1.5.3 bactopia/bactopia "Morion Earring" - 2020/12/04¶
Added¶
- Changelog moved to docs
- recursive search for
bactopia prepare - allow multiple FASTQs per sample (FASTQs are merged)
Fixed¶
- unable to run
bactopia datasetswithout parameters - PLSDB blast results in invalid JSON format
- Error message for unaccepted
run_type
v1.5.2 bactopia/bactopia "Physical Earring" - 2020/11/18¶
Fixed¶
--skip_qccausing "file not found"qc_readsnot honoring FINAL_BP and FINAL_READS checks
v1.5.1 bactopia/bactopia "Astral Earring" - 2020/11/17¶
Fixed¶
bactopia toolsnot a valid project namebactopia tools--cleanup_workdirunrecognized variable
v1.5.0 bactopia/bactopia "Cassie Earring" - 2020/11/12¶
Added¶
- Conda environments will check if in sync with latest version now
- md5sums of all conda envs
- Verify species-specific datasets exist
- separate work dir for bactopia and bactopia tools
--cleanup_workdirto delete work directory after successful run- default values for
bactopia datasetssummary.json - Fallback to NCBI Assembly when eUtils is down
- Additional pre-process QC checks
- OSX/Linux conda envs for Bactopia Tools
- Documentation edits and updates
- Bactopia and Nextflow versions are now output for logging purposes
- option to skip QC step (
--skip_qc) bactopia datasetscan now specify assembly levelbactopia toolsnow use reusable conda envsbactopia toolsfor Roary and PIRATE can now include local assemblies
Fixed¶
- Warn user if no completed genomes are available
- use of
--generafor ncbi-genome-download - improved genome_size handling
- explicit file passing for AWS Batch
- Memory estimates now floored
- PLSDB blast not being executed
v1.4.11 bactopia/bactopia "Metamorph Ring" - 2020/09/19¶
Added¶
bactopia buildchecks if each environment is built before building- Can specify
bactopia buildto build a specific environment - Removed build numbers in Conda environment yamls
- Created separate Conda yamls for Linux and Mac
- NCBI assembly accessions will retrieve the most current version (e.g. .1, .2, .3, etc...)
Fixed¶
bactopia datasetstrailing whitespace in species namesbactopia datasetsrandom subsample missing specified species when--limitand--include_genusused- GitLab CI OSX compatibility
- Adaptive resource allocations
- Datasets are checked for existence
- Variant calls against references with multiple chromosomes
v1.4.10 bactopia/bactopia "Jelly Ring" - 2020/08/25¶
Added¶
cardis back as a default Ariba dataset- Added timestamps to
versionsfiles
Fixed¶
bactopia searchnot creating--outdirgtdbtool not using prefix in outdir namingpiratetool using pangenome alignment instead of core- Use of
scratchcausing logs to fail
v1.4.9 bactopia/bactopia "Toreador's Ring" - 2020/08/23¶
Added¶
- Support for multiple accession
bactopia search(SRA)- Bactopia Tools:
fastani,mashtree,pirate,roary(Assembly)
Fixed¶
- Undefined variable in
mapping_query.sh - ENA API endpoint for
bactopia search - Updated GTDB-TK to 1.3.0 to support latest downloads
- FastANI tool merge_results in no longer a separate step
- ANI is now one-to-many calculations
--reassemblemisapplied
v1.4.8 bactopia/bactopia "Shikaree Ring" - 2020/08/20¶
Added¶
- Versions are logged for Bactopia
- STDOUT/STDERR logs are kept for each sample
- can be skipped using
--skip_logs
- can be skipped using
Fixed¶
- Long sample names breaking Prokka annotation
- Syntax errors in Bactopia tools
- null values being tested as integers
- Ariba card and mlst downloads not working
- missing parameter in GTDB Bactopia tool
v1.4.7 bactopia/bactopia "Serket Ring" - 2020/08/17¶
Added¶
--no_cacheto skip caching ncbi assembly info
v1.4.6 bactopia/bactopia "Astral Ring" - 2020/08/17¶
Added¶
- Option to rebuild conda envs to default location
- Updated fastq-dl for sra-toolkit forced interaction workaround
v1.4.5 bactopia/bactopia "Bomb Queen Ring" - 2020/08/13¶
Fixed¶
--min_basepairsand--min_readsnot being honored
v1.4.4 bactopia/bactopia "Vilma's Ring" - 2020/08/13¶
Fixed¶
annotate_genomename clashes- Updated
fastq-dlversion to support new ENA API endpoint
v1.4.3 bactopia/bactopia "Sattva Ring" - 2020/08/13¶
Added¶
--skip_aribaoption inbactopia datasets
Fixed¶
bactopia versionsandbactopia citationsimproper execution- Convert spaces to tabs in citation doc
- Corrected CheckM name in program version info file
- CARD no longer default Ariba dataset download for
bactopia datasets
v1.4.2 bactopia/bactopia "Tamas Ring" - 2020/08/10¶
Added¶
- added requirement checks of
--accessionsinbactopia datasets - improved ENA spell check for species name
Fixed¶
- file of accessions not working with
bactopia datasets - Dockerfile and Singularity being missed by
update-version.sh
v1.4.1 bactopia/bactopia "Rajas Ring" - 2020/08/06¶
Added¶
- Links to publication (woohoo!)
- Can pass a Prodigal training file to
bactopia datasets
Fixed¶
- Typos in the Docs
- blast_primers now uses
blastnand blast_genes usesmegablast - validExitStatus deprecation warning
v1.4.0 bactopia/bactopia "Archer's Ring" - 2020/07/01¶
Added¶
- New Bactopia Tools
eggnogfor functional annotation using eggNOG-mappermashtreeto create a tree using Mash distancespirateto create pangenome using PIRATEismapperfor insertion site discovery- Documentation for new tools and tweaks to existing
- BTs
roaryandpiratecan now be run on just completed genomes - Can limit number of completed genomes downloaded where applicable
bactopia datasetscan provide list of RefSeq accessions to downloadbactopia searchcan now use BioSample and Run accessionsbactopia searchcan select a subset of Experiments associated with a BioSample- Support for organisms with multiple MLST schemas
- Assembly QC via QUAST and CheckM
- Assemblies (local or NCBI Assembly accession) as inputs for Bactopia
- Long reads as supplementary to paired end reads for hybrid assembly
- Tools versions are locked to the minor version, not the patch
Fixed¶
summarywill now determine absolute path of inputsfastaniimproved user reference import- went back a version on
call_variantsopenjdk version - all bactopia tools now put nextflow info in the same folder as outputs
- Typos in docs
- Bactopia Tools now check existence of include and exclude files
- Lots more documentation
- Updated citations/tools used by Bactopia
- Did I mention typos?
Removed¶
- ISMapper as part of the main pipeline (its now a tool)
insertion-sequencesin bactopia datasets
v1.3.1 bactopia/bactopia "Emperor Hairpin" - 2020/04/20¶
Added¶
summarytool now gives reason for ranksummarytools now splitsfailedintoexcludeandqc-failure- Better documentation on how
--cpusworks in Nextflow - Efficiency info when executed on standard profile
- split
blast_queryintoblast_genes,blast_primersandblast_proteins mapping_querynow creates multifasta of fastas at maps at once then splits per-base coverage into separate files--nfconfigusers can provide their own Nextflow config filefastaniusers can provide their own reference nowbactopia versionswill print versions for tools used by Bactopiabactopia citationswill print citations for tools and datasets used by Bactopiabactopia searchcan filter based on total bases, mean read length, and missing FASTQs- blast queries results are only JSON format for easy parsing later
- added
--compliantoption for Prokka annotation
Fixed¶
- build-containers.sh not working with Bactopia Tools
- Bactopia Tools container tools missing environment.yml
- Typo in
fastaniusage - Samples with multiple QC errors counted for each error
- Incorrect ISMapper version
- typo in
summarySLURM profile gtdbSingularity container not mounting path to GTDB databaseroarymissingrenamein containersblast_queryresults overwriting one anotherbuild-containers.shnow creates a "latest" tagbactopia tool roaryoutputs results based on the given prefix- renamed
--addgenesto--nogenes - updated ASA³P citation
- Typos in Bactopia Tools docs
- Link in README.md
v1.3.0 bactopia/bactopia "Leaping Boots" - 2020/02/19¶
Added¶
- bactopia tools (BT) framework
- docs for each tool
- subcommand to execute tools
bactopia tools fastani- pairwise average nucleotide identitygtdb- assigning objective taxonomic classificationsphyloflash- 16s assembly, alignment and treeroary- pan-genome and core genome treesummary- summary of results
--includeand--excludeto modify which samples used in BT analysisupdate-version.shimprovements- can now set how Nextflow publishes outputs (copy, symlink, etc...) via
--publish_mode - Warning if output directory already exists and require
--forceto overwrite - option (
--rfam) to turn on ncRNA annotation in Prokka - reduced "README.md" contents, instead point to documentation
- Updated acknowledgements and bibtex for citations
Fixed¶
- nextflow.config version out of sync
--available_datasetsaccessing not existent variable--available_datasetsis tested before requiring inputs- Make use of tbl2asn-forever
- adjusted how Bactopia is executed,
nextflow runno longer pulls from github
v1.2.4 bactopia/bactopia "Rabbit Charm" - 2019/12/20¶
Added¶
--conda_helpto be used for conda build test--skip_fastq_checkto skip check that input FASTQs meet minimum requirements- Undocumented parameters to the usage
Fixed¶
- snippy not working with samtools 1.10
- NXF_HOME variable is exported to the conda env location
- speed of checking if conda environments are built
v1.2.3 bactopia/bactopia "Tropical Punches +1" - 2019/12/19¶
Added¶
- select-references selects latest assembly accession version (BioPython/Entrez)
- select-references skips assemblies that have been excluded from RefSeq
- test to for paired-end related errors (e.g. different read counts)
--min_genome_sizeand--max_genome_sizeparameter for estimated genome sizes- Check is also made after assembly
update-version.shimprovements- Better genome size estimates using Mash for high and low coverage sequences
- Script to update conda environments
- added
--conda_helpto be used for conda build test
Fixed¶
--random_tie_breakalways true- not using latest assembly accession for ncbi-genome-download
- usage of assemblies that have been excluded from RefSeq
- allowing PE reads with different read counts to be processed (hint... they fail pretty quickly)
- failure to stop analysis of sample with low read counts
- coverage reported as 'inf'
- references to cgmlst in the setup datasets
- non-explicit patterns in publishDir
- low coverage/read errors after QC were not put in root dir
- snippy not working with samtools 1.10
v1.2.2 bactopia/bactopia "Tropical Punches" - 2019/10/22¶
Added¶
- Size of "work" directory to the execution summary
- User controlled overwrites of existing output files
- Check for unknown parameters at runtime
- FASTQ downloads from SRA (via fastq-dl and fasterq-dump)
- Documentation updates
- Script for building containers
Fixed¶
bactopiacommand now explicitly states which tag to use for Nextflow run- Version info not updated in Dockerfile and Singularity
- Duplicated QC'd FASTQs
- nextflow: docker "Memory limited without swap" error
Removed¶
- cgmlst support in
bactopia datasets - setup.py left over from pre-conda config
v1.2.1 bactopia/bactopia "Fruit Punches" - 2019/10/17¶
Added¶
bactopia buildto build Conda environments- Version info pulled from nextflow.config
- Set default values resource allocations
- Documentation on new changes
- Automatic building of Conda environments, if none exist
--nfdirto determine where bactopia is being run from
Fixed¶
- Never ending typos
--datasetsnow, not--dataset<-(Typo)- path for outputting Nextflow reports
- Typo in antimicrobial_resistance.sh (task.cpus not cpus)
--speciesis now consistent betweenbactopiaandbactopia datasets- Bug when checking if specific species dataset exists, but no species datasets exist
- Cleaned up version update script
- Cleaned up usage
Removed¶
--max_cpusability to limit total cores used, access to config is being deprecated in Nextflow--max_cpussince it is redudant to--cpusnow
v1.2.0 bactopia/bactopia "Beestinger" - 2019/10/16¶
Added¶
--compressto compress certain outputs, default uncompressed- Species name check in
bactopia datasets - Use requests package instead of urllib3
- Added
bactopia searchto query ENA for list of Illumina accessions - Documentation
- Feedback edits
- Output overview
- Additional program acknowledgements
- bibtex of citations
- missing parameters to usage
- info for
--genome_sizeparameter bactopia searchusage- Workflow overview
- blastdbcmd compatible seqid to assembly fasta
- allows search for entries with sample name
- Mask low coverage regions in consensus (subs only) fasta
- Added --dry_run to build conda envs one at a time (prevent parallel issues)
- Added Singularity recipes
- Added SLURM config
Fixed¶
- Never ending typos
bactopia datasetslowercase species names not found in MLST schemasbactopia versionno longer calls nextflow- SEQUENCE_TYPE channel groups FASTQ and assembly
- MINMER_QUERY channel groups FASTQ and signature
- Ariba MLST always running with
--noclean - Bugs related
--compress - Reduced size of per-base coverage outputs
- Removed
-parse_seqidsfrom makeblastdb command, caused blast queries to fail - genomeCoverageBed failing on empty BAM files
Removed¶
--clean_cachefunction
v1.1.0 bactopia/bactopia "Wooden Sword +1" - 2019/09/19¶
Added¶
- NCBI's amrfinder
- Dockerfile for main bactopia install
- Completed documentation!
Fixed¶
- insertion_sequences inputs are not now grouped into single channel
- Unintended FASTQ duplication via poor publishDir pattern
v1.0.1 bactopia/bactopia "Wooden Sword" - 2019/09/12¶
Added¶
- README.md documentation
Fixed¶
- call_variants_auto bug fixed
- documentation
- version numbers
v1.0.0 bactopia/bactopia "Wooden Sword" - 2019/09/04¶
- Initial release of bactopia/bactopia