Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update SPAdes module to official nf-core and version bump #666

Merged
merged 26 commits into from
Oct 22, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
26 commits
Select commit Hold shift + click to select a range
7d3650e
Replace local spades modules with official nf-core version (missing f…
jfy133 Sep 9, 2024
391ebd4
Update CHANGELOG.md
jfy133 Sep 9, 2024
e6301c8
Fix output docs
jfy133 Sep 10, 2024
debd080
Make sure warnings log also gets a prefix
jfy133 Sep 10, 2024
89f98e2
Merge branch 'dev' into bump-metaspades
jfy133 Sep 10, 2024
4d19b8f
Fix a syntax error (Ben made me do it)
jfy133 Sep 12, 2024
5437376
Switch local MEGAHIT to official nf-core one
jfy133 Sep 20, 2024
12b13d3
Fix wrongly set branch
jfy133 Sep 20, 2024
830a66e
Merge branch 'dev' into bump-metaspades
jfy133 Sep 20, 2024
7d7ad71
Fix linting
jfy133 Sep 20, 2024
4c9c5df
Merge branch 'dev' into bump-metaspades
jfy133 Oct 3, 2024
9d7ce62
Replace assembler single-use fix-cpu functions with single line defin…
jfy133 Oct 3, 2024
ce449f2
Ensure checkmax is still used
jfy133 Oct 3, 2024
efabee7
Specify also module changes
jfy133 Oct 3, 2024
d7e51ef
Lets see if SPAdes now works with fusion with the new container...
jfy133 Oct 4, 2024
9a2e958
Fix GTDBTk Url in schema
jfy133 Oct 22, 2024
5c00892
Merge branch 'dev' into bump-metaspades
jfy133 Oct 22, 2024
a77b7c2
Update CHANGELOG
jfy133 Oct 22, 2024
01065ae
Merge branch 'dev' into bump-metaspades
jfy133 Oct 22, 2024
717d433
Make sure meta mode is executed with the new module
jfy133 Oct 22, 2024
ebd18a1
Fix linting
jfy133 Oct 22, 2024
48cad97
Remove a TODO
jfy133 Oct 22, 2024
860926a
Add MEGAHIT prefix to ensure MEGAHIT at the beginning of the input fi…
jfy133 Oct 22, 2024
639addb
Re-add memory explicit parameter to MEGAHIT module (inbuilt into SPAd…
jfy133 Oct 22, 2024
ec318c7
Re-add MEGAHIT memory
jfy133 Oct 22, 2024
338a552
Use HTTPS URL for megahit container
jfy133 Oct 22, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,8 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
### `Changed`

- [#674](https://github.com/nf-core/mag/pull/674) - Changed to porechop-abi as default adapter trimming tool for long reads. User can still use porechop if preferred (added by @muabnezor)
- [#666](https://github.com/nf-core/mag/pull/666) - Update SPAdes to version 4.0.0, replace both METASPADES and MEGAHIT with official nf-core modules (requested by @elsherbini, fix by @jfy133)
- [#666](https://github.com/nf-core/mag/pull/666) - Update URLs to GTDB database downloads due to server move (reported by @Jokendo-collab, fix by @jfy133)
- [#695](https://github.com/nf-core/mag/pull/695) - Updated to nf-core 3.0.2 `TEMPLATE` (by @jfy133)
- [#695](https://github.com/nf-core/mag/pull/695) - Switch more stable Zenodo link for CheckM data (by @jfy133)

Expand All @@ -29,6 +31,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
| ------------ | ---------------- | ----------- |
| Porechop_ABI | | 0.5.0 |
| Filtlong | 0.2.0 | 0.2.1 |
| SPAdes | 3.15.3 | 4.0.0 |

### `Deprecated`

Expand Down
3 changes: 1 addition & 2 deletions conf/base.config
Original file line number Diff line number Diff line change
Expand Up @@ -10,9 +10,8 @@

process {

// TODO nf-core: Check the defaults for all processes
cpus = { 1 * task.attempt }
memory = { 6.GB * task.attempt }
memory = { 7.GB * task.attempt }
time = { 4.h * task.attempt }

errorStrategy = { task.exitStatus in ((130..145) + 104) ? 'retry' : 'finish' }
Expand Down
251 changes: 41 additions & 210 deletions conf/modules.config

Large diffs are not rendered by default.

24 changes: 12 additions & 12 deletions conf/test_full.config
Original file line number Diff line number Diff line change
Expand Up @@ -16,30 +16,30 @@ params {

// Input data for full size test
// hg19 reference with highly conserved and low-complexity regions masked by Brian Bushnell
host_fasta = "s3://ngi-igenomes/test-data/mag/hg19_main_mask_ribo_animal_allplant_allfungus.fa.gz"
input = "s3://ngi-igenomes/test-data/mag/samplesheets/samplesheet.full.csv"
host_fasta = "s3://ngi-igenomes/test-data/mag/hg19_main_mask_ribo_animal_allplant_allfungus.fa.gz"
input = "s3://ngi-igenomes/test-data/mag/samplesheets/samplesheet.full.csv"

//centrifuge_db = "s3://ngi-igenomes/test-data/mag/p_compressed+h+v.tar.gz"
kraken2_db = "s3://ngi-igenomes/test-data/mag/minikraken_8GB_202003.tgz"
cat_db = "s3://ngi-igenomes/test-data/mag/CAT_prepare_20210107.tar.gz"
kraken2_db = "s3://ngi-igenomes/test-data/mag/minikraken_8GB_202003.tgz"
cat_db = "s3://ngi-igenomes/test-data/mag/CAT_prepare_20210107.tar.gz"
// gtdb_db = "s3://ngi-igenomes/test-data/mag/gtdbtk_r214_data.tar.gz" ## This should be updated to release 220, once we get GTDB-Tk working again
skip_gtdbtk = true
skip_gtdbtk = true

// TODO TEMPORARY: deactivate SPAdes due to incompatibility of container with fusion file system
skip_spades = true
skip_spadeshybrid = true
skip_spades = false
skip_spadeshybrid = false

// reproducibility options for assembly
spades_fix_cpus = 10
spadeshybrid_fix_cpus = 10
megahit_fix_cpu_1 = true
spades_fix_cpus = 10
spadeshybrid_fix_cpus = 10
megahit_fix_cpu_1 = true

// available options to enable reproducibility for BUSCO (--busco_db) not used here
// to allow detection of possible problems in automated lineage selection mode using public databases

// test CAT with official taxonomic ranks only
cat_official_taxonomy = true
cat_official_taxonomy = true

// Skip CONCOCT due to timeout issues
skip_concoct = true
skip_concoct = true
}
16 changes: 8 additions & 8 deletions docs/output.md
Original file line number Diff line number Diff line change
Expand Up @@ -219,10 +219,10 @@ Trimmed (short) reads are assembled with both megahit and SPAdes. Hybrid assembl
<summary>Output files</summary>

- `Assembly/SPAdes/`
- `[sample/group]_scaffolds.fasta.gz`: Compressed assembled scaffolds in fasta format
- `[sample/group]_graph.gfa.gz`: Compressed assembly graph in gfa format
- `[sample/group]_contigs.fasta.gz`: Compressed assembled contigs in fasta format
- `[sample/group].log`: Log file
- `[sample/group].scaffolds.fa.gz`: Compressed assembled scaffolds in fasta format
- `[sample/group].assembly.gfa.gz`: Compressed assembly graph in gfa format
- `[sample/group].contigs.fa.gz`: Compressed assembled contigs in fasta format
- `[sample/group].spades.log`: Log file
- `QC/[sample/group]/`: Directory containing QUAST files and Bowtie2 mapping logs
- `SPAdes-[sample].bowtie2.log`: Bowtie2 log file indicating how many reads have been mapped from the sample that the metagenome was assembled from, only present if `--coassemble_group` is not set.
- `SPAdes-[sample/group]-[sampleToMap].bowtie2.log`: Bowtie2 log file indicating how many reads have been mapped from the respective sample ("sampleToMap").
Expand All @@ -238,10 +238,10 @@ SPAdesHybrid is a part of the [SPAdes](http://cab.spbu.ru/software/spades/) soft
<summary>Output files</summary>

- `Assembly/SPAdesHybrid/`
- `[sample/group]_scaffolds.fasta.gz`: Compressed assembled scaffolds in fasta format
- `[sample/group]_graph.gfa.gz`: Compressed assembly graph in gfa format
- `[sample/group]_contigs.fasta.gz`: Compressed assembled contigs in fasta format
- `[sample/group].log`: Log file
- `[sample/group].scaffolds.fa.gz`: Compressed assembled scaffolds in fasta format
- `[sample/group].assembly.gfa.gz`: Compressed assembly graph in gfa format
- `[sample/group].contigs.fa.gz`: Compressed assembled contigs in fasta format
- `[sample/group].spades.log`: Log file
- `QC/[sample/group]/`: Directory containing QUAST files and Bowtie2 mapping logs
- `SPAdesHybrid-[sample].bowtie2.log`: Bowtie2 log file indicating how many reads have been mapped from the sample that the metagenome was assembled from, only present if `--coassemble_group` is not set.
- `SPAdesHybrid-[sample/group]-[sampleToMap].bowtie2.log`: Bowtie2 log file indicating how many reads have been mapped from the respective sample ("sampleToMap").
Expand Down
10 changes: 10 additions & 0 deletions modules.json
Original file line number Diff line number Diff line change
Expand Up @@ -172,6 +172,11 @@
"git_sha": "283613159e079152f1336cef0db1c836086206e0",
"installed_by": ["modules"]
},
"megahit": {
"branch": "master",
"git_sha": "7755db15e36b30da564cd67fffdfe18a255092aa",
"installed_by": ["modules"]
},
"metabat2/jgisummarizebamcontigdepths": {
"branch": "master",
"git_sha": "911696ea0b62df80e900ef244d7867d177971f73",
Expand Down Expand Up @@ -247,6 +252,11 @@
"git_sha": "911696ea0b62df80e900ef244d7867d177971f73",
"installed_by": ["modules"]
},
"spades": {
"branch": "master",
"git_sha": "cfebb244d8c83ae533bf2db399f9af361927d504",
"installed_by": ["modules"]
},
"tiara/tiara": {
"branch": "master",
"git_sha": "911696ea0b62df80e900ef244d7867d177971f73",
Expand Down
40 changes: 0 additions & 40 deletions modules/local/megahit.nf

This file was deleted.

51 changes: 0 additions & 51 deletions modules/local/spades.nf

This file was deleted.

49 changes: 0 additions & 49 deletions modules/local/spadeshybrid.nf

This file was deleted.

6 changes: 6 additions & 0 deletions modules/nf-core/megahit/environment.yml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

70 changes: 70 additions & 0 deletions modules/nf-core/megahit/main.nf

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading
Loading