Add fqtk as a demultiplexer #99

sam-white04 · 2023-03-08T16:19:38Z

Hello,

I have added fqtk as an optional demultiplexer. The tool fqtk requires an additional input to be provided as a path in the 5th column of --input samplesheet.csv.

Thank you,
Samantha White

PR checklist

This comment contains a description of changes (with reason).
If you've fixed a bug or added code that should be tested, add tests!
If you've added a new tool - have you followed the pipeline conventions in the contribution docs- [x] If necessary, also make a PR on the nf-core/demultiplex branch on the nf-core/test-datasets repository.
Make sure your code lints (nf-core lint).
Ensure the test suite passes (nextflow run . -profile test,docker --outdir <OUTDIR>).
Usage Documentation in docs/usage.md is updated.
Output Documentation in docs/output.md is updated.
CHANGELOG.md is updated. (@sam-white04 Will update as soon as PR is posted)
README.md is updated (including new tool citations and authors/contributors).

github-actions · 2023-03-08T16:57:29Z

`nf-core lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit 9b611b2

+| ✅ 155 tests passed       |+
#| ❔   1 tests were ignored |#
!| ❗   4 tests had warnings |!

❗ Test warnings:

pipeline_todos - TODO string in README.md: Add full-sized test dataset and amend the paragraph below if applicable
pipeline_todos - TODO string in methods_description_template.yml: #Update the HTML below to your prefered methods description, e.g. add publication citation for this pipeline
pipeline_todos - TODO string in awsfulltest.yml: You can customise AWS full pipeline tests as required
schema_description - No description provided in schema for parameter: skip_tools

❔ Tests ignored:

actions_ci - actions_ci

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .editorconfig
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/ci.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-demultiplex_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-demultiplex_logo_light.png
files_exist - File found: docs/images/nf-core-demultiplex_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: lib/nfcore_external_java_deps.jar
files_exist - File found: lib/NfcoreSchema.groovy
files_exist - File found: lib/NfcoreTemplate.groovy
files_exist - File found: lib/Utils.groovy
files_exist - File found: lib/WorkflowMain.groovy
files_exist - File found: main.nf
files_exist - File found: assets/multiqc_config.yml
files_exist - File found: conf/base.config
files_exist - File found: conf/igenomes.config
files_exist - File found: .github/workflows/awstest.yml
files_exist - File found: .github/workflows/awsfulltest.yml
files_exist - File found: lib/WorkflowDemultiplex.groovy
files_exist - File found: modules.json
files_exist - File found: pyproject.toml
files_exist - File not found check: Singularity
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: docs/images/nf-core-demultiplex_logo.png
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: .travis.yml
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: params.show_hidden_params
nextflow_config - Config variable found: params.schema_ignore_params
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version ends in dev: '1.2.0dev'
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/CONTRIBUTING.md matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/PULL_REQUEST_TEMPLATE.md matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .github/workflows/linting.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-demultiplex_logo_light.png matches the template
files_unchanged - docs/images/nf-core-demultiplex_logo_light.png matches the template
files_unchanged - docs/images/nf-core-demultiplex_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - lib/nfcore_external_java_deps.jar matches the template
files_unchanged - lib/NfcoreSchema.groovy matches the template
files_unchanged - lib/NfcoreTemplate.groovy matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
files_unchanged - pyproject.toml matches the template
actions_awstest - '.github/workflows/awstest.yml' is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml does not use -profile test
readme - README Nextflow minimum version badge matched config. Badge: 22.10.1, Config: 22.10.1
readme - README Nextflow minimum version in Quick Start section matched config. README: 22.10.1, Config: 22.10.1
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (130 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
actions_schema_validation - Workflow validation passed: linting_comment.yml
actions_schema_validation - Workflow validation passed: ci.yml
actions_schema_validation - Workflow validation passed: fix-linting.yml
actions_schema_validation - Workflow validation passed: awstest.yml
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: awsfulltest.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
multiqc_config - 'assets/multiqc_config.yml' follows the ordering scheme of the minimally required plugins.
multiqc_config - 'assets/multiqc_config.yml' contains a matching 'report_comment'.
multiqc_config - 'assets/multiqc_config.yml' contains 'export_plots: true'.
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'

Run details

nf-core/tools version 2.7.2
Run at 2023-03-15 15:25:00

edmundmiller · 2023-03-08T17:04:38Z

I don't mind them being squashed but that commit message 1. Has nothing to do with fqtk 2. Is from another PR and I think we from a rebase that didn't work out as planned.

Co-authored-by: ewels <[email protected]> This is a combination of 3 commits. Fqtk off of dev Remove todo's from fqtk_demultiplex.nf Update demultiplex.nf Update README.md Update input file paths

Update CHANGELOG.md

edmundmiller

Few small comments and style changes, but looks pretty good!

README.md

subworkflows/local/fqtk_demultiplex/main.nf

edmundmiller · 2023-03-15T00:21:09Z

subworkflows/local/fqtk_demultiplex/main.nf

+
+    rg.ID = [fcid,lane].join(".")
+    rg.PU = [fcid, lane, index].findAll().join(".")
+    rg.PL = "SINGULAR"


Should this be hard-coded?

@emiller88 Good point, it probably should not be hardcoded. How else could it be filled in?

There might be a way to read in the read group. We can just throw a TODO on it though for now.

workflows/demultiplex.nf

edmundmiller · 2023-03-15T00:25:18Z

workflows/demultiplex.nf

+def extract_csv_fqtk(input_csv) {
+
+    // Flowcell Sheet schema
+    // Possible values for the "content" column: [meta, path, number, string, bool]
+    def input_schema = [
+        'columns': [
+            'id': [
+                'content': 'meta',
+                'meta_name': 'id',
+                'pattern': '',
+            ],
+            'samplesheet': [
+                'content': 'path',
+                'pattern': '^.*.csv$',
+            ],
+            'lane': [
+                'content': 'meta',
+                'meta_name': 'lane',
+                'pattern': '',
+            ],
+            'flowcell': [
+                'content': 'path',
+                'pattern': '',
+            ],
+            'per_flowcell_manifest': [
+                'content': 'path',
+                'pattern': '',
+            ]
+        ],
+        required: ['id','flowcell', 'samplesheet', 'per_flowcell_manifest'],
+    ]
+
+    return extract_csv(input_csv, input_schema)
+}


These are starting to get excessive. @matthdsm what happens if we put them in lib/? Would they still work the same?

@emiller88 Moving the prep for ch_flowcells into subworkflows would remove the need for these functions to be in demultiplex.nf, right?

Right, that's a great idea!

I've no qualms putting all the functions in /lib. They're only in the main file for convenience!

Co-authored-by: Edmund Miller <[email protected]>

sam-white04

@emiller88 Thank you for your comments. If everyone is on the same page, Im happy to pull the generation of ch_flowcells and ch_flowcells_tar into subworkflows for fqtk vs the other demultiplexers.

sam-white04 · 2023-03-15T14:58:35Z

subworkflows/local/fqtk_demultiplex/main.nf

+
+    rg.ID = [fcid,lane].join(".")
+    rg.PU = [fcid, lane, index].findAll().join(".")
+    rg.PL = "SINGULAR"


@emiller88 Good point, it probably should not be hardcoded. How else could it be filled in?

workflows/demultiplex.nf

sam-white04 · 2023-03-15T15:01:52Z

workflows/demultiplex.nf

+def extract_csv_fqtk(input_csv) {
+
+    // Flowcell Sheet schema
+    // Possible values for the "content" column: [meta, path, number, string, bool]
+    def input_schema = [
+        'columns': [
+            'id': [
+                'content': 'meta',
+                'meta_name': 'id',
+                'pattern': '',
+            ],
+            'samplesheet': [
+                'content': 'path',
+                'pattern': '^.*.csv$',
+            ],
+            'lane': [
+                'content': 'meta',
+                'meta_name': 'lane',
+                'pattern': '',
+            ],
+            'flowcell': [
+                'content': 'path',
+                'pattern': '',
+            ],
+            'per_flowcell_manifest': [
+                'content': 'path',
+                'pattern': '',
+            ]
+        ],
+        required: ['id','flowcell', 'samplesheet', 'per_flowcell_manifest'],
+    ]
+
+    return extract_csv(input_csv, input_schema)
+}


@emiller88 Moving the prep for ch_flowcells into subworkflows would remove the need for these functions to be in demultiplex.nf, right?

edmundmiller

I think this looks good to me. #102 For follow up to clean up some things that needed to be introduced here.

edmundmiller · 2023-03-16T17:04:10Z

If you rerun the CI enough, it works 🙃

sam-white04 requested review from matthdsm and edmundmiller as code owners March 8, 2023 16:19

edmundmiller assigned sam-white04 Mar 8, 2023

edmundmiller added this to the 1.2.0 milestone Mar 8, 2023

edmundmiller linked an issue Mar 8, 2023 that may be closed by this pull request

For sample demultiplexing from FASTQs, add fqtk. #87

Closed

sam-white04 force-pushed the fqtk branch from b63a6ee to 3249f91 Compare March 8, 2023 17:36

edmundmiller changed the title ~~fqtk additional tool~~ Add fqtk as a demultiplexer Mar 8, 2023

sam-white04 requested a review from nh13 March 13, 2023 17:58

sam-white04 force-pushed the fqtk branch from 53c1452 to 0d843a8 Compare March 14, 2023 21:57

edmundmiller and others added 3 commits March 14, 2023 16:49

Add fqtk to nf-core/demultiplex

a814de3

Co-authored-by: ewels <[email protected]> This is a combination of 3 commits. Fqtk off of dev Remove todo's from fqtk_demultiplex.nf Update demultiplex.nf Update README.md Update input file paths

Update fqtk

cb44e91

Update CHANGELOG.md

Fix whitespace in conf/modules.config

c07c941

sam-white04 force-pushed the fqtk branch from 0d843a8 to c07c941 Compare March 14, 2023 22:50

edmundmiller requested changes Mar 15, 2023

View reviewed changes

fqtk: Apply suggestions from Edmund Miller

9b611b2

Co-authored-by: Edmund Miller <[email protected]>

sam-white04 commented Mar 15, 2023

View reviewed changes

edmundmiller mentioned this pull request Mar 15, 2023

Clean up samplesheet reading #102

Open

edmundmiller approved these changes Mar 15, 2023

View reviewed changes

matthdsm approved these changes Mar 16, 2023

View reviewed changes

edmundmiller merged commit f5ae6bf into nf-core:dev Mar 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fqtk as a demultiplexer #99

Add fqtk as a demultiplexer #99

sam-white04 commented Mar 8, 2023 •

edited

Loading

github-actions bot commented Mar 8, 2023 •

edited

Loading

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

edmundmiller commented Mar 8, 2023 •

edited

Loading

edmundmiller left a comment

edmundmiller Mar 15, 2023

sam-white04 Mar 15, 2023

edmundmiller Mar 15, 2023

edmundmiller Mar 15, 2023

sam-white04 Mar 15, 2023

edmundmiller Mar 15, 2023

matthdsm Mar 16, 2023

sam-white04 left a comment

sam-white04 Mar 15, 2023

sam-white04 Mar 15, 2023

edmundmiller left a comment

edmundmiller commented Mar 16, 2023

Add fqtk as a demultiplexer #99

Add fqtk as a demultiplexer #99

Conversation

sam-white04 commented Mar 8, 2023 • edited Loading

PR checklist

github-actions bot commented Mar 8, 2023 • edited Loading

nf-core lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

edmundmiller commented Mar 8, 2023 • edited Loading

edmundmiller left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sam-white04 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

edmundmiller left a comment

Choose a reason for hiding this comment

edmundmiller commented Mar 16, 2023

sam-white04 commented Mar 8, 2023 •

edited

Loading

github-actions bot commented Mar 8, 2023 •

edited

Loading

`nf-core lint` overall result: Passed ✅ ⚠️

edmundmiller commented Mar 8, 2023 •

edited

Loading