[dev-ratio] add deseq2 and gprofiler2 into the experimental branch #306

suzannejin · 2024-10-22T13:13:21Z

PR checklist

github-actions · 2024-10-22T13:15:36Z

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit 4343c6b

+| ✅ 320 tests passed       |+
#| ❔   7 tests were ignored |#
!| ❗   4 tests had warnings |!

❗ Test warnings:

pipeline_todos - TODO string in main.nf: Optionally add in-text citation tools to this list.
pipeline_todos - TODO string in main.nf: Optionally add bibliographic entries to this list.
pipeline_todos - TODO string in main.nf: Only uncomment below if logic in toolCitationText/toolBibliographyText has been filled!
pipeline_todos - TODO string in base.config: Check the defaults for all processes

❔ Tests ignored:

files_exist - File is ignored: assets/multiqc_config.yml
nextflow_config - Config default ignored: params.tools
nextflow_config - Config default ignored: params.report_file
nextflow_config - Config default ignored: params.logo_file
nextflow_config - Config default ignored: params.css_file
nextflow_config - Config default ignored: params.citations_file
multiqc_config - multiqc_config

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .editorconfig
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/ci.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-differentialabundance_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-differentialabundance_logo_light.png
files_exist - File found: docs/images/nf-core-differentialabundance_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: main.nf
files_exist - File found: conf/base.config
files_exist - File found: conf/igenomes.config
files_exist - File found: conf/igenomes_ignored.config
files_exist - File found: .github/workflows/awstest.yml
files_exist - File found: .github/workflows/awsfulltest.yml
files_exist - File found: modules.json
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: docs/images/nf-core-differentialabundance_logo.png
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/NfcoreTemplate.groovy
files_exist - File not found check: lib/Utils.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: lib/WorkflowMain.groovy
files_exist - File not found check: lib/WorkflowDifferentialabundance.groovy
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: Singularity
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nextflow_config - Found nf-schema plugin
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: validation.help.enabled
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable found: validation.help.beforeText
nextflow_config - Config variable found: validation.help.afterText
nextflow_config - Config variable found: validation.help.command
nextflow_config - Config variable found: validation.summary.beforeText
nextflow_config - Config variable found: validation.summary.afterText
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config variable (correctly) not found: params.max_cpus
nextflow_config - Config variable (correctly) not found: params.max_memory
nextflow_config - Config variable (correctly) not found: params.max_time
nextflow_config - Config variable (correctly) not found: params.validationFailUnrecognisedParams
nextflow_config - Config variable (correctly) not found: params.validationLenientMode
nextflow_config - Config variable (correctly) not found: params.validationSchemaIgnoreParams
nextflow_config - Config variable (correctly) not found: params.validationShowHiddenParams
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version ends in dev: 1.6.0dev
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
nextflow_config - nextflow.config contains configuration profile test
nextflow_config - Config default value correct: params.study_name= study
nextflow_config - Config default value correct: params.study_type= rnaseq
nextflow_config - Config default value correct: params.study_abundance_type= counts
nextflow_config - Config default value correct: params.observations_id_col= sample
nextflow_config - Config default value correct: params.observations_type= sample
nextflow_config - Config default value correct: params.features_id_col= gene_id
nextflow_config - Config default value correct: params.features_name_col= gene_name
nextflow_config - Config default value correct: params.features_type= gene
nextflow_config - Config default value correct: params.features_metadata_cols= gene_id,gene_name,gene_biotype
nextflow_config - Config default value correct: params.features_gtf_feature_type= transcript
nextflow_config - Config default value correct: params.features_gtf_table_first_field= gene_id
nextflow_config - Config default value correct: params.affy_file_name_col= file
nextflow_config - Config default value correct: params.affy_background= true
nextflow_config - Config default value correct: params.affy_bgversion= 2
nextflow_config - Config default value correct: params.affy_cdfname= null
nextflow_config - Config default value correct: params.affy_build_annotation= true
nextflow_config - Config default value correct: params.proteus_measurecol_prefix= LFQ intensity
nextflow_config - Config default value correct: params.proteus_norm_function= normalizeMedian
nextflow_config - Config default value correct: params.proteus_plotsd_method= violin
nextflow_config - Config default value correct: params.proteus_plotmv_loess= true
nextflow_config - Config default value correct: params.proteus_palette_name= Set1
nextflow_config - Config default value correct: params.filtering_min_abundance= 1.0
nextflow_config - Config default value correct: params.filtering_min_samples= 1.0
nextflow_config - Config default value correct: params.filtering_min_proportion_not_na= 0.5
nextflow_config - Config default value correct: params.exploratory_clustering_method= ward.D2
nextflow_config - Config default value correct: params.exploratory_cor_method= spearman
nextflow_config - Config default value correct: params.exploratory_n_features= 500
nextflow_config - Config default value correct: params.exploratory_whisker_distance= 1.5
nextflow_config - Config default value correct: params.exploratory_mad_threshold= -5
nextflow_config - Config default value correct: params.exploratory_main_variable= auto_pca
nextflow_config - Config default value correct: params.exploratory_assay_names= raw,normalised,variance_stabilised
nextflow_config - Config default value correct: params.exploratory_final_assay= variance_stabilised
nextflow_config - Config default value correct: params.exploratory_palette_name= Set1
nextflow_config - Config default value correct: params.differential_feature_id_column= gene_id
nextflow_config - Config default value correct: params.differential_fc_column= log2FoldChange
nextflow_config - Config default value correct: params.differential_pval_column= pvalue
nextflow_config - Config default value correct: params.differential_qval_column= padj
nextflow_config - Config default value correct: params.differential_min_fold_change= 2.0
nextflow_config - Config default value correct: params.differential_max_pval= 1.0
nextflow_config - Config default value correct: params.differential_max_qval= 0.05
nextflow_config - Config default value correct: params.differential_feature_name_column= gene_name
nextflow_config - Config default value correct: params.differential_foldchanges_logged= true
nextflow_config - Config default value correct: params.differential_palette_name= Set1
nextflow_config - Config default value correct: params.deseq2_test= Wald
nextflow_config - Config default value correct: params.deseq2_fit_type= parametric
nextflow_config - Config default value correct: params.deseq2_sf_type= ratio
nextflow_config - Config default value correct: params.deseq2_min_replicates_for_replace= 7
nextflow_config - Config default value correct: params.deseq2_independent_filtering= true
nextflow_config - Config default value correct: params.deseq2_lfc_threshold= 0
nextflow_config - Config default value correct: params.deseq2_alt_hypothesis= greaterAbs
nextflow_config - Config default value correct: params.deseq2_p_adjust_method= BH
nextflow_config - Config default value correct: params.deseq2_alpha= 0.1
nextflow_config - Config default value correct: params.deseq2_minmu= 0.5
nextflow_config - Config default value correct: params.deseq2_vs_method= vst
nextflow_config - Config default value correct: params.deseq2_shrink_lfc= true
nextflow_config - Config default value correct: params.deseq2_cores= 1
nextflow_config - Config default value correct: params.deseq2_vs_blind= true
nextflow_config - Config default value correct: params.deseq2_vst_nsub= 1000
nextflow_config - Config default value correct: params.limma_spacing= null
nextflow_config - Config default value correct: params.limma_block= null
nextflow_config - Config default value correct: params.limma_correlation= null
nextflow_config - Config default value correct: params.limma_method= ls
nextflow_config - Config default value correct: params.limma_proportion= 0.01
nextflow_config - Config default value correct: params.limma_stdev_coef_lim= 0.1,4
nextflow_config - Config default value correct: params.limma_winsor_tail_p= 0.05,0.1
nextflow_config - Config default value correct: params.limma_lfc= 0
nextflow_config - Config default value correct: params.limma_adjust_method= BH
nextflow_config - Config default value correct: params.limma_p_value= 1.0
nextflow_config - Config default value correct: params.propd_alpha= null
nextflow_config - Config default value correct: params.propd_moderated= true
nextflow_config - Config default value correct: params.propd_fdr= 0.05
nextflow_config - Config default value correct: params.propd_permutation= 0
nextflow_config - Config default value correct: params.propd_ncutoffs= 100
nextflow_config - Config default value correct: params.propd_weighted_degree= false
nextflow_config - Config default value correct: params.propr_metric= rho
nextflow_config - Config default value correct: params.propr_ivar= clr
nextflow_config - Config default value correct: params.propr_alpha= null
nextflow_config - Config default value correct: params.propr_fdr= 0.05
nextflow_config - Config default value correct: params.propr_permutation= 100
nextflow_config - Config default value correct: params.propr_ncutoffs= 100
nextflow_config - Config default value correct: params.propr_tails= right
nextflow_config - Config default value correct: params.gsea_permute= phenotype
nextflow_config - Config default value correct: params.gsea_nperm= 1000
nextflow_config - Config default value correct: params.gsea_scoring_scheme= weighted
nextflow_config - Config default value correct: params.gsea_metric= Signal2Noise
nextflow_config - Config default value correct: params.gsea_sort= real
nextflow_config - Config default value correct: params.gsea_order= descending
nextflow_config - Config default value correct: params.gsea_set_max= 500
nextflow_config - Config default value correct: params.gsea_set_min= 15
nextflow_config - Config default value correct: params.gsea_norm= meandiv
nextflow_config - Config default value correct: params.gsea_rnd_type= no_balance
nextflow_config - Config default value correct: params.gsea_make_sets= true
nextflow_config - Config default value correct: params.gsea_num= 100
nextflow_config - Config default value correct: params.gsea_plot_top_x= 20
nextflow_config - Config default value correct: params.gsea_rnd_seed= timestamp
nextflow_config - Config default value correct: params.gprofiler2_significant= true
nextflow_config - Config default value correct: params.gprofiler2_measure_underrepresentation= false
nextflow_config - Config default value correct: params.gprofiler2_evcodes= false
nextflow_config - Config default value correct: params.gprofiler2_max_qval= 0.05
nextflow_config - Config default value correct: params.gprofiler2_domain_scope= annotated
nextflow_config - Config default value correct: params.gprofiler2_min_diff= 1
nextflow_config - Config default value correct: params.gprofiler2_palette_name= Blues
nextflow_config - Config default value correct: params.grea_set_min= 15
nextflow_config - Config default value correct: params.grea_set_max= 500
nextflow_config - Config default value correct: params.grea_permutation= 100
nextflow_config - Config default value correct: params.shinyngs_build_app= true
nextflow_config - Config default value correct: params.shinyngs_shinyapps_account= null
nextflow_config - Config default value correct: params.shinyngs_shinyapps_app_name= null
nextflow_config - Config default value correct: params.gene_sets_files= null
nextflow_config - Config default value correct: params.report_title= null
nextflow_config - Config default value correct: params.report_author= null
nextflow_config - Config default value correct: params.report_description= null
nextflow_config - Config default value correct: params.report_scree= true
nextflow_config - Config default value correct: params.report_round_digits= 4
nextflow_config - Config default value correct: params.igenomes_base= s3://ngi-igenomes/igenomes/
nextflow_config - Config default value correct: params.custom_config_version= master
nextflow_config - Config default value correct: params.custom_config_base= https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Config default value correct: params.publish_dir_mode= copy
nextflow_config - Config default value correct: params.validate_params= true
nextflow_config - Config default value correct: params.pipelines_testdata_base_path= https://raw.githubusercontent.com/nf-core/test-datasets/
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/CONTRIBUTING.md matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/PULL_REQUEST_TEMPLATE.md matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .github/workflows/linting.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-differentialabundance_logo_light.png matches the template
files_unchanged - docs/images/nf-core-differentialabundance_logo_light.png matches the template
files_unchanged - docs/images/nf-core-differentialabundance_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
actions_ci - '.github/workflows/ci.yml' is triggered on expected events
actions_ci - '.github/workflows/ci.yml' checks minimum NF version
actions_awstest - '.github/workflows/awstest.yml' is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml does not use -profile test
readme - README Nextflow minimum version badge matched config. Badge: 24.04.2, Config: 24.04.2
readme - README Zenodo placeholder was replaced with DOI.
plugin_includes - No wrong validation plugin imports have been found
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (0 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: fix-linting.yml
actions_schema_validation - Workflow validation passed: release-announcements.yml
actions_schema_validation - Workflow validation passed: awsfulltest.yml
actions_schema_validation - Workflow validation passed: template_version_comment.yml
actions_schema_validation - Workflow validation passed: download_pipeline.yml
actions_schema_validation - Workflow validation passed: ci.yml
actions_schema_validation - Workflow validation passed: clean-up.yml
actions_schema_validation - Workflow validation passed: awstest.yml
actions_schema_validation - Workflow validation passed: linting_comment.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'
base_config - conf/base.config found and not ignored.
modules_config - conf/modules.config found and not ignored.
modules_config - GUNZIP_GTF found in conf/modules.config and Nextflow scripts.
modules_config - GTF_TO_TABLE found in conf/modules.config and Nextflow scripts.
modules_config - VALIDATOR found in conf/modules.config and Nextflow scripts.
modules_config - AFFY_JUSTRMA_RAW found in conf/modules.config and Nextflow scripts.
modules_config - AFFY_JUSTRMA_NORM found in conf/modules.config and Nextflow scripts.
modules_config - PROTEUS found in conf/modules.config and Nextflow scripts.
modules_config - GEOQUERY_GETGEO found in conf/modules.config and Nextflow scripts.
modules_config - DESEQ2_NORM found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_DIFFERENTIALABUNDANCE found in conf/modules.config and Nextflow scripts.
modules_config - LIMMA_DIFFERENTIAL found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_DIFFERENTIALABUNDANCE found in conf/modules.config and Nextflow scripts.
modules_config - GSEA_GSEA found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_DIFFERENTIALABUNDANCE found in conf/modules.config and Nextflow scripts.
modules_config - PLOT_EXPLORATORY found in conf/modules.config and Nextflow scripts.
modules_config - PLOT_DIFFERENTIAL found in conf/modules.config and Nextflow scripts.
modules_config - SHINYNGS_APP found in conf/modules.config and Nextflow scripts.
modules_config - RMARKDOWNNOTEBOOK found in conf/modules.config and Nextflow scripts.
modules_config - MAKE_REPORT_BUNDLE found in conf/modules.config and Nextflow scripts.
modules_config - CUSTOM_MATRIXFILTER found in conf/modules.config and Nextflow scripts.
modules_config - CUSTOM_TABULARTOGSEACLS found in conf/modules.config and Nextflow scripts.
modules_config - CUSTOM_TABULARTOGSEAGCT found in conf/modules.config and Nextflow scripts.
modules_config - TABULAR_TO_GSEA_CHIP found in conf/modules.config and Nextflow scripts.
modules_config - PROPR found in conf/modules.config and Nextflow scripts.
modules_config - PROPD found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_DIFFERENTIALABUNDANCE found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_DIFFERENTIALABUNDANCE found in conf/modules.config and Nextflow scripts.
modules_config - MYGENE found in conf/modules.config and Nextflow scripts.
modules_config - GREA found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_DIFFERENTIALABUNDANCE found in conf/modules.config and Nextflow scripts.
nfcore_yml - Repository type in .nf-core.yml is valid: pipeline
nfcore_yml - nf-core version in .nf-core.yml is set to the latest version: 3.0.2

Run details

nf-core/tools version 3.0.2
Run at 2024-10-23 17:14:38

…hways/contrast/etc into different subfolders

bjlang · 2024-10-23T09:17:26Z

conf/modules.config

@@ -490,7 +490,7 @@ process {
    // However, later we need to better handle this, maybe by a bit of groovy scripting to
    // overwrite the repeated parameters (?)

-    withName: "PROPR"{
+    withName: PROPR {


Suggested change

withName: PROPR {

withName: 'PROPR' {

For consistency I'd use ' throughout the config

bjlang · 2024-10-23T09:32:49Z

conf/modules.config

+                (meta.diff_method ? "-from-${meta.diff_method}" : ''),
+                (meta.args_diff ? "-${meta.args_diff.replace('--','').replace(' ', '_')}" : ''),
+                (meta.cor_method ? "-from-${meta.cor_method}" : ''),
+                (meta.args_cor ? "-${meta.args_cor.replace('--','').replace(' ', '_')}" : ''),


With splitting the toolsheet information into only the subworkflow relevant chunks, the data accessed here wouldn't be existing anymore. I'm not sure such an elaborate file naming is necessary in this case - could it not be reduced to the pathway name and args_enr? (I dont see a reason for 2 different pathways with identical args_diff, args_cor and args_enr to exist)

but if you have two pathways:

deseq2 + gprofiler2

deseq2 + gsea

Would you save same deseq2 output twice, but into two different pathway directories?

I'm talking only about the enrichment methods like the GREA process here, where you would access not only enr arguments but also need to pass over all other args from previous steps to build the filename in the way you suggest. So, what I mean is to use only args_enr and if this is not specific enough maybe additionally the pathway_name.
For deseq2 in the differential analysis step using args_diff to build the name is fine.

but now that we don't have the pathway name info inside the meta anymore, we don't have any way to distinguish between them and store them into different places (?)

bjlang · 2024-10-23T09:40:35Z

conf/test_experimental.config

-    pathway = "propd,propd_grea,propr,cor"
+
+    // analysis
+    pathway = "propd,propd_grea,propd_ora,propr,cor,deseq2_ora"


You think propd_fdr and pcorbshrink can be safely omitted from the test?

propd_fdr is currently a bit useless for this test, because it is not giving any significant results for the test dataset, so it can be ignored.
But I would have to check other datasets for it to run and test

Then instead of pcorbshrink maybe I would ignore cor

bjlang · 2024-10-23T09:45:57Z

subworkflows/local/differential/main.nf

+include { DESEQ2_DIFFERENTIAL as DESEQ2 } from '../../../modules/nf-core/deseq2/differential/main'
+include { FILTER_DIFFTABLE as FILTER_DESEQ2 } from '../../../modules/local/filter_difftable'


I personally prefer to use renaming only when necessary, i.e. when a module is included multiple times. That way you always directly know for nf-core modules in which folder the module code is located, without needing to check the include statement.

bjlang · 2024-10-23T10:18:12Z

subworkflows/local/enrichment/main.nf

+    ch_results_genewise_filtered
+        .filter { it[0]["enr_method"] == "gprofiler2" }
+        .combine(ch_gene_sets)
+        .combine(ch_counts)
+        .multiMap { meta_results, results, meta_gene_sets, gene_sets, meta_counts, counts ->
+            de : [meta_results, results]
+            gmt : [gene_sets]
+            background : [counts]
+        }
+        .set{ ch_enrichment_gprofiler2 }
+
+    GPROFILER2_GOST(
+        ch_enrichment_gprofiler2.de,
+        ch_enrichment_gprofiler2.gmt,
+        ch_enrichment_gprofiler2.background
+    )


I don't see a reason for the multiMap here.

Why not something like

Suggested change

ch_results_genewise_filtered

.filter { it[0]["enr_method"] == "gprofiler2" }

.combine(ch_gene_sets)

.combine(ch_counts)

.multiMap { meta_results, results, meta_gene_sets, gene_sets, meta_counts, counts ->

de : [meta_results, results]

gmt : [gene_sets]

background : [counts]

}

.set{ ch_enrichment_gprofiler2 }

GPROFILER2_GOST(

ch_enrichment_gprofiler2.de,

ch_enrichment_gprofiler2.gmt,

ch_enrichment_gprofiler2.background

)

GPROFILER2_GOST(

ch_results_genewise_filtered.filter { it[0]["enr_method"] == "gprofiler2" },

ch_gene_sets.first()[1],

ch_counts.first()[1]

)

bjlang · 2024-10-23T10:20:29Z

subworkflows/local/experimental/main.nf

+    // parse optional input files that affect the normalization
+    // TODO we should consider to put this kind of stuff in a separate data handling / preprocessing / normalization block
+    if (params.control_features) {
+        ch_control_features = Channel.of([ [ "id": params.study_name  ], file(params.control_features, checkIfExists: true)]).first()
+    } else {
+        ch_control_features = [[],[]]
+    }
+    if (params.transcript_length_matrix) {
+        ch_transcript_lengths = Channel.of([ [ "id": params.study_name  ], file(params.transcript_length_matrix, checkIfExists: true)]).first()
+    } else {
+        ch_transcript_lengths = [[],[]]
+    }
+


As this is Deseq2 specific, wouldn't it be better to move it into the differential_analysis subworkflow?

it is good to call params within the subworkflows? I thought that should be avoided for a better encapsulation of the subworflow?
Also, not now, but in the future, I think it make sense to have two separated blocks with DESEQ2_NORM dealing the normalization of the data (so in the data processing block) that will use these control/transcript length features, and DESEQ2_DIFFERENTIAL running the differential analysis on the already normalized data

Yes I see your point, but I would argue that we have a better encapsulation, when a subworkflow can be so transparently changed (i.e. by adding a new module) that there is no need for making any changes to the calling workflow.

suzannejin added 3 commits October 21, 2024 12:40

experimental subworkflow can take gene_sets_file as optional input

7e47988

add gprofiler2 to experimental branch

7560a55

specified subworkflow:process name inside modules.config

f08aa0b

suzannejin requested review from pinin4fjords, WackerO and bjlang October 22, 2024 14:36

suzannejin added 2 commits October 22, 2024 17:17

add deseq2 to experimental branch

fd98508

modify modules.config to save the different output from different pat…

9111891

…hways/contrast/etc into different subfolders

suzannejin changed the title ~~[dev-ratio] add gprofiler2 into the experimental branch~~ [dev-ratio] add deseq2 and gprofiler2 into the experimental branch Oct 22, 2024

bjlang reviewed Oct 23, 2024

View reviewed changes

Merge branch 'dev-ratio' into dev-ratio-genesets

4343c6b

suzannejin closed this Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[dev-ratio] add deseq2 and gprofiler2 into the experimental branch #306

[dev-ratio] add deseq2 and gprofiler2 into the experimental branch #306

suzannejin commented Oct 22, 2024 •

edited

Loading

github-actions bot commented Oct 22, 2024 •

edited

Loading

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

bjlang Oct 23, 2024

bjlang Oct 23, 2024

bjlang Oct 23, 2024

suzannejin Oct 23, 2024 •

edited

Loading

bjlang Oct 23, 2024

suzannejin Oct 24, 2024

bjlang Oct 23, 2024

suzannejin Oct 23, 2024

bjlang Oct 23, 2024

bjlang Oct 23, 2024

bjlang Oct 23, 2024

suzannejin Oct 23, 2024 •

edited

Loading

bjlang Oct 23, 2024

		include { DESEQ2_DIFFERENTIAL as DESEQ2 } from '../../../modules/nf-core/deseq2/differential/main'
		include { FILTER_DIFFTABLE as FILTER_DESEQ2 } from '../../../modules/local/filter_difftable'

[dev-ratio] add deseq2 and gprofiler2 into the experimental branch #306

[dev-ratio] add deseq2 and gprofiler2 into the experimental branch #306

Conversation

suzannejin commented Oct 22, 2024 • edited Loading

PR checklist

github-actions bot commented Oct 22, 2024 • edited Loading

nf-core pipelines lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

suzannejin Oct 23, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

suzannejin Oct 23, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

suzannejin commented Oct 22, 2024 •

edited

Loading

github-actions bot commented Oct 22, 2024 •

edited

Loading

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

suzannejin Oct 23, 2024 •

edited

Loading

suzannejin Oct 23, 2024 •

edited

Loading