Skip to content

A collection of amplicon sequencing data sets intended for use with the mbio-airflow-dags repository

Notifications You must be signed in to change notification settings

microbiomeDB/amplicon_sequencing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

In this directory are subdirectories for amplicon sequencing studies. 
Each study/ subdirectory needs:	
	1. a samplesheet.csv file w 2-4 columns `sampleID`,`forwardReads`,`reverseReads`,`run`
	2. a nf-params.json file w configuration for how airflow + nf-core should run
	3. fastq.gz files

There is also a file `amplicon_studies.csv` with two columns `studyName` and `studyPath`.
When you have made your subdirectory for your study and are happy w it, you should add
a row to this csv file. Eventually, modifications to this file trigger an Airflow run for the
associated nf-core/ampliseq DAG. For now, trigger it manually.

IMPORTANT NOTES:

  - It is best practice to include full (as opposed to relative) paths where they are needed
  - An example directory exists called test_study
  - There is a file called processed_studies_provenance.csv. Mostly you should NOT touch this.*

* Possible exception being to remove a study to force it to re-run next time amplicon_studies.csv
were touched.

About

A collection of amplicon sequencing data sets intended for use with the mbio-airflow-dags repository

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages