All test samples and genome data are shared on our public DNANexus project. You don't have to download any data for testing our pipeline on DNANexus platform.
There are two methods to run our pipeline on DNANexus.
- Building your own DX workflow from
chip.wdl
with dxWDL (CLI) - Using a pre-built DX workflow on our public DX project (Web UI)
This document describes instruction for the item 1).
-
Sign up for a DNANexus account.
-
Create a new DX project with name
[YOUR_PROJECT_NAME]
by clicking on "+New Project" on the top left. -
Git clone this pipeline.
$ git clone https://github.com/ENCODE-DCC/chip-seq-pipeline2
-
Move to pipeline's directory.
$ cd chip-seq-pipeline2
-
Download dxWDL.
$ wget https://github.com/dnanexus/dxWDL/releases/download/0.77/dxWDL-0.77.jar $ chmod +rx dxWDL-0.77.jar
-
Choose an appropriate input for your project (AWS or Azure):
- AWS
$ INPUT=examples/dx/ENCSR936XTK_subsampled_dx.json
- Azure
$ INPUT=examples/dx_azure/ENCSR936XTK_subsampled_dx_azure.json
- AWS
-
Compile
chip.wdl
with an input JSON for the SUBSAMPLED paired-end sample of ENCSR936XTK.$ PROJECT=[YOUR_PROJECT_NAME] $ OUT_FOLDER=/test_sample_chip_ENCSR936XTK_subsampled $ java -jar dxWDL-0.77.jar compile chip.wdl -project ${PROJECT} -f -folder ${OUT_FOLDER} -defaults ${INPUT} -extras workflow_opts/docker.json
-
Go to DNANexus project page and click on your project.
-
Move to the directory
/test_sample_chip_ENCSR936XTK_subsampled
. -
You will find a DX workflow
chip
with all parameters pre-defined. Click on it. -
Specify an output directory by clicking "Workflow Actions" on the top right. Click on "Set output folder" and choose an output folder.
-
Click on "Run as Analysis..." and you will be automatically redirected to the "Monitor" tab.
-
It will take about an hour. You will be able to find all outputs on your output folder. Final QC report (
qc.html
)/JSON (qc.json
) will be found on it. -
See full specification for input JSON file.