Skip to content

Commit

Permalink
Description of genome_statistics subworkflow
Browse files Browse the repository at this point in the history
  • Loading branch information
ksenia-krasheninnikova authored Nov 14, 2023
1 parent 3cbd4c1 commit 4bab6a4
Showing 1 changed file with 15 additions and 1 deletion.
16 changes: 15 additions & 1 deletion docs/output.md
Original file line number Diff line number Diff line change
Expand Up @@ -129,7 +129,21 @@ The subworkflow performs scaffolding of the primary contigs using HiC mapping ge

### GENOME_STATISTICS

Is used at various stages of the main workflow to evaluate the quality of contigs at the intermidate steps</p>
<details markdown="1">
<summary>Output files</summary>

- <code>*.assembly_summary</code>
- numeric statistics for pri and alt sequences
- <code>*ccs.merquryk</code>
- folder with merqury plots and kmer statistics
- <code>*busco</code>
- folder with BUSCO results

</details>

This subworkflow is used to evaluate the quality of sequences. It is performed after the intermidate steps, such as raw assembly generation, purging and polishing, and also at the end of the pipeline when scaffolds are produced.</p>

![Genome statistics subworkflow](https://raw.githubusercontent.com/sanger-tol/genomeassembly/documentation/docs/images/v1/genome_statistics.png)

### ORGANELLES
Implements steps to assemble the mitogenome</p>
Expand Down

0 comments on commit 4bab6a4

Please sign in to comment.