cRegulon is an optimization model to identify combinatorial regulon from single cell expression and chromatin accessibility data.
This is cRegulon software: an optimization model to identify combinatorial regulon from single cell expression and chromatin accessibility data.
- Python >=3.0 with packages: numpy, sklearn, and scipy
- matlab >= 2021
- Homer
wget https://github.com/fengzhanying/cRegulon/archive/master.zip
unzip master.zip
cd cRegulon-master
wget https://www.dropbox.com/s/0h1wxlu7iqheajo/cRegulon.tar.gz
tar -xzvf cRegulon.tar.gz
The typic input file (RAd4_scRNA.txt) of scRNA-seq data is a gene by cell count matrix:
scRNA | Cell1 | Cell2 | Cell3 |
Gene1 | 5 | 0 | 3 |
Gene2 | 0 | 2 | 0 |
Gene3 | 1 | 0 | 0 |
scATAC | Cell1 | Cell2 | Cell3 | Cell4 |
Peak1 | 1 | 0 | 1 | 0 |
Peak2 | 0 | 1 | 0 | 1 |
Peak3 | 1 | 0 | 0 | 0 |
We run the following script to make the gene expression matrix and gene activity matrix:
source Preprocessing.sh RAd4
This process will produce gene expression file (RAd4_GE.txt) and gene activity file (RAd4_GA.txt)
With the input files are (RAd4_scRNA.txt) and (RAd4_scATAC.txt), we run the following script:
source PS_PECA.sh RAd4 mm10
This process will produce the TF-REs-TG triplets files (RAd4_network.txt) and TF-TG regulatory strength file (RAd4_TRS.txt).
With the input TF-TG regulatory strength file (RAd4_TRS.txt), we run the following script:
source runCSI.sh RAd4
This will generate normalized TF-TG regulatory strength file (RAd4_TRS.txt) and TF-TF combinatorial network (RAd4_CSI.txt).
With the input of TF-TF combinatorial network (C.txt), normalized TF-TG regulatory strength matrix (R.txt), gene expression matrix (GE.txt), and gene activity matrix (GA.txt), we run the following cRegulon model:
source cRegulon.sh RAd4
This will output:
- TF combinatorial effects in all cRegulons: X.txt
- cRegulon combination coefficients for scRNA-seq: H1.txt
- cRegulon combination coefficients for scATAC-seq: H2.txt
- TF modules of cRegulons: TFs (*TF.txt) and TF pairs (*TFPair.txt).
- Regulatory sub-network of each cRegulon: *SubNet.txt
If you use cRegulon software or cRegulon associated concepts, please cite
Zhanying Feng, et al. Modeling combinatorial regulon from single cell gene expression and chromatin accessibility data. 2023.