Human Neural Organoid Cell Atlas Toolbox

🛠️ The Swiss Army Knive of the Single Cell Cartographer

This package provides a set of tools we used to generate and analyze the Human Neural Organoid Cell Atlas. Among other things, it provides functions to:

Rapidly annotate cell types based on marker genes
Map query data to the reference atlas
Transfer annotations between datasets
Compute 'presence scores' for query data based on the reference atlas
Perform differential expression analysis

Installation

The latest release of HNOCA-tools can be installed with pip

pip install hnoca

Quick start

🖋️ Annotation

We developed snapseed to rapidly annotate the HNOCA. It annotates cells based on manually defined sets of marker genes for individual cell types or cell type hierarchies. It is fast (i.e. GPU-accelerated) and simple to enable annotation of very large datasets.

import hnoca.snapseed as snap
from hnoca.snapseed.utils import read_yaml

# Read in the marker genes
marker_genes = read_yaml("marker_genes.yaml")

# Annotate anndata objects
snap.annotate(
    adata,
    marker_genes,
    group_name="clusters",
    layer="lognorm",
)

# Or for more complex hierarchies
snap.annotate_hierarchy(
    adata,
    marker_genes,
    group_name="clusters",
    layer="lognorm",
)

🗺️ Mapping

For reference mapping, we mostly rely on scPoli and scANVI. Based on pretrained models, we here provide a simple interface to map query data to the reference atlas.

import scvi
import hnoca.map as mapping

# Load the reference model
ref_model = scvi.model.SCANVI.load(
    os.path.join("model.pt"),
    adata=ref_adata,
)

# Map query data
mapper = mapping.AtlasMapper(ref_model)
mapper.map_query(query_adata, retrain="partial", max_epochs=100, batch_size=1024)

Now that the query dataset is mapped, we can perform kNN-based label transfer and presence score calculation.

# Compute the weighted kNN
mapper.compute_wknn(k=100)

# Transfer labels
celltype_transfer = mapper.transfer_labels(label_key="cell_type")
presence_scores = mapper.get_presence_scores(split_by="batch")

📊 Differential expression

We have used ANOVA for DE analysis between the HNOCA and the reference atlas. Here, this is implemented as the test_de() function.

import hnoca.stats as stats

# Perform DE analysis
de_df = stats.test_de(
    joint_adata,
    group_key="origin",
    return_coef_group="organoid",
    adjust_method="holm",
)

In addition to DE testing on the atlas itself, we found it useful to treat the atlas as a universal "control" and test for DE w.r.t query datasets. For this, we first compute the matched expression profile for each cell in the query dataset and then test for DE using an F-test.

# Compute matched expression profiles based on mapped data
matched_adata = mapper.get_matched_expression()

# Perform DE analysis
de_df = stats.test_de_paired(
    query_adata,
    matched_adata,
    adjust_method="holm",
)

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
.github/workflows		.github/workflows
docs		docs
hnoca		hnoca
site		site
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mkdocs.yml		mkdocs.yml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Human Neural Organoid Cell Atlas Toolbox

🛠️ The Swiss Army Knive of the Single Cell Cartographer

Installation

Quick start

🖋️ Annotation

🗺️ Mapping

📊 Differential expression

About

Releases 2

Packages

Contributors 3

Languages

License

devsystemslab/HNOCA-tools

Folders and files

Latest commit

History

Repository files navigation

Human Neural Organoid Cell Atlas Toolbox

🛠️ The Swiss Army Knive of the Single Cell Cartographer

Installation

Quick start

🖋️ Annotation

🗺️ Mapping

📊 Differential expression

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 3

Languages

Packages