High resolution model & distance function prior [draft] #337

vitkl · 2023-11-26T13:43:21Z

…tial KNN

…d distance prior, Gamma affinity prior, cell-type-independent global effects model (one receptor one effect not cell-type-specific receptor effect)) + updates to Visium HD model (normalisation)

…, per cell type normalisation, minor changes

…hoe prior, more likelihood options, zero diag & upper tri pathways, sqrt normalisation

…hanges

…age distance function, heatmap with vcenter

vitkl · 2024-10-18T13:18:12Z

To enable using total cell abundance estimates from histology images the following changes are necessary (use_proportion_factorisation_prior_on_w_sf = True):

Changing the parameterization of the factorisation prior to produce % of total cell abundance.
Forcing the model to match the provided total cell abundance estimates by using that data as prior with very narrow distribution around the provided values (N_cells_per_location_alpha_prior=1000.0, use_n_s_cells_per_location_limit = True).
Changing detection_alpha=200.0 back to narrow distribution.
Changing other priors.
Code modifications to support N_cells_per_location of shape=(n_obs, 1).

This branch can be installed as follows (I have not tested this particular recipe so please let me know if it doesn't work):

export PYTHONNOUSERSITE="True"
conda create -y -n c2l_v015 python=3.10
conda activate c2l_v015
pip install git+https://github.com/vitkl/scvi-tools.git@pyro_fixes
pip install "cell2location[tutorials,dev] @ git+https://github.com/BayraktarLab/cell2location.git@hires_sliding_window"
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu124
pip install jupyter ipykernel
conda activate c2l_v015
python -m ipykernel install --user --name=c2l_v015 --display-name='Environment (c2l_v015)'

Temporary usage instructions. This will become mode='exact_total_cell_abundance' that switches all of these options on:

detection_alpha = 200.0
N_cells_per_location_alpha_prior = 1000.0
use_per_cell_type_normalisation = False
# ideally this is not count of cells 
# but % of spot occupied by cells * 0.9999 quantile of N cells across the data
N_cells_per_location = adata_vis.obs[['n_cell_occupancy']].values.astype('float32')

A_B_per_location_alpha_prior = None
A_factors_per_location = 40.0
B_groups_per_location = 5.0

use_proportion_factorisation_prior_on_w_sf = True
use_n_s_cells_per_location_limit = True

import torch
torch.set_float32_matmul_precision('high')

seed = 0
scvi.settings.seed = seed
np.random.seed(seed)

    # prepare anndata for scVI model
    cell2location.models.Cell2location.setup_anndata(
        adata=adata_vis, batch_key="sample"
    )

    if training:
        import pyro
        mod = cell2location.models.Cell2location(
            adata_vis, cell_state_df=inf_aver, 
            amortised=False,
            N_cells_per_location=N_cells_per_location, # np.array shape (n_obs, 1)
            detection_alpha=detection_alpha,
            use_per_cell_type_normalisation=use_per_cell_type_normalisation,
            N_cells_per_location_alpha_prior=N_cells_per_location_alpha_prior,
            N_cells_mean_var_ratio=None,
            detection_hyp_prior={"mean_alpha": float(1.0)},
            detection_cell_type_prior_alpha=float(100.0),
            A_B_per_location_alpha_prior=A_B_per_location_alpha_prior,
            A_factors_per_location=A_factors_per_location,
            B_groups_per_location=B_groups_per_location,
            use_proportion_factorisation_prior_on_w_sf=use_proportion_factorisation_prior_on_w_sf,
            use_n_s_cells_per_location_limit=use_n_s_cells_per_location_limit,
            n_groups=50,
        ) 
        
        mod.view_anndata_setup()
    
        mod.train(max_epochs=80000,
                  # train using full data (batch_size=None)
                  batch_size=None,
                  plan_kwargs={'optim': pyro.optim.Adam(optim_args={'lr': 0.002})},
                  # use all data points in training because
                  # we need to estimate cell abundance at all locations
                  train_size=1,
                  scale_elbo=1 / (adata_vis.n_obs * adata_vis.n_vars),
                  accelerator='gpu')
    
        # Save model
        mod.save(f"{scvi_run_name}", overwrite=True)
    else:
        # can be loaded later like this:
        mod = cell2location.models.Cell2location.load(f"{scvi_run_name}", adata_vis)

Note that this N_cells_per_location code doesn't support amortised=True.

vitkl added 30 commits November 26, 2023 13:42

draft for hires model & distance function prior

6b987e6

missing dependency

f1c3127

bug fix

047f160

bug fix

58bffdd

getting rid of data_transform argument

adba6c2

working conv2d pooling + distance function effect

a99ca5e

add tests and bump version

444cbd4

more universal quantile method

d13de3c

minor bug fixes

f769044

more bug fixes

973a6c4

minor changes

2eccd8c

defined grid dataloader

3ac9b97

enabling complex variable shapes (right dimensions)

f940f6b

handling Dirichlet vars in obs plate using custom dim lists

e2e5deb

handling custom dataloader in model.train()

0f49fcc

model for cell compartments & conv2d redistribution & bug fixes

9db7e68

compatibility of tile and normal dataloader, registering tiles

527c36e

multi-resolution likelihood, loading many tiles, independent c2l prior

4667909

mask likelihood for non-tissue locations

a42699f

removing learnable weights for overdispersion

87713ce

add overdispersion scaling back

0fd4b05

minor bug fixes

aa2a3e2

add pyro_guide(*args, **kwargs) setup to model.train()

31e7885

guide updates & draft cell comm model & tests & defaults

286c1fe

dealing with multi-subunit receptors + bug fixes + batch disjoint spa…

c0747ae

…tial KNN

expanding tiles & overlapping tiles dataloader

f9d6478

filtering expanded tiles + minor changes

c462b45

temporary ordering bug fix

5429f1f

test indexing bug + minor changes + dense distance

60aaefb

minor changes

0efe255

minor changes

85484ff

vitkl mentioned this pull request Apr 21, 2024

Compatibility with VisiumHD #358

Closed

vitkl added 18 commits June 19, 2024 02:25

cell comm downstream model draft (incl input preprocessing, simplifie…

a418997

…d distance prior, Gamma affinity prior, cell-type-independent global effects model (one receptor one effect not cell-type-specific receptor effect)) + updates to Visium HD model (normalisation)

normal likelihood + minor changes

41e05b1

enable providing N_cells_per_location as array

4dc20c0

alternative N cells model (more flexible, allows N cells array input)…

ce403de

…, per cell type normalisation, minor changes

normalisation options

7ddcae6

scaling by hierarchical annotations, correct unexpanded tiles, horses…

c3eec10

…hoe prior, more likelihood options, zero diag & upper tri pathways, sqrt normalisation

bug fix

4569d79

proportion factorisation for w_sf prior, prior option for A & B inputs

5c12379

use spatial receptor distribution (baseline), LR affinity with 1

490a2cc

bug fix for non-negative model, tracking total S-R occupancy, minor c…

c725e54

…hanges

options for properly utilising segmented cell input

482258f

cell abundance proportional normalisation by N

fc60db2

diffusion domain function

fd2babc

changing distance prior, option to cap distance effect to 10x of aver…

92ed022

…age distance function, heatmap with vcenter

minor bug fixes and changes

77666b9

adding optional extra categorical + N_cells_per_location dtype bug fix

77f28d7

remove cell comm model from normal c2l

b1ab786

make dask optional

d0f5018

vitkl mentioned this pull request Oct 6, 2024

Compatibility issues with scvi-tools #385

Closed

vitkl mentioned this pull request Oct 18, 2024

Inputting nuclei counts into cell2location #344

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High resolution model & distance function prior [draft] #337

High resolution model & distance function prior [draft] #337

vitkl commented Nov 26, 2023 •

edited

Loading

vitkl commented Oct 18, 2024 •

edited

Loading

High resolution model & distance function prior [draft] #337

Are you sure you want to change the base?

High resolution model & distance function prior [draft] #337

Conversation

vitkl commented Nov 26, 2023 • edited Loading

vitkl commented Oct 18, 2024 • edited Loading

vitkl commented Nov 26, 2023 •

edited

Loading

vitkl commented Oct 18, 2024 •

edited

Loading