Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A question on the paper on differentiation expression #40

Open
Chengwei94 opened this issue Jun 5, 2023 · 3 comments
Open

A question on the paper on differentiation expression #40

Chengwei94 opened this issue Jun 5, 2023 · 3 comments

Comments

@Chengwei94
Copy link

Hi there, thanks for the very nice package and paper. I think its an interesting and appealing way to think of single cells than the usual clustering method, and am quite excited for this.

I do have a question on the de part. In the paper, there is this part on calculating pjk instead of using the fjk results by the topic models because the DE analysis is a gene-by-gene analysis, whereas the topic model considers all genes at once. Is there any disadvantages in using the fjk because I assume that the fjk will be more accurate than pjk since that they take account the uncertainty in the topic proportions as well. Thanks!

@pcarbo
Copy link
Member

pcarbo commented Jun 5, 2023

@Chengwei94 You are right that a key difference is that the topic model considers all genes at once, whereas the GoM DE analysis is a gene-by-gene analysis, which conditions on the topic proportions. And therefore, indeed, the GoM DE analysis does not account for uncertainty in the topic proportions. However, you did state one thing incorrectly: the topic model (at least as it is implemented in fastTopics, as well as most software) does not account for uncertainty in any of the model parameters (the topic-specific expression levels and the topic proportions). In practice, the fjk's and the pjk's will usually be very similiar, but the GoM DE analysis also takes the extra step of quantifying uncertainty in the pjk's, and therefore allows for calculating measures of support (e.g., p-values, lfsr). Hope that helps!

@Chengwei94
Copy link
Author

Thanks. This has been helpful.

I do have another practical concern on the de analysis. I think one of the most important uses for de analysis is to compare between two similar population between two conditions. Are there any approaches you would recommend here?

@pcarbo
Copy link
Member

pcarbo commented Jun 6, 2023

You could consider poisson mash; the R package is here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants