-
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
It depends on where the probability is retrieved from, namely the underlying cluster model. However, the probabilities are generally more dispersed across topics which results in lower probabilities. That, however, is from an absolute perspective and you generally want to compare relatively.
As mentioned above, it depends on the underlying cluster model. The probabilities with HDBSCAN, for example, are created after the actual assignment of clusters and therefore does not necessarily represent the training process. It is merely an approximation.
You can use
As mentioned above, this is a result of HDBSCAN that calculates the probabilities after assigning topics. So the probabilities are merely an approximation and may not match the inherent training process.
Also related to HDBSCAN calculates its probabilities. I would advise reading through HDBSCAN's documentation here and here. |
Beta Was this translation helpful? Give feedback.
It depends on where the probability is retrieved from, namely the underlying cluster model. However, the probabilities are generally more dispersed across topics which results in lower probabilities. That, however, is from an absolute perspective and you generally want to compare relatively.
As mentioned above, it depends on the underlying cluster model. The probabilities w…