Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Test setup #2

Open
sindhura97 opened this issue Jan 29, 2024 · 2 comments
Open

Test setup #2

sindhura97 opened this issue Jan 29, 2024 · 2 comments

Comments

@sindhura97
Copy link

Hi, For inference, do you get embeddings of all texts for all intents in the test set and cluster them? Or do you construct batches like in the training phase?

@Anton87
Copy link
Contributor

Anton87 commented Mar 19, 2024

Hi @sindhura97 ,
this depends on the size of your problem, i.e how many utterances you need to cluster. If their corresponding embedding fit in memory, you can cluster all the utterances together during inference, otherwise you need to split by batch.

However, for inference it does not make a big difference between the two approaches. :-)

@sindhura97
Copy link
Author

Hi, my intention to ask this question was to know about the evaluation setup. Were the evaluation metrics obtained for one final clustering per dataset or are the metrics computed for each batch and averaged?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants