-
I'd like to experiment with sending multiple lists of keywords created by c-TF-IDF, CountVectorizer, or even keywords made by MaximalMarginalRelevance into the LLM call with a prompt as such:
I don't see anything specifically within the docs referencing something like this, but is it possible? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
This is unfortunately not possible at the moment and only the main representations are given. Do note though that I actually haven't seen use cases where this is needed since LLMs tend to derive the topic labels more from the representative documents rather than the keywords. Moreover, although the keywords are likely to be different between c-TF-IDF and for for instance MMR, they will still be quite similar and contain overlap in keywords. Having said that, it would indeed be interesting to test whether there would actually be an effect of having multiple representations. I should note that I am a bit hesitant to implement this until there is clear proof it actually has a positive (and perhaps significant) effect on the resulting labels. |
Beta Was this translation helpful? Give feedback.
This is unfortunately not possible at the moment and only the main representations are given. Do note though that I actually haven't seen use cases where this is needed since LLMs tend to derive the topic labels more from the representative documents rather than the keywords. Moreover, although the keywords are likely to be different between c-TF-IDF and for for instance MMR, they will still be quite similar and contain overlap in keywords.
Having said that, it would indeed be interesting to test whether there would actually be an effect of having multiple representations. I should note that I am a bit hesitant to implement this until there is clear proof it actually has a positive (and p…