Skip to content

Working with *too big* datasets #1432

Closed Answered by MaartenGr
huycke asked this question in Q&A
Discussion options

You must be logged in to vote

It would return an error like the following:

Are you sure that that is the exact error log you get when following the exact code as mentioned in the notebook "Topic Modeling on Large Data"? The reason why I am asking is that the error log you shared shows that you initialized the topic model as follows:

topic_model = BERTopic(language="english")

# Step 4: Fit the model to your data
topics, probabilities = topic_model.fit_transform(df['text'])

which is not according to the instructions of the notebook. Please share the error log that you get when you follow along with the notebook without changing any parameters.

If however, that is the exact error log that you get regardless of how you…

Replies: 3 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@huycke
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by huycke
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants