Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Concatenating two corpora forgets text feature #1054

Open
ajdapretnar opened this issue Apr 22, 2024 · 0 comments
Open

Concatenating two corpora forgets text feature #1054

ajdapretnar opened this issue Apr 22, 2024 · 0 comments

Comments

@ajdapretnar
Copy link
Collaborator

Describe the bug
When concatenating two corpora with the same column set as text feature (say, "content"), Concatenate forgets the variable was set as the text variable.

To Reproduce
Steps to reproduce the behavior:

  1. Take two corpora (i.e. Parlamint), set non-first string variable as text feature for both.
  2. Concatenate.
  3. Observe the new text feature. It is back to the default.

Proposed solution
Text features is currently only a property of Corpus (if I am not mistaken). Have the feature be encoded in data.attributes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant