Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Save Wiktionary categories #4

Open
rominf opened this issue Aug 28, 2020 · 3 comments
Open

Save Wiktionary categories #4

rominf opened this issue Aug 28, 2020 · 3 comments
Labels
not-in-dbnary The required data is not available in dbnary, yet.

Comments

@rominf
Copy link

rominf commented Aug 28, 2020

Thank you for the nice dictionaries, I really appreciate your work!

I'm building a Telegram bot for learning languages and I find wikdict dictionaries very helpful.

However, I think that it would be better to preserve categories in the databases, to allow user to select only required words from a certain categories.

@karlb
Copy link
Owner

karlb commented Aug 28, 2020

I'm totally willing to preserve the categories. But for that to work, I need to get them out of the wiki text markup into RDF, which has to be done for every language separately. This step is performed by http://kaiko.getalp.org/about-dbnary/ and I haven't seen the categories in there last time I looked. Categories for which language are most interesting for you? Could you give a specific example of a word an the expected category from Wiktionary (this avoids misunderstanding, there are many subtly different items in Wiktionary)?

@rominf
Copy link
Author

rominf commented Aug 28, 2020

I'm planning to build a universal bot, but for now I'm interested in three languages: English, Finnish, and Danish.
For example, it would be great if the user could request all computing words (https://en.wiktionary.org/wiki/Category:en:Computing), including https://en.wiktionary.org/wiki/program

@karlb
Copy link
Owner

karlb commented Aug 30, 2020

OK, I clearly know what you mean, now. I had a look at the RDF data from dnary again and was not able to find any category information. Until that changes, I don't see a way to add that data with an acceptable amount of effort.

@karlb karlb added the not-in-dbnary The required data is not available in dbnary, yet. label Aug 30, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
not-in-dbnary The required data is not available in dbnary, yet.
Projects
None yet
Development

No branches or pull requests

2 participants