Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

languages #2

Open
gsiolas opened this issue Feb 10, 2016 · 3 comments
Open

languages #2

gsiolas opened this issue Feb 10, 2016 · 3 comments

Comments

@gsiolas
Copy link

gsiolas commented Feb 10, 2016

is there a way for users to provide or help you provide new languages to the ntlk mashape api? thx!

@japerk
Copy link
Owner

japerk commented Feb 16, 2016

The best thing that could help would be to send me a link to a good training corpus. The API is generally based on the NLTK corpora, except for sentiment, which uses other movie review corpora. So if you know of a good training corpus for your language, let me know, and I'll see what I can do.

@gsiolas
Copy link
Author

gsiolas commented Feb 16, 2016

Hi Jacob,
do you need training corpora or something like that
https://github.com/MKLab-ITI/greek-sentiment-lexicon (link 2
https://github.com/MKLab-ITI/greek-sentiment-lexicon/blob/master/greek_sentiment_lexicon.tsv)
could be of use? an already sentiment-rated lexicon...
Giorgos

On Wed, Feb 17, 2016 at 12:04 AM, Jacob Perkins [email protected]
wrote:

The best thing that could help would be to send me a link to a good
training corpus. The API is generally based on the NLTK corpora, except for
sentiment, which uses other movie review corpora. So if you know of a good
training corpus for your language, let me know, and I'll see what I can do.


Reply to this email directly or view it on GitHub
#2 (comment)
.

@japerk
Copy link
Owner

japerk commented Feb 17, 2016

Thanks for that. Unfortunately, I don't have a setup yet for keyword based sentiment analysis. What I need is something like the polarity dataset here: https://www.cs.cornell.edu/people/pabo/movie-review-data/. The ideal structure is sentences or paragraphs, each classified as pos or neg.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants