Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How about doing top domains by h-index? #3

Open
danluu opened this issue Apr 26, 2016 · 4 comments
Open

How about doing top domains by h-index? #3

danluu opened this issue Apr 26, 2016 · 4 comments

Comments

@danluu
Copy link

danluu commented Apr 26, 2016

I see you have domains by median, and submitters by h-index. Why not domains by h-index? :-).

I imagine the list will be dominated by news sites like techcrunch and the NYC, shared hosting platforms like medium and tumblr, and community sites like wikipedia and github, but I'd still be curious to see what the results are if it's easy for you to run the query.

@antontarasenko
Copy link
Owner

antontarasenko commented Apr 26, 2016

Sure. In hackernews/top-authors-h-index.sql, I replace [by] author with domain(url) author. And here's the top 25 (verbatim):

github.com  279  
null    276  
google.com  259  
nytimes.com 238  
blogspot.com    234  
techcrunch.com  208  
github.io   194  
arstechnica.com 185  
medium.com  183  
wired.com   180  
tumblr.com  172  
wordpress.com   170  
washingtonpost.com  167  
mozilla.org 163  
eff.org 163  
theguardian.com 147  
theatlantic.com 143  
bbc.co.uk   143  
wsj.com 142  
wikipedia.org   132  
youtube.com 130  
amazon.com  127  
apple.com   127  
reddit.com  126  
bloomberg.com   123  

@danluu
Copy link
Author

danluu commented Apr 26, 2016

Am I misunderstanding how you're defining h-index? I thought that, for example, an h-index of 20033 would imply 20033 stories of score >= 20033.

@antontarasenko
Copy link
Owner

@danluu You're right. I should have replaced [by] twice. I fixed the previous post. Thanks!

@danluu
Copy link
Author

danluu commented Apr 26, 2016

Cool. Thanks! I was wondering if pg was so prolific that he'd "beat" some of these sites that have hundred of authors.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants