Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vegaExamples flights #9

Open
Hypercubed opened this issue Jul 16, 2016 · 2 comments
Open

vegaExamples flights #9

Hypercubed opened this issue Jul 16, 2016 · 2 comments

Comments

@Hypercubed
Copy link
Contributor

The flight JSON files in the vegaExamples directory are, I believe, subsets of the corssFilter data here: https://github.com/square/crossfilter/tree/gh-pages/ which is 230k rows. This itself a subset of the ASA Data Expo dataset.

The ASA dataset is very big, but you might consider adding your own large subset from the original source.

As for the vegaExamples JSON files they are formatted poorly. Would you consider a PR these PrettyPrints the JSON files?

@curran
Copy link
Owner

curran commented Jul 16, 2016

Sure, that sounds great! Maybe the flights data could be moved into a directory all its own, with various subsets there and the addition of the lineage you mentioned into the README there.

@Hypercubed
Copy link
Contributor Author

The original source data has a lot more details (http://stat-computing.org/dataexpo/2009/the-data.html) and is 12 GB uncompressed. I'd like to use the vegaExample data because the date is well formatted, unlike the crossfilter example data or the ASA source, making it easy to define a schema.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants