Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HW11-Proposal_HyoSungKim #17

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

angelicakim28
Copy link

No description provided.

@angelicakim28 angelicakim28 changed the title Proposal-HyoSungKim HW11-Proposal_HyoSungKim Apr 12, 2019

## Anticipated difficulties

The biggest challenge for this is the amount of data points to display on the map. My current data has over 300,000 points considered possible non-owner occupied rentals and I think that the processing for this will be very slow. A possible work around this issue is to narrow down on what I plan to display, for example, filtering out and displaying rentals with risk scores greater than x. I am still considering whether or not I will by default, filter the dataset or allow the user to input a range of risk scores to filter.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You're exactly right that this is the central difficulty for an application of this kind. 300,000 points may well be too many, but you could perhaps aggregate at lower zoom levels and only display "full data" when zoomed in enough to naturally display fewer elements. This might be a candidate for using vector tiles (which we won't be covering in any detail I'm afraid, but which I'd be happy to give you pointers on if it comes to that)


## Missing pieces

It would be useful to know if there was a way to continuously update a data file with inputs and although these wouldn't be used for the visual aspects of the application, it can serve to add data into the primary dataset and as next steps, use this information to improve the model. By updating the model every now and then, risk score predictions may improve.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The only option for something like this would be a bit of back-end code that (nightly? hourly?) churns through inputs to update the model. The good news is that you can avoid that problem entirely for the sake of the final. If you need help pursuing next steps after that, let me know and I'll point you in the right direction

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants