Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

demo commit pushed loader and model #1

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

Emerald-Z
Copy link

No description provided.

@Emerald-Z Emerald-Z closed this Mar 28, 2023
@Emerald-Z Emerald-Z reopened this Mar 28, 2023
@Emerald-Z
Copy link
Author

Emerald-Z commented Apr 1, 2023

Description

  • How this works: this takes in an image, reads and preprocesses one frame by isolating the box containing the digits and then isolating each digit. Then it feeds it into a forward NN to predict each digit and sums the values according to their weights(tens, ones, tenths)

Limitations

- Bounding box padding sometimes inaccurate with cutoff before the decimal place
  • Possible improvements:
    • Most important:
      • Using a 4 point transform so that the numbers can be viewed from an angle
      • Model not very accurate
    • Making bounding box detection faster? Currently I’m looping over all pixels and that could be potentially really slow
    • Replacing the feed-forward net with a CNN

Images:
Screen Shot 2023-03-28 at 10 59 53 AM

Screen Shot 2023-03-28 at 11 00 11 AM

Screen Shot 2023-03-28 at 11 00 23 AM

Screen Shot 2023-03-28 at 11 00 33 AM

Testcases: WIP

  • I wrote a simple test case with unittest that tests its capability to detect the right numbers - currently failing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant