Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(data): Refactor SQL files for dumping data into openchallenges-db #2142

Merged
merged 12 commits into from
Sep 30, 2023

Conversation

vpchung
Copy link
Member

@vpchung vpchung commented Sep 20, 2023

Fixes #2047 ; goes hand-in-hand with #2114

Due to downloading the OC Data as CSVs in order to generate the flat files, this also fixes #2169 and fixes #2176

Changelog

  • Enable local_infile feature in mariadb server
  • Refactor *data.sql files to load data directly from flat files
  • Update database with snapshot date of 2023-09-28
    • Added challenges from CAGI, CodaBench, and CodaLab
    • Added new organizations, incl. Research to the People
    • General updates to challenges and organizations

Preview

Screenshot 2023-09-29 at 5 14 21 AM

@vpchung vpchung marked this pull request as ready for review September 29, 2023 17:37
@vpchung
Copy link
Member Author

vpchung commented Sep 29, 2023

Hmm, looks like I'm also not able to generate the cards on the home page either:

Screenshot 2023-09-29 at 10 41 09 AM

Not sure if this is related to the "Other" facet mentioned earlier... Reverting this back to drafts.

@vpchung vpchung marked this pull request as draft September 29, 2023 17:42
@vpchung
Copy link
Member Author

vpchung commented Sep 29, 2023

everything worked as expected except the Other option for the Platform facet.

Whoops, the error also reproduces when selecting Upcoming from Status, Starting Soon from Category, and 2023, 2022, or 2017 - 2021 from Challenge Year:

Screenshot 2023-09-29 at 2 10 47 PM

@vpchung
Copy link
Member Author

vpchung commented Sep 29, 2023

After playing around with web app, I think the issue stems from one challenge:

Jan2024: Rare Disease AI Hackathon

I noticed that using any search terms related to this challenge, e.g. rare, disease, would also give us the error; hence why I think this particular challenge is the issue. Plus, it has a status = upcoming and platform = other so that tracks with the patterns mentioned above.

@vpchung
Copy link
Member Author

vpchung commented Sep 29, 2023

FOUND THE 🐛 !! It was actually not just the hackathon that caused the issue, but any challenges that had doker as its submission type (instead of container_image). Updating these values has fixed the issue:

bug-fixed

Data contributors have been notified of this as to help prevent this in the future.

This PR is now ready for review ✅

@vpchung vpchung marked this pull request as ready for review September 29, 2023 22:51
tschaffter
tschaffter approved these changes Sep 30, 2023
@tschaffter tschaffter merged commit 8395cb1 into Sage-Bionetworks:main Sep 30, 2023
@vpchung vpchung deleted the feat-2047 branch October 16, 2023 20:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants