An webapp which searches through the most popular manga, anime, novel and webtoon readers in PT-BR/EN-US to find the most updated one for a specific series.
Made by Eduardo Henrique (BACKEND) and Rosialdo Vidinho (FRONTEND). Documentation and project management support by Guilherme Bernardo;
✅ means done.
🚧 means doing.
❌ means won't do.
- 🚧 Migrating from JS to TS.
- 🚧 Adopt micro commits strategy.
- Commit for atomic changes so that errors and bugs can be resolved faster.
- 🚧 Integrate backend and frontend.
- 🚧 Separate app in 3 Docker containers: one for the DB, one for the API and one for the crawlers.
- 🚧 Creating an RESTful API using Express so the user can read from DB without having access to it.
- ✅ Use proxies.
- ✅ Creating a proxy pool with auto renew.
- 🚧 Create crawlers for the following sites:
- ✅ Send multiple requests at once.
- ✅ Using Puppeteer for JS Rendering.
- ✅ Created list of relevant sites to use.
- 🚧 Change crawler structure to adopt crawlee.
- 🚧 Creating crawlers for a lot of sites.
- ✅ Created DB (MongoDB).
- 🚧 Create backup DB.
- 🚧 Update DB on demand.
- 🚧 Create search bar to take user input.
- 🚧 Creating frontend for the site using React.
- 🚧 Create responsive dropdown menu for series type selection.
- ❌ Crawl websites with Cloudflare anti-bot features.
- Sites such as mangalivre won't be crawled for the time being.
- ❌ Read series through site.
- It will only redirect the user to a site with said series.
- ❌ Update DB daily.
- We are deciding if this is viable right now, but as it is, it will hinder our progress, so we will postpone this feature.
- Maybe this will be possible since we discovered a way to request multiple pages at once.
- Updating DB daily;
- Updating DB without reconstructing it from scratch;
Possible Solutions:
- Make the update process faster and less resource-heavy;
- Only update series that changed values;
- Only update DB after going through every series in a site;
- Create proxy pool with auto renew;
Solutions:
- Update for every visited page in the site;
- Use Webshare builtin proxy renewing tool;
WIP 😎