Skip to content

Latest commit

 

History

History
33 lines (19 loc) · 817 Bytes

README.md

File metadata and controls

33 lines (19 loc) · 817 Bytes

Web Search Engine

Advanced Computing Concepts Group Project - COMP8547-4-R-2019F - University of Windsor

The project is structured in three modules:

  1. react-searchkit: Frontend UI powered by React.js

  2. search-engine-backend: Backend REST API developed using Spring Boot framework which retuns the search result

  3. WebCrawler: Web crawler to crawl through pages

Techniques used:

  • Web crawler with Jsoup
  • Inverted index with Trie
  • Regex
  • Edit distance
  • HashMap
  • Quick Sort

Working Flow:

Working flow diagram

Trie Structure

trie structure example

Search HashMap Structure - Example

example