OSSE Search Engine

Overly Simple Search Engine - Making search engines simple??
Pronunciation: "oh-see"

🚩 Table of Contents

Why?
Usage
Features
How it works
Roadmap
Contributing
License

🐂 Why?

Just for fun! I really wanted to learn Rust and at the time I was really interested in how search engines worked, so there wasn't any better way of achieving both goals than with this very project!

🤖 Usage

This repository is a monorepo formed by the independent components that form the OSSE search engine.

Installing Dependencies

With Nix:

$ nix develop

Otherwise:

Install cargo and trunk with your preferred method (such as your favorite package manager).

Running

Crawler

$ cargo run --bin crawler

Indexer

$ cargo run --bin indexer

Frontend

$ trunk serve frontend/index.html --open

Once all the components are running, you can navigate to 127.0.0.1:8080 on your favorite web browser and start using OSSE!

🎨 Features

Completely Self-Hosted : OSSE does not use any external services, all you need is its three components (indexer, crawler & frontend) to have a "complete" search engine.
Custom Indexing and Ranking Algorithms : OSSE uses its own open-source indexing and ranking algorithm, meaning that its code is reviewable and improvable by third parties, ensuring its technically and morally correct functionality.
Hackable : OSSE is built with extensibility & modularity in mind, so it is entirely feasible to replace or customize its various components.
Privacy Respecting : As a result of OSSE being completely independent, it does not send any metadata to any services.

⚙️ How it works

The OSSE search engine is separated into three independent components:

Indexer

This component provides both the actual search engine indexer's implementation and the REST API used to search and add indexed resources. It uses Actix Web for the REST API (running on port 4444). For the implementation of the actual indexer data structure, we currently use a very simple reverse index implemented with a hashmap, so all the indexed resources are currently lost each time the indexer is restarted.

Crawler

This component is a simple recursive crawler that forwards the crawled raw HTML to the indexer. It uses reqwest for fetching a predefined list of root websites and parses them with scraper, sending the website contents to the indexer and extracting all its links, adding them to a queue of websites to be crawled. This process is "recursively" repeated indefinitely.

Frontend

This component is a simple web interface to the indexer. It allows users to search and visualize results in a user friendly way. It is currently built using Yew, which allows us to write the frontend in rust and produce a "blazingly fast" Wasm based web-ui.

🐾 Roadmap

Add frontend
Change indexer to use a ngram index instead of a reverse index
Improve frontend
Improve responsiveness of searching when the indexer is recieving info from crawlers
Rust cleanup
Improve page ranking algorithm

💬 Contributing

"If you have any ideas or patches, please do not hesitate to contribute to OSSE!"

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
crawler		crawler
docs		docs
frontend		frontend
indexer		indexer
lib		lib
.envrc		.envrc
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
flake.lock		flake.lock
flake.nix		flake.nix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OSSE Search Engine

🚩 Table of Contents

🐂 Why?

🤖 Usage

Installing Dependencies

With Nix:

Otherwise:

Running

🎨 Features

⚙️ How it works

Indexer

Crawler

Frontend

🐾 Roadmap

💬 Contributing

📜 License

About

Releases

Packages

Languages

License

Baitinq/OSSE

Folders and files

Latest commit

History

Repository files navigation

OSSE Search Engine

🚩 Table of Contents

🐂 Why?

🤖 Usage

Installing Dependencies

With Nix:

Otherwise:

Running

🎨 Features

⚙️ How it works

Indexer

Crawler

Frontend

🐾 Roadmap

💬 Contributing

📜 License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages