-
Notifications
You must be signed in to change notification settings - Fork 1
Home
Ishaan Lagwankar edited this page Dec 3, 2019
·
2 revisions
This project deals with a basic implementation of an SQL Engine built on top of Apache Hadoop’s MapReduce framework. The project assumes the form of a hive-like terminal and acts as a CLI for users to input queries in specified formats. The standard query syntax followed is similar to that of any SQL Engine. The query given by the user is converted into a series of MapReduce jobs provisioned on the running Hadoop cluster, and the outputs are displayed onto the terminal as well.
The framework and working of the SQL engine is broadly divided into 3 parts.
- Query handling
- Code Generation
- Mapreduce Job Scheduling and Processing