MiniSQL

This project is a basic implementation of Apache Hive running on top of Apache Hadoop MapReduce. The project deals mainly with basic queries such as load, select and delete.

Usage

This project mainly deals with the MapReduce Python API, and this can be found in MiniSQL/PyImpl. The syntax for running queries is given below. Please ensure that Hadoop is running on localhost:9000 and ensure node write permissions are enabled for the folder from where the code is being run.

Load

load bigdata/test.csv as [column1:int,column2:float,column2:string];

Select

select * from bigdata/test.csv where column1 > 30;

Delete

delete bigdata/test.csv;

Please refer to the wiki for more details and the functioning of the engine.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

MiniSQL

Usage

Load

Select

Delete

Files

README.md

Latest commit

History

README.md

File metadata and controls

MiniSQL

Usage

Load

Select

Delete