This project is a basic implementation of Apache Hive running on top of Apache Hadoop MapReduce. The project deals mainly with basic queries such as load, select and delete.
This project mainly deals with the MapReduce Python API, and this can be found in MiniSQL/PyImpl. The syntax for running queries is given below. Please ensure that Hadoop is running on localhost:9000 and ensure node write permissions are enabled for the folder from where the code is being run.
load bigdata/test.csv as [column1:int,column2:float,column2:string];
select * from bigdata/test.csv where column1 > 30;
delete bigdata/test.csv;
Please refer to the wiki for more details and the functioning of the engine.