In this Project we have choosen Dataset of San Francisco to anlayze the different attributes of cpmpensation and to find which department is best in that area and which has shown the most growth in 4 year and to best suited for analysing in hive we have converted our data to avro format.
Dataset
https://www.kaggle.com/kaggle/sf-salaries
Follow the instructions in the tutorial and execute the queries