Tutorial on 'Data Analysis using Python', as a part of International Conference on Emerging Trends & Innovations in ICT (ICEI-2017). Held at Pune Institute of Computer Technology (PICT).
Resource persons:
- Jeril Kuriakose
- Godson D'silva
This tutorial focuses on Big Data and how to analyse the Big Data. This tutorial is divided in two two parts, in the first part we will learning how to use Hadoop for performing the map-reduce operation on the large dataset. The software installation steps for the first part of the tutorial can be found here.
In the second part we will be focussing on Data Analysis using Python. We will be using the pandas library for performing the analysis. To perform classification and clustering we will be using the sklearn library. We will using Jupyter notebook as our IDE. The software installation steps of part 2 can be found here.