📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
-
Updated
Mar 20, 2017 - Scala
📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
MapReduce in Nodejs
Lightweight and extensible library to execute MapReduce-like jobs in Python
pagerank hadoop
MapReduce Framework based on Storm that is flexible for any MapReduce work. Built with a number of workers and a single master.Used BerkeleyDB as temporary data storage in case of big data processing
Map-Reduce jobs in python to get insightful information from NYC Taxi data
Recommends movies to the users based on the users profiles and the ratings of other users.
Mapreduce concepts- Secondary sort, counters, mutiple mapreduce jobs
MapReduce Job Development, RDDs Programming, Medical Data Management, Sales Analysis, And Efficient Data Integration For Big Data Analysis. Spark: Big Data Processing, SQOOP Integration, And Spark Structured Streaming For Real-Time Data.
Performed business operations using Big data technologies: AWS EMR, AWS RDS (MySQL), Hadoop, Apache Scoop, Apache HBase, MapReduce
Beta versions/student projects
Big data technologies that I have experimented with
Cloud and big data 2017/2018: Programming Assignments
Hadoop jobs written using GoLang, and run using Hadoop on Docker Containers
Design and implementation of different MapReduce jobs used to analyze a dataset on Covid-19 disease created by Our World In Data
Count the number of times a word occurs in 1GB (Big Data) Dataset of books using hadoop map-reduce
A cloud computing coursework on bigdata etc
Big Data, Hadoop, and MapReduce in Python. MapReduce Jobs using the MRJob library & Amazon Elastic MapReduce service.
Add a description, image, and links to the mapreduce-jobs topic page so that developers can more easily learn about it.
To associate your repository with the mapreduce-jobs topic, visit your repo's landing page and select "manage topics."