- Chennai, India.
RetailRhythm Public
A dynamic data pipeline for real-time sales metric simulation and analysis for big box retailers. It integrates Kafka, Flink, and DuckDB in a Dockerized environment, enhanced by Metabase for action…
MIT License UpdatedMay 4, 2024 -
SQLServerMetadata Public
Forked from keif888/SQLServerMetadataSQL Server Metadata Toolkit
C# Microsoft Public License UpdatedJul 3, 2020 -
pyspark-example-project Public
Forked from AlexIoannides/pyspark-example-projectExample project implementing best practices for PySpark ETL jobs and applications.
Python UpdatedJun 26, 2020 -
cp-all-in-one Public
Forked from confluentinc/cp-all-in-onedocker-compose.yml files for cp-all-in-one , cp-all-in-one-community, cp-all-in-one-cloud
Shell UpdatedMay 12, 2020 -
flick Public
flick is a cli-tool written in Go for generating boiler-plate code for Web Scraping projects. I'm beginning with Python boiler-plates and would love to extend it to Go and Rust in a while with idio…
awesome-crawler Public
Forked from BruceDone/awesome-crawlerA collection of awesome web crawler,spider in different languages
MIT License UpdatedFeb 18, 2020 -
go-github Public
Forked from google/go-githubGo library for accessing the GitHub API
Go BSD 3-Clause "New" or "Revised" License UpdatedFeb 13, 2020 -
PRML algorithms implemented in Python
Jupyter Notebook MIT License UpdatedJan 24, 2020 -
githubv4 Public
Forked from shurcooL/githubv4Package githubv4 is a client library for accessing GitHub GraphQL API v4 (https://developer.github.com/v4/).
Go MIT License UpdatedDec 16, 2019 -
Cookbook Public
Forked from andkret/CookbookThe Data Engineering Cookbook
Apache License 2.0 UpdatedOct 18, 2019 -
PySpark-Boilerplate Public
Forked from ekampf/PySpark-BoilerplateA boilerplate for writing PySpark Jobs
Python UpdatedOct 15, 2019 -
ml Public
Forked from cloudxlab/mlMachine Learning Projects and Learning Content
Jupyter Notebook UpdatedSep 13, 2019 -
spacy-course Public
Forked from explosion/spacy-course👩🏫 Advanced NLP with spaCy: A free online course
Python MIT License UpdatedSep 2, 2019 -
ssis-queries Public
Forked from yorek/ssis-queriesA set of queries useful to easily extract monitoring and package performance data from SSISDB database
PLpgSQL UpdatedApr 13, 2019 -
news-graph Public
Forked from BrambleXu/news-graphKey information extraction from text and graph visualization
Python UpdatedFeb 11, 2019 -
PySpark-Cookbook Public
Forked from PacktPublishing/PySpark-CookbookPySpark Cookbook, published by Packt
HTML MIT License UpdatedJul 4, 2018 -
airflow-pyspark-reddit Public
Forked from danielblazevski/airflow-pyspark-redditExample of using Airflow to schedule downloading data form S3 and launching spark jobs
Python UpdatedOct 17, 2016