Stars
In this work, the PyTorch Module for linear regression is used on the US Housing Index mined from kaggle.com to predict US House Price.
Graph Neural Network Library for PyTorch
Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO
Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand
This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging the data, filling the data warehouse, and running checks on…
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.
Gathers machine learning and deep learning models for Stock forecasting including trading bots and simulations
More than 2000+ Data engineer interview questions.
Example of integrating Poetry with Docker leveraging multi-stage builds.
Solutions of challenges of Hackerrank Python domain
Hackerrank Problem solving solutions in Python
Solutions to the practice exercises, coding challenges, and other problems on Hackerrank! www.Hackerrank.com
HackerRank solutions in Java/JS/Python/C++/C#
170+ solutions to Hackerrank.com practice problems using Python 3, С++ and Oracle SQL
HackerRank Python solutions and challenges.
Apache Spark - A unified analytics engine for large-scale data processing
Scenic: A Jax Library for Computer Vision Research and Beyond
The course site for the Data Processing in Python from IES