Skip to content
View arpit-sc's full-sized avatar

Block or report arpit-sc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Apache Nemo (Incubating) - Data Processing System for Flexible Employment With Different Deployment Characteristics

Java 112 64 Updated Jul 19, 2023

Resource scheduling and cluster management for AI

JavaScript 2,649 549 Updated Jun 6, 2024

Best Practices on Recommendation Systems

Python 19,872 3,167 Updated Feb 12, 2025

An open source ML system for the end-to-end data science lifecycle

Java 1,041 481 Updated Mar 3, 2025

Find secrets with Gitleaks 🔑

Go 19,094 1,553 Updated Mar 3, 2025

Parallel computing with task scheduling

Python 12,990 1,749 Updated Mar 5, 2025

version your SQL schemas with git + automatically migrate them

Python 334 4 Updated Dec 4, 2021

Hide-My-Windows Laser Tripwire

C 3,701 173 Updated Oct 26, 2023

AI Code Completions

Shell 10,738 506 Updated Jul 3, 2024

A Ruby Gem to detect under what license a project is distributed.

Ruby 821 284 Updated Mar 3, 2025

Apache Geode

Java 2,300 685 Updated Jan 20, 2025

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…

Python 27,967 7,940 Updated Mar 20, 2024

A cluster computing framework for processing large-scale geospatial data

Java 2,009 697 Updated Mar 6, 2025

The Data Engineering Cookbook

Python 14,087 2,568 Updated Mar 5, 2025

This is my site. There are many like it, but this one is mine.

HTML 38 7 Updated Nov 19, 2024

Data-Centric Pipelines and Data Versioning

Go 6,206 569 Updated Feb 3, 2025

YugabyteDB - the cloud native distributed SQL database for mission-critical applications.

C 9,309 1,113 Updated Mar 6, 2025

A curated list of data engineering tools for software developers

7,138 1,283 Updated Feb 17, 2025

A command-line tool to generate, analyze, convert and manipulate colors

Rust 5,249 105 Updated Mar 2, 2025

Interactive and Reactive Data Science using Scala and Spark.

JavaScript 3,149 654 Updated May 16, 2023

R configurations for Docker

Shell 1,471 271 Updated Feb 28, 2025

bamboolib - a GUI for pandas DataFrames

Jupyter Notebook 945 94 Updated Feb 20, 2024

📝 An awesome Data Science repository to learn and apply for real world problems.

25,879 6,011 Updated Mar 4, 2025

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架

Go 11,625 1,814 Updated Mar 4, 2025

🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Python 1,495 234 Updated Dec 2, 2024

OpenFaaS - Serverless Functions Made Simple

Go 25,472 1,953 Updated Feb 26, 2025

bootOS is a monolithic operating system in 512 bytes of x86 machine code.

Assembly 1,798 91 Updated Jan 4, 2024

Knack - A Python command line interface framework

Python 351 96 Updated Jul 16, 2024

Create *beautiful* command-line interfaces with Python

Python 7,965 561 Updated May 22, 2024

Library for building powerful interactive command line applications in Python

Python 9,574 725 Updated Jan 21, 2025
Next