Skip to content
View hitdxh's full-sized avatar

Block or report hitdxh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,986 3,435 Updated Feb 14, 2025

Apache Hive

Java 5,627 4,708 Updated Feb 14, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 3,606 203 Updated Feb 14, 2025

OpenMMLab Pre-training Toolbox and Benchmark

Python 3,544 1,074 Updated Nov 1, 2024

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,442 1,264 Updated Aug 14, 2024

Python 实现23种设计模式

Python 233 80 Updated Nov 9, 2018

Distributed SQL Query Engine in Python using Ray

Rust 243 16 Updated Oct 2, 2024

RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.

Python 325 74 Updated Dec 26, 2024

This is the official repository for M2UGen

Jupyter Notebook 473 37 Updated Jan 2, 2025

List of Dirty, Naughty, Obscene, and Otherwise Bad Words

3,008 668 Updated Aug 5, 2024

Curated list of project-based tutorials

217,591 28,351 Updated Aug 15, 2024

Tools to download and cleanup Common Crawl data

Python 982 144 Updated Apr 25, 2023

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,643 351 Updated Dec 7, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,412 6,009 Updated Feb 15, 2025

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Python 44,566 18,214 Updated Feb 14, 2025

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,521 28,504 Updated Feb 15, 2025

ClickHouse® is a real-time analytics database management system

C++ 38,992 7,088 Updated Feb 15, 2025

Go HTTP framework with high-performance and strong-extensibility for building micro-services.

Go 5,879 557 Updated Feb 11, 2025

The Go programming language

Go 125,836 17,876 Updated Feb 15, 2025

Apache Flink

Java 24,514 13,508 Updated Feb 14, 2025

beego is an open-source, high-performance web framework for the Go programming language.

Go 732 183 Updated Apr 27, 2022

A golang ebook intro how to build a web with golang

Go 43,435 10,627 Updated May 12, 2024

公众号「宫水三叶的刷题日记」刷穿 LeetCode 系列文章源码

7,403 956 Updated Nov 21, 2024

Leetcode algorithm solutions together with self-made teaching videos

Java 72 21 Updated Aug 10, 2019
Jupyter Notebook 874 324 Updated Jan 21, 2020

Mining synonyms from unstructured and semi-structured data

Python 246 60 Updated Dec 3, 2024

一位酷爱做饭的程序员,立志用动画将算法说的通俗易懂。我的面试网站 www.chengxuchu.com

10,631 1,564 Updated May 11, 2023

Provide all my solutions and explanations in Chinese for all the Leetcode coding problems.

6,170 737 Updated Dec 29, 2024

📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

178,864 51,189 Updated Aug 21, 2024
Next