Skip to content
View shjwudp's full-sized avatar

Organizations

@BaguaSys

Block or report shjwudp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. c4-dataset-script Public

    Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.

    Python 120 14

  2. megabyte Public

    A PyTorch implementation of MEGABYTE. This multi-scale transformer architecture has the excellent features of tokenization-free and sub-quadratic attention. The paper link: https://arxiv.org/abs/23…

    Python 4 3

  3. BaguaSys/bagua Public

    Bagua Speeds up PyTorch

    Python 879 82

  4. BaguaSys/bagua-net Public archive

    High performance NCCL plugin for Bagua.

    Rust 15 4

  5. shu Public

    中文书籍收录整理, Collection of Chinese Books

    Python 177 34

  6. blueprint-trainer Public

    Scaffolding for sequence model training research.

    Python 1

34 contributions in the last year

Contribution Graph
Day of Week March April May June July August September October November December January February
Sunday
Monday
Tuesday
Wednesday
Thursday
Friday
Saturday
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More

Contribution activity

February 2025

Created 3 commits in 1 repository
Loading

Seeing something unexpected? Take a look at the GitHub profile guide.