Skip to content
View Kyriection's full-sized avatar
🎨
Focusing
🎨
Focusing

Block or report Kyriection

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Kyriection/README.md

Typing SVG

Pinned Loading

  1. FMInference/H2O FMInference/H2O Public

    [NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

    Python 413 51

  2. jiaweizzhao/GaLore jiaweizzhao/GaLore Public

    GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

    Python 1.5k 154

  3. VITA-Group/Q-GaLore VITA-Group/Q-GaLore Public

    Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.

    Python 182 14

  4. zhuhanqing/APOLLO zhuhanqing/APOLLO Public

    APOLLO: SGD-like Memory, AdamW-level Performance

    74 1

  5. VITA-Group/BNN_NoBN VITA-Group/BNN_NoBN Public

    [CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang

    Python 57 10

  6. meta-llama/llama-recipes meta-llama/llama-recipes Public

    Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

    Jupyter Notebook 15.8k 2.3k