Skip to content
View MaeChd's full-sized avatar

Block or report MaeChd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 4,490 586 Updated Dec 26, 2024

LLM training code for Databricks foundation models

Python 4,113 541 Updated Jan 31, 2025

When it comes to optimizers, it's always better to be safe than sorry

Python 169 4 Updated Jan 23, 2025

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 759 59 Updated Oct 8, 2024

A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643

Python 74 5 Updated Sep 4, 2023

functorch is JAX-like composable function transforms for PyTorch.

Jupyter Notebook 1,405 99 Updated Feb 1, 2025
Jupyter Notebook 171 8 Updated Oct 21, 2024

LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters

Python 27 7 Updated Dec 5, 2024

中国科学院大学2019-2020课程(秋季,春季,夏季)

HTML 1,219 271 Updated Aug 22, 2023

Code for NeurIPS 2019 paper: "Tensor Programs I: Wide Feedforward or Recurrent Neural Networks of Any Architecture are Gaussian Processes"

Jupyter Notebook 241 22 Updated Aug 26, 2020
Python 5 1 Updated Dec 3, 2024

[NeurIPS 2024 Spotlight] Official repository of the CycleNet paper: "CycleNet: Enhancing Time Series Forecasting through Modeling Periodic Patterns". This work is developed by the Lab of Professor …

Jupyter Notebook 125 14 Updated Dec 18, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 13,477 1,516 Updated Jan 15, 2025

LaTeX Thesis Template for the University of Chinese Academy of Sciences

TeX 3,532 940 Updated Feb 29, 2024

Chinese GPT2: pre-training and fine-tuning framework for text generation

Python 187 41 Updated May 24, 2021

Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.

Python 188 43 Updated Jul 14, 2023

Understanding Training Dynamics of Deep ReLU Networks

Python 284 32 Updated Jan 29, 2025

A Library for Advanced Deep Time Series Models.

Python 7,824 1,251 Updated Jan 10, 2025

The official code for "One Fits All: Power General Time Series Analysis by Pretrained LM (NeurIPS 2023 Spotlight)"

Python 503 71 Updated Jan 8, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 39,117 4,802 Updated Feb 1, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,769 4,057 Updated Jul 17, 2024

LDAdam - Adaptive Optimization from Low-Dimensional Gradient Statistics

Python 6 Updated Nov 6, 2024

Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "

Python 97 5 Updated Oct 24, 2024

Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch

Lua 11,701 2,596 Updated Oct 24, 2023

PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)

Jupyter Notebook 312 14 Updated Jan 22, 2025

Video+code lecture on building nanoGPT from scratch

Python 3,828 550 Updated Aug 13, 2024

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 22,871 5,569 Updated Aug 14, 2024

EasyLiterature is an open-sourced, Python-based command line tool for automatic literature management.

Python 249 16 Updated Aug 24, 2024

Deep Residual Learning in Spiking Neural Networks

Python 158 22 Updated Aug 9, 2022
Next