Skip to content
View zhpacer's full-sized avatar

Block or report zhpacer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM training in simple, raw C/CUDA

Cuda 25,102 2,866 Updated Oct 2, 2024
Python 1 Updated Feb 4, 2024

Formerly known as code.google.com/p/1-billion-word-language-modeling-benchmark

Perl 444 105 Updated Apr 12, 2016

Inference Llama 2 in one file of pure C

C 17,884 2,170 Updated Aug 6, 2024

Llama from scratch, or How to implement a paper without crying

Jupyter Notebook 540 51 Updated May 29, 2024

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,620 349 Updated Dec 7, 2024

Fast and memory-efficient exact attention

Python 15,140 1,432 Updated Jan 18, 2025

Inference code for Llama models

Python 57,283 9,664 Updated Aug 18, 2024

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python 24,646 4,451 Updated Aug 18, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,756 4,055 Updated Jul 17, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,769 2,224 Updated Jul 29, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 38,644 6,263 Updated Dec 9, 2024

Markdown使用基础

2 Updated Apr 14, 2020

End-to-End Speech Recognition Using Tensorflow

Python 42 11 Updated Mar 24, 2023

CS224S / LINGUIST285 - Spoken Language Processing

HTML 26 10 Updated Feb 13, 2020

这个工程的目的是从视频中获取语音识别的训练数据,用于训练字幕自动生成

Python 53 21 Updated Aug 5, 2018

Implementing Recurrent Neural Network from Scratch

Python 484 151 Updated May 28, 2018

A build-it-yourself, 6-wheel rover based on the rovers on Mars!

Prolog 8,675 1,373 Updated Jan 11, 2025

End-to-end ASR/LM implementation with PyTorch

Python 596 140 Updated Aug 30, 2021

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

1,862 226 Updated Jun 27, 2022

high-performance graph database for real-time use cases

Go 20,593 1,503 Updated Jan 22, 2025
Jupyter Notebook 313 75 Updated Apr 25, 2022

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,760 3,618 Updated Jul 28, 2024

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 7,948 1,903 Updated Sep 26, 2024

The Elements of Statistical Learning (ESL)的中文翻译、代码实现及其习题解答。

Jupyter Notebook 2,497 601 Updated Sep 12, 2024

YSDA course in Natural Language Processing

Jupyter Notebook 9,907 2,616 Updated Dec 25, 2024

中文 Python 笔记

Jupyter Notebook 6,976 2,906 Updated Oct 1, 2020

Source Code for my blog

JavaScript 6 1 Updated Apr 12, 2023