Skip to content
View Antlera's full-sized avatar

Highlights

  • Pro

Block or report Antlera

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,271 242 Updated Feb 7, 2025

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 234 21 Updated Jan 15, 2025

LLM training in simple, raw C/CUDA

Cuda 25,462 2,928 Updated Oct 2, 2024

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 2,491 141 Updated Feb 11, 2025

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 8,115 376 Updated Feb 11, 2025

A Java implemented Texas holdem and short deck Solver

Java 833 191 Updated Jul 24, 2023

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 124,633 10,045 Updated Feb 12, 2025

Apply deep reinforcement learning methods including DQN, DDPG for traffic light control in simulation (discrete environment), to prove the 'Green Wave' phenomenon in intelligent traffic system.

Python 82 27 Updated Sep 9, 2019

X.-Y. Liu, Z. Ding, S. Borst, A. Walid. Deep reinforcement learning for intelligent transportation systems. NeurIPS Workshop on Machine Learning for Intelligent Transportation Systems, 2018.

Python 7 1 Updated Jan 12, 2019

Official inference framework for 1-bit LLMs

C++ 12,723 890 Updated Dec 20, 2024

Assembler for NVIDIA Maxwell architecture

Sass 966 164 Updated Jan 3, 2023

The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".

Python 158 17 Updated Dec 3, 2024

"OpenCity: Open Spatio-Temporal Foundation Models for Traffic Prediction"

Python 94 12 Updated Sep 1, 2024

Unlock the potential of finetuning Large Language Models (LLMs). Learn from industry expert, and discover when to apply finetuning, data preparation techniques, and how to effectively train and eva…

Jupyter Notebook 49 35 Updated Oct 20, 2023

LLM inference in C/C++

C++ 73,906 10,664 Updated Feb 11, 2025

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 5,662 450 Updated Nov 24, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 23,274 2,299 Updated Jan 22, 2025

本仓库收集AI科技领域高质量信息源。 可以起到一个同步信息源的作用,避免信息差和信息茧房。

TypeScript 1,469 88 Updated Jul 10, 2024

A PalWorld Server API like minecraft bukkit, not finish yet

C++ 273 17 Updated Feb 28, 2024

This is an unofficial palworld server binary distribution project that fixes some problems with the original server.

Batchfile 898 30 Updated Jan 28, 2024

a curated list of high-quality papers on resource-efficient LLMs 🌱

103 7 Updated Feb 1, 2025

Bayesian optimisation & Reinforcement Learning library developed by Huawei Noah's Ark Lab

Jupyter Notebook 2,567 448 Updated Feb 11, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 46,104 4,897 Updated Jan 22, 2025

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 55,179 7,412 Updated Nov 13, 2024

AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).

Python 291 27 Updated Jun 1, 2023

😋 A curated reading list about database systems

466 31 Updated May 3, 2022

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 916 52 Updated Dec 6, 2024

[SIGIR'24] The official implementation code of MOELoRA.

Python 143 19 Updated Jul 22, 2024

High-speed Large Language Model Serving for Local Deployment

C++ 8,087 421 Updated Jan 28, 2025
Next