Antlera

Tingfeng Lan Antlera

AI Infra @antgroup | OSPP'23 | Code for Fun!

24 followers · 39 following

Achievements

Highlights

Stars

DefTruth / CUDA-Learn-Notes

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,271 242 Updated Feb 7, 2025

andrewkchan / yalm

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 234 21 Updated Jan 15, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 25,462 2,928 Updated Oct 2, 2024

tensorzero / tensorzero

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 2,491 141 Updated Feb 11, 2025

windingwind / zotero-pdf-translate

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 8,115 376 Updated Feb 11, 2025

bupticybee / TexasHoldemSolverJava

A Java implemented Texas holdem and short deck Solver

Java 833 191 Updated Jul 24, 2023

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 124,633 10,045 Updated Feb 12, 2025

quantumiracle / Reinforcement_Learning_for_Traffic_Light_Control

Apply deep reinforcement learning methods including DQN, DDPG for traffic light control in simulation (discrete environment), to prove the 'Green Wave' phenomenon in intelligent traffic system.

Python 82 27 Updated Sep 9, 2019

YangletLiu / DQN_traffic_light_control

Forked from quantumiracle/Reinforcement_Learning_for_Traffic_Light_Control

X.-Y. Liu, Z. Ding, S. Borst, A. Walid. Deep reinforcement learning for intelligent transportation systems. NeurIPS Workshop on Machine Learning for Intelligent Transportation Systems, 2018.

Python 7 1 Updated Jan 12, 2019

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 12,723 890 Updated Dec 20, 2024

NervanaSystems / maxas

Assembler for NVIDIA Maxwell architecture

Sass 966 164 Updated Jan 3, 2023

CASE-Lab-UMD / LLM-Drop

The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".

Python 158 17 Updated Dec 3, 2024

HKUDS / OpenCity

"OpenCity: Open Spatio-Temporal Foundation Models for Traffic Prediction"

Python 94 12 Updated Sep 1, 2024

ksm26 / Finetuning-Large-Language-Models

Unlock the potential of finetuning Large Language Models (LLMs). Learn from industry expert, and discover when to apply finetuning, data preparation techniques, and how to effectively train and eva…

Jupyter Notebook 49 35 Updated Oct 20, 2023