-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedAug 27, 2024 -
EETQ Public
Forked from NetEase-FuXi/EETQEasy and Efficient Quantization for Transformers
C++ UpdatedFeb 22, 2024 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedMay 12, 2023 -
apex Public
Forked from NVIDIA/apexA PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 12, 2023 -
flash-attention-v1 Public template
Forked from dptech-corp/flash-attentionFast and memory-efficient exact attention
C++ BSD 3-Clause "New" or "Revised" License UpdatedJan 29, 2023 -
cuda-samples Public
Forked from NVIDIA/cuda-samplesSamples for CUDA Developers which demonstrates features in CUDA Toolkit
C Other UpdatedOct 21, 2021 -
DevOpsCurriculum Public
Forked from Knowre-Dev/DevOpsCurriculumKnowre 데브옵스 커리큘럼
MIT License UpdatedSep 8, 2021 -
models Public
Forked from tensorflow/modelsModels and examples built with TensorFlow
Python Apache License 2.0 UpdatedSep 2, 2021 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedSep 2, 2021 -
bert Public
Forked from google-research/bertTensorFlow code and pre-trained models for BERT
Python Apache License 2.0 UpdatedSep 1, 2021 -
DeepLearningExamples Public
Forked from NVIDIA/DeepLearningExamplesDeep Learning Examples
Jupyter Notebook UpdatedAug 31, 2021 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
C++ Other UpdatedJul 20, 2020