matichon-vultureprime

Mati matichon-vultureprime

5 followers · 3 following

Vultureprime
Bangkok, Thailand
@KMatiDev1

Achievements

Organizations

Stars

Lightning-AI / lightning-thunder

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Python 1,271 86 Updated Jan 30, 2025

run-ai / runai-model-streamer

C++ 154 6 Updated Jan 21, 2025

nv-legate / cupynumeric

An Aspiring Drop-In Replacement for NumPy at Scale

Python 823 79 Updated Jan 6, 2025

Jackch-NV / TRTLLM-w4afp8-fp8-mix-inference

Python 4 1 Updated Jul 31, 2024

trancethehuman / ai-workshop-code

Code I wrote for my AI & LLM workshops

Jupyter Notebook 380 132 Updated Jan 24, 2025

QwenLM / Qwen2.5-Coder

Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.

Python 4,077 296 Updated Jan 17, 2025

voideditor / void

TypeScript 9,633 533 Updated Jan 30, 2025

vadimdemedes / ink

🌈 React for interactive command-line apps

TypeScript 27,533 628 Updated Nov 29, 2024

faressoft / terminalizer

🦄 Record your terminal and generate animated gif images or share a web player

JavaScript 15,481 503 Updated Aug 29, 2024

NVIDIA / TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…

Python 685 49 Updated Jan 29, 2025

NVIDIA / TensorRT-Incubator

Experimental projects related to TensorRT

MLIR 86 14 Updated Jan 31, 2025

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉

3,311 226 Updated Jan 24, 2025

togethercomputer / MoA

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,656 364 Updated Jan 7, 2025

Nutlope / turboseek

An AI search engine inspired by Perplexity

TypeScript 1,325 200 Updated Jan 17, 2025

NVIDIA / cuda-checkpoint

CUDA checkpoint and restore utility

C 277 15 Updated Jan 27, 2025

checkpoint-restore / criu

Checkpoint/Restore tool

C 3,069 617 Updated Jan 29, 2025

measuredco / puck

The visual editor for React

TypeScript 5,907 365 Updated Jan 30, 2025

aymeric-roucher / benchmark_agents

Jupyter Notebook 22 4 Updated Mar 5, 2024

interstellarninja / function-calling-eval

A framework for evaluating function calls made by LLMs

Python 36 4 Updated Jul 23, 2024

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,721 1,037 Updated Jan 29, 2025

apoorvumang / prompt-lookup-decoding

Jupyter Notebook 497 23 Updated Aug 23, 2024

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,148 160 Updated Jan 30, 2025

state-spaces / mamba

Mamba SSM architecture

Python 13,860 1,195 Updated Jan 18, 2025

DAMO-NLP-SG / DAMO-SeaLLMs

[ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia

JavaScript 157 14 Updated Jul 30, 2024

DAMO-NLP-SG / contrastive-cot

Contrastive Chain-of-Thought Prompting

Python 57 4 Updated Nov 18, 2023

defog-ai / sql-eval

Evaluate the accuracy of LLM generated outputs

Jupyter Notebook 590 63 Updated Jan 30, 2025

triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend

Python 760 113 Updated Jan 30, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,270 1,086 Updated Jan 30, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 15,232 1,440 Updated Jan 30, 2025

jgehrcke / github-repo-stats

GitHub Action for advanced repository traffic analysis and reporting

Python 327 41 Updated Oct 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mati matichon-vultureprime

Achievements

Achievements

Organizations

Block or report matichon-vultureprime

Stars

Lightning-AI / lightning-thunder

run-ai / runai-model-streamer

nv-legate / cupynumeric

Jackch-NV / TRTLLM-w4afp8-fp8-mix-inference

trancethehuman / ai-workshop-code

QwenLM / Qwen2.5-Coder

voideditor / void

vadimdemedes / ink

faressoft / terminalizer

NVIDIA / TensorRT-Model-Optimizer

NVIDIA / TensorRT-Incubator

DefTruth / Awesome-LLM-Inference

togethercomputer / MoA

Nutlope / turboseek

NVIDIA / cuda-checkpoint

checkpoint-restore / criu

measuredco / puck

aymeric-roucher / benchmark_agents

interstellarninja / function-calling-eval

ShishirPatil / gorilla

apoorvumang / prompt-lookup-decoding

argilla-io / distilabel

state-spaces / mamba

DAMO-NLP-SG / DAMO-SeaLLMs

DAMO-NLP-SG / contrastive-cot

defog-ai / sql-eval

triton-inference-server / tensorrtllm_backend

NVIDIA / TensorRT-LLM

Dao-AILab / flash-attention

jgehrcke / github-repo-stats