matichon-vultureprime

Mati matichon-vultureprime

5 followers · 3 following

Vultureprime
Bangkok, Thailand
@KMatiDev1

Achievements

Organizations

Stars

30 results for source starred repositories

Clear filter

Lightning-AI / lightning-thunder

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Python 1,278 86 Updated Feb 7, 2025

run-ai / runai-model-streamer

C++ 156 8 Updated Feb 6, 2025

nv-legate / cupynumeric

An Aspiring Drop-In Replacement for NumPy at Scale

Python 825 79 Updated Jan 6, 2025

Jackch-NV / TRTLLM-w4afp8-fp8-mix-inference

Python 4 1 Updated Jul 31, 2024

trancethehuman / ai-workshop-code

Code I wrote for my AI & LLM workshops

Jupyter Notebook 382 135 Updated Feb 7, 2025

QwenLM / Qwen2.5-Coder

Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.

Python 4,354 343 Updated Jan 17, 2025

voideditor / void

TypeScript 9,854 546 Updated Feb 7, 2025

vadimdemedes / ink

🌈 React for interactive command-line apps

TypeScript 27,571 629 Updated Nov 29, 2024

faressoft / terminalizer

🦄 Record your terminal and generate animated gif images or share a web player

JavaScript 15,489 503 Updated Aug 29, 2024

NVIDIA / TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…

Python 692 50 Updated Jan 31, 2025

NVIDIA / TensorRT-Incubator

Experimental projects related to TensorRT

MLIR 88 14 Updated Feb 6, 2025

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉

3,366 230 Updated Jan 31, 2025

togethercomputer / MoA

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,660 365 Updated Jan 7, 2025

Nutlope / turboseek

An AI search engine inspired by Perplexity

TypeScript 1,343 207 Updated Jan 17, 2025

NVIDIA / cuda-checkpoint

CUDA checkpoint and restore utility

C 286 15 Updated Jan 27, 2025

checkpoint-restore / criu

Checkpoint/Restore tool

C 3,078 623 Updated Feb 4, 2025

measuredco / puck

The visual editor for React

TypeScript 6,055 373 Updated Feb 3, 2025

aymeric-roucher / benchmark_agents

Jupyter Notebook 22 5 Updated Mar 5, 2024

interstellarninja / function-calling-eval

A framework for evaluating function calls made by LLMs

Python 36 4 Updated Jul 23, 2024

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,753 1,037 Updated Feb 6, 2025

apoorvumang / prompt-lookup-decoding

Jupyter Notebook 499 23 Updated Aug 23, 2024

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,287 165 Updated Feb 4, 2025

state-spaces / mamba

Mamba SSM architecture

Python 13,901 1,200 Updated Jan 18, 2025

DAMO-NLP-SG / DAMO-SeaLLMs

[ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia

JavaScript 158 14 Updated Jul 30, 2024

DAMO-NLP-SG / contrastive-cot

Contrastive Chain-of-Thought Prompting

Python 57 4 Updated Nov 18, 2023

defog-ai / sql-eval

Evaluate the accuracy of LLM generated outputs

Jupyter Notebook 602 65 Updated Feb 2, 2025

triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend

Python 765 113 Updated Feb 7, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,328 1,088 Updated Feb 7, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 15,328 1,443 Updated Feb 4, 2025

jgehrcke / github-repo-stats

GitHub Action for advanced repository traffic analysis and reporting

Python 328 41 Updated Oct 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mati matichon-vultureprime

Achievements

Achievements

Organizations

Block or report matichon-vultureprime

Stars

Lightning-AI / lightning-thunder

run-ai / runai-model-streamer

nv-legate / cupynumeric

Jackch-NV / TRTLLM-w4afp8-fp8-mix-inference

trancethehuman / ai-workshop-code

QwenLM / Qwen2.5-Coder

voideditor / void

vadimdemedes / ink

faressoft / terminalizer

NVIDIA / TensorRT-Model-Optimizer

NVIDIA / TensorRT-Incubator

DefTruth / Awesome-LLM-Inference

togethercomputer / MoA

Nutlope / turboseek

NVIDIA / cuda-checkpoint

checkpoint-restore / criu

measuredco / puck

aymeric-roucher / benchmark_agents

interstellarninja / function-calling-eval

ShishirPatil / gorilla

apoorvumang / prompt-lookup-decoding

argilla-io / distilabel

state-spaces / mamba

DAMO-NLP-SG / DAMO-SeaLLMs

DAMO-NLP-SG / contrastive-cot

defog-ai / sql-eval

triton-inference-server / tensorrtllm_backend

NVIDIA / TensorRT-LLM

Dao-AILab / flash-attention

jgehrcke / github-repo-stats