-
Vultureprime
- Bangkok, Thailand
- @KMatiDev1
-
TRTLLM-w4afp8-fp8-mix-inference Public
Forked from Jackch-NV/TRTLLM-w4afp8-fp8-mix-inferencePython UpdatedNov 19, 2024 -
runai-model-streamer Public
Forked from run-ai/runai-model-streamerC++ Apache License 2.0 UpdatedNov 19, 2024 -
continue Public
Forked from continuedev/continue⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains
TypeScript Apache License 2.0 UpdatedOct 4, 2024 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedSep 22, 2024 -
seacrowd-eval Public
Forked from scb-10x/seacrowd-evalSeacrowd eval base-code for ThaiLLM-Leaderboard
Python UpdatedSep 15, 2024 -
TensorRT-Incubator Public
Forked from NVIDIA/TensorRT-IncubatorExperimental projects related to TensorRT
MLIR UpdatedAug 17, 2024 -
-
llama-hub Public
Forked from run-llama/llama-hubA library of data loaders for LLMs made by the community -- to be used with GPT Index and/or LangChain
Jupyter Notebook MIT License UpdatedOct 10, 2023