inference-serving

energy-inference Public Forked from grantwilkins/energy-inference
Code for MPhil Thesis at University of Cambridge

inference-serving/energy-inference’s past year of commit activity

Python 1 3 0 0 Updated Feb 17, 2025
ni-science-festival Public

inference-serving/ni-science-festival’s past year of commit activity

Python 0 0 0 1 Updated Feb 14, 2025
TAO-Amodal Public Forked from WesleyHsieh0806/TAO-Amodal
Official Code for Tracking Any Object Amodally

inference-serving/TAO-Amodal’s past year of commit activity

Jupyter Notebook 0 12 0 0 Updated Jan 29, 2025
LLM-serving-with-proxy-models Public Forked from James-QiuHaoran/LLM-serving-with-proxy-models
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction

inference-serving/LLM-serving-with-proxy-models’s past year of commit activity

Jupyter Notebook 0 Apache-2.0 5 0 0 Updated Jan 28, 2025
deepstream_python_apps Public Forked from NVIDIA-AI-IOT/deepstream_python_apps
DeepStream SDK Python bindings and sample applications

inference-serving/deepstream_python_apps’s past year of commit activity

Jupyter Notebook 0 527 0 0 Updated Oct 17, 2024
Programming-Massively-Parallel-Processors Public Forked from R100001/Programming-Massively-Parallel-Processors

inference-serving/Programming-Massively-Parallel-Processors’s past year of commit activity

Cuda 0 32 0 0 Updated Aug 15, 2024
jetson-inference Public Forked from dusty-nv/jetson-inference
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

inference-serving/jetson-inference’s past year of commit activity

C++ 0 MIT 3,031 0 0 Updated Jul 23, 2024
vision-transformer-from-scratch Public Forked from tintn/vision-transformer-from-scratch
A Simplified PyTorch Implementation of Vision Transformer (ViT)

inference-serving/vision-transformer-from-scratch’s past year of commit activity

Jupyter Notebook 0 MIT 32 0 0 Updated Jul 3, 2024
ATS Public Forked from adaptivetokensampling/ATS
Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)

inference-serving/ATS’s past year of commit activity

Shell 0 Apache-2.0 17 0 0 Updated Jul 1, 2024
Cephalo-Phi-3-Vision-MoE Public Forked from lamm-mit/Cephalo-Phi-3-Vision-MoE

inference-serving/Cephalo-Phi-3-Vision-MoE’s past year of commit activity

Python 0 2 0 0 Updated Jun 9, 2024

View all repositories

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inference-serving

Popular repositories Loading

Repositories

People

Top languages

Most used topics