Skip to content
View pymhq's full-sized avatar

Organizations

@cncf @envoyproxy @knative @knative-extensions @CloudNative-Serverless-Meetup

Block or report pymhq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,066 46 Updated Jul 31, 2024

Code behind Arxiv Papers

Python 501 59 Updated Apr 2, 2024

Carbon Limiting Auto Tuning for Kubernetes

Go 33 8 Updated Nov 11, 2024

A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.

Jupyter Notebook 156 69 Updated Feb 10, 2025

Build complex, serverless, and highly scalable generative AI applications with prompt chaining.

Python 263 80 Updated Feb 4, 2025

Our Open Source Project for MSIS 549 AI and ML Class. This is a Text-to-Image and Image-to-Text Model.

Jupyter Notebook 1 Updated May 2, 2024

WasmEdge is a lightweight, high-performance, and extensible WebAssembly runtime for cloud native, edge, and decentralized applications. It powers serverless apps, embedded functions, microservices,…

C++ 8,938 798 Updated Feb 11, 2025

Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos

Python 20 Updated Dec 29, 2024

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

4,750 490 Updated Jul 30, 2024

LLRT (Low Latency Runtime) is an experimental, lightweight JavaScript runtime designed to address the growing demand for fast and efficient Serverless applications.

JavaScript 8,251 364 Updated Feb 10, 2025

Code for my professional website.

HTML 6 1 Updated Sep 8, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 37,302 5,607 Updated Feb 11, 2025

PyTorch深度学习快速入门教程(绝对通俗易懂!)

Python 3,007 659 Updated Feb 9, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 21,306 2,754 Updated Aug 15, 2024

Personal Webpage

HTML 259 119 Updated Nov 13, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 12,121 11,553 Updated Feb 9, 2025

Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.

Python 2,170 189 Updated Feb 11, 2025

The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.

Python 144 18 Updated Feb 10, 2025

Self-calculated rating of problems in leetcode weekly/biweekly contests.

Vue 526 42 Updated Feb 2, 2025

Serve machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.

Python 396 82 Updated Nov 20, 2023

LeetCode C++ solution

C++ 197 81 Updated Feb 1, 2025

Radius is a cloud-native, portable application platform that makes app development easier for teams building cloud-native apps.

Go 1,528 100 Updated Feb 11, 2025

MongoDB storage integration layer for the Rocks storage engine

C++ 400 100 Updated Jun 9, 2022

LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

C++ 37,067 7,925 Updated Jan 30, 2025

A library that provides an embeddable, persistent key-value store for fast storage.

C++ 29,108 6,403 Updated Feb 11, 2025

CNCF Landscape Graph, data model, and applications.

Jupyter Notebook 42 11 Updated Feb 11, 2025

A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.

Rust 4,861 184 Updated Feb 10, 2025

Fact-checking LLM outputs with self-ask

Jupyter Notebook 292 40 Updated Oct 23, 2023

Website for the Seattle GNU/Linux Conference

HTML 27 37 Updated Jan 12, 2025
Next