iamdoing

Doing iamdoing

3 followers · 16 following

Lists (1)

Sort

✨ Inspiration

1 repository

Stars

LINs-lab / DynMoE

[ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

Python 73 10 Updated Feb 7, 2025

ZhenweiAn / Dynamic_MoE

Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"

Python 39 2 Updated Jul 30, 2024

withinmiaov / A-Survey-on-Mixture-of-Experts-in-LLMs

The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".

275 18 Updated Jan 21, 2025

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,505 119 Updated Mar 6, 2025

ZihanWang314 / CoE

Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models

Python 103 10 Updated Mar 4, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 20,676 1,459 Updated Feb 6, 2025

miguelangel-nubla / videoP2Proxy

Proxy to enable P2P only cameras to work with standard protocols.

C 184 47 Updated Sep 28, 2018

google-deepmind / bbeh

Python 46 2 Updated Mar 7, 2025

test-time-training / ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,140 73 Updated Jul 14, 2024

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,706 196 Updated Mar 4, 2025

google / langfun

OO for LLMs

Python 654 51 Updated Mar 8, 2025

StonyBrookNLP / appworld

🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Paper.

Python 155 13 Updated Dec 3, 2024

Junjie-Ye / TL-Training

TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use

Python 5 1 Updated Dec 24, 2024

x-cls / superclass

[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training

Python 202 7 Updated Jan 13, 2025

asaparov / prontoqa

Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.

Python 137 14 Updated Oct 17, 2024

jun0wanan / awesome-large-multimodal-agents

413 25 Updated Sep 25, 2024

bytedance / SandboxFusion

Python 145 10 Updated Feb 7, 2025

facebookresearch / searchformer

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

Jupyter Notebook 362 18 Updated Jun 11, 2024

walkccc / LeetCode

💡 LeetCode in C++20/Java/Python/MySQL/TypeScript (respect coding conventions)

C++ 1,208 427 Updated Mar 10, 2025

VikParuchuri / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 16,620 1,082 Updated Mar 9, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,532 423 Updated Mar 10, 2025

zhentingqi / rStar

Python 905 105 Updated Jan 23, 2025

Tele-AI / TeleChat2

星辰语义大模型TeleChat2是由中国电信人工智能研究院研发训练的大语言模型，是首个完全国产算力训练并开源的千亿参数模型

Python 222 22 Updated Feb 13, 2025

Open-Source-O1 / Open-O1

Python 1,341 51 Updated Nov 21, 2024

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,298 141 Updated Mar 8, 2025

google-deepmind / natural-plan

Python 29 2 Updated Sep 16, 2024

OSU-NLP-Group / TravelPlanner

[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

Python 323 45 Updated Dec 11, 2024

llazyl / TVBox

Java 217 2,580 Updated Jan 12, 2024

q215613905 / TVBoxOS

Forked from llazyl/TVBox

Java 2,120 2,068 Updated Mar 8, 2025

qist / tvbox

OK影视、tvbox配置文件，如果喜欢，请Fork自用。使用前请仔细阅读仓库说明，一旦使用将被视为你已了解。

JavaScript 4,269 1,621 Updated Mar 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly