Skip to content
View iamdoing's full-sized avatar

Block or report iamdoing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

Python 73 10 Updated Feb 7, 2025

Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"

Python 39 2 Updated Jul 30, 2024

The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".

275 18 Updated Jan 21, 2025

A curated list for Efficient Large Language Models

Python 1,505 119 Updated Mar 6, 2025

Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models

Python 103 10 Updated Mar 4, 2025

Official inference repo for FLUX.1 models

Python 20,676 1,459 Updated Feb 6, 2025

Proxy to enable P2P only cameras to work with standard protocols.

C 184 47 Updated Sep 28, 2018
Python 46 2 Updated Mar 7, 2025

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,140 73 Updated Jul 14, 2024

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,706 196 Updated Mar 4, 2025

OO for LLMs

Python 654 51 Updated Mar 8, 2025

🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Paper.

Python 155 13 Updated Dec 3, 2024

TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use

Python 5 1 Updated Dec 24, 2024

[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training

Python 202 7 Updated Jan 13, 2025

Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.

Python 137 14 Updated Oct 17, 2024
Python 145 10 Updated Feb 7, 2025

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

Jupyter Notebook 362 18 Updated Jun 11, 2024

💡 LeetCode in C++20/Java/Python/MySQL/TypeScript (respect coding conventions)

C++ 1,208 427 Updated Mar 10, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 16,620 1,082 Updated Mar 9, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,532 423 Updated Mar 10, 2025
Python 905 105 Updated Jan 23, 2025

星辰语义大模型TeleChat2是由中国电信人工智能研究院研发训练的大语言模型,是首个完全国产算力训练并开源的千亿参数模型

Python 222 22 Updated Feb 13, 2025
Python 1,341 51 Updated Nov 21, 2024

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,298 141 Updated Mar 8, 2025
Python 29 2 Updated Sep 16, 2024

[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

Python 323 45 Updated Dec 11, 2024
Java 217 2,580 Updated Jan 12, 2024
Java 2,120 2,068 Updated Mar 8, 2025

OK影视、tvbox配置文件,如果喜欢,请Fork自用。使用前请仔细阅读仓库说明,一旦使用将被视为你已了解。

JavaScript 4,269 1,621 Updated Mar 9, 2025
Next