Skip to content
View fangyuan-ksgk's full-sized avatar
:electron:
Researching on MARL
:electron:
Researching on MARL

Block or report fangyuan-ksgk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

s1: Simple test-time scaling

Python 5,093 566 Updated Feb 12, 2025

A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.

Python 738 120 Updated Dec 16, 2024

Memento is a Python app that records everything you do on your computer and lets you go back in time, search, and chat with a LLM (Large Language Model) to find back information about what you did.

Python 589 49 Updated Apr 23, 2024

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,230 48 Updated Jan 12, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 9,789 1,269 Updated Feb 1, 2025

RAGEN is the first open-source reproduction of DeepSeek-R1 on AGENT training.

Python 779 50 Updated Feb 12, 2025

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 937 42 Updated Feb 1, 2025

Benchmarking physical understanding in generative video models

Python 108 8 Updated Feb 11, 2025

[ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos

Python 256 8 Updated Jan 15, 2025

Large World Model -- Modeling Text and Video with Millions Context

Python 7,221 556 Updated Oct 19, 2024

Implementation snake game based on Diffusion model

Python 85 5 Updated Jan 9, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 23,772 2,031 Updated Feb 12, 2025

A small open source 3D agent simulator based on LLM.

Python 55 8 Updated Dec 1, 2024

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,449 464 Updated Feb 12, 2025

Everything about the SmolLM2 and SmolVLM family of models

Python 1,845 108 Updated Feb 6, 2025

experiment with concept encoding in LLM

Jupyter Notebook 1 Updated Jan 3, 2025

An AI Hedge Fund Team

Python 8,142 1,651 Updated Feb 10, 2025

Official Implementation of Iterative Graph Alignment https://arxiv.org/abs/2408.16667

Python 2 Updated Dec 5, 2024

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 702 90 Updated Feb 3, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,788 376 Updated Jul 11, 2024

NeurIPS 2024 tutorial on LLM Inference

Jupyter Notebook 39 2 Updated Dec 10, 2024

Next Generation Visual Programming System

TypeScript 3,722 89 Updated Feb 3, 2025

End-to-end Generative Optimization for AI Agents

Python 472 35 Updated Feb 12, 2025

A simple OpenAI Gym environment for single and multi-agent reinforcement learning

Python 741 114 Updated Dec 14, 2023

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 3,905 364 Updated Feb 8, 2025

Visualize any repo or codebase into diagram or animation

Python 13 2 Updated Oct 14, 2024

A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.

Jupyter Notebook 702 56 Updated May 7, 2024

Original implementation of "3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes"

Python 276 10 Updated Jan 15, 2025

Model Context Protocol Servers

JavaScript 8,576 998 Updated Feb 12, 2025

Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?

Python 102 3 Updated Feb 11, 2025
Next