swagger-coder

Follow

swagger-coder swagger-coder

Follow

10 followers · 28 following

Achievements

Achievements

Lists (23)

Sort

3D

agent

aigc

awesome系列

cv

gpt

llama

LLM

Music-LM

🚀 My stack

sing

sudoku

svc

web工程化

不错的教程

多模态

16 repositories

大模型

工具

持续关注的项目

概率建模

目标检测

25 repositories

知识图谱

语音

Stars

nanbingxyz / 5ire

5ire is a cross-platform desktop AI assistant, MCP client. It compatible with major service providers, supports local knowledge base and tools via model context protocol servers .

TypeScript 2,139 146 Updated Mar 27, 2025

drmingdrmer / md2zhihu

convert markdown to zhihu compatible format.

Python 65 9 Updated Sep 23, 2024

PFCCLab / blog

PFCC 社区博客

TypeScript 10 15 Updated Mar 25, 2025

SkyworkAI / Skywork-R1V

Pioneering Multimodal Reasoning with CoT

Python 1,153 108 Updated Mar 26, 2025

turningpoint-ai / VisualThinker-R1-Zero

Explore the Multimodal “Aha Moment” on 2B Model

Python 537 17 Updated Mar 18, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 23,467 2,134 Updated Mar 28, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,425 269 Updated Mar 1, 2025

huggingface / evaluation-guidebook

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 1,096 70 Updated Jan 7, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,096 305 Updated Mar 28, 2025

ModalMinds / MM-EUREKA

MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Python 458 16 Updated Mar 29, 2025

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 6,956 717 Updated Mar 21, 2025

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,598 188 Updated Mar 24, 2025

Liuziyu77 / Visual-RFT

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,455 65 Updated Mar 19, 2025

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

520 30 Updated Mar 28, 2025

Quyans / Drag-Your-Gaussian

Officially implement of the paper "Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting".

44 Updated Feb 7, 2025

Fancy-MLLM / R1-Onevision

R1-onevision, a visual language model capable of deep CoT reasoning.

475 15 Updated Mar 26, 2025

luo3300612 / Visualizer

assistant tools for attention visualization in deep learning

Jupyter Notebook 1,125 87 Updated Jun 9, 2022

facebookresearch / detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 31,601 7,613 Updated Jan 14, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,947 230 Updated Mar 4, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,846 585 Updated Mar 29, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,410 1,443 Updated Mar 10, 2025

ycyy / ollama-gradio-webui

ollama gradio webui

Python 9 4 Updated Mar 7, 2024

abi / screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 69,336 8,536 Updated Mar 20, 2025

jiaqihuang01 / DETRIS

[AAAI-2025] The official code of Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation

Python 28 Updated Mar 13, 2025

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 2,005 215 Updated Feb 23, 2025

castorini / daam

Diffusion attentive attribution maps for interpreting Stable Diffusion.

Jupyter Notebook 746 64 Updated Apr 5, 2024

helblazer811 / ManimML

ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.

Python 2,525 152 Updated Jun 22, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,639 128 Updated Aug 13, 2024

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 5,168 544 Updated Dec 10, 2024

Liyulingyue / DesktopPet

Forked from llq20133100095/DeskTopPet

一个桌面宠物程序，现在似乎发展成为桌面便签了。桌面便签程序见develop-todolist分支。

Python 9 2 Updated Nov 17, 2024