yiming258

yiming258

5 followers · 12 following

Stars

deepseek-ai / DeepSeek-R1

56,115 6,932 Updated Feb 1, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 15,265 1,441 Updated Jan 30, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,202 408 Updated Jan 30, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 7,881 822 Updated Feb 1, 2025

jianchang512 / pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。

Python 11,655 1,303 Updated Jan 13, 2025

Chenyme / Chenyme-AAVT

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

Python 2,090 178 Updated Aug 23, 2024

buxuku / video-subtitle-master

批量为视频或者音频生成字幕，并可批量将字幕翻译成其它语言。这是一个客户端工具, 跨平台支持 mac 和 windows 系统, 支持百度，火山，deeplx, openai, deepseek, ollama 等多个翻译服务

TypeScript 829 54 Updated Jan 14, 2025

buxuku / VideoSubtitleGenerator

批量为本地视频生成字幕文件，并可将字幕文件翻译成其它语言，跨平台支持 window, mac 系统

JavaScript 648 52 Updated Dec 12, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 13,817 1,155 Updated Jan 1, 2025

gptzm / multibot-chat

MultiBot Chat 是一个基于 Streamlit 的多机器人聊天应用，支持多种大语言模型（LLM）API，包括 OpenAI、AzureOpenAI、ChatGLM、CoZe、Qwen、Ollama、XingHuo、DeepSeek、Moonshot、Yi 和 Groq。这个应用允许用户同时与多个 AI 聊天机器人进行对话，比较不同模型的回答，并进行群聊式的讨论。

Python 13 Updated Jan 22, 2025

ryo-ma / gpt-assistants-api-ui

💬 OpenAI Assistants API chat UI 🛠️ It works easily by setting the ASSISTANT ID 📁 Supports file upload and file download 🏃 Supports Streaming API 🪟 Support to Azure OpenAI

Python 231 189 Updated Sep 23, 2024

THUDM / ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Python 1,264 66 Updated Jan 24, 2025

jeinlee1991 / chinese-llm-benchmark

中文大模型能力评测榜单：目前已囊括164个大模型，覆盖chatgpt、gpt-4o、谷歌gemini、Claude3.5、百度文心一言、千问、百川、讯飞星火、商汤senseChat、minimax等商用模型，以及deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、书生internLM2.5等开源大模型。不仅提供能力评分排行榜，也提供所有模型的原始输出结果！

3,375 152 Updated Jan 29, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 5,199 452 Updated Feb 2, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 4,937 571 Updated Oct 22, 2024

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,075 547 Updated Oct 24, 2024

alibaba / Megatron-LLaMA

Forked from NVIDIA/Megatron-LM

Best practice for training LLaMA models in Megatron-LM

Python 641 56 Updated Jan 2, 2024

RLHFlow / Online-RLHF

A recipe for online RLHF and online iterative DPO.

Python 467 51 Updated Dec 28, 2024

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 1,127 80 Updated Jan 22, 2025

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,632 1,483 Updated Jan 27, 2025

Huanshere / VideoLingo

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组

Python 9,520 934 Updated Feb 2, 2025

X-PLUG / mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,401 177 Updated Jan 23, 2025

VisionRush / DeepFakeDefenders

Image forgery recognition algorithm

Python 613 75 Updated Sep 9, 2024

Liuziyu77 / MMDU

Official repository of MMDU dataset

Python 83 1 Updated Sep 29, 2024

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,732 164 Updated Jan 22, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 19,868 1,390 Updated Jan 31, 2025

ai-forever / Kandinsky-3

Python 335 31 Updated Jan 19, 2025

Kwai-Kolors / Kolors

Kolors Team

Python 4,136 308 Updated Nov 13, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,911 528 Updated Dec 25, 2024

alipay / Ant-Multi-Modal-Framework

Research Code for Multimodal-Cognition Team in Ant Group

Python 134 5 Updated Jul 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly