Skip to content
View yiming258's full-sized avatar

Block or report yiming258

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast and memory-efficient exact attention

Python 15,265 1,441 Updated Jan 30, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,202 408 Updated Jan 30, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 7,881 822 Updated Feb 1, 2025

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。

Python 11,655 1,303 Updated Jan 13, 2025

这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。

Python 2,090 178 Updated Aug 23, 2024

批量为视频或者音频生成字幕,并可批量将字幕翻译成其它语言。这是一个客户端工具, 跨平台支持 mac 和 windows 系统, 支持百度,火山,deeplx, openai, deepseek, ollama 等多个翻译服务

TypeScript 829 54 Updated Jan 14, 2025

批量为本地视频生成字幕文件,并可将字幕文件翻译成其它语言, 跨平台支持 window, mac 系统

JavaScript 648 52 Updated Dec 12, 2024

Faster Whisper transcription with CTranslate2

Python 13,817 1,155 Updated Jan 1, 2025

MultiBot Chat 是一个基于 Streamlit 的多机器人聊天应用,支持多种大语言模型(LLM)API,包括 OpenAI、AzureOpenAI、ChatGLM、CoZe、Qwen、Ollama、XingHuo、DeepSeek、Moonshot、Yi 和 Groq。这个应用允许用户同时与多个 AI 聊天机器人进行对话,比较不同模型的回答,并进行群聊式的讨论。

Python 13 Updated Jan 22, 2025

💬 OpenAI Assistants API chat UI 🛠️ It works easily by setting the ASSISTANT ID 📁 Supports file upload and file download 🏃 Supports Streaming API 🪟 Support to Azure OpenAI

Python 231 189 Updated Sep 23, 2024

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Python 1,264 66 Updated Jan 24, 2025

中文大模型能力评测榜单:目前已囊括164个大模型,覆盖chatgpt、gpt-4o、谷歌gemini、Claude3.5、百度文心一言、千问、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、书生internLM2.5等开源大模型。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!

3,375 152 Updated Jan 29, 2025

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 5,199 452 Updated Feb 2, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 4,937 571 Updated Oct 22, 2024

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,075 547 Updated Oct 24, 2024

Best practice for training LLaMA models in Megatron-LM

Python 641 56 Updated Jan 2, 2024

A recipe for online RLHF and online iterative DPO.

Python 467 51 Updated Dec 28, 2024

Recipes to train reward model for RLHF.

Python 1,127 80 Updated Jan 22, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,632 1,483 Updated Jan 27, 2025

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组

Python 9,520 934 Updated Feb 2, 2025

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,401 177 Updated Jan 23, 2025

Image forgery recognition algorithm

Python 613 75 Updated Sep 9, 2024

Official repository of MMDU dataset

Python 83 1 Updated Sep 29, 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,732 164 Updated Jan 22, 2025

Official inference repo for FLUX.1 models

Python 19,868 1,390 Updated Jan 31, 2025
Python 335 31 Updated Jan 19, 2025

Kolors Team

Python 4,136 308 Updated Nov 13, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,911 528 Updated Dec 25, 2024

Research Code for Multimodal-Cognition Team in Ant Group

Python 134 5 Updated Jul 11, 2024
Next