Highlights
- Pro
Stars
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
中文医疗问诊大模型MedChatZH,具有中西医问诊、优秀的对话能力 (Computers in Biology and Medchine 2024)
🟣 LLMs interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
Official inference repo for FLUX.1 models
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
An Analytical Evaluation Board of Multi-turn LLM Agents
A generative world for general-purpose robotics & embodied AI learning.
Python tool for converting files and office documents to Markdown.
ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning
Modular and structured prompt caching for low-latency LLM inference
Question and Answer based on Anything.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
A deep learning lyrics-to-audio alignment system, generating synchronized lyrics from a song and its lyrics
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
白霜拼音:蒹葭苍苍,白露为霜。白霜拼音使用使用745396750字的高质量语料,进行分词,重新统计字频、词频,归一化,打造纯净、词频准确、智能的词库。白霜词库是目前rime方案下最好的开源词库,立志于打造不输于商业输入法的输入体验。
Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence