Skip to content
View Mountain-AI's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Mountain-AI

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AllenAI's post-training codebase

Python 2,829 364 Updated Mar 22, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 16,920 1,104 Updated Mar 20, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 23,217 1,425 Updated Mar 20, 2025

SpatialLM: Large Language Model for Spatial Understanding

Python 1,476 92 Updated Mar 21, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 10,299 694 Updated Mar 21, 2025

Universal markup converter

Haskell 36,484 3,467 Updated Mar 22, 2025

一个使用Flutter开发,支持诸多云平台AI大模型API调用的智能工作生活助手应用。除了常规大模型应用,还有极简记账、随机菜品、猫狗之家、waifu图片、MAL动漫排行、BGM动漫资讯、饮食健康等生活日常工具。(持续更新中……)

Dart 38 7 Updated Mar 21, 2025

The sample app showcasing Tencent Cloud Chat integration with Flutter across iOS, Android, Web, macOS, and Windows platforms.

C 111 45 Updated Mar 10, 2025

The reinforcement learning training code for AgiBot X1.

Python 1,402 437 Updated Oct 23, 2024

The inference module for AgiBot X1.

C++ 1,565 474 Updated Nov 22, 2024

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

Python 35,863 9,019 Updated Feb 5, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 46,099 4,221 Updated Mar 22, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 19,501 2,430 Updated Mar 22, 2025

VoceChat Web App

TypeScript 1,877 193 Updated Feb 23, 2025

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 9,066 1,225 Updated Mar 5, 2025

free online AI resume editor

TypeScript 1,209 155 Updated Mar 20, 2025

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 943 65 Updated Mar 21, 2025

Taming Stable Diffusion for Lip Sync!

Python 3,176 478 Updated Mar 21, 2025

Truly independent web browser

C++ 36,221 1,513 Updated Mar 22, 2025

The official Python API for ElevenLabs Text to Speech.

Python 2,447 296 Updated Mar 13, 2025

Olares: An Open-Source Sovereign Cloud OS for Local AI

Shell 1,924 63 Updated Mar 22, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 9,053 926 Updated Mar 20, 2025

Make websites accessible for AI agents

Python 46,991 4,865 Updated Mar 22, 2025

Making Docker and Kubernetes management easy.

TypeScript 32,389 2,552 Updated Mar 21, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 71,892 7,794 Updated Mar 22, 2025

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 3,344 394 Updated Feb 27, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 31,437 3,177 Updated Jan 7, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 23,785 2,382 Updated Mar 22, 2025
Next