Skip to content
View wangerzi's full-sized avatar
Busy
Busy

Organizations

@LiberSonora

Block or report wangerzi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀 「大模型」3小时从0训练27M参数的视觉多模态VLM!🌏 Train a 27M-parameter VLM from scratch in just 3 hours!

Python 902 91 Updated Dec 13, 2024

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero

Python 16,530 1,305 Updated Feb 3, 2025

🚀🚀 「大模型」50分钟完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 50 min!

Python 7,947 812 Updated Dec 13, 2024

🎉 Elegant and powerful theme for Hexo.

JavaScript 2,532 452 Updated Jan 27, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,194 1,303 Updated Jan 27, 2025

批量为视频或者音频生成字幕,并可批量将字幕翻译成其它语言。这是一个客户端工具, 跨平台支持 mac 和 windows 系统, 支持百度,火山,deeplx, openai, deepseek, ollama 等多个翻译服务

TypeScript 935 60 Updated Feb 6, 2025

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 8,378 890 Updated Feb 5, 2025

Sealos is a production-ready Kubernetes distribution. You can create any programming language and any framework development Env, create high availability databases like mysql/pgsql/redis/mongo, and…

TypeScript 14,869 2,157 Updated Feb 7, 2025

本项目是一个用于翻译数据集的工具,支持通过命令行脚本调用进行数据集多语言翻译。

Python 1 Updated Jan 13, 2025

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Python 4,806 405 Updated Jul 30, 2024

《道诡异仙》李火旺 sharegpt 数据集和大模型

Python 2 Updated Jan 16, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 16,109 1,035 Updated Feb 7, 2025

Fast and memory-efficient exact attention

Python 15,345 1,445 Updated Feb 8, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 12,133 1,384 Updated Feb 5, 2025

Flutter移动端+桌面端三方网易云播放器

Dart 1,285 47 Updated Jan 9, 2025

LiberSonora,寓意“自由的声音”,是一个 AI 赋能的、强大的、开源有声书工具集,包含智能字幕提取、AI标题生成、多语言翻译等功能,支持 GPU 加速、批量离线处理。LiberSonora, meaning "The Voice of Freedom," is an AI-powered robust open-source audiobook toolkit.

Python 73 6 Updated Feb 6, 2025

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。

Python 5,761 1,116 Updated Dec 26, 2024

A list of Free Software network services and web applications which can be hosted on your own servers

215,487 10,195 Updated Feb 5, 2025

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

MATLAB 7,079 1,866 Updated Jun 1, 2024

Vim-fork focused on extensibility and usability

Vim Script 85,985 5,853 Updated Feb 8, 2025

Based on RapidOCR, extract the PDF content.

Python 140 15 Updated Aug 28, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 39,688 4,871 Updated Feb 8, 2025

一些大语言模型和多模态模型的应用,主要包括Rag,小模型,Agent,跨模态搜索,OCR等等

Python 147 7 Updated Nov 6, 2024

Analysis of Chinese and English layouts 中英文版面分析

Python 161 8 Updated Dec 24, 2024

📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.

C++ 522 62 Updated May 15, 2024

检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation issues.

Python 55 Updated Dec 10, 2024

整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.

Python 495 46 Updated Jan 17, 2025
Next