Skip to content
View HuangKexinSpace's full-sized avatar

Highlights

  • Pro

Block or report HuangKexinSpace

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Toolkit for linearizing PDFs for LLM datasets/training

Python 12,459 871 Updated May 22, 2025

Small python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface.

Python 4,185 229 Updated May 22, 2025

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 4,708 505 Updated May 22, 2025

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 29,002 1,982 Updated Apr 28, 2025
Jupyter Notebook 1,520 168 Updated Mar 17, 2025

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 9,542 1,287 Updated May 1, 2025

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,917 1,911 Updated May 22, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 40,193 5,152 Updated Aug 16, 2024
Python 1 Updated Mar 18, 2025

🔥Char detection base on crnn 字符(单字)检测基于CRNN

Python 81 12 Updated May 16, 2023

超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

C++ 12,120 2,285 Updated Aug 14, 2023

南方科技大学研究生学位论文LaTeX模板

TeX 250 55 Updated Apr 24, 2025
Jupyter Notebook 53 5 Updated Jun 10, 2024

A C# implementation of the WebSocket protocol client and server

C# 5,902 1,675 Updated May 4, 2025
JavaScript 2 Updated Apr 20, 2023

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 98,642 14,821 Updated May 23, 2025

Ollama Python library

Python 7,637 689 Updated May 22, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 141,379 11,839 Updated May 23, 2025

A simple implementation of the Google NotebookLM Audio overview function. You can run 💬 DIY Podcast Generator 🎙️ on your PC and generate a podcast video with captions.

Python 9 1 Updated Oct 22, 2024

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 34,851 3,220 Updated May 23, 2025
Python 20 Updated Apr 29, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,054 5,219 Updated Jun 27, 2024

Source code and demo for memory bank and SiliconFriend

Python 278 39 Updated May 24, 2023

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 41,799 5,969 Updated May 23, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 107,981 17,577 Updated May 22, 2025

An enterprise-class UI design language and React UI library

TypeScript 94,702 52,407 Updated May 23, 2025

🤖 Components Library for Quickly Building LLM Chat Interfaces.

TypeScript 837 113 Updated Nov 26, 2024

Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.

HTML 14,213 43,420 Updated May 14, 2025
Next