-
SUSTech
- Shenzhen
-
13:46
(UTC +08:00) - https://huangkexinspace.github.io/
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Toolkit for linearizing PDFs for LLM datasets/training
Small python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface.
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🔥Char detection base on crnn 字符(单字)检测基于CRNN
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
A C# implementation of the WebSocket protocol client and server
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
A simple implementation of the Google NotebookLM Audio overview function. You can run 💬 DIY Podcast Generator 🎙️ on your PC and generate a podcast video with captions.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Source code and demo for memory bank and SiliconFriend
LlamaIndex is the leading framework for building LLM-powered agents over your data.
🦜🔗 Build context-aware reasoning applications
An enterprise-class UI design language and React UI library
🤖 Components Library for Quickly Building LLM Chat Interfaces.
Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.