Skip to content
View rocksen's full-sized avatar

Block or report rocksen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A fluent UI for ADB on Windows

C# 431 34 Updated Dec 31, 2024

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 21,085 1,664 Updated Jan 4, 2025

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker

Python 13,486 985 Updated Jan 4, 2025

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 774 58 Updated Oct 11, 2024

Python tool for converting files and office documents to Markdown.

Python 31,543 1,295 Updated Jan 4, 2025

Learning Flow Fields in Attention for Controllable Person Image Generation

Python 834 80 Updated Jan 3, 2025

🚀 Generate a smart commit message with Kimi AI support for IntelliJ, PyCharm, WebStorm, and GoLand

Kotlin 50 5 Updated Apr 29, 2024

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Python 4,821 239 Updated Jan 3, 2025

FauxPilot - an open-source alternative to GitHub Copilot server

Python 14,640 631 Updated Apr 9, 2024

CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.

Python 1,682 134 Updated Aug 25, 2024

Simple, unified interface to multiple Generative AI providers

Python 9,549 856 Updated Jan 2, 2025

An LLM-based Web Navigating Agent (KDD'24)

Python 776 63 Updated Sep 27, 2024
Python 177 8 Updated Nov 22, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 2,519 201 Updated Dec 5, 2024

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 2,006 116 Updated Dec 24, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,474 566 Updated Dec 31, 2024

PDF to Markdown with vision models

Python 7,721 457 Updated Dec 18, 2024

Get your documents ready for gen AI

Python 17,283 902 Updated Jan 3, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,401 422 Updated Jan 2, 2025

Implemented by Clean Architecture, Hilt, MVVM, LiveData, RX, Retrofit2, Room, Anko

Kotlin 457 85 Updated Jul 27, 2023

Android project written in Java with pure http handler and RxJava + Hilt + MVVM

Java 2 Updated Feb 1, 2024

🎹 KingKeyboard 是一个自定义键盘。内置了满足各种场景的键盘需求:包括但不限于混合、字母、数字、电话、身份证、车牌号等可输入场景。还支持自定义。集成简单,键盘可定制化。

Java 252 35 Updated Jun 20, 2024

Official inference framework for 1-bit LLMs

C++ 12,542 879 Updated Dec 20, 2024

Convert PDF to markdown + JSON quickly with high accuracy

Python 19,013 1,113 Updated Jan 3, 2025

Android 权限请求框架,已适配 Android 14

Java 5,915 796 Updated Jul 17, 2024

Vision model based document ingestion

Python 1,288 65 Updated Jan 4, 2025

Android下WIFI隔空apk安装

Java 623 139 Updated Jun 2, 2020

🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.

2,844 332 Updated Jan 4, 2025

A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG

Python 334 35 Updated Aug 13, 2024

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

TypeScript 7,342 576 Updated Dec 24, 2024
Next