
Starred repositories
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
MTEB: Massive Text Embedding Benchmark
TypeChat is a library that makes it easy to build natural language interfaces using types.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
A good looking terminal emulator which mimics the old cathode display...
BARTScore: Evaluating Generated Text as Text Generation
MathEval is a benchmark dedicated to the holistic evaluation on mathematical capacities of LLMs.
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…
Code and Data Repo for ACL'24 Paper "Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models"
[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
The official repository of the Omni-MATH benchmark.
🖥 Control your display's brightness & volume on your Mac as if it was a native Apple Display. Use Apple Keyboard keys or custom shortcuts. Shows the native macOS OSDs.
This is the repository of the Ape210K dataset and baseline models.
Unlock your displays on your Mac! Flexible HiDPI scaling, XDR/HDR extra brightness, virtual screens, DDC control, extra dimming, PIP/streaming, EDID override and lots more!
Installer & Activited Microsoft Office For MacOS
A repo lists papers related to LLM based agent
Xiaomi Mobile Phone Kernel OpenSource
Awesome IoT. A collaborative list of great resources about IoT Framework, Library, OS, Platform
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。