Skip to content
View yysirs's full-sized avatar

Organizations

@cubenlp

Block or report yysirs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Train a 1B LLM with 1T tokens from scratch by personal

Python 492 55 Updated Jan 28, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,419 237 Updated Jan 27, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,655 582 Updated Feb 1, 2025

Financial portfolio optimisation in python, including classical efficient frontier, Black-Litterman, Hierarchical Risk Parity

Jupyter Notebook 4,740 976 Updated Dec 24, 2024

📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical papers in the field of surface defect research which are of great importance.

Python 3,341 544 Updated May 27, 2024
Python 14 Updated Jul 7, 2023

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 5,198 452 Updated Feb 1, 2025

【深度学习模型部署框架】支持tf/torch/trt/trtllm/vllm以及更多nn框架,支持dynamic batching、streaming模式,支持python/c++双语言,可限制,可拓展,高性能。帮助用户快速地将模型部署到线上,并通过http/rpc接口方式提供服务。

C++ 152 13 Updated Jan 10, 2025

LLM101n: Let's build a Storyteller

31,232 1,709 Updated Aug 1, 2024

TrustRAG:The RAG Framework within Reliable input,Trusted output

Python 635 59 Updated Jan 23, 2025

Building a quick conversation-based search demo with Lepton AI.

TypeScript 7,959 1,016 Updated Jan 14, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 13,495 1,517 Updated Jan 15, 2025

Play ChatGPT and other LLM with Xiaomi AI Speaker

Python 6,417 898 Updated Oct 30, 2024

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,730 1,037 Updated Feb 2, 2025

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

7,132 421 Updated Jul 28, 2024

OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…

JavaScript 21,106 4,584 Updated Feb 2, 2025

The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"

Python 259 21 Updated May 9, 2024

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,236 185 Updated Jan 16, 2024

A topic-centric list of HQ open datasets.

61,857 10,013 Updated Nov 13, 2024

BIBench:数据分析领域LLM评测基准

14 Updated Mar 2, 2024

Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc.…

Python 2,399 276 Updated Sep 26, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,799 494 Updated Nov 27, 2024

Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

Python 14,075 1,391 Updated Jan 31, 2025

OCR toolbox from Davar-Lab

Python 744 155 Updated Nov 16, 2023

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,395 475 Updated Jan 27, 2025

An Autonomous LLM Agent for Complex Task Solving

Python 8,137 859 Updated Aug 12, 2024
Python 593 54 Updated Jul 31, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,275 1,089 Updated Feb 2, 2025

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 38,617 5,649 Updated Feb 2, 2025
Next