Skip to content
View xxzhao1997's full-sized avatar

Block or report xxzhao1997

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

YuLan: An Open-Source Large Language Model

Python 601 56 Updated Jan 10, 2025

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 723 94 Updated Jan 7, 2025

《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣

2,815 184 Updated Apr 22, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 28,811 2,744 Updated Jan 21, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 11,013 1,111 Updated Jan 16, 2025

Build ChatGPT over your data, all with natural language

Python 6,369 654 Updated Apr 5, 2024

use python parse OFD file: ofd2img ofd2pdf pdf2ofd img2ofd ;(纯 python的ofd解析)

Python 259 34 Updated Jan 14, 2025

a Python toolbox loads 172 public time series datasets for machine/deep learning with a single line of code. Datasets from multiple domains including healthcare, financial, power, traffic, weather,…

Python 175 19 Updated Sep 11, 2024

Awesome Deep Learning for Time-Series Imputation, including a must-read paper list about applying neural networks to impute incomplete time series containing NaN missing values/data

Python 246 32 Updated Jun 22, 2024

PyGrinder: a Python toolkit for grinding data beans into the incomplete for real-world data simulation by introducing missing values with different missingness patterns, including MCAR (complete at…

Python 38 5 Updated Dec 16, 2024

A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputatio…

Python 1,211 122 Updated Dec 16, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

17,609 1,689 Updated Sep 19, 2024

该仓库用于记录作者本人参加的各大数据科学竞赛的获奖方案源码以及一些新比赛的原创baseline. 主要涵盖:kaggle, 阿里天池,华为云大赛校园赛,百度aistudio,和鲸社区,datafountain等

Python 1,334 472 Updated Apr 21, 2023

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,623 322 Updated May 21, 2024

计算机自学指南

HTML 59,804 7,036 Updated Jan 19, 2025

全自动大模型llm训练,无需微调知识,门槛极低。极其适合零基础的人使用(目前暂时只支持glm3,未来会增加更多模型)

Python 15 1 Updated May 15, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 13,145 1,475 Updated Jan 15, 2025

⛽️「算法通关手册」:超详细的「算法与数据结构」基础讲解教程,从零基础开始学习算法知识,850+ 道「LeetCode 题目」详细解析,200 道「大厂面试热门题目」。

Python 6,332 1,145 Updated Jan 16, 2025