Skip to content
View xxzhao1997's full-sized avatar

Block or report xxzhao1997

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

YuLan: An Open-Source Large Language Model

Python 619 57 Updated Jan 10, 2025

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 766 99 Updated Feb 22, 2025

《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣

CSS 3,088 206 Updated Mar 5, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 43,018 3,833 Updated Mar 6, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 13,093 1,348 Updated Mar 5, 2025

Build ChatGPT over your data, all with natural language

Python 6,418 659 Updated Apr 5, 2024

use python parse OFD file: ofd2img ofd2pdf pdf2ofd img2ofd ;(纯 python的ofd解析)

Python 309 43 Updated Feb 28, 2025

a Python toolbox loads 172 public time series datasets for machine/deep learning with a single line of code. Datasets from multiple domains including healthcare, financial, power, traffic, weather,…

Python 184 19 Updated Sep 11, 2024

Awesome Deep Learning for Time-Series Imputation, including an unmissable paper list about applying neural networks to impute incomplete time series containing NaN missing values/data

Python 265 33 Updated Feb 10, 2025

PyGrinder: a Python toolkit for grinding data beans into the incomplete for real-world data simulation by introducing missing values with different missingness patterns, including MCAR (complete at…

Python 42 5 Updated Feb 3, 2025

A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputatio…

Python 1,286 128 Updated Mar 6, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

18,685 1,800 Updated Sep 19, 2024

该仓库用于记录作者本人参加的各大数据科学竞赛的获奖方案源码以及一些新比赛的原创baseline. 主要涵盖:kaggle, 阿里天池,华为云大赛校园赛,百度aistudio,和鲸社区,datafountain等

Python 1,336 472 Updated Apr 21, 2023

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,709 325 Updated May 21, 2024

计算机自学指南

HTML 60,833 7,117 Updated Mar 3, 2025

全自动大模型llm训练,无需微调知识,门槛极低。极其适合零基础的人使用(目前暂时只支持glm3,未来会增加更多模型)

Python 16 1 Updated May 15, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 14,975 1,733 Updated Mar 2, 2025

⛽️「算法通关手册」:超详细的「算法与数据结构」基础讲解教程,从零基础开始学习算法知识,850+ 道「LeetCode 题目」详细解析,200 道「大厂面试热门题目」。

Python 6,521 1,166 Updated Jan 16, 2025