Skip to content
View liuhaibin's full-sized avatar

Block or report liuhaibin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,362 92 Updated Aug 20, 2024

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Python 3,479 515 Updated Dec 22, 2024

Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitive data leaving your network.

Python 563 67 Updated Aug 12, 2024

Streamlit — A faster way to build and share data apps.

Python 36,542 3,153 Updated Jan 6, 2025

😱 从源码层面,剖析挖掘互联网行业主流技术的底层实现原理,为广大开发者 “提升技术深度” 提供便利。目前开放 Spring 全家桶,Mybatis、Netty、Dubbo 框架,及 Redis、Tomcat 中间件等

Java 22,375 4,147 Updated Jan 2, 2025

MLX: An array framework for Apple silicon

C++ 18,163 1,046 Updated Jan 5, 2025

深度学习经典、新论文逐段精读

27,696 2,480 Updated Nov 17, 2024

Explore Python's charms by asking WHY questions

1,913 112 Updated Dec 18, 2023

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 18,296 2,215 Updated Nov 13, 2024

记录您对左耳朵耗子(陈皓)的点滴回忆

2,867 208 Updated May 23, 2024

Book_3_《数学要素》 | 鸢尾花书:从加减乘除到机器学习;上架;欢迎继续纠错,纠错多的同学还会有赠书!

Python 6,620 1,162 Updated Sep 11, 2024

Book_5_《统计至简》 | 鸢尾花书:从加减乘除到机器学习;上架!

Jupyter Notebook 3,065 633 Updated Sep 11, 2024

A playbook for systematically maximizing the performance of deep learning models.

27,703 2,291 Updated Jun 18, 2024

Location for summaries and analysis of data related to n-CoV 2019, first reported in Wuhan, China

HTML 656 256 Updated Dec 8, 2022

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,552 1,548 Updated May 23, 2024

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Python 6,179 1,178 Updated May 28, 2023

Awesome Beancount Resources

HTML 273 37 Updated Dec 28, 2024

The spring.io site and reference application

HTML 3,131 1,506 Updated Apr 19, 2023

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 26,452 8,736 Updated Jan 3, 2025