Stars
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitive data leaving your network.
Streamlit — A faster way to build and share data apps.
😱 从源码层面,剖析挖掘互联网行业主流技术的底层实现原理,为广大开发者 “提升技术深度” 提供便利。目前开放 Spring 全家桶,Mybatis、Netty、Dubbo 框架,及 Redis、Tomcat 中间件等
Explore Python's charms by asking WHY questions
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Book_3_《数学要素》 | 鸢尾花书:从加减乘除到机器学习;上架;欢迎继续纠错,纠错多的同学还会有赠书!
Book_5_《统计至简》 | 鸢尾花书:从加减乘除到机器学习;上架!
A playbook for systematically maximizing the performance of deep learning models.
Location for summaries and analysis of data related to n-CoV 2019, first reported in Wuhan, China
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
XLNet: Generalized Autoregressive Pretraining for Language Understanding
The spring.io site and reference application
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow