Stars
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Projects and Quiz answers are provided for all the Courses in Algorithms to battery management systems including honours
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three …
RAG兴趣小组,全手写的一个RAG应用。Langchain的大部分库会很方便,但是你不一定理解其中原理,所以代码尽可能展现基本算法,主打理解RAG的原理
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
OpenThaiGPT focuses on developing a Thai Chatbot system to have capabilities equivalent to ChatGPT, as well as being able to connect to external systems and be able to retrieve data flexibly. Easil…
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Implementation of paper DESCN, which is accepted in SIGKDD 2022.
Experiments codes for SIGKDD '23 paper "Explicit Feature Interaction-aware Uplift Network for Online Marketing"
Reproducing Shalit et al.'s Individual Treatment Effect model. This is a deep neural net that can be applied to various problems in causal inference.
Rankability Enhanced Revenue Uplift Modeling Framework for Online Marketing (KDD 2024)
Python notebooks for the book by Scott Cunningham.
Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.
An index of algorithms for learning causality with data
🛠 How to Apply Causal ML to Real Scene Modeling?How to learn Causal ML?【✔从Causal ML到实际场景的Uplift建模】
This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf