Skip to content
View yg838457845's full-sized avatar

Block or report yg838457845

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Retrieval and Retrieval-augmented LLMs

Python 8,676 629 Updated Feb 13, 2025

本项目是针对RAG中的Retrieve阶段的召回技术及算法效果所做评估实验。使用主体框架为LlamaIndex.

Jupyter Notebook 212 24 Updated Dec 22, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,399 200 Updated Aug 11, 2024

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

HTML 984 99 Updated Apr 27, 2024

Example models using DeepSpeed

Python 6,317 1,068 Updated Feb 14, 2025

[NIPS2023] RRHF & Wombat

Python 802 49 Updated Sep 22, 2023

Making large AI models cheaper, faster and more accessible

Python 40,434 4,471 Updated Feb 25, 2025

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,588 480 Updated Jan 8, 2024

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,694 507 Updated Jul 18, 2024

闻达:一个LLM调用平台。目标为针对特定环境的高效内容生成,同时考虑个人和中小企业的计算资源局限性,以及知识安全和私密性问题

JavaScript 6,270 811 Updated Jan 23, 2025

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,189 556 Updated Oct 24, 2024

Finetune Bloom big language model with Lora method

Python 31 2 Updated Jun 9, 2023

Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigscience/license) using Alpaca-LoRA and Alpaca_data_cleaned.json

Jupyter Notebook 185 39 Updated Jun 18, 2023

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,058 768 Updated Oct 16, 2024

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Python 911 161 Updated Apr 26, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,813 2,223 Updated Jul 29, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,846 4,063 Updated Jul 17, 2024

基于ChatGLM-6B + LoRA的Fintune方案

Python 3,766 445 Updated Nov 25, 2023

Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch

Python 764 80 Updated Jul 29, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,811 1,026 Updated Feb 22, 2025

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Python 4,132 361 Updated May 6, 2023

Stable Diffusion web UI

Python 148,536 27,757 Updated Feb 18, 2025

Auto-GPT中文版本及爱好者组织 同步更新原项目 AI领域创业 自媒体组织 用AI工作学习创作变现

Python 2,415 403 Updated Sep 25, 2023

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,026 5,234 Updated Jun 27, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,716 1,889 Updated Apr 30, 2024

Reverse engineered ChatGPT API

Python 28,062 4,484 Updated Aug 2, 2023

Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)

Java 75,716 13,985 Updated Aug 14, 2023