-
-
opentensorAI-connector-template Public
Forked from opentensor/opentensorAI-connector-templateJavaScript UpdatedMay 9, 2024 -
axolotl Public
Forked from axolotl-ai-cloud/axolotlGo ahead and axolotl questions
Python Apache License 2.0 UpdatedJan 16, 2024 -
autocrit Public
Forked from CarperAI/autocritA repository for transformer critique learning and generation
Python UpdatedOct 2, 2023 -
-
llama-trl Public
Forked from jasonvanf/llama-trlLLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
Jupyter Notebook Apache License 2.0 UpdatedSep 25, 2023 -
OpenLLaMA2 Public
Forked from OpenRLHF/OpenRLHFA Ray-based High-performance LLaMA2 RLHF framework
Python Apache License 2.0 UpdatedSep 25, 2023 -
safe-rlhf Public
Forked from PKU-Alignment/safe-rlhfSafe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
-
direct-preference-optimization Public
Forked from eric-mitchell/direct-preference-optimizationReference implementation for DPO (Direct Preference Optimization)
Python Apache License 2.0 UpdatedSep 8, 2023 -
airoboros Public
Forked from jondurbin/airoborosCustomizable implementation of the self-instruct paper.
Python Apache License 2.0 UpdatedSep 7, 2023 -
substrate-indexer Public
indexer for substrate chain (bt)
TypeScript MIT License UpdatedAug 24, 2023 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedAug 16, 2023 -
-
text-generation-inference Public
Forked from huggingface/text-generation-inferenceLarge Language Model Text Generation Inference
-
langfuse Public
Forked from langfuse/langfuseopen-source observability for LLM applications
TypeScript Other UpdatedJul 25, 2023 -
-
pfrl Public
Forked from pfnet/pfrlPFRL: a PyTorch-based deep reinforcement learning library
Python MIT License UpdatedJul 16, 2023 -
validators Public
Forked from opentensor/validatorsRepository for bittensor validators
Python MIT License UpdatedJul 6, 2023 -
-
-
-
ColossalAI Public
Forked from hpcaitech/ColossalAIMaking large AI models cheaper, faster and more accessible
Python Apache License 2.0 UpdatedApr 11, 2023 -
langchain Public
Forked from langchain-ai/langchain⚡ Building applications with LLMs through composability ⚡
-
langflow Public
Forked from langflow-ai/langflow⛓️ LangFlow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
TypeScript MIT License UpdatedApr 1, 2023 -
-
alpaca-weight Public
Forked from clcarwin/alpaca-weightTrain llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.
-
alpaca-lora Public
Forked from tloen/alpaca-loraCode for reproducing the Stanford Alpaca InstructLLaMA result on consumer hardware
Jupyter Notebook Apache License 2.0 UpdatedMar 15, 2023 -
-
H3 Public
Forked from HazyResearch/H3Language Modeling with the H3 State Space Model
Assembly Apache License 2.0 UpdatedFeb 8, 2023 -
trlx Public
Forked from CarperAI/trlxA repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)