Skip to content
View sharkwyf's full-sized avatar

Highlights

  • Pro

Block or report sharkwyf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

    Python Apache License 2.0 Updated Nov 14, 2024
  • Jupyter Notebook MIT License Updated Sep 29, 2024
  • VITA Public

    Forked from VITA-MLLM/VITA

    ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

    Python Other Updated Sep 9, 2024
  • ragflow Public

    Forked from infiniflow/ragflow

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

    Python Apache License 2.0 Updated Aug 16, 2024
  • graphrag Public

    Forked from microsoft/graphrag

    A modular graph-based Retrieval-Augmented Generation (RAG) system

    Python MIT License Updated Aug 16, 2024
  • HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

    Jupyter Notebook MIT License Updated Aug 16, 2024
  • Python MIT License Updated Aug 15, 2024
  • cgdt Public

    [AAAI'2024] Critic-Guided Decision Transformer for Offline Reinforcement Learning

    Python 7 MIT License Updated Aug 1, 2024
  • dify Public

    Forked from langgenius/dify

    An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack r…

    TypeScript Other Updated Jul 29, 2024
  • SimPO Public

    Forked from princeton-nlp/SimPO

    SimPO: Simple Preference Optimization with a Reference-Free Reward

    Python Updated Jun 2, 2024
  • RepoAgent Public

    Forked from OpenBMB/RepoAgent

    An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

    Python 1 Apache License 2.0 Updated Mar 19, 2024
  • langflow Public

    Forked from langflow-ai/langflow

    ⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.

    Python MIT License Updated Mar 6, 2024
  • trl Public

    Forked from huggingface/trl

    Train transformer language models with reinforcement learning.

    Python Apache License 2.0 Updated Jan 13, 2024
  • Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

    Python Apache License 2.0 Updated Dec 13, 2023
  • neuralmmo Public

    Forked from NeuralMMO/baselines

    Baselines for Neural MMO -- new users should treat this repo as a starter project

    Python MIT License Updated Nov 15, 2023
  • agenta Public

    Forked from Agenta-AI/agenta

    The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.

    TypeScript MIT License Updated Oct 26, 2023
  • Example models using DeepSpeed

    Python Apache License 2.0 Updated Sep 26, 2023
  • vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python Apache License 2.0 Updated Aug 20, 2023
  • IVR Public

    Forked from ryanxhr/IVR

    Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

    Python MIT License Updated Jul 27, 2023
  • 🕸 A Node app for creating a Feed Reader in Notion.

    JavaScript MIT License Updated Jun 2, 2023
  • Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

    Python Other Updated May 29, 2023
  • PDT Public

    Forked from zhxieml/PDT

    Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer

    Python MIT License Updated May 28, 2023
  • dreamerv3 Public

    Forked from danijar/dreamerv3

    Mastering Diverse Domains through World Models

    Python MIT License Updated Feb 22, 2023
  • FrozenBiLM Public

    Forked from antoyang/FrozenBiLM

    [NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

    Python Apache License 2.0 Updated Jan 28, 2023
  • ASE Public

    Forked from nv-tlabs/ASE
    Python Other Updated Jan 25, 2023
  • DI-engine Public

    Forked from opendilab/DI-engine

    OpenDILab Decision AI Engine

    Python Apache License 2.0 Updated Dec 12, 2022
  • basalt_2022 Public

    Python MIT License Updated Oct 31, 2022
  • Online Decision Transformer

    Python Other Updated Oct 17, 2022
  • MineDojo Public

    Forked from MineDojo/MineDojo

    Modified actions space to MineRL style

    Java MIT License Updated Oct 17, 2022
  • MotionCLIP Public

    Forked from GuyTevet/MotionCLIP

    Official Pytorch implementation of the paper "MotionCLIP: Exposing Human Motion Generation to CLIP Space"

    Python MIT License Updated Oct 12, 2022