Skip to content
View shenfe's full-sized avatar
🌕
I may be slow to respond.
🌕
I may be slow to respond.
  • ByteDance
  • Beijing

Block or report shenfe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,406 382 Updated Jul 16, 2023

Official implementation of paper "Meta Prompting for AI Systems" (https://arxiv.org/abs/2311.11482)

Python 129 16 Updated Sep 17, 2024

WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)

Python 325 14 Updated Apr 15, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,080 486 Updated May 3, 2024

Awesome papers about unifying LLMs and KGs

2,141 160 Updated Oct 20, 2024

Resources of deep learning for mathematical reasoning (DL4MATH).

345 27 Updated Dec 22, 2023

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,622 250 Updated Dec 17, 2024

Safety Score for Pre-Trained Language Models

Python 93 7 Updated Oct 18, 2023

DSPy: The framework for programming—not prompting—language models

Python 20,735 1,565 Updated Jan 3, 2025

Industry leading face manipulation platform

Python 20,717 3,205 Updated Jan 4, 2025

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,418 74 Updated Mar 8, 2024

Task-based datasets, preprocessing, and evaluation for sequence models.

Python 563 58 Updated Jan 4, 2025

Multipack distributed sampler for fast padding-free training of LLMs

Python 182 13 Updated Aug 10, 2024

Model API for GALACTICA

Jupyter Notebook 2,692 278 Updated Mar 5, 2023
Jupyter Notebook 46 5 Updated Nov 17, 2024

[ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization

Python 29 4 Updated Sep 12, 2024

Pipeline for pulling and processing online language model pretraining data from the web

Python 175 23 Updated Jul 31, 2023

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.

JavaScript 29,694 2,985 Updated Jan 3, 2025

🔥Highlighting the top ML papers every week.

10,615 635 Updated Jan 1, 2025

MTEB: Massive Text Embedding Benchmark

Jupyter Notebook 2,058 291 Updated Jan 4, 2025

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 13,199 1,055 Updated Dec 5, 2024

Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.

139 9 Updated Aug 30, 2024

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

1,937 209 Updated Nov 1, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,598 1,878 Updated Apr 30, 2024

Repository for Decomposed Prompting

Python 83 7 Updated Nov 15, 2023

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features…

Python 2,683 362 Updated Jan 5, 2025

Using GPT to organize and access information, and generate questions. Long term goal is to make an agent-like research assistant.

Jupyter Notebook 661 56 Updated Dec 20, 2023

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,387 119 Updated Jun 13, 2024

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,624 132 Updated Aug 4, 2024
Next