-
-
-
-
gritlm Public
Forked from lawliet19189/gritlmGenerative Representational Instruction Tuning
Jupyter Notebook MIT License UpdatedNov 5, 2024 -
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedSep 24, 2024 -
-
licensed-pile Public
Forked from r-three/common-pileRepo to hold code and track issues for the collection of permissively licensed data
Python MIT License UpdatedMay 26, 2024 -
-
open_lm Public
Forked from mlfoundations/open_lmA repository for research on medium sized language models.
Python MIT License UpdatedApr 25, 2024 -
sgpt Public
SGPT: GPT Sentence Embeddings for Semantic Search
-
open-instruct Public
Forked from allenai/open-instruct -
FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
-
bigcode-evaluation-harness Public
Forked from bigcode-project/bigcode-evaluation-harnessA framework for the evaluation of autoregressive code generation language models.
Python Apache License 2.0 UpdatedFeb 5, 2024 -
alpaca_eval Public
Forked from tatsu-lab/alpaca_evalAn automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Jupyter Notebook Apache License 2.0 UpdatedDec 25, 2023 -
mteb Public
Forked from embeddings-benchmark/mtebMassive Text Embedding Benchmark - Internal Development Git
Python Apache License 2.0 UpdatedDec 5, 2023 -
lm-evaluation-harness Public
Forked from bigscience-workshop/lm-evaluation-harnessA framework for few-shot evaluation of autoregressive language models.
Python MIT License UpdatedOct 14, 2023 -
vilio Public
🥶Vilio: State-of-the-art VL models in PyTorch & PaddlePaddle
-
promptsource Public
Forked from bigscience-workshop/promptsourceToolkit for creating, sharing and using natural language prompts.
-
-
FLAN Public
Forked from uSaiPrashanth/FLANProvides a minimal implementation to extract FLAN datasets for further processing
-
Megatron-DeepSpeed Public
Forked from lintangsutawika/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedDec 30, 2022 -
prompt_semantics Public
Forked from awebson/prompt_semanticsThis repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
-
-
matrixshapes Public
Language modelling task to infer shapes of matrices - One of the most difficult tasks for models like GPT-3, GPT-J
-
ytclipcc Public
Create Captions for any YouTube Clip using Wav2Vec2
-
t-zero Public
Forked from bigscience-workshop/t-zeroReproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
-
-
bigscience Public
Forked from bigscience-workshop/bigscienceCentral place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Shell Other UpdatedJul 16, 2022 -
sentence-transformers Public
Forked from UKPLab/sentence-transformersMultilingual Sentence & Image Embeddings with BERT