tju01

Follow

tju01 tju01

Follow

Organizations

Stars

emrgnt-cmplxty / zero-shot-replication

Python 73 7 Updated Sep 5, 2023

declare-lab / instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Python 544 46 Updated Mar 10, 2024

g588928812 / FastChat_eval

Forked from lm-sys/FastChat

using eval part of FastChat to evaluate the current mess of open-source LLMs

HTML 2 1 Updated Jun 8, 2023