Stars
5
stars
written in Python
Clear filter
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Noise supression using deep filtering
[EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“