You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Make it possible for evaluation metrics that have categorical/text values
Motivation
What problem are you trying to solve: key approach within the company I currently work at is evaluation of LLM's with categorical rankings: e.g. low, medium, high, level of hallucination
**How are you currently solving this problem?:**Currently I generate plots based on the data manually after a run
What are the benefits of this feature?: This would make evaluation of model performance a lot easier
The text was updated successfully, but these errors were encountered:
Proposal summary
Make it possible for evaluation metrics that have categorical/text values
Motivation
What problem are you trying to solve: key approach within the company I currently work at is evaluation of LLM's with categorical rankings: e.g. low, medium, high, level of hallucination
**How are you currently solving this problem?:**Currently I generate plots based on the data manually after a run
What are the benefits of this feature?: This would make evaluation of model performance a lot easier
The text was updated successfully, but these errors were encountered: