Replies: 2 comments
-
Hey @loulblemo - this is intended. Abstentions (LLM saying when it doesn't know) are a desired quality. That being said, we're thinking of ways to better handle this case specifically - notably for groundedness evaluations. |
Beta Was this translation helpful? Give feedback.
-
Hey @loulblemo - wanted to let you know we've updated this treatment. For answer relevance, abstentions will now be counted as not relevant. Additionally, you can also use a special version of groundedness that assesses answerability. In this case, answerable abstentions will be considered not grounded (score 0), and unanswerable abstentions will still be grounded (score 1). |
Beta Was this translation helpful? Give feedback.
-
f_qa_relevance("My favorite color is blue, what is my favorite color?", "I don't know")
Beta Was this translation helpful? Give feedback.
All reactions