Python Eval - Search News

Support for LiteLLM models as Eval Judges

Is your feature request related to a problem? Please describe. I am using LiteLLM models for agents and would like to use the same models for eval judges. atm, it appears only Google API models are ...

GitHub

Eval fails for non-English languages

When evaluating text in other languages (e.g., Thai, etc.), the eval logic incorrectly returns mismatches (Match score: 0)— even when the evaluated expression ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Support for LiteLLM models as Eval Judges

Eval fails for non-English languages

Trending now