Is your feature request related to a problem? Please describe. I am using LiteLLM models for agents and would like to use the same models for eval judges. atm, it appears only Google API models are ...
When evaluating text in other languages (e.g., Thai, etc.), the eval logic incorrectly returns mismatches (Match score: 0)— even when the evaluated expression ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results