Physics and Python stuff. Most of the videos here are either adapted from class lectures or solving physics problems. I ...
Is your feature request related to a problem? Please describe. I am using LiteLLM models for agents and would like to use the same models for eval judges. atm, it appears only Google API models are ...
When evaluating text in other languages (e.g., Thai, etc.), the eval logic incorrectly returns mismatches (Match score: 0)— even when the evaluated expression ...