repo · GitHub
eval-harness-plus
An extensible evaluation harness for LLMs.
⌥ PATH·NHow're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]→LEvaluate a model properly→Reval-harness-plus
Related across the graph
Topics