Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Isabeau Levito was dinged for under-rotating her triple loop and got leveled down for her step sequence, which is where she tends to pick up points on the competition. It left her in eighth place and ...