A structured framework for evaluating local LLMs using Ollama.
Site under construction. In the meantime, see how the two models designed this page: