About this project
A decision-engine POC for AI unit economics that maps custom weightings (accuracy, latency, cost) to find the optimal model for production. It is built exclusively on Nebius Token Factory to evaluate 40+ open-weight LLMs (Llama 3, Mixtral, Qwen) through one unified API.
Technologies
app
benchmark