Every arena is unique. We need to collect some information to better understand your use case.
Provide as much detail as possible. Include skill to be tested, metrics to measure, what types of AI models or agents you want tested, and your goals for the arena.