SLM RAG Arena - Compare and Find The Best Sub-5B Models for RAG

🏟️ This arena evaluates how well small language models (under 5B) answer questions based on document contexts.

📝 Instructions：

Click the "Get a Question" button to load a random question with context
Review the query and context to understand the information provided to the models
Compare answers generated by two different models on answer quality or appropriate refusal
Cast your vote for the better response, or select 'Tie' if equally good or 'Neither' if both are inadequate

💬 Query - Question About Document Content

Click "Get a Question" to start