Loading
Loading
This benchmark tests how LLM manage semantic and emotional exception in Brazilian Portuguese expressions
The questions are made by Matteo Sisti
62
Total Prompts
725
Scored Responses
4
Contributors
30%
Average Overall Score
| Rank | Model | Avg. Score | Prompts Tested | Avg. Response Time |
|---|---|---|---|---|
| Rank | Model | Avg. Score | Prompts Tested | Avg. Response Time |
|---|---|---|---|---|
1 | x-ai/grok-4.1-fast | 0.64 | 11 | 12ms |
2 | z-ai/glm-4.6 | 0.38 | 47 | 30ms |
3 | qwen/qwen3-235b-a22b-2507 | 0.38 | 47 | 5ms |
4 | deepseek/deepseek-chat | 0.36 | 47 | 4ms |
5 | x-ai/grok-4 | 0.36 | 47 | 55ms |