Loading
Loading
This benchmark tests how LLM manage semantic and emotional exception in Brazilian Portuguese expressions
47
Total Prompts
689
Scored Responses
2
Contributors
0.3
Avg Score
| Rank | Model | Avg. Score | Prompts Tested | Avg. Response Time |
|---|---|---|---|---|
🥇1 | z-ai/glm-4.6 | 0.40 | 47 | 30ms |
🥈2 | deepseek/deepseek-chat | 0.38 | 47 | 4ms |
🥉3 | qwen/qwen3-235b-a22b-2507 | 0.37 | 47 | 5ms |
4 | meta-llama/llama-4-maverick | 0.36 | 47 | 5ms |
5 | x-ai/grok-4 | 0.35 | 47 | 54ms |