Semantic and emotional exceptions in Brazilian Portuguese

Public

This benchmark tests how LLM manage semantic and emotional exception in Brazilian Portuguese expressions

sociologycc-by-4.0Public Submissions

Citation Information

The questions are made by Matteo Sisti

Total Prompts

1055

Scored Responses

Contributors

32%

Average Overall Score

Model Leaderboard

Rank	Model	Avg. Score	Prompts Tested	Avg. Response Time
1	x-ai/grok-4.1-fast	0.44	92	14ms
2	x-ai/grok-4-fast	0.37	92	6ms
3	google/gemini-2.5-flash-lite	0.33	92	2ms
4	meta-llama/llama-4-maverick	0.27	92	3ms
5	meta-llama/llama-3.3-70b-instruct:free	0.14	88	3ms

Rank	Model	Avg. Score	Prompts Tested	Avg. Response Time

Collaborators & Contributors

Matteo Sisti

OwnerPolytechnic Institute of Turin

b449250d-ff0f-4ecd-afcb-240768028b9f

mik*****@forest-ai.org

AdminWarsaw University of Technology

28c9e78b-a9a0-4e42-92b2-c81bf439a762

ang*****@acad.ufsm.br

ContributorUniversidade Federal de Santa Maria

20ffe446-054e-4efb-a799-931572afce63

ham*****@leads.edu.pk

ContributorLahore Leads University

f6de769c-b75f-4e59-b242-933b017eece1

sev*****@marun.edu.tr

ContributorMarmara University

ee5fba3b-175f-45e4-9c34-d065f4d6a3c7

Text Search

Benchmark

Select a Benchmark...

Total Score Count

Minimum

Maximum

Bad Score Result

Minimum

Maximum

Threshold (for considering a score as bad)

Good Score Result

Minimum

Maximum

Threshold (for considering a score as good)

Total Reviews Count

Minimum

Maximum

Positive Reviews Count

Minimum

Maximum

Negative Reviews Count

Minimum

Maximum

Semantic and emotional exceptions in Brazilian Portuguese

Citation Information

Model Leaderboard

Collaborators & Contributors

Filters

Total Score Count

Bad Score Result

Good Score Result

Total Reviews Count

Positive Reviews Count

Negative Reviews Count