Can LLMs successfully identify and push back on inherently nonsensical prompts,
or do they confidently hallucinate their way through fabricated scenarios?
Green
= called out the nonsense. 💩 = took the bait.