MUStReason: A Benchmark for Diagnosing Pragmatic Reasoning in Video-LMs for Multimodal Sarcasm Detection
Anzeige
Ähnliche Artikel
arXiv – cs.AI
•
A Unified Geometric Space Bridging AI Models and the Human Brain
VentureBeat – AI
•
Mistral launches its own AI Studio for quick development with its European open source, proprietary models
arXiv – cs.AI
•
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models
arXiv – cs.AI
•
LM Fight Arena: LMMs im Kampf – neues Benchmark für Echtzeit-Strategie
arXiv – cs.AI
•
Guiding Evolution of Artificial Life Using Vision-Language Models
arXiv – cs.LG
•
Is GPT-4o mini Blinded by its Own Safety Filters? Exposing the Multimodal-to-Unimodal Bottleneck in Hate Speech Detection