Visual Distractor: Vision-Language Models Fooled by Images

Lead
Visual Distractor: Vision-Language Models Fooled by Images

Sep. 2023 – Dec. 2023

  • Proposed a benchmark measuring VLM hallucination vulnerabilities from visual interference in text question answering
  • Built datasets by generating diverse questions on anomalous synthetic images and evaluated VLMs for incorrect responses