ING-VP: MLLMs cannot Play Easy Vision-based Games Yet.

AllVideos News Images Maps Shopping Books

As multimodal large language models (MLLMs) continue to demonstrate increasingly competitive performance across a broad spectrum of tasks, more intricate and comprehensive benchmarks have been developed to assess these cutting-edge models.

_{Oct 9, 2024}

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet - arXiv

arxiv.org › cs

About Featured Snippets

ING-VP: MLLMs Cannot Play Easy Vision-based Games Yet

openreview.net › forum

Sep 26, 2024 · This paper proposed ING-VP, a new benchmark that can be used to test the zero-shot performance of MLLMs on visual interactive games. They ...

Paper page - ING-VP: MLLMs cannot Play Easy Vision-based Games Yet

huggingface.co › papers

Oct 10, 2024 · We present ING-VP, the first INteractive Game-based Vision Planning benchmark, specifically designed to evaluate the spatial imagination and multi-step ...

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet

www.researchgate.net › publication › 38...

We present ING-VP, the first INteractive Game-based Vision Planning benchmark, specifically designed to evaluate the spatial imagination and multi-step ...

arXivGPT on X: "🏷️:ING-VP: MLLMs cannot Play Easy Vision-based ...

twitter.com › arXivGPT › status

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet :https://t.co/RaVPoGD3S7.

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet

www.aimodels.fyi › papers › arxiv › ing-...

Oct 9, 2024 · The researchers have developed a new benchmark, called ING-VP, to specifically assess the spatial imagination and multi-step reasoning abilities ...

Ge Zhang on X: "[1/n] ### All Current MLLMs Cannot Play Easy Vision ...

twitter.com › GeZhang86038849 › status

Nov 2, 2024 · Highlights of ING-VP： ♻️Multimodal interactive environment ♟️Six classic games: Sokoban, Maze, 8-queens, Sudoku, Tower or Hanoi, 15-puzzles ...

ICLR 2025 Conference Submissions | OpenReview

openreview.net › submissions › Conferen...

ING-VP: MLLMs Cannot Play Easy Vision-based Games Yet. ICLR 2025 Conference ... Legendre-KAN : High Accuracy KA Network Based on Legendre Polynomials.

Hangyu Guo | Papers With Code

paperswithcode.com › author › hangyu-...

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet · 1 code implementation ... To bridge this gap, we present ING-VP, the first INteractive Game-based ...

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

huggingface.co › papers

Nov 25, 2024 · We introduce BALROG, a novel benchmark designed to assess the agentic capabilities of LLMs and VLMs through a diverse set of challenging games.