Abstract: This paper introduces the human-curated Pandas-PlotBench dataset, designed to evaluate language models’ effectiveness as assistants in visual data exploration. Our benchmark focuses on ...
Brick builders and gamers unite! LEGO has an exciting and unique new challenge coming up this week. Called LEGO Code Break, the new game challenges players to solve clues and find hidden codes ...