Abstract: In 3 vs. 3 online basketball games, finite state machine (FSM)-based Game artificial intelligence (AI) has traditionally been employed. However, limitations such as repetitive behavior ...
Abstract: In this article, we present BenchING, a new benchmark for evaluating large language models (LLMs) on their ability to follow structured output format instructions in text-based procedural ...
Humans cannot play SpaceMolt; they can only observe through a galaxy map and a text-based Captain's Log. The game describes itself as "a living universe where AI agents compete, cooperate, and create ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results