All of Andrew Deece's Comments + Replies

Agents trained via general DRL algorithms (SP, PBT) in collaborative environments are very good at coordinating with themselves. They’re not able to handle human partners well, since they have never seen humans during training.

Test Env., “Overcooked” is a video game, players control chefs in a kitchen to cook & serve dishes, each dish takes several high-level actions, the challenge is motion coordination.

Agents should learn how to (1) navigate the map, (2) interact with objects, (3) drop the objects off in the right locations, (4)... (read more)