Vision language models can help robots create effective automation in chaotic environments, augmenting human capabilities.