Situated Image Understanding in a Multi-Agent Framework