ECCV2024 Workshop on
Multimodal Agents

1Stanford University, 2Salesforce AI Research, 3Microsoft Research, Redmond 4University of California, Los Angeles 5University of Washington

Time and Venue

Sep 29 or 30, 2024

8:30am - 12:30pm


MiCo Milano, Milan, Italy

Zoom Meeting Link

Call for Papers

For ECCV 2024, we invite paper submissions focused on Multimodal Agents (MMAs), a dynamic field dedicated to creating systems that generate effective actions in various environments by interpreting multimodal sensory inputs. The rise of Large Language Models (LLMs) and Vision-Language Models (VLMs) has led to significant advancements in this area, impacting a broad spectrum from foundational research to practical applications. Our workshop will delve into how these developments integrate with traditional domain-specific technologies, such as visual question answering and vision-language navigation. We are especially interested in contributions that explore:

  • Embodied Systems: The usage of MMAs in embodied systems, both physical and virtual.
  • Interactive Agents: The creation of multimodal agents that can interact with humans.
  • Agent Infrastructure: Infrastructure designed to improve the ability to create MMAs or improve the capabilities of MMAs.
  • Applications: The application of MMAs in diverse research areas including, but not limited to, traditional multimodal understanding tasks, robotics, and gaming.

Submissions should address common interests in these fields, such as data collection, benchmarking, and ethical considerations. This workshop aims to serve as a forum for sharing comprehensive insights and discussing the shared challenges in the field of Multimodal Agents, ultimately contributing to the foundational understanding and further advancement of this new, exciting research area.

We will use this CMT3 submission page for all workshop submissions. All submissions will be due by Friday July 26th Anywhere on Earth (AoE). Authors should strictly adhere to the official ECCV 2024 guidelines and use the ECCV 2024 formatting template for submissions. Papers are limited to 14 pages. In addition to full workshop papers, we also accept extended abstracts via the same submission link. Please adhere to the ECCV 2024 guidelines for extended abstract submissions.

Important Dates

  • Submission deadline: July 26th (AoE)
  • Author Notification: August 8th (Aoe)
  • Camera-ready Deadline: August 15th (AoE)

Workshop Materials

Download links will be available

Timetable Schedule

Time Slot Speaker(s) Details
08:00 - 08:10 Zane Durante Opening Remarks
08:10 - 08:40 TBA Invited Talk
08:40 - 09:10 TBA Invited Talk
09:10 - 09:25 Coffee Break
09:25 - 09:35 Award Winner #1 Spotlight #1
09:35 - 09:45 Award Winner #2 Spotlight #2
09:45 - 11:15 Poster session
11:15 - 12:00 Panel Discussion Current limitations and future challenges for MMAs
12:00 - 12:10 Juan Carlos Niebles Closing Remarks and Awards

Invited Speakers

Our invited speaker list will be updated as we confirm additional speakers and approach the conference date.