Join us in Davies Auditorium, New Haven, CT, USA, April 10th, 2026, for a deep dive into Multi-modal representation learning and foundation models.
Machine learning's progress with isolated modalities—text, vision, audio—masks a fundamental gap: intelligence emerges from integrating diverse data sources. Developing unified models that reason coherently across modalities remains one of AI's defining challenges.
Multimodal learning bridges this gap by enabling systems to jointly learn from and align information across heterogeneous data sources. This paradigm promises richer representations, stronger generalization, and more robust reasoning, with implications for science, healthcare, robotics, and human-centered AI.
This workshop brings together researchers and practitioners exploring the foundations and frontiers of multimodal intelligence. We seek to spark cross-disciplinary discussion on architectures, alignment strategies, and applications that will shape the future of AI systems that understand the world as we do.
We are accepting papers for oral and poster presentations. Paper deadline: March 25, 2026. See the Call for Contributions.
Submit your poster or register for early attendance via the Submission & Registration Form ↗. Early registration deadline: April 3, 2026.
1 page Extended abstract (excl. refs/appendix), non‑archival by default.
| Time | Session |
|---|---|
| 8:15–8:50 | Registration and poster setup |
| 8:50–9:00 | Opening remarks |
| 9:00–9:50 | Keynote: Ruslan Salakhutdinov (Carnegie Mellon University) |
| 9:50–10:40 | Keynote: James Duncan (Yale University) |
| 10:40–11:05 | Coffee break & discussions |
| 11:05–11:55 | Highlighted talks: Manling Li (Northwestern), Ruohan Gao (UMD) |
| 11:55–12:10 | Remarks: Jeffrey Brock, Dean, Yale School of Engineering & Applied Science |
| 12:10–1:40 | Lunch break & poster session |
| 1:40–2:05 | Highlighted talks: Kayhan Batmanghelich (BU) |
| 2:05–2:55 | Keynote: Atlas Wang (UT Austin) |
| 2:55–3:20 | Coffee break & discussions |
| 3:20–4:10 | Keynote: Jian Pei (Duke) |
| 4:10–5:00 | Highlighted talks: Yunzhu Li (Columbia), Yifeng Gao (UTRGV) |
| 5:00–6:00 | Panel: All keynote speakers |
| 6:00–6:10 | Concluding remarks |
| 6:10–8:00 | Social |
Extended organizers shown where noted.
Unfortunately, we do not have an option to attend virtually.
Your submission will not appear in formal proceedings or be indexed; you retain the right to publish elsewhere.
Email us: hiren.madhu@yale.edu
Proudly sponsored by the Yale School of Engineering and Applied Science AI Seed Grant and Futurewei Technologies.