June 27, 2025
Human communication involves a complex interplay of verbal and nonverbal signals, essential for conveying meaning and achieving interpersonal goals. To develop socially intelligent AI technologies, it is crucial to develop models that can both comprehend and generate dyadic behavioral dynamics. To this end, we introduce the Seamless Interaction Dataset, a large-scale collection of over 4,000 hours of face-to-face interaction footage from over 4,000 participants in diverse contexts. This dataset enables the development of AI technologies that understand dyadic embodied dynamics, unlocking breakthroughs in virtual agents, telepresence experiences, and multimodal content analysis tools. We also develop a suite of models that utilize the dataset to generate dyadic motion gestures and facial expressions aligned with human speech. These models can take as input both the speech and visual behavior of their interlocutors. We present a variant with speech from an LLM model and integrations with 2D and 3D rendering methods, bringing us closer to interactive virtual agents. Additionally, we describe controllable variants of our motion models that can adapt emotional responses and expressivity levels, as well as generating more semantically-relevant gestures. Finally, we discuss methods for assessing the quality of these dyadic motion models, which are demonstrating the potential for more intuitive and responsive human-AI interactions.
Written by
Morteza Behrooz
Ning Dong
Jeff Girard
Vasu Sharma
Jan Zikes
Akinniyi Akinyemi
Alex Shcherbyna
Alexander Richard
Alice Rakotoarison
Amia Oberai
Anastasis Stathopoulos
Anna Sun
Antony D'Avirro
Arina Turkatenko
Benjamin Peloquin
Bo Wan
Brandon Han
Carleigh Wood
Chao Wang
Chen Zhang
Christophe Ropers
Christopher Klaiber
Cynthia Gao
Dejan Kovachev
Denise Hernandez
Evonne Ng
Fabian Prada
Fabio Maria Carlucci
Guangyao Ma
Hang Li
Hirofumi Inaguma
Jason Zheng
Jeff Wang
Jie Shen
Jiemin Zhang
Jing Ma
Joe Chuang
Jon Daly
Jovan Popovic
Joy Chen
Julia Buffalini
Zhiyuan Yao
Junming Chen
Kam-Woh Ng
Kathryn Alvero
Louis-Philippe Morency
Lucas Mantovani
Mark Duppenthaler
Martin Gleize
Martin Ma
Mary Williamson
Michael Zollhoefer
Moneish Kumar
Omid Poursaeed
Paden Tomasello
Pavel Litvin
Pavlo Zhyzheria
Praveen Chowdary
Qingyao Jia
Raj Janardhan
Rongjie Huang
Safiyyah Saleem
Sagar Miglani
Sahir Gomez
Sen He
Shiyang Cheng
Somya Jain
Sreyas Mohan
Srivathsan Govindarajan
Tao Xiang
Tu Anh Nguyen
Tuan Tran
Vasu Agrawal
Wei Liu
Xinyue Zhang
Xutai Ma
Yilei Li
Yilin Yang
Yordan Hristov
Zhang Chen
Publisher
arXiv
June 05, 2026
Anshumali Shrivastava, Jason Chen, Qi Ma, Zeyu Yang
June 05, 2026
May 26, 2026
Valentin Wyart, Huy V. Vo, Jean Remi King, Josephine Raugel, Jérémy Rapin, Marc Szafraniec, Max Seitzer, Patrick Labatut, Piotr Bojanowski
May 26, 2026
May 20, 2026
Alvin W. M. Tan, Nicolas Hamilakis, Manel Khentout, Sho Tsuji, Balázs Kégl, Michael C. Frank, Angel Villar Corrales, Charles-Eric Saint-James, Dongyan Lin, Emmanuel Dupoux, Jiayi Shen, Juan Pino, Mahi Luthra, Martin Gleize, Phillip Rust, Rashel Moritz, Sheila Krogh-Jespersen, Surya Parimi, Tom Fizycki, Vanessa Stark, Yosuke Higuchi, Youssef Benchekroun
May 20, 2026
May 18, 2026
Alexandre Rezende, Rohit Patel, Steven McClain
May 18, 2026

Our approach
Latest news
Foundational models