HUMAN & MACHINE INTELLIGENCE

CONVERSATIONAL AI

Seamless Interaction: Dyadic Audiovisual Motion Modeling and Large-Scale Dataset

June 27, 2025

Abstract

Human communication involves a complex interplay of verbal and nonverbal signals, essential for conveying meaning and achieving interpersonal goals. To develop socially intelligent AI technologies, it is crucial to develop models that can both comprehend and generate dyadic behavioral dynamics. To this end, we introduce the Seamless Interaction Dataset, a large-scale collection of over 4,000 hours of face-to-face interaction footage from over 4,000 participants in diverse contexts. This dataset enables the development of AI technologies that understand dyadic embodied dynamics, unlocking breakthroughs in virtual agents, telepresence experiences, and multimodal content analysis tools. We also develop a suite of models that utilize the dataset to generate dyadic motion gestures and facial expressions aligned with human speech. These models can take as input both the speech and visual behavior of their interlocutors. We present a variant with speech from an LLM model and integrations with 2D and 3D rendering methods, bringing us closer to interactive virtual agents. Additionally, we describe controllable variants of our motion models that can adapt emotional responses and expressivity levels, as well as generating more semantically-relevant gestures. Finally, we discuss methods for assessing the quality of these dyadic motion models, which are demonstrating the potential for more intuitive and responsive human-AI interactions.

Download the Paper

AUTHORS

Written by

Vasu Agrawal

Akinniyi Akinyemi

Kathryn Alvero

Morteza Behrooz

Julia Buffalini

Fabio Maria Carlucci

Joy Chen

Junming Chen

Zhang Chen

Shiyang Cheng

Praveen Chowdary

Joe Chuang

Antony D'Avirro

Jon Daly

Ning Dong

Mark Duppenthaler

Cynthia Gao

Jeff Girard

Martin Gleize

Sahir Gomez

Hongyu Gong

Srivathsan Govindarajan

Brandon Han

Sen He

Denise Hernandez

Yordan Hristov

Rongjie Huang

Hirofumi Inaguma

Somya Jain

Raj Janardhan

Qingyao Jia

Christopher Klaiber

Dejan Kovachev

Moneish Kumar

Hang Li

Yilei Li

Pavel Litvin

Wei Liu

Guangyao Ma

Jing Ma

Martin Ma

Xutai Ma

Lucas Mantovani

Sagar Miglani

Sreyas Mohan

Louis-Philippe Morency

Evonne Ng

Kam-Woh Ng

Tu Anh Nguyen

Amia Oberai

Benjamin Peloquin

Juan Pino

Jovan Popovic

Omid Poursaeed

Fabian Prada

Alice Rakotoarison

Alexander Richard

Christophe Ropers

Safiyyah Saleem

Vasu Sharma

Alex Shcherbyna

Jie Shen

Anastasis Stathopoulos

Anna Sun

Paden Tomasello

Tuan Tran

Arina Turkatenko

Bo Wan

Chao Wang

Jeff Wang

Mary Williamson

Carleigh Wood

Tao Xiang

Yilin Yang

Zhiyuan Yao

Chen Zhang

Jiemin Zhang

Xinyue Zhang

Jason Zheng

Pavlo Zhyzheria

Jan Zikes

Michael Zollhoefer

Publisher

arXiv

Related Publications

June 05, 2026

CONVERSATIONAL AI

RANKING AND RECOMMENDATIONS

Superintelligent Retrieval Agent: The Next Frontier of Agentic Retrieval

Zeyu Yang, Qi Ma, Jason Chen, Anshumali Shrivastava

June 05, 2026

May 26, 2026

HUMAN & MACHINE INTELLIGENCE

THEORY

Misalignment Between Backpropagation and the Hierarchy of Brain Responses to Images

Josephine Raugel, Max Seitzer, Marc Szafraniec, Huy V. Vo, Jérémy Rapin, Patrick Labatut, Piotr Bojanowski, Valentin Wyart, Jean Remi King

May 26, 2026

May 20, 2026

HUMAN & MACHINE INTELLIGENCE

RESEARCH

EgoBabyVLM: Benchmarking Cross-Modal Learning from Naturalistic Egocentric Video Data

Dongyan Lin, Phillip Rust, Angel Villar Corrales, Alvin W. M. Tan, Mahi Luthra, Charles-Eric Saint-James, Rashel Moritz, Sheila Krogh-Jespersen, Vanessa Stark, Surya Parimi, Jiayi Shen, Youssef Benchekroun, Yosuke Higuchi, Martin Gleize, Tom Fizycki, Nicolas Hamilakis, Manel Khentout, Sho Tsuji, Balázs Kégl, Juan Pino, Michael C. Frank, Emmanuel Dupoux

May 20, 2026

May 18, 2026

CONVERSATIONAL AI

RESEARCH

GIM: Evaluating models via tasks that integrate multiple cognitive domains

Rohit Patel, Alexandre Rezende, Steven McClain

May 18, 2026

Help Us Pioneer The Future of AI

We share our open source frameworks, tools, libraries, and models for everything from research exploration to large-scale production deployment.