HUMAN & MACHINE INTELLIGENCE

CONVERSATIONAL AI

Seamless Interaction: Dyadic Audiovisual Motion Modeling and Large-Scale Dataset

June 27, 2025

Abstract

Human communication involves a complex interplay of verbal and nonverbal signals, essential for conveying meaning and achieving interpersonal goals. To develop socially intelligent AI technologies, it is crucial to develop models that can both comprehend and generate dyadic behavioral dynamics. To this end, we introduce the Seamless Interaction Dataset, a large-scale collection of over 4,000 hours of face-to-face interaction footage from over 4,000 participants in diverse contexts. This dataset enables the development of AI technologies that understand dyadic embodied dynamics, unlocking breakthroughs in virtual agents, telepresence experiences, and multimodal content analysis tools. We also develop a suite of models that utilize the dataset to generate dyadic motion gestures and facial expressions aligned with human speech. These models can take as input both the speech and visual behavior of their interlocutors. We present a variant with speech from an LLM model and integrations with 2D and 3D rendering methods, bringing us closer to interactive virtual agents. Additionally, we describe controllable variants of our motion models that can adapt emotional responses and expressivity levels, as well as generating more semantically-relevant gestures. Finally, we discuss methods for assessing the quality of these dyadic motion models, which are demonstrating the potential for more intuitive and responsive human-AI interactions.

Download the Paper

AUTHORS

Written by

Morteza Behrooz

Ning Dong

Jeff Girard

Vasu Sharma

Jan Zikes

Akinniyi Akinyemi

Alex Shcherbyna

Alexander Richard

Alice Rakotoarison

Amia Oberai

Anastasis Stathopoulos

Anna Sun

Antony D'Avirro

Arina Turkatenko

Benjamin Peloquin

Bo Wan

Brandon Han

Carleigh Wood

Chao Wang

Chen Zhang

Christophe Ropers

Christopher Klaiber

Cynthia Gao

Dejan Kovachev

Denise Hernandez

Evonne Ng

Fabian Prada

Fabio Maria Carlucci

Guangyao Ma

Hang Li

Hirofumi Inaguma

Hongyu Gong

Jason Zheng

Jeff Wang

Jie Shen

Jiemin Zhang

Jing Ma

Joe Chuang

Jon Daly

Jovan Popovic

Joy Chen

Juan Pino

Julia Buffalini

Zhiyuan Yao

Junming Chen

Kam-Woh Ng

Kathryn Alvero

Louis-Philippe Morency

Lucas Mantovani

Mark Duppenthaler

Martin Gleize

Martin Ma

Mary Williamson

Michael Zollhoefer

Moneish Kumar

Omid Poursaeed

Paden Tomasello

Pavel Litvin

Pavlo Zhyzheria

Praveen Chowdary

Qingyao Jia

Raj Janardhan

Rongjie Huang

Safiyyah Saleem

Sagar Miglani

Sahir Gomez

Sen He

Shiyang Cheng

Somya Jain

Sreyas Mohan

Srivathsan Govindarajan

Tao Xiang

Tu Anh Nguyen

Tuan Tran

Vasu Agrawal

Wei Liu

Xinyue Zhang

Xutai Ma

Yilei Li

Yilin Yang

Yordan Hristov

Zhang Chen

Publisher

arXiv

Related Publications

June 05, 2026

CONVERSATIONAL AI

RANKING AND RECOMMENDATIONS

Superintelligent Retrieval Agent: The Next Frontier of Agentic Retrieval

Anshumali Shrivastava, Jason Chen, Qi Ma, Zeyu Yang

June 05, 2026

May 26, 2026

HUMAN & MACHINE INTELLIGENCE

THEORY

Misalignment Between Backpropagation and the Hierarchy of Brain Responses to Images

Valentin Wyart, Huy V. Vo, Jean Remi King, Josephine Raugel, Jérémy Rapin, Marc Szafraniec, Max Seitzer, Patrick Labatut, Piotr Bojanowski

May 26, 2026

May 20, 2026

HUMAN & MACHINE INTELLIGENCE

RESEARCH

EgoBabyVLM: Benchmarking Cross-Modal Learning from Naturalistic Egocentric Video Data

Alvin W. M. Tan, Nicolas Hamilakis, Manel Khentout, Sho Tsuji, Balázs Kégl, Michael C. Frank, Angel Villar Corrales, Charles-Eric Saint-James, Dongyan Lin, Emmanuel Dupoux, Jiayi Shen, Juan Pino, Mahi Luthra, Martin Gleize, Phillip Rust, Rashel Moritz, Sheila Krogh-Jespersen, Surya Parimi, Tom Fizycki, Vanessa Stark, Yosuke Higuchi, Youssef Benchekroun

May 20, 2026

May 18, 2026

CONVERSATIONAL AI

RESEARCH

GIM: Evaluating models via tasks that integrate multiple cognitive domains

Alexandre Rezende, Rohit Patel, Steven McClain

May 18, 2026

Help Us Pioneer The Future of AI

We share our open source frameworks, tools, libraries, and models for everything from research exploration to large-scale production deployment.