HUMAN & MACHINE INTELLIGENCE

CONVERSATIONAL AI

Seamless Interaction: Dyadic Audiovisual Motion Modeling and Large-Scale Dataset

June 27, 2025

Abstract

Human communication involves a complex interplay of verbal and nonverbal signals, essential for conveying meaning and achieving interpersonal goals. To develop socially intelligent AI technologies, it is crucial to develop models that can both comprehend and generate dyadic behavioral dynamics. To this end, we introduce the Seamless Interaction Dataset, a large-scale collection of over 4,000 hours of face-to-face interaction footage from over 4,000 participants in diverse contexts. This dataset enables the development of AI technologies that understand dyadic embodied dynamics, unlocking breakthroughs in virtual agents, telepresence experiences, and multimodal content analysis tools. We also develop a suite of models that utilize the dataset to generate dyadic motion gestures and facial expressions aligned with human speech. These models can take as input both the speech and visual behavior of their interlocutors. We present a variant with speech from an LLM model and integrations with 2D and 3D rendering methods, bringing us closer to interactive virtual agents. Additionally, we describe controllable variants of our motion models that can adapt emotional responses and expressivity levels, as well as generating more semantically-relevant gestures. Finally, we discuss methods for assessing the quality of these dyadic motion models, which are demonstrating the potential for more intuitive and responsive human-AI interactions.

Download the Paper

AUTHORS

Written by

Morteza Behrooz

Ning Dong

Jeff Girard

Vasu Sharma

Jan Zikes

Akinniyi Akinyemi

Alex Shcherbyna

Alexander Richard

Alice Rakotoarison

Amia Oberai

Anastasis Stathopoulos

Anna Sun

Antony D'Avirro

Arina Turkatenko

Benjamin Peloquin

Bo Wan

Brandon Han

Carleigh Wood

Chao Wang

Chen Zhang

Christophe Ropers

Christopher Klaiber

Cynthia Gao

Dejan Kovachev

Denise Hernandez

Evonne Ng

Fabian Prada

Fabio Maria Carlucci

Guangyao Ma

Hang Li

Hirofumi Inaguma

Hongyu Gong

Jason Zheng

Jeff Wang

Jie Shen

Jiemin Zhang

Jing Ma

Joe Chuang

Jon Daly

Jovan Popovic

Joy Chen

Juan Pino

Julia Buffalini

Zhiyuan Yao

Junming Chen

Kam-Woh Ng

Kathryn Alvero

Louis-Philippe Morency

Lucas Mantovani

Mark Duppenthaler

Martin Gleize

Martin Ma

Mary Williamson

Michael Zollhoefer

Moneish Kumar

Omid Poursaeed

Paden Tomasello

Pavel Litvin

Pavlo Zhyzheria

Praveen Chowdary

Qingyao Jia

Raj Janardhan

Rongjie Huang

Safiyyah Saleem

Sagar Miglani

Sahir Gomez

Sen He

Shiyang Cheng

Somya Jain

Sreyas Mohan

Srivathsan Govindarajan

Tao Xiang

Tu Anh Nguyen

Tuan Tran

Vasu Agrawal

Wei Liu

Xinyue Zhang

Xutai Ma

Yilei Li

Yilin Yang

Yordan Hristov

Zhang Chen

Publisher

arXiv

Related Publications

May 12, 2026

HUMAN & MACHINE INTELLIGENCE

RESEARCH

NeuralSet: A High-Performing Python Package for Neuro-AI

Corentin Bel, Linnea Evanson, Julien Gadonneix, Andrea Santos Revilla, Mingfang (Lucy) Zhang, Julie Bonnaire, Charlotte Caucheteux, Alexandre Défossez, Théo Desbordes, Pablo Diego-Simón, Shubh Khanna, Juliette Millet, Pierre Orhan, Saarang Panchavati, Antoine Ratouchniak, Alexis Thual, Hubert Jacob Banville, Jarod Levy, Jean Remi King, Josephine Raugel, Jérémy Rapin, Katelyn Begany, Marlene Careil, Simon Dahan, Sophia Houhamdi, Stéphane d'Ascoli, Teon Brooks, Yohann Benchetrit

May 12, 2026

May 06, 2026

HUMAN & MACHINE INTELLIGENCE

RESEARCH

NeuralBench: A Unifying Framework to Benchmark NeuroAI Models

Saarang Panchavati, Antoine Ratouchniak, Mingfang (Lucy) Zhang, Elisa Cascardi, Hubert Banville, Jarod Levy, Jean-Rémi King, Jérémy Rapin, Katelyn Begany, Marlene Careil, Simon Dahan, Stéphane d'Ascoli, Teon Brooks, Yohann Benchetrit

May 06, 2026

April 09, 2026

HUMAN & MACHINE INTELLIGENCE

COMPUTER VISION

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Lei Zhang, Junjiao Tian, Kunpeng Li, Jialiang Wang, Weifeng Chen, Yuxiao Bao, Julian McAuley, Manling Li, Zecheng He, Felix Xu, Markos Georgopoulos, Zhipeng Fan

April 09, 2026

March 26, 2026

HUMAN & MACHINE INTELLIGENCE

A foundation model of vision, audition, and language for in-silico neuroscience

Hubert Jacob Banville, Jean Remi King, Josephine Raugel, Jérémy Rapin, Katelyn Begany, Stéphane d'Ascoli, Teon Brooks, Yohann Benchetrit

March 26, 2026

Help Us Pioneer The Future of AI

We share our open source frameworks, tools, libraries, and models for everything from research exploration to large-scale production deployment.