Juan Pino

RESEARCH SCIENTIST | MENLO PARK, UNITED STATES

Juan is a Research Scientist at Facebook AI Research in Menlo Park. He studied machine translation at the University of Cambridge. Juan is currently interested in developing end-to-end models for speech translation as well as models that are simultaneous (i.e. they work like human interpreters and begin generating a translation before consuming the entirety of the input text or the input audio).

Juan's Publications

June 27, 2025

HUMAN & MACHINE INTELLIGENCE

CONVERSATIONAL AI

Seamless Interaction: Dyadic Audiovisual Motion Modeling and Large-Scale Dataset

Morteza Behrooz, Ning Dong, Jeff Girard, Vasu Sharma, Jan Zikes, Akinniyi Akinyemi, Alex Shcherbyna, Alexander Richard, Alice Rakotoarison, Amia Oberai, Anastasis Stathopoulos, Anna Sun, Antony D'Avirro, Arina Turkatenko, Benjamin Peloquin, Bo Wan, Brandon Han, Carleigh Wood, Chao Wang, Chen Zhang, Christophe Ropers, Christopher Klaiber, Cynthia Gao, Dejan Kovachev, Denise Hernandez, Evonne Ng, Fabian Prada, Fabio Maria Carlucci, Guangyao Ma, Hang Li, Hirofumi Inaguma, Hongyu Gong, Jason Zheng, Jeff Wang, Jie Shen, Jiemin Zhang, Jing Ma, Joe Chuang, Jon Daly, Jovan Popovic, Joy Chen, Juan Pino, Julia Buffalini, Zhiyuan Yao, Junming Chen, Kam-Woh Ng, Kathryn Alvero, Louis-Philippe Morency, Lucas Mantovani, Mark Duppenthaler, Martin Gleize, Martin Ma, Mary Williamson, Michael Zollhoefer, Moneish Kumar, Omid Poursaeed, Paden Tomasello, Pavel Litvin, Pavlo Zhyzheria, Praveen Chowdary, Qingyao Jia, Raj Janardhan, Rongjie Huang, Safiyyah Saleem, Sagar Miglani, Sahir Gomez, Sen He, Shiyang Cheng, Somya Jain, Sreyas Mohan, Srivathsan Govindarajan, Tao Xiang, Tu Anh Nguyen, Tuan Tran, Vasu Agrawal, Wei Liu, Xinyue Zhang, Xutai Ma, Yilei Li, Yilin Yang, Yordan Hristov, Zhang Chen

June 27, 2025

November 30, 2023

SPEECH & AUDIO

NLP

Seamless: Multilingual Expressive and Streaming Speech Translation

Seamless Communication, Elahe Kalbassi, Xutai Ma, Abinesh Ramakrishnan, Alexandre Mourachko, Alice Rakotoarison, Amanda Kallet, Yu-An Chung, Ann Lee, Anna Sun, Artyom Kozhevnikov, Benjamin Peloquin, Bokai Yu, Brian Ellis, Can Balioglu, Carleigh Wood, Changhan Wang, Christophe Ropers, Christophe Touret, Christopher Klaiber, Corinne Wong, Cynthia Gao, Daniel Licht, David Dale, Ethan Ye, Gabriel Mejia Gonzalez, Guillaume Wenzek, Hady Elsahar, Hirofumi Inaguma, Holger Schwenk, Hongyu Gong, Ilia Kulikov, Ivan Evtimov, Jean Maillard, Jeff Wang, John Hoffman, Juan Pino, Justin Haaheim, Justine Kao, Prangthip Hansanti, Kaushik Ram Sadagopan, Kevin Heffernan, Loïc Barrault, Maha Elbayad, Mariano Coria Meglioli, Mark Duppenthaler, Marta R. Costa-jussà, Mary Williamson, Min-Jae Hwang, Ning Dong, Francisco Guzmán, Paden Tomasello, Paul-Ambroise Duquenne, Peng-Jen Chen, Pengwei Li, Pierre Andrews, Pierre Fernandez, Robin San Roman, Ruslan Mavlyutov, Safiyyah Saleem, Skyler Wang, Somya Jain, Sravya Popuri, Tuan Tran, Yilin Yang

November 30, 2023

August 22, 2023

SPEECH & AUDIO

NLP

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

Seamless Communication, Safiyyah Saleem, Abinesh Ramakrishnan, Alexandre Mourachko, Alice Rakotoarison, Amanda Kallet, Andy Chung, Ann Lee, Anna Sun, Bapi Akula, Benjamin Peloquin, Bernie Huang, Bokai Yu, Brian Ellis, Can Balioglu, Carleigh Wood, Changhan Wang, Christophe Ropers, Christopher Klaiber, Cynthia Gao, Daniel Li (FAIR), Daniel Licht, David Dale, Elahe Kalbassi, Ethan Ye, Gabriel Mejia Gonzalez, Guillaume Wenzek, Hady Elsahar, Hirofumi Inaguma, Holger Schwenk, Hongyu Gong, Igor Tufanov, Ilia Kulikov, Janice Lam, Jean Maillard, Jeff Wang (PM - AI), John Hoffman, Juan Pino, Justin Haaheim, Justine Kao, Prangthip Hasanti, Kaushik Ram Sadagopan, Kevin Heffernan, Kevin Tran, Loic Barrault, Maha Elbayad, Marta R. Costa-jussa, Mohamed Ramadan, Naji El Hachem, Ning Dong (AI), Onur Çelebi, Paco Guzmán, Paden Tomasello, Paul-Ambroise Duquenne, Peng-Jen Chen, Pengwei Li, Pierre Andrews, Ruslan Mavlyutov, Russ Howes, Skyler Wang, Somya Jain, Sravya Popuri, Tuan Tran, Vish Vogeti, Xutai Ma, Yilin Yang

August 22, 2023

August 19, 2023

NLP

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Changhan Wang, Bowen Shi, Juan Pino, Mohamed Anwar, Vedanuj Goswami, Wei-Ning Hsu

August 19, 2023

December 06, 2021

NLP

Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling

Hongyu Gong, Juan Pino, Xian Li, Yun Tang

December 06, 2021

October 26, 2020

SPEECH & AUDIO

NLP

Self-Supervised Representations Improve End-to-End Speech Translation

Anne Wu, Changhan Wang, Jiatao Gu, Juan Miguel Pino

October 26, 2020

October 23, 2020

RESEARCH

SPEECH & AUDIO

Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation

Changhan Wang, Jiatao Gu, Juan Pino

October 23, 2020

April 30, 2020

RESEARCH

NLP

SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation

Liezl Puzon, Juan Pino, Arya McCarthy

April 30, 2020

November 25, 2019

Findings of the First Shared Task onMachine Translation Robustness

Xian Li, Juan Pino, Philipp Koehn, Antonios Anastasopoulos, Graham Neubig, Hassan Sajjad, Nadir K.Durrani, Orhan Firat, Paul Michel, Yonatan Belinkov

November 25, 2019

August 23, 2019

RESEARCH

SPEECH & AUDIO

Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions

Vishrav Chaudhary, Juan Pino, Paco Guzmán, Philipp Koehn

August 23, 2019

August 15, 2019

RESEARCH

SPEECH & AUDIO

On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models

Xian Li, Juan Pino, Graham Neubig, Paul Michel

August 15, 2019