Francisco (Paco) Guzmán

RESEARCH SCIENTIST | MENLO PARK, UNITED STATES

Paco Guzmán is a Research Scientist working on Translations. His research has been focused on several aspects of Machine Translation including low-resource translation, translation mining, evaluation, and quality estimation.

Over the years, Paco has been a speaker and panelist on several events dedicated to increasing diversity in AI. He co-founded the Facebook-Georgia Tech co-teaching program.

Before joining Facebook in 2016, Paco was a Research Scientist at Qatar Computing Research Institute in Qatar in 2012-2016. He obtained his PhD in 2011 from ITESM (Monterrey Tech) in Mexico. Paco visited Carnegie Mellon University in 2008-2009 where he worked at the Language Technologies Institute.

Francisco's Work

Francisco's Publications

November 30, 2023

SPEECH & AUDIO

NLP

Seamless: Multilingual Expressive and Streaming Speech Translation

Seamless Communication, Elahe Kalbassi, Xutai Ma, Abinesh Ramakrishnan, Alexandre Mourachko, Alice Rakotoarison, Amanda Kallet, Yu-An Chung, Ann Lee, Anna Sun, Artyom Kozhevnikov, Benjamin Peloquin, Bokai Yu, Brian Ellis, Can Balioglu, Carleigh Wood, Changhan Wang, Christophe Ropers, Christophe Touret, Christopher Klaiber, Corinne Wong, Cynthia Gao, Daniel Licht, David Dale, Ethan Ye, Gabriel Mejia Gonzalez, Guillaume Wenzek, Hady Elsahar, Hirofumi Inaguma, Holger Schwenk, Hongyu Gong, Ilia Kulikov, Ivan Evtimov, Jean Maillard, Jeff Wang, John Hoffman, Juan Pino, Justin Haaheim, Justine Kao, Prangthip Hansanti, Kaushik Ram Sadagopan, Kevin Heffernan, Loïc Barrault, Maha Elbayad, Mariano Coria Meglioli, Mark Duppenthaler, Marta R. Costa-jussà, Mary Williamson, Min-Jae Hwang, Ning Dong, Francisco Guzmán, Paden Tomasello, Paul-Ambroise Duquenne, Peng-Jen Chen, Pengwei Li, Pierre Andrews, Pierre Fernandez, Robin San Roman, Ruslan Mavlyutov, Safiyyah Saleem, Skyler Wang, Somya Jain, Sravya Popuri, Tuan Tran, Yilin Yang

November 30, 2023

August 22, 2023

SPEECH & AUDIO

NLP

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

Seamless Communication, Safiyyah Saleem, Abinesh Ramakrishnan, Alexandre Mourachko, Alice Rakotoarison, Amanda Kallet, Andy Chung, Ann Lee, Anna Sun, Bapi Akula, Benjamin Peloquin, Bernie Huang, Bokai Yu, Brian Ellis, Can Balioglu, Carleigh Wood, Changhan Wang, Christophe Ropers, Christopher Klaiber, Cynthia Gao, Daniel Li (FAIR), Daniel Licht, David Dale, Elahe Kalbassi, Ethan Ye, Gabriel Mejia Gonzalez, Guillaume Wenzek, Hady Elsahar, Hirofumi Inaguma, Holger Schwenk, Hongyu Gong, Igor Tufanov, Ilia Kulikov, Janice Lam, Jean Maillard, Jeff Wang (PM - AI), John Hoffman, Juan Pino, Justin Haaheim, Justine Kao, Prangthip Hasanti, Kaushik Ram Sadagopan, Kevin Heffernan, Kevin Tran, Loic Barrault, Maha Elbayad, Marta R. Costa-jussa, Mohamed Ramadan, Naji El Hachem, Ning Dong (AI), Onur Çelebi, Paco Guzmán, Paden Tomasello, Paul-Ambroise Duquenne, Peng-Jen Chen, Pengwei Li, Pierre Andrews, Ruslan Mavlyutov, Russ Howes, Skyler Wang, Somya Jain, Sravya Popuri, Tuan Tran, Vish Vogeti, Xutai Ma, Yilin Yang

August 22, 2023

December 21, 2021

Improving Zero-Shot Translation by Disentangling Positional Information

James Cross, Paco Guzmán, Xian Li, Danni Liu, Jan Niehues

December 21, 2021

November 18, 2021

An Exploratory Study on Multilingual Quality Estimation

Vishrav Chaudhary, Adi Renduchintala, Ahmed El-Kishky, Lucia Specia, Paco Guzmán, Shuo Sun, Fred Blain, Marina Fomicheva

November 18, 2021

November 16, 2020

RESEARCH

NLP

CCAligned: A Massive Collection of Cross-Lingual Web-Document Pairs

Ahmed Hassan El-Kishky, Paco Guzmán, Vishrav Chaudhary, Philipp Koehn

November 16, 2020

October 01, 2020

BERGAMOT-LATTE Submissions for the WMT20 Quality Estimation Shared Task

Vishrav Chaudhary, Lucia Specia, Paco Guzmán, Shuo Sun, Fred Blain, Lisa Yankovskaya, Marina Fomicheva, Mark Fishel

October 01, 2020

August 23, 2019

RESEARCH

SPEECH & AUDIO

Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions

Vishrav Chaudhary, Juan Pino, Paco Guzmán, Philipp Koehn

August 23, 2019

August 02, 2019

RESEARCH

SPEECH & AUDIO

Low-Resource Corpus Filtering using Multilingual Sentence Embeddings

Vishrav Chaudhary, Holger Schwenk, Paco Guzmán, Philipp Koehn, Yuqing Tang

August 02, 2019