Francisco (Paco) Guzmán

RESEARCH SCIENTIST | MENLO PARK, UNITED STATES

Paco Guzmán is a Research Scientist working on Translations. His research has been focused on several aspects of Machine Translation including low-resource translation, translation mining, evaluation, and quality estimation.

Over the years, Paco has been a speaker and panelist on several events dedicated to increasing diversity in AI. He co-founded the Facebook-Georgia Tech co-teaching program.

Before joining Facebook in 2016, Paco was a Research Scientist at Qatar Computing Research Institute in Qatar in 2012-2016. He obtained his PhD in 2011 from ITESM (Monterrey Tech) in Mexico. Paco visited Carnegie Mellon University in 2008-2009 where he worked at the Language Technologies Institute.

Francisco's Work

Francisco's Publications

November 30, 2023

SPEECH & AUDIO

NLP

Seamless: Multilingual Expressive and Streaming Speech Translation

Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia Gonzalez, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-jussà, Maha Elbayad, Hongyu Gong, Francisco Guzmán, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alexandre Mourachko, Benjamin Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson

November 30, 2023

August 22, 2023

SPEECH & AUDIO

NLP

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

Seamless Communication, Loic Barrault, Andy Chung, David Dale, Ning Dong (AI), Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Peng-Jen Chen, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Abinesh Ramakrishnan, Alexandre Mourachko, Amanda Kallet, Ann Lee, Anna Sun, Bapi Akula, Benjamin Peloquin, Bernie Huang, Bokai Yu, Brian Ellis, Can Balioglu, Carleigh Wood, Changhan Wang, Christophe Ropers, Cynthia Gao, Daniel Li (FAIR), Elahe Kalbassi, Ethan Ye, Gabriel Mejia Gonzalez, Hirofumi Inaguma, Holger Schwenk, Igor Tufanov, Ilia Kulikov, Janice Lam, Jeff Wang (PM - AI), Juan Pino, Justin Haaheim, Justine Kao, Prangthip Hasanti, Kevin Tran, Maha Elbayad, Marta R. Costa-jussa, Mohamed Ramadan, Naji El Hachem, Onur Çelebi, Paco Guzmán, Paden Tomasello, Pengwei Li, Pierre Andrews, Ruslan Mavlyutov, Russ Howes, Safiyyah Saleem, Skyler Wang, Somya Jain, Sravya Popuri, Tuan Tran, Vish Vogeti, Xutai Ma, Yilin Yang

August 22, 2023

December 21, 2021

Improving Zero-Shot Translation by Disentangling Positional Information

James Cross, Paco Guzmán, Xian Li, Danni Liu, Jan Niehues

December 21, 2021

November 18, 2021

An Exploratory Study on Multilingual Quality Estimation

Vishrav Chaudhary, Adi Renduchintala, Ahmed El-Kishky, Lucia Specia, Paco Guzmán, Shuo Sun, Fred Blain, Marina Fomicheva

November 18, 2021

November 16, 2020

RESEARCH

NLP

CCAligned: A Massive Collection of Cross-Lingual Web-Document Pairs

Ahmed Hassan El-Kishky, Paco Guzmán, Vishrav Chaudhary, Philipp Koehn

November 16, 2020

October 01, 2020

BERGAMOT-LATTE Submissions for the WMT20 Quality Estimation Shared Task

Vishrav Chaudhary, Lucia Specia, Paco Guzmán, Shuo Sun, Fred Blain, Lisa Yankovskaya, Marina Fomicheva, Mark Fishel

October 01, 2020

August 23, 2019

RESEARCH

SPEECH & AUDIO

Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions

Vishrav Chaudhary, Juan Pino, Paco Guzmán, Philipp Koehn

August 23, 2019

August 02, 2019

RESEARCH

SPEECH & AUDIO

Low-Resource Corpus Filtering using Multilingual Sentence Embeddings

Vishrav Chaudhary, Holger Schwenk, Paco Guzmán, Philipp Koehn, Yuqing Tang

August 02, 2019