Florian Metze

RESEARCH SCIENTIST | NEW YORK CITY, UNITED STATES

Florian joined FAIAR in 2019 to work on multi-modal understanding of speech, video, and language. He has a background in speech and audio processing, including end-to-end processing of conversations, analysis of voice properties, and multi-lingual or low-resource speech recognition. He is also faculty at Carnegie Mellon's Language Technologies Institute (currently on LOA), where he continues to supervise a team of students.

Florian's Work

Florian's Publications

March 15, 2022

COMPUTER VISION

How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language

Amanda Duarte, Shruti Palaskar, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giro-i-Nieto

March 15, 2022

January 27, 2021

NLP

Support-Set bottlenecks for video-text representation learning

Mandela Patrick, Andrea Vedaldi, Bernie Huang, Florian Metze, Yuki Asano, Alexander Hauptmann, João F. Henriques

January 27, 2021