Products

AI Research

Resources

About

Overview
Open Source
Careers

Florian Metze

RESEARCH SCIENTIST | NEW YORK CITY, UNITED STATES

Florian joined FAIAR in 2019 to work on multi-modal understanding of speech, video, and language. He has a background in speech and audio processing, including end-to-end processing of conversations, analysis of voice properties, and multi-lingual or low-resource speech recognition. He is also faculty at Carnegie Mellon's Language Technologies Institute (currently on LOA), where he continues to supervise a team of students.

Research Areas

Computer Vision

Natural Language Processing (NLP)

Florian's Work

Speech Recognition Virtual Kitchen

End-to-End Speech Recognition

The How2 Challenge

Florian's Publications

March 15, 2022

COMPUTER VISION

How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language

Amanda Duarte, Shruti Palaskar, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giro-i-Nieto

March 15, 2022

January 27, 2021

NLP

Support-Set bottlenecks for video-text representation learning

Mandela Patrick, Andrea Vedaldi, Bernie Huang, Florian Metze, Yuki Asano, Alexander Hauptmann, João F. Henriques

January 27, 2021

About AI at Meta

Media Generation

Foundational models

Our approach

Our approach About AI at Meta People Careers

Research

Research Infrastructure Resources Demos

Meta AI

Meta AI Assistant Media Generation Vibes AI Studio

Latest news

Latest news Blog Newsletter

Foundational models

Meta © 2026