Products

AI Research

Resources

About

RESEARCH

SPEECH & AUDIO

Weak-Attention Suppression For Transformer Based Speech Recognition

August 13, 2020

Abstract

Transformers, originally proposed for natural language processing (NLP) tasks, have recently achieved great success in automatic speech recognition (ASR). However, adjacent acoustic units (i.e., frames) are highly correlated, and long-distance dependencies between them are weak, unlike text units. It suggests that ASR will likely benefit from sparse and localized attention. In this paper, we propose Weak-Attention Suppression (WAS), a method that dynamically induces sparsity in attention probabilities. We demonstrate that WAS leads to consistent Word Error Rate (WER) improvement over strong transformer baselines. On the widely used LibriSpeech benchmark, our proposed method reduced WER by 10% on test-clean and 5% on test-other for streamable transformers, resulting in a new state-of-the-art among streaming models. Further analysis shows that WAS learns to suppress attention of non-critical and redundant continuous acoustic frames, and is more likely to suppress past frames rather than future ones. It indicates the importance of lookahead in attention-based ASR models.

Download the Paper

AUTHORS

Written by

Ching-Feng Yeh

Christian Fuegen

Chunyang Wu

Duc Le

Frank Zhang

Mike Seltzer

Yongqiang Wang

Publisher

Interspeech

Research Topics

Natural Language Processing (NLP)

Related Publications

July 17, 2026

CONVERSATIONAL AI

REINFORCEMENT LEARNING

Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning

Zilin Xiao, Qi Ma, Jason Chen, Xintao Chen, Avinash Atreya, Hanjie Chen, Vicente Ordonez

July 17, 2026

July 13, 2026

AR/VR

RESEARCH

S-EMBER: A Large-Scale Benchmark for Streaming Egocentric Memory Retrieval

Xiaodong Wang, Xuanyi Zhao, Pedro Rodriguez, Devendra Singh Sachan, Barlas Oguz, Seungwhan Moon, Shang-Wen Li, Gargi Ghosh, Xin Dong, Wen-Tau Yih

July 13, 2026

July 03, 2026

HUMAN & MACHINE INTELLIGENCE

ROBOTICS

Interpreting Physics in Video World Models

Sonia Joseph, Quentin Garrido, Randall Balestriero, Matthew Kowal, Thomas Fel, Shahab Bakhtiari, Blake Richards, Mike Rabbat

July 03, 2026

June 05, 2026

CONVERSATIONAL AI

RANKING AND RECOMMENDATIONS

Superintelligent Retrieval Agent: The Next Frontier of Agentic Retrieval

Zeyu Yang, Qi Ma, Jason Chen, Anshumali Shrivastava

June 05, 2026

Help Us Pioneer The Future of AI

We share our open source frameworks, tools, libraries, and models for everything from research exploration to large-scale production deployment.

About AI at Meta

Media Generation

Foundational models

Our approach

Our approach About AI at Meta People Careers

Research

Research Infrastructure Resources Demos

Meta AI

Meta AI Assistant Media Generation Vibes AI Studio

Latest news

Latest news Blog Newsletter

Foundational models

Meta © 2026