Mostafa Elhoushi

RESEARCH ENGINEER | TORONTO, CANADA

I am a research engineer in FAIR Systems and Machine Learning (SysML) group. I focus on reserach that makes deep learning inference or training faster. Prior to joining FAIR, I was in the compilers team of Meta's Training and Inference Accelerator (MTIA) chip. I graduated from Ain Shams University, Egypt with my Bachelors and Masters, and obtained my PhD from Queen's University, Canada.

Mostafa's Work

Mostafa's Publications

June 14, 2024

NLP

SYSTEMS RESEARCH

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Mostafa Elhoushi, Akshat Shrivastava, Diana Liskovich, Basil Hosmer, Bram Wasti, Liangzhen Lai, Bilge Acun, Ahmed Aly, Beidi Chen, Carole-Jean Wu, Ahmed Roman, Nas Mahmoud, Saurabh Agarwal

June 14, 2024

June 03, 2024

SYSTEMS RESEARCH

CHAI: Clustered Head Attention for Efficient LLM Inference

Saurabh Agarwal, Bilge Acun, Basil Hosmer, Mostafa Elhoushi, Yejin Lee, Carole-Jean Wu

June 03, 2024

April 03, 2024

COMPUTER VISION

Sieve: Multimodal Dataset Pruning Using Image Captioning Models

Anas Mahmoud, Mostafa Elhoushi, Amro Abbas, Yu Yang, Newsha Ardalani, Hugh Leather, Ari Morcos

April 03, 2024

July 26, 2023

SYSTEMS RESEARCH

Learning Compiler Pass Orders using Coreset and Normalized Value Prediction

Youwei Liang, Kevin Stone, Chris Cummins, Mostafa Elhoushi, Jiadong Guo, Pengtao Xie, Hugh Leather, Yuandong Tian

July 26, 2023

June 19, 2023

SYSTEMS RESEARCH

MODeL: Memory Optimizations for Deep Learning

Benoit Steiner, Mostafa Elhoushi, Jacob Kahn, James Hegarty

June 19, 2023