ML APPLICATIONS

Objective Mismatch in Model-based Reinforcement Learning

May 01, 2020

Abstract

Model-based reinforcement learning (MBRL) is a powerful framework for data-efficiently learning control of continuous tasks. Recent work in MBRL has mostly focused on using more advanced function approximators and planning schemes, with little development of the general framework. In this paper, we identify a fundamental issue of the standard MBRL framework – what we call objective mismatch. Objective mismatch arises when one objective is optimized in the hope that a second, often uncorrelated, metric will also be optimized. In the context of MBRL, we characterize the objective mismatch between training the forward dynamics model w.r.t. the likelihood of the one-step ahead prediction, and the overall goal of improving performance on a downstream control task. For example, this issue can emerge with the realization that dynamics models effective for a specific task do not necessarily need to be globally accurate, and vice versa globally accurate models might not be sufficiently accurate locally to obtain good control performance on a specific task. In our experiments, we study this objective mismatch issue and demonstrate that the likelihood of one-step ahead predictions is not always correlated with control performance. This observation highlights a critical limitation in the MBRL framework which will require further research to be fully understood and addressed. We propose an initial method to mitigate the mismatch issue by re-weighting dynamics model training. Building on it, we conclude with a discussion about other potential directions of research for addressing this issue.

Download the Paper

AUTHORS

Written by

Roberto Calandra

Brandon Amos

Nathan Owen Lambert

Omry Yadan

Publisher

Learning for DynamIcs & Control (L4DC)

Related Publications

April 14, 2026

COMPUTER VISION

ML APPLICATIONS

TransText: Transparency Aware Image-to-Video Typography Animation

Zijian Zhou, Bohao Tang, Pengfei Liu, Fei Zhang, Frost Xu, Hang Li (BizAI), Semih Gunel, Sen He, Soubhik Sanyal, Tao Xiang, Viktar Atliha, Zhe Wang

April 14, 2026

August 12, 2025

RESEARCH

NLP

Efficient Speculative Decoding for Llama at Scale: Challenges and Solutions

GenAI and Infra Teams

August 12, 2025

August 05, 2025

RESEARCH

CORE MACHINE LEARNING

FastCSP: Accelerated Molecular Crystal Structure Prediction with Universal Model for Atoms

Yi Yang, Xiang Fu, Matt Uyttendaele, Andrew J. Ouderkirk, Noa Marom, Xingyu Liu, Ammar Rizvi, Anuroop Sriram, Arman Boromand, Brandon M. Wood, Chiara Daraio, Daniel S. Levine, Keian Noori, Kyle Michel, Lafe J. Purvis, C. Lawrence Zitnick, Luis Barroso-Luque, Misko Dzamba, Muhammed Shuaibi, Meng Gao, Tingling Rao, Vahe Gharakhanyan, Viachaslau Bernat, Zachary W. Ulissi

August 05, 2025

August 04, 2025

RESEARCH

ML APPLICATIONS

The Open DAC 2025 Dataset for Sorbent Discovery in Direct Air Capture

Logan M. Brabson, Xiaohan Yu, Sihoon Choi, Kareem Abdelmaqsoud, Elias Moubarak, Pim de Haan, Sindy Löwe, Johann Brehmer, John R. Kitchin, Max Welling, Andrew J. Medford, David S. Sholl, Anuroop Sriram, C. Lawrence Zitnick, Zachary Ulissi

August 04, 2025

Help Us Pioneer The Future of AI

We share our open source frameworks, tools, libraries, and models for everything from research exploration to large-scale production deployment.