July 06, 2020
We study the Cross-Entropy Method (CEM) for the non-convex optimization of a continuous and parameterized objective function and introduce a differentiable variant that enables us to differentiate the output of CEM with respect to the objective function's parameters. In the machine learning setting this brings CEM inside of the end-to-end learning pipeline where this has otherwise been impossible. We show applications in a synthetic energy-based structured prediction task and in non-convex continuous control. In the control setting we show how to embed optimal action sequences into a lower-dimensional space. This enables us to use policy optimization to fine-tune modeling components by differentiating through the CEM-based controller.
April 14, 2026
Zijian Zhou, Bohao Tang, Pengfei Liu, Fei Zhang, Frost Xu, Hang Li (BizAI), Semih Gunel, Sen He, Soubhik Sanyal, Tao Xiang, Viktar Atliha, Zhe Wang
April 14, 2026
August 12, 2025
GenAI and Infra Teams
August 12, 2025
August 05, 2025
Yi Yang, Xiang Fu, Matt Uyttendaele, Andrew J. Ouderkirk, Noa Marom, Xingyu Liu, Ammar Rizvi, Anuroop Sriram, Arman Boromand, Brandon M. Wood, Chiara Daraio, Daniel S. Levine, Keian Noori, Kyle Michel, Lafe J. Purvis, C. Lawrence Zitnick, Luis Barroso-Luque, Misko Dzamba, Muhammed Shuaibi, Meng Gao, Tingling Rao, Vahe Gharakhanyan, Viachaslau Bernat, Zachary W. Ulissi
August 05, 2025
August 04, 2025
Logan M. Brabson, Xiaohan Yu, Sihoon Choi, Kareem Abdelmaqsoud, Elias Moubarak, Pim de Haan, Sindy Löwe, Johann Brehmer, John R. Kitchin, Max Welling, Andrew J. Medford, David S. Sholl, Anuroop Sriram, C. Lawrence Zitnick, Zachary Ulissi
August 04, 2025

Our approach
Latest news
Foundational models