CONVERSATIONAL AI

NLP

Llama 2: Open Foundation and Fine-Tuned Chat Models

July 18, 2023

Abstract

In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized for dialogue use cases. Our models outperform open-source chat models on most benchmarks we tested, and based on our human evaluations for helpfulness and safety, may be a suitable substitute for closedsource models. We provide a detailed description of our approach to fine-tuning and safety improvements of Llama 2-Chat in order to enable the community to build on our work and contribute to the responsible development of LLMs.

Download the Paper

AUTHORS

Written by

Moya Chen

Marie-Anne Lachaux

Thibaut Lavril

Hugo Touvron

Adina Williams

Alan Schelten

Amjad Almahairi

Andrew Kuan

Andrew Poulton

Angela Fan

Anthony Hartshorn

Artem Korenev

Aurelien Rodriguez

Binh Tang

Brian Fuller

Cristian Canton Ferrer

Cynthia Gao

Dan Bikel

David Esiobu

Diana Liskovich

Xiaoqing Ellen Tan

Eric Michael Smith

Guillem Cucurull

Hakan Inan

Igor Molybog

Iliyan Zarov

Isabel Kloumann

Puxin Xu

Jenya Lee

Jeremy Fu

Jeremy Reizenstein

Jude Fernandes

Kalyan Saladi

Kevin Stone

Louis Martin

Lukas Blecher

Madian Khabsa

Marcin Kardas

Melanie Kambadur

Naman Goyal

Nikolay Bashlykov

Peter Albert

Praj Bhargava

Punit Singh Koura

Pushkar Mishra

Ranjan Subramanian

Rashi Rungta

Robert Stojnic

Ross Taylor

Ruan Silva

Rui Hou

Saghar Hosseini

Sergey Edunov

Sharan Narang

Shruti Bhosale

Soumya Batra

Thomas Scialom

Todor Mihaylov

Vedanuj Goswami

Viktor Kerkez

Wenyin Fu

Xavier Martinet

Yasmine Babaei

Yinghai Lu

Yixin Nie

Yuchen Zhang

Yuning Mao

Zheng Yan

Publisher

arxiv

Related Publications

May 04, 2026

NLP

Compute Optimal Tokenization

Sachin Mehta, Alisa Liu, Margaret Li, Artidoro Pagnoni, Gargi Ghosh, Luke Zettlemoyer, Mike Lewis, Srini Iyer, Tomasz Limisiewicz

May 04, 2026

March 24, 2026

NLP

OPEN SOURCE

HyperAgents

Jenny Zhang, Bingchen Zhao, Jakob Foerster, Sam Devlin, Tatiana Shavrina, Winnie Yang

March 24, 2026

March 17, 2026

RESEARCH

NLP

Omnilingual MT: Machine Translation for 1,600 Languages

Omnilingual MT Team, Niyati Bafna, Ioannis Tsiamas, Mark Duppenthaler, Albert Ventayol-Boada, Alexandre Mourachko, Andrea Caciolai, Arina Turkatenko, Artyom Kozhevnikov, Belen Alastruey, Charles-Eric Saint-James, Chierh CHENG, Christophe Ropers, Cynthia Gao, David Dale, Edan Toledo, Eduardo Sánchez, Gabriel Mejia Gonzalez, Holger Schwenk, Jean Maillard, Joe Chuang, João Maria Janeiro, Kevin Heffernan, Marta R. Costa-jussa, Mary Williamson, Nate Ekberg, Paul-Ambroise Duquenne, Pere Lluís Huguet Cabot, Rashel Moritz, Shireen Yates, Surya Parimi

March 17, 2026

March 17, 2026

RESEARCH

SPEECH & AUDIO

Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech

Omnilingual SONAR Team, Ioannis Tsiamas, Yen Meng, Vivek Iyer, Guillem Ramirez, Jaehyeong Jo, Alexandre Mourachko, Yu-An Chung, Artyom Kozhevnikov, Belen Alastruey, Christophe Ropers, David Dale, Holger Schwenk, João Maria Janeiro, Kevin Heffernan, Loic Barrault, Marta R. Costa-jussa, Paul-Ambroise Duquenne, Pere Lluís Huguet Cabot

March 17, 2026

Help Us Pioneer The Future of AI

We share our open source frameworks, tools, libraries, and models for everything from research exploration to large-scale production deployment.