Better Alignment with Instruction Back-and-Forth Translation
Authors:
Thao Nguyen
,
Jeffrey Li
,
Sewoong Oh
,
Ludwig Schmidt
,
Jason Weston
,
Luke Zettlemoyer
,
Xian Li
Abstract
We propose a new method, instruction back-and-forth translation, to construct high-quality synthetic data grounded in world knowledge for aligning large language models (LLMs). Given documents from a web corpus, we generate and curate synthetic instructions using the backtranslation approach proposed by Li et al.(2023a), and rewrite the responses to improve their quality further based on the initial documents. https://arxiv.org/abs/2408.04614
Authors:
Thao Nguyen
,
Jeffrey Li
,
Sewoong Oh
,
Ludwig Schmidt
,
Jason Weston
,
Luke Zettlemoyer
,
Xian Li
Abstract
We propose a new method, instruction back-and-forth translation, to construct high-quality synthetic data grounded in world knowledge for aligning large language models (LLMs). Given documents from a web corpus, we generate and curate synthetic instructions using the backtranslation approach proposed by Li et al.(2023a), and rewrite the responses to improve their quality further based on the initial documents. https://arxiv.org/abs/2408.04614