Skip to content

Commit

Permalink
Merge branch 'preference_docs' of github.com:SalmanMohammadi/torchtun…
Browse files Browse the repository at this point in the history
…e into preference_docs
  • Loading branch information
SalmanMohammadi committed Sep 21, 2024
2 parents 5ea2cfd + 853fd9f commit 02ca414
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/source/basics/preference_datasets.rst
Original file line number Diff line number Diff line change
Expand Up @@ -100,7 +100,7 @@ Preference dataset format
-------------------------
Preference datasets are expected to have two columns: *"chosen"*, which indicates the human annotator's preferred response, and *"rejected"*, indicating
the human annotator's dis-preferred response. Each of these columns should contain a list of messages with an identical prompt, followed by a list of messages.
the human annotator's dis-preferred response. Each of these columns should contain a list of messages with an identical prompt.
The list of messages could include a system prompt, an instruction, multiple turns between user and assistant, or tool calls/returns. Let's take a look at
Anthropic's helpfulness/harmlessness dataset `on Hugging Face <https://huggingface.co/datasets/RLHFlow/HH-RLHF-Helpful-standard>`_ as an example of a multi-turn
chat-style format:
Expand Down

0 comments on commit 02ca414

Please sign in to comment.