Search results for: human-feedback
Filters
total: 149
filtered: 1
Chosen catalog filters
Search results for: human-feedback
-
WikiPrefs: human preferences dataset build from text edits
Open Research DataThe WikiPrefs dataset is a human preferences dataset for Large Language Models alignment. It was built using the EditPrefs method from historical edits of Wikipedia featured articles