We educated “critique-writing” fashions to explain flaws in summaries. Human evaluators discover flaws in summaries far more typically when proven our mannequin’s critiques. Bigger fashions are higher at self-critiquing, with scale bettering critique-writing greater than summary-writing. This reveals promise for utilizing AI methods to help human supervision of AI methods on troublesome duties.
Supply hyperlink