paper · arXiv

Constitutional methods for alignment

Training models to critique and revise their own outputs against principles.

Want the primary source?View original →