Jun 17, 2024 · This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language models.
This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language models.
Jun 17, 2024 · This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language ...
Nov 12, 2024 · We introduce a novel, sociotechnical approach to red teaming that leverages the control of procedural guidance and the accuracy of human ...
Jun 17, 2024 · This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language ...
Oct 24, 2024 · This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language ...
This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language models.
Jun 18, 2024 · New paper out! Very excited that we're able to share STAR: SocioTechnical Approach to Red Teaming Language Models.
This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language models. Paper · Add ...