{"id":402,"date":"2023-03-09T10:44:12","date_gmt":"2023-03-09T10:44:12","guid":{"rendered":"https:\/\/todaysainews.com\/index.php\/2023\/03\/09\/our-approach-to-alignment-research-2\/"},"modified":"2025-04-27T07:34:08","modified_gmt":"2025-04-27T07:34:08","slug":"our-approach-to-alignment-research-2","status":"publish","type":"post","link":"https:\/\/todaysainews.com\/index.php\/2023\/03\/09\/our-approach-to-alignment-research-2\/","title":{"rendered":"Our approach to alignment research"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<div>\n<p>There is currently no known indefinitely scalable solution to the alignment problem. As AI progress continues, we expect to encounter a number of new alignment problems that we don\u2019t observe yet in current systems. Some of these problems we anticipate now and some of them will be entirely\u00a0new.<\/p>\n<p>We believe that finding an indefinitely scalable solution is likely very difficult. Instead, we aim for a more pragmatic approach: building and aligning a system that can make faster and better alignment research progress than humans\u00a0can.<\/p>\n<p>As we make progress on this, our AI systems can take over more and more of our alignment work and ultimately conceive, implement, study, and develop better alignment techniques than we have now. They will work together with humans to ensure that their own successors are more aligned with\u00a0humans.<\/p>\n<p>We believe that evaluating alignment research is substantially easier than producing it, especially when provided with evaluation assistance. Therefore human researchers will focus more and more of their effort on reviewing alignment research done by AI systems instead of generating this research by themselves. Our goal is to train models to be so aligned that we can off-load almost all of the cognitive labor required for alignment research.<\/p>\n<p>Importantly, we only need \u201cnarrower\u201d AI systems that have human-level capabilities in the relevant domains to do as well as humans on alignment research. We expect these AI systems are easier to align than general-purpose systems or systems much smarter than\u00a0humans.<\/p>\n<p>Language models are particularly well-suited for automating alignment research because they come \u201cpreloaded\u201d with a lot of knowledge and information about human values from reading the internet. Out of the box, they aren\u2019t independent agents and thus don\u2019t pursue their own goals in the world. To do alignment research they don\u2019t need unrestricted access to the internet. Yet a lot of alignment research tasks can be phrased as natural language or coding\u00a0tasks.<\/p>\n<p>Future versions of\u00a0<a href=\"https:\/\/openai.com\/blog\/webgpt\/\" rel=\"noopener noreferrer\" target=\"_blank\">WebGPT<\/a>,\u00a0<a href=\"https:\/\/openai.com\/blog\/instruction-following\/\" rel=\"noopener noreferrer\" target=\"_blank\">InstructGPT<\/a>, and\u00a0<a href=\"https:\/\/openai.com\/blog\/openai-codex\/\" rel=\"noopener noreferrer\" target=\"_blank\">Codex<\/a>\u00a0can provide a foundation as alignment research assistants, but they aren\u2019t sufficiently capable yet. While we don\u2019t know when our models will be capable enough to meaningfully contribute to alignment research, we think it\u2019s important to get started ahead of time. Once we train a model that could be useful, we plan to make it accessible to the external alignment research\u00a0community.<br class=\"softbreak\"\/><\/p>\n<\/div>\n<\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/openai.com\/blog\/our-approach-to-alignment-research\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] There is currently no known indefinitely scalable solution to the alignment problem. As AI progress continues, we<\/p>\n","protected":false},"author":2,"featured_media":403,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[19],"tags":[],"class_list":["post-402","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-openai"],"_links":{"self":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/402","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/comments?post=402"}],"version-history":[{"count":1,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/402\/revisions"}],"predecessor-version":[{"id":2871,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/402\/revisions\/2871"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/media\/403"}],"wp:attachment":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/media?parent=402"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/categories?post=402"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/tags?post=402"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}