{"id":855,"date":"2023-11-01T14:21:56","date_gmt":"2023-11-01T14:21:56","guid":{"rendered":"https:\/\/todaysainews.com\/index.php\/2023\/11\/01\/learning-robust-real-time-cultural-transmission-without-human-data-2\/"},"modified":"2025-04-27T07:31:53","modified_gmt":"2025-04-27T07:31:53","slug":"learning-robust-real-time-cultural-transmission-without-human-data-2","status":"publish","type":"post","link":"https:\/\/todaysainews.com\/index.php\/2023\/11\/01\/learning-robust-real-time-cultural-transmission-without-human-data-2\/","title":{"rendered":"Learning Robust Real-Time Cultural Transmission without Human Data"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<div class=\"article-cover\">\n<div class=\"article-cover__header\">\n<p class=\"article-cover__eyebrow glue-label\">Research<\/p>\n<dl class=\"article-cover__meta\">\n<dt class=\"glue-visually-hidden\">Published<\/dt>\n<dd class=\"article-cover__date glue-label\">\n              <time datetime=\"2022-03-03\"><br \/>\n                3 March 2022<br \/>\n              <\/time>\n            <\/dd>\n<dt class=\"glue-visually-hidden\">Authors<\/dt>\n<dd class=\"article-cover__authors\">\n<p data-block-key=\"okx18\">Cultural General Intelligence Team<\/p>\n<\/dd>\n<\/dl>\n<section class=\"glue-social glue-social--zippy share share--left article-cover__share\" data-glue-expansion-panel-expand-tooltip=\"Share: Expand to see social channels\" data-glue-expansion-panel-collapse-tooltip=\"Share: Hide social channels\" id=\"share-9acbb526-5ae8-4c27-bee9-415806c5d56d\">\n<\/section><\/div>\n<\/p><\/div>\n<div class=\"gdm-rich-text rich-text\">\n<p data-block-key=\"gb75o\">Over millennia, humankind has discovered, evolved, and accumulated a wealth of cultural knowledge, from navigation routes to mathematics and social norms to works of art. Cultural transmission, defined as efficiently passing information from one individual to another, is the inheritance process underlying this exponential increase in human capabilities.<\/p>\n<\/div>\n<figure class=\"single-media single-media--inline\">\n<\/figure>\n<figure class=\"single-media single-media--inline\">\n<\/figure>\n<div class=\"gdm-rich-text rich-text\">\n<p data-block-key=\"60er7\">Our agent, in blue, imitates and remembers the demonstration of both bots (left) and humans (right), in red.<\/p>\n<p data-block-key=\"7sfev\">For more videos of our agents in action, visit our <a href=\"https:\/\/sites.google.com\/view\/dm-cgi\" rel=\"noopener\" target=\"_blank\">website<\/a>.<\/p>\n<p data-block-key=\"qjobl\">In this work, we use deep reinforcement learning to generate artificial agents capable of test-time cultural transmission. Once trained, our agents can infer and recall navigational knowledge demonstrated by experts. This knowledge transfer happens in real time and generalises across a vast space of previously unseen tasks. For example, our agents can quickly learn new behaviours by observing a single human demonstration, without ever training on human data.<\/p>\n<\/div>\n<figure class=\"single-media single-media--inline\"><figcaption class=\"single-media__caption\">\n<p data-block-key=\"y3bkr\">A summary of our reinforcement learning environment. The tasks are navigational representatives for a broad class of human skills, which require particular sequences of strategic decisions, such as cooking, wayfinding, and problem solving.<\/p>\n<\/figcaption><\/figure>\n<div class=\"gdm-rich-text rich-text\">\n<p data-block-key=\"n7r3g\">We train and test our agents in procedurally generated 3D worlds, containing colourful, spherical goals embedded in a noisy terrain full of obstacles. A player must navigate the goals in the correct order, which changes randomly on every episode. Since the order is impossible to guess, a naive exploration strategy incurs a large penalty. As a source of culturally transmitted information, we provide a privileged \u201cbot\u201d that always enters goals in the correct sequence.<\/p>\n<\/div>\n<figure class=\"single-media single-media--inline\">\n<\/figure>\n<figure class=\"single-media single-media--inline\"><figcaption class=\"single-media__caption\">\n<p data-block-key=\"lf337\">Our MEDAL(-ADR) agent outperforms ablations on held-out tasks, in worlds without obstacles (top) and with obstacles (bottom).<\/p>\n<\/figcaption><\/figure>\n<div class=\"gdm-rich-text rich-text\">\n<p data-block-key=\"khwf6\">Via ablations, we identify a minimal sufficient &#8220;starter kit&#8221; of training ingredients required for cultural transmission to emerge, dubbed MEDAL-ADR. These components include memory (M), expert dropout (ED), attentional bias towards the expert (AL), and automatic domain randomization (ADR). Our agent outperforms the ablations, including the state-of-the-art method (ME-AL), across a range of challenging held-out tasks. Cultural transmission generalises out of distribution surprisingly well, and the agent recalls demonstrations long after the expert has departed. Looking into the agent&#8217;s brain, we find strikingly interpretable neurons responsible for encoding social information and goal states.<\/p>\n<\/div>\n<figure class=\"single-media single-media--inline\">\n<\/figure>\n<figure class=\"single-media single-media--inline\"><figcaption class=\"single-media__caption\">\n<p data-block-key=\"xpekz\">Our agent generalises outside the training distribution (top) and possesses individual neurons that encode social information (bottom).<\/p>\n<\/figcaption><\/figure>\n<div class=\"gdm-rich-text rich-text\">\n<p data-block-key=\"81pl7\">In summary, we provide a procedure for training an agent capable of flexible, high-recall, real-time cultural transmission, without using human data in the training pipeline. This paves the way for cultural evolution as an algorithm for developing more generally intelligent artificial agents.<\/p>\n<p data-block-key=\"3kzmr\">This authors&#8217; notes is based on joint work by the Cultural General Intelligence Team: Avishkar Bhoopchand, Bethanie Brownfield, Adrian Collister, Agustin Dal Lago, Ashley Edwards, Richard Everett, Alexandre Fr\u00e9chette, Edward Hughes, Kory W. Mathewson, Piermaria Mendolicchio, Yanko Oliveira, Julia Pawar, Miruna P\u00eeslar, Alex Platonov, Evan Senter, Sukhdeep Singh, Alexander Zacherl, and Lei M. Zhang.<\/p>\n<p data-block-key=\"u4yph\">Read the full paper <a href=\"https:\/\/arxiv.org\/abs\/2203.00715\" rel=\"noopener\" target=\"_blank\">here<\/a>.<\/p>\n<\/div><\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/deepmind.google\/discover\/blog\/learning-robust-real-time-cultural-transmission-without-human-data\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] Research Published 3 March 2022 Authors Cultural General Intelligence Team Over millennia, humankind has discovered, evolved, and<\/p>\n","protected":false},"author":2,"featured_media":765,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[21],"tags":[],"class_list":["post-855","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-deepmind-ai"],"featured_image_urls":{"full":["https:\/\/todaysainews.com\/wp-content\/uploads\/2023\/10\/E2UU4A-zCZ9b_oZyE_xQIeAZpDbjpcmw99-QVYVXs81UpJKnBzTo4O81rWapIqIfOAr39WSFMo336ekSH4_Z25BHiDamvwtQEKlSteg260fZhCwJw1200-h630-n-nu.jpeg",1200,630,false],"thumbnail":["https:\/\/todaysainews.com\/wp-content\/uploads\/2023\/10\/E2UU4A-zCZ9b_oZyE_xQIeAZpDbjpcmw99-QVYVXs81UpJKnBzTo4O81rWapIqIfOAr39WSFMo336ekSH4_Z25BHiDamvwtQEKlSteg260fZhCwJw1200-h630-n-nu-150x150.jpeg",150,150,true],"medium":["https:\/\/todaysainews.com\/wp-content\/uploads\/2023\/10\/E2UU4A-zCZ9b_oZyE_xQIeAZpDbjpcmw99-QVYVXs81UpJKnBzTo4O81rWapIqIfOAr39WSFMo336ekSH4_Z25BHiDamvwtQEKlSteg260fZhCwJw1200-h630-n-nu-300x158.jpeg",300,158,true],"medium_large":["https:\/\/todaysainews.com\/wp-content\/uploads\/2023\/10\/E2UU4A-zCZ9b_oZyE_xQIeAZpDbjpcmw99-QVYVXs81UpJKnBzTo4O81rWapIqIfOAr39WSFMo336ekSH4_Z25BHiDamvwtQEKlSteg260fZhCwJw1200-h630-n-nu-768x403.jpeg",640,336,true],"large":["https:\/\/todaysainews.com\/wp-content\/uploads\/2023\/10\/E2UU4A-zCZ9b_oZyE_xQIeAZpDbjpcmw99-QVYVXs81UpJKnBzTo4O81rWapIqIfOAr39WSFMo336ekSH4_Z25BHiDamvwtQEKlSteg260fZhCwJw1200-h630-n-nu-1024x538.jpeg",640,336,true],"1536x1536":["https:\/\/todaysainews.com\/wp-content\/uploads\/2023\/10\/E2UU4A-zCZ9b_oZyE_xQIeAZpDbjpcmw99-QVYVXs81UpJKnBzTo4O81rWapIqIfOAr39WSFMo336ekSH4_Z25BHiDamvwtQEKlSteg260fZhCwJw1200-h630-n-nu.jpeg",1200,630,false],"2048x2048":["https:\/\/todaysainews.com\/wp-content\/uploads\/2023\/10\/E2UU4A-zCZ9b_oZyE_xQIeAZpDbjpcmw99-QVYVXs81UpJKnBzTo4O81rWapIqIfOAr39WSFMo336ekSH4_Z25BHiDamvwtQEKlSteg260fZhCwJw1200-h630-n-nu.jpeg",1200,630,false],"broadnews-featured":["https:\/\/todaysainews.com\/wp-content\/uploads\/2023\/10\/E2UU4A-zCZ9b_oZyE_xQIeAZpDbjpcmw99-QVYVXs81UpJKnBzTo4O81rWapIqIfOAr39WSFMo336ekSH4_Z25BHiDamvwtQEKlSteg260fZhCwJw1200-h630-n-nu-1024x538.jpeg",1024,538,true],"broadnews-large":["https:\/\/todaysainews.com\/wp-content\/uploads\/2023\/10\/E2UU4A-zCZ9b_oZyE_xQIeAZpDbjpcmw99-QVYVXs81UpJKnBzTo4O81rWapIqIfOAr39WSFMo336ekSH4_Z25BHiDamvwtQEKlSteg260fZhCwJw1200-h630-n-nu-825x575.jpeg",825,575,true],"broadnews-medium":["https:\/\/todaysainews.com\/wp-content\/uploads\/2023\/10\/E2UU4A-zCZ9b_oZyE_xQIeAZpDbjpcmw99-QVYVXs81UpJKnBzTo4O81rWapIqIfOAr39WSFMo336ekSH4_Z25BHiDamvwtQEKlSteg260fZhCwJw1200-h630-n-nu-590x410.jpeg",590,410,true]},"author_info":{"info":["Sanna"]},"category_info":"<a href=\"https:\/\/todaysainews.com\/index.php\/category\/deepmind-ai\/\" rel=\"category tag\">DeepMind AI<\/a>","tag_info":"DeepMind AI","comment_count":"0","_links":{"self":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/855","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/comments?post=855"}],"version-history":[{"count":1,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/855\/revisions"}],"predecessor-version":[{"id":2631,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/855\/revisions\/2631"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/media\/765"}],"wp:attachment":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/media?parent=855"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/categories?post=855"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/tags?post=855"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}