{"id":750,"date":"2023-10-27T12:37:26","date_gmt":"2023-10-27T12:37:26","guid":{"rendered":"https:\/\/todaysainews.com\/index.php\/2023\/10\/27\/robocat-a-self-improving-robotic-agent-2\/"},"modified":"2025-04-27T07:32:35","modified_gmt":"2025-04-27T07:32:35","slug":"robocat-a-self-improving-robotic-agent-2","status":"publish","type":"post","link":"https:\/\/todaysainews.com\/index.php\/2023\/10\/27\/robocat-a-self-improving-robotic-agent-2\/","title":{"rendered":"RoboCat: A self-improving robotic agent"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<div class=\"article-cover article-cover--centered\">\n<div class=\"article-cover__header\">\n<p class=\"article-cover__eyebrow glue-label\">Research<\/p>\n<dl class=\"article-cover__meta\">\n<dt class=\"glue-visually-hidden\">Published<\/dt>\n<dd class=\"article-cover__date glue-label\">\n              <time datetime=\"2023-06-20\"><br \/>\n                20 June 2023<br \/>\n              <\/time>\n            <\/dd>\n<dt class=\"glue-visually-hidden\">Authors<\/dt>\n<dd class=\"article-cover__authors\">\n<p data-block-key=\"0ljwt\">The RoboCat team<\/p>\n<\/dd>\n<\/dl>\n<section class=\"glue-social glue-social--zippy share share--centered article-cover__share\" data-glue-expansion-panel-expand-tooltip=\"Share: Expand to see social channels\" data-glue-expansion-panel-collapse-tooltip=\"Share: Hide social channels\" id=\"share-bef9d1fe-4d8c-4a3a-ae5e-39728f98804b\">\n<\/section><\/div>\n<picture class=\"article-cover__image\"><source media=\"(min-width: 1024px)\" type=\"image\/webp\" width=\"1072\" height=\"603\" srcset=\"https:\/\/lh3.googleusercontent.com\/Rz9Xv4TXuTe-eO2UDUD6kDElDB5wDE2b2hEU1liUAi0AyiTwQ81mLMigXg3kueWrHoqeNctRO5-EMprZDRnXcaL8snfqHwDqgQpw_qB3VEvoO_jCCzI=w1072-h603-n-nu-rw 1x, https:\/\/lh3.googleusercontent.com\/Rz9Xv4TXuTe-eO2UDUD6kDElDB5wDE2b2hEU1liUAi0AyiTwQ81mLMigXg3kueWrHoqeNctRO5-EMprZDRnXcaL8snfqHwDqgQpw_qB3VEvoO_jCCzI=w2144-h1206-n-nu-rw 2x\"\/><source media=\"(min-width: 600px)\" type=\"image\/webp\" width=\"928\" height=\"522\" srcset=\"https:\/\/lh3.googleusercontent.com\/Rz9Xv4TXuTe-eO2UDUD6kDElDB5wDE2b2hEU1liUAi0AyiTwQ81mLMigXg3kueWrHoqeNctRO5-EMprZDRnXcaL8snfqHwDqgQpw_qB3VEvoO_jCCzI=w928-h522-n-nu-rw 1x, https:\/\/lh3.googleusercontent.com\/Rz9Xv4TXuTe-eO2UDUD6kDElDB5wDE2b2hEU1liUAi0AyiTwQ81mLMigXg3kueWrHoqeNctRO5-EMprZDRnXcaL8snfqHwDqgQpw_qB3VEvoO_jCCzI=w1856-h1044-n-nu-rw 2x\"\/><source type=\"image\/webp\" width=\"528\" height=\"297\" srcset=\"https:\/\/lh3.googleusercontent.com\/Rz9Xv4TXuTe-eO2UDUD6kDElDB5wDE2b2hEU1liUAi0AyiTwQ81mLMigXg3kueWrHoqeNctRO5-EMprZDRnXcaL8snfqHwDqgQpw_qB3VEvoO_jCCzI=w528-h297-n-nu-rw 1x, https:\/\/lh3.googleusercontent.com\/Rz9Xv4TXuTe-eO2UDUD6kDElDB5wDE2b2hEU1liUAi0AyiTwQ81mLMigXg3kueWrHoqeNctRO5-EMprZDRnXcaL8snfqHwDqgQpw_qB3VEvoO_jCCzI=w1056-h594-n-nu-rw 2x\"\/><img loading=\"lazy\" decoding=\"async\" alt=\"An image of RoboCat's robotic arm in action.\" height=\"603\" src=\"https:\/\/lh3.googleusercontent.com\/Rz9Xv4TXuTe-eO2UDUD6kDElDB5wDE2b2hEU1liUAi0AyiTwQ81mLMigXg3kueWrHoqeNctRO5-EMprZDRnXcaL8snfqHwDqgQpw_qB3VEvoO_jCCzI=w1072-h603-n-nu\" width=\"1072\"\/>\n    <\/picture>\n<\/p><\/div>\n<div class=\"gdm-rich-text rich-text\">\n<p data-block-key=\"h5nrk\"><b>New foundation agent learns to operate different robotic arms, solves tasks from as few as 100 demonstrations, and improves from self-generated data.<\/b><\/p>\n<p data-block-key=\"a7gfd\">Robots are quickly becoming part of our everyday lives, but they\u2019re often only programmed to perform specific tasks well. While harnessing recent advances in AI could lead to robots that could help in many more ways, progress in building general-purpose robots is slower in part because of the time needed to collect real-world training data.<\/p>\n<p data-block-key=\"nvc1j\"><a href=\"https:\/\/arxiv.org\/abs\/2306.11706\" rel=\"noopener\" target=\"_blank\">Our latest paper<\/a> introduces a self-improving AI agent for robotics, RoboCat, that learns to perform a variety of tasks across different arms, and then self-generates new training data to improve its technique.<\/p>\n<p data-block-key=\"vzsw2\">Previous research has explored how to develop <a href=\"https:\/\/ai.googleblog.com\/2022\/12\/rt-1-robotics-transformer-for-real.html\" rel=\"noopener\" target=\"_blank\">robots that can learn to multi-task at scale<\/a> and <a href=\"https:\/\/sites.research.google\/palm-saycan\" rel=\"noopener\" target=\"_blank\">combine the understanding of language models with the real-world capabilities<\/a> of a helper robot. RoboCat is the first agent to solve and adapt to multiple tasks and do so across different, real robots.<\/p>\n<p data-block-key=\"jmadg\">RoboCat learns much faster than other state-of-the-art models. It can pick up a new task with as few as 100 demonstrations because it draws from a large and diverse dataset. This capability will help accelerate robotics research, as it reduces the need for human-supervised training, and is an important step towards creating a general-purpose robot.<\/p>\n<\/div>\n<figure class=\"single-media single-media--inline\">\n<\/figure>\n<div class=\"gdm-rich-text rich-text\">\n<h2 data-block-key=\"g97ba\">How RoboCat improves itself<\/h2>\n<p data-block-key=\"g1j6y\">RoboCat is based on our multimodal model <a href=\"https:\/\/deepmind.google\/discover\/blog\/a-generalist-agent\/\">Gato<\/a> (Spanish for \u201ccat\u201d), which can process language, images, and actions in both simulated and physical environments. We combined Gato\u2019s architecture with a large training dataset of sequences of images and actions of various robot arms solving hundreds of different tasks.<\/p>\n<p data-block-key=\"vpkmo\">After this first round of training, we launched RoboCat into a \u201cself-improvement\u201d training cycle with a set of previously unseen tasks. The learning of each new task followed five steps:<\/p>\n<ol>\n<li data-block-key=\"nuik4\">Collect 100-1000 demonstrations of a new task or robot, using a robotic arm controlled by a human.<\/li>\n<li data-block-key=\"52pdj\">Fine-tune RoboCat on this new task\/arm, creating a specialised spin-off agent.<\/li>\n<li data-block-key=\"zij4y\">The spin-off agent practises on this new task\/arm an average of 10,000 times, generating more training data.<\/li>\n<li data-block-key=\"fwot1\">Incorporate the demonstration data and self-generated data into RoboCat\u2019s existing training dataset.<\/li>\n<li data-block-key=\"n89pj\">Train a new version of RoboCat on the new training dataset.<\/li>\n<\/ol>\n<\/div>\n<figure class=\"single-media single-media--inline\"><figcaption class=\"single-media__caption\">\n<p data-block-key=\"utlpy\">RoboCat\u2019s training cycle, boosted by its ability to autonomously generate additional training data.<\/p>\n<\/figcaption><\/figure>\n<div class=\"gdm-rich-text rich-text\">\n<p data-block-key=\"d60qm\">The combination of all this training means the latest RoboCat is based on a dataset of millions of trajectories, from both real and simulated robotic arms, including self-generated data. We used four different types of robots and many robotic arms to collect vision-based data representing the tasks RoboCat would be trained to perform.<\/p>\n<\/div>\n<figure class=\"single-media single-media--inline\"><figcaption class=\"single-media__caption\">\n<p data-block-key=\"jos8g\">RoboCat learns from a diverse range of training data types and tasks: Videos of a real robotic arm picking up gears, a simulated arm stacking blocks and RoboCat using a robotic arm to pick up a cucumber.<\/p>\n<\/figcaption><\/figure>\n<div class=\"gdm-rich-text rich-text\">\n<h2 data-block-key=\"8ogic\">Learning to operate new robotic arms and solve more complex tasks<\/h2>\n<p data-block-key=\"r4rco\">With RoboCat\u2019s diverse training, it learned to operate different robotic arms within a few hours. While it had been trained on arms with two-pronged grippers, it was able to adapt to a more complex arm with a three-fingered gripper and twice as many controllable inputs.<\/p>\n<\/div>\n<figure class=\"single-media single-media--inline\"><figcaption class=\"single-media__caption\">\n<p data-block-key=\"k1rl5\"><b>Left:<\/b> A new robotic arm RoboCat learned to control<br \/><b>Right:<\/b> Video of RoboCat using the arm to pick up gears<\/p>\n<\/figcaption><\/figure>\n<div class=\"gdm-rich-text rich-text\">\n<p data-block-key=\"vllh1\">After observing 1000 human-controlled demonstrations, collected in just hours, RoboCat could direct this new arm dexterously enough to pick up gears successfully 86% of the time. With the same level of demonstrations, it could adapt to solve tasks that combined precision and understanding, such as removing the correct fruit from a bowl and solving a shape-matching puzzle, which are necessary for more complex control.<\/p>\n<\/div>\n<figure class=\"single-media single-media--inline\"><figcaption class=\"single-media__caption\">\n<p data-block-key=\"zqaho\">Examples of tasks RoboCat can adapt to solving after 500-1000 demonstrations.<\/p>\n<\/figcaption><\/figure>\n<div class=\"gdm-rich-text rich-text\">\n<h2 data-block-key=\"ovro5\">The self-improving generalist<\/h2>\n<p data-block-key=\"bsgsi\">RoboCat has a virtuous cycle of training: the more new tasks it learns, the better it gets at learning additional new tasks. The initial version of RoboCat was successful just 36% of the time on previously unseen tasks, after learning from 500 demonstrations per task. But the latest RoboCat, which had trained on a greater diversity of tasks, more than doubled this success rate on the same tasks.<\/p>\n<\/div>\n<figure class=\"single-media single-media--inline\"><figcaption class=\"single-media__caption\">\n<p data-block-key=\"wvg68\">The big difference in performance between the initial RoboCat (one round of training) compared with the final version (extensive and diverse training, including self-improvement) after both versions were fine-tuned on 500 demonstrations of previously unseen tasks.<\/p>\n<\/figcaption><\/figure>\n<div class=\"gdm-rich-text rich-text\">\n<p data-block-key=\"6x5ft\">These improvements were due to RoboCat&#8217;s growing breadth of experience, similar to how people develop a more diverse range of skills as they deepen their learning in a given domain. RoboCat\u2019s ability to independently learn skills and rapidly self-improve, especially when applied to different robotic devices, will help pave the way toward a new generation of more helpful, general-purpose robotic agents.<\/p>\n<\/div>\n<aside class=\"button-group\">\n<\/aside>\n<aside class=\"related-posts\">\n<\/aside><\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/deepmind.google\/discover\/blog\/robocat-a-self-improving-robotic-agent\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] Research Published 20 June 2023 Authors The RoboCat team New foundation agent learns to operate different robotic<\/p>\n","protected":false},"author":2,"featured_media":751,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[21],"tags":[],"class_list":["post-750","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-deepmind-ai"],"_links":{"self":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/750","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/comments?post=750"}],"version-history":[{"count":1,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/750\/revisions"}],"predecessor-version":[{"id":2699,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/750\/revisions\/2699"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/media\/751"}],"wp:attachment":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/media?parent=750"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/categories?post=750"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/tags?post=750"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}