{"id":438,"date":"2023-03-15T03:44:26","date_gmt":"2023-03-15T03:44:26","guid":{"rendered":"https:\/\/todaysainews.com\/index.php\/2023\/03\/15\/learning-to-play-minecraft-with-video-pretraining\/"},"modified":"2025-04-27T07:33:58","modified_gmt":"2025-04-27T07:33:58","slug":"learning-to-play-minecraft-with-video-pretraining","status":"publish","type":"post","link":"https:\/\/todaysainews.com\/index.php\/2023\/03\/15\/learning-to-play-minecraft-with-video-pretraining\/","title":{"rendered":"Learning to play Minecraft with Video PreTraining"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<div>\n<p>The internet contains an enormous amount of publicly available videos that we can learn from. You can watch a person make a gorgeous presentation, a digital artist draw a beautiful sunset, and a Minecraft player build an intricate house. However, these videos only provide a record of\u00a0<em>what<\/em>\u00a0happened but not precisely\u00a0<em>how<\/em>\u00a0it was achieved, i.e., you will not know the exact sequence of mouse movements and keys pressed. If we would like to build large-scale\u00a0<a href=\"https:\/\/arxiv.org\/abs\/2108.07258\" rel=\"noopener noreferrer\" target=\"_blank\">foundation models<\/a>\u00a0in these domains as we\u2019ve done in language with\u00a0<a href=\"https:\/\/proceedings.neurips.cc\/paper\/2020\/hash\/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html\" rel=\"noopener noreferrer\" target=\"_blank\">GPT<\/a>, this lack of action labels poses a new challenge not present in the language domain, where \u201caction labels\u201d are simply the next words in a\u00a0sentence.<\/p>\n<p>In order to utilize the wealth of unlabeled video data available on the internet, we introduce a novel, yet simple, semi-supervised imitation learning method: Video PreTraining (VPT). We start by gathering a small dataset from contractors where we record not only their video, but also the actions they took, which in our case are keypresses and mouse movements. With this data we train an inverse dynamics model (IDM), which predicts the action being taken at each step in the video. Importantly, the IDM can use past\u00a0<em>and future<\/em>\u00a0information to guess the action at each step. This task is much easier and thus requires far less data than the behavioral cloning task of predicting actions given\u00a0<em>past video frames only<\/em>, which requires inferring what the person wants to do and how to accomplish it. We can then use the trained IDM to label a much larger dataset of online videos and learn to act via behavioral\u00a0cloning.<br class=\"softbreak\"\/><\/p>\n<\/div>\n<\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/openai.com\/research\/vpt\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] The internet contains an enormous amount of publicly available videos that we can learn from. You can<\/p>\n","protected":false},"author":2,"featured_media":439,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[19],"tags":[],"class_list":["post-438","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-openai"],"_links":{"self":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/438","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/comments?post=438"}],"version-history":[{"count":1,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/438\/revisions"}],"predecessor-version":[{"id":2853,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/438\/revisions\/2853"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/media\/439"}],"wp:attachment":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/media?parent=438"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/categories?post=438"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/tags?post=438"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}