{"id":430,"date":"2023-03-14T22:36:22","date_gmt":"2023-03-14T22:36:22","guid":{"rendered":"https:\/\/todaysainews.com\/index.php\/2023\/03\/14\/introducing-whisper-2\/"},"modified":"2025-04-27T07:33:58","modified_gmt":"2025-04-27T07:33:58","slug":"introducing-whisper-2","status":"publish","type":"post","link":"https:\/\/todaysainews.com\/index.php\/2023\/03\/14\/introducing-whisper-2\/","title":{"rendered":"Introducing Whisper"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<div>\n<p>Other existing approaches frequently use smaller, more closely paired audio-text training datasets,<span class=\"ui-fn\"><sup class=\"inline-block min-w-[1.5ch] indent-0 not-italic [em_&amp;]:indent-2\"><span class=\"error\">[^reference-1]<\/span><\/sup><!----><\/span> <span class=\"ui-fn\"><sup class=\"inline-block min-w-[1.5ch] indent-0 not-italic [em_&amp;]:indent-2\"><span class=\"error\">[^reference-2]<\/span><\/sup><!----><\/span><span class=\"ui-fn\"><sup class=\"inline-block min-w-[1.5ch] indent-0 not-italic [em_&amp;]:indent-2\"><span class=\"error\">[^reference-3]<\/span><\/sup><!----><\/span> or use broad but unsupervised audio pretraining.<span class=\"ui-fn\"><sup class=\"inline-block min-w-[1.5ch] indent-0 not-italic [em_&amp;]:indent-2\"><span class=\"error\">[^reference-4]<\/span><\/sup><!----><\/span><span class=\"ui-fn\"><sup class=\"inline-block min-w-[1.5ch] indent-0 not-italic [em_&amp;]:indent-2\"><span class=\"error\">[^reference-5]<\/span><\/sup><!----><\/span><span class=\"ui-fn\"><sup class=\"inline-block min-w-[1.5ch] indent-0 not-italic [em_&amp;]:indent-2\"><span class=\"error\">[^reference-6]<\/span><\/sup><!----><\/span>\u00a0Because Whisper was trained on a large and diverse dataset and was not fine-tuned to any specific one, it does not beat models that specialize in LibriSpeech performance, a famously competitive benchmark in speech recognition. However, when we measure Whisper\u2019s zero-shot performance across many diverse datasets we find it is much more robust and makes 50% fewer errors than those\u00a0models.<\/p>\n<p>About a third of Whisper\u2019s audio dataset is non-English, and it is alternately given the task of transcribing in the original language or translating to English. We find this approach is particularly effective at learning speech to text translation and outperforms the supervised SOTA on CoVoST2 to English translation\u00a0zero-shot.<br class=\"softbreak\"\/><\/p>\n<\/div>\n<\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/openai.com\/research\/whisper\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] Other existing approaches frequently use smaller, more closely paired audio-text training datasets,[^reference-1] [^reference-2][^reference-3] or use broad but<\/p>\n","protected":false},"author":2,"featured_media":431,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[19],"tags":[],"class_list":["post-430","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-openai"],"_links":{"self":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/430","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/comments?post=430"}],"version-history":[{"count":1,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/430\/revisions"}],"predecessor-version":[{"id":2857,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/posts\/430\/revisions\/2857"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/media\/431"}],"wp:attachment":[{"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/media?parent=430"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/categories?post=430"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/todaysainews.com\/index.php\/wp-json\/wp\/v2\/tags?post=430"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}