<!DOCTYPE html><html lang="en" xmlns="http://www.w3.org/1999/xhtml" xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" style="font-size:16px;"><head></head><head><meta charset="utf-8"/><!--[if !mso]><!--><meta http-equiv="X-UA-Compatible" content="IE=edge"/><!--<![endif]--><meta name="viewport" content="width=device-width,initial-scale=1"/><meta name="x-apple-disable-message-reformatting"/><meta name="format-detection" content="telephone=no,address=no,email=no,date=no,url=no"/><meta name="color-scheme" content="light"/><meta name="supported-color-schemes" content="light"/><title>Training Agents Inside of Scalable World Models</title><!--[if mso]><xml><o:OfficeDocumentSettings><o:AllowPNG/><o:PixelsPerInch>96</o:PixelsPerInch></o:OfficeDocumentSettings></xml><![endif]--><style>
:root { color-scheme: light; supported-color-schemes: light; }
body { margin: 0; padding: 0; min-width: 100%!important; -ms-text-size-adjust: 100% !important; -webkit-transform: scale(1) !important; -webkit-text-size-adjust: 100% !important; -webkit-font-smoothing: antialiased !important; }
.body { word-wrap: normal; word-spacing:normal; }
table.mso { width: 100%; border-collapse: collapse; padding: 0; table-layout: fixed; }
img { border: 0; outline: none; }
table { mso-table-lspace: 0px; mso-table-rspace: 0px; }
td, a, span { mso-line-height-rule: exactly; }
#root [x-apple-data-detectors=true],
a[x-apple-data-detectors=true],
#MessageViewBody a { color: inherit !important; text-decoration: inherit !important; font-size: inherit !important; font-family: inherit !important; font-weight: inherit !important; line-height: inherit !important; }
span.MsoHyperlink { color: inherit !important; mso-style-priority: 99 !important; }
span.MsoHyperlinkFollowed { color: inherit !important; mso-style-priority: 99 !important; }
.a { background-color:#dedede; }
.b { background-color:#2a2a2a; }
.c { background-color:#ffffff; }
.d { background-color:#fff0c8; }
.d2 { background-color:#FFFFFF; }
.d3 { background-color:#FFFFFF; }
h1 a { text-decoration:none;color:#2C81E5;font-style:italic; }
h2 a { text-decoration:none;color:#2C81E5;font-style:italic; }
h3 a { text-decoration:none;color:#2C81E5;font-style:italic; }
h4 a { text-decoration:none;color:#2C81E5;font-style:italic; }
h5 a { text-decoration:none;color:#2C81E5;font-style:italic; }
h6 a { text-decoration:none;color:#2C81E5;font-style:italic; }
h1, h1 a, h2, h2 a, h3, h3 a, h4, h4 a, h5, h5 a, h6, h6 a, ul, li, ol, p, p a { margin: 0;padding: 0; }
h1 { font-family:'Trebuchet MS','Lucida Grande',Tahoma,sans-serif;font-weight:700;font-size:28px;color:#2A2A2A;line-height:42px;padding-bottom:4px;padding-top:16px;mso-margin-top-alt:16px;mso-margin-bottom-alt:4px }
h2 { font-family:'Trebuchet MS','Lucida Grande',Tahoma,sans-serif;font-weight:700;font-size:24px;color:#2A2A2A;line-height:36px;padding-bottom:4px;padding-top:16px;mso-margin-top-alt:16px;mso-margin-bottom-alt:4px }
h3 { font-family:'Trebuchet MS','Lucida Grande',Tahoma,sans-serif;font-weight:400;font-size:20px;color:#2A2A2A;line-height:30px;padding-bottom:4px;padding-top:16px;mso-margin-top-alt:16px;mso-margin-bottom-alt:4px }
h4 { font-family:'Trebuchet MS','Lucida Grande',Tahoma,sans-serif;font-weight:400;font-size:18px;color:#2A2A2A;line-height:27px;padding-bottom:4px;padding-top:16px;mso-margin-top-alt:16px;mso-margin-bottom-alt:4px }
h5 { font-family:'Trebuchet MS','Lucida Grande',Tahoma,sans-serif;font-weight:400;font-size:16px;color:#2A2A2A;line-height:24px;padding-bottom:4px;padding-top:16px;mso-margin-top-alt:16px;mso-margin-bottom-alt:4px }
h6 { font-family:'Trebuchet MS','Lucida Grande',Tahoma,sans-serif;font-weight:400;font-size:14px;color:#2A2A2A;line-height:21px;padding-bottom:4px;padding-top:16px;mso-margin-top-alt:16px;mso-margin-bottom-alt:4px }
p { font-family:'Georgia','Times New Roman',serif;font-weight:400;color:#2D2D2D;font-size:16px;line-height:24px;padding-bottom:8px;padding-top:8px;mso-margin-top-alt:8px;mso-margin-bottom-alt:8px; }
p a, .e a, ul a, li a, .h a, .h2 a, .h3 a { word-break:break-word;color:#2C81E5 !important;text-decoration:none;font-style:italic; }
p a span, .e a span, ul a span, li a span { color: inherit }
p .bold { font-weight:bold;color:#2D2D2D; }
p span[style*="font-size"] { line-height: 1.6; }
.f p { font-size:12px;line-height:15px;color:#2D2D2D;padding:0; }
.f p a { color:#2D2D2D !important; }
.g p { font-family:'Helvetica',Arial,sans-serif;font-size:14px;line-height:20px;font-weight:normal;margin:0; }
.g p a { text-decoration: underline; }
.i p { font-family:'Helvetica',Arial,sans-serif;line-height:23px;font-size:15px;color:#2D2D2D; }
.i p a { color:#2D2D2D !important; }
.i2 p { font-family:'Helvetica',Arial,sans-serif;line-height:23px;font-size:15px;color:#2D2D2D; }
.i2 p a { color:#2D2D2D !important; }
.i3 p { font-family:'Helvetica',Arial,sans-serif;line-height:43px;font-size:24px;color:#2D2D2D; }
.i3 p a { color:#2D2D2D !important; }
.h p a { color:#595959 !important; }
.h2 p a { color:#595959 !important; }
.h3 p a { color:#595959 !important; }
.f p a, .i p a, .i2 p a, .i3 p a, .h p a, .h2 p a, .h3 p a { text-decoration:underline; }
.j { border-top:3px solid #ffeb2d; }
.k p { padding-left:15px;padding-bottom:0px;padding-top:6px;mso-margin-top-alt:6px;mso-margin-bottom-alt:0px;mso-margin-left-alt:15px; }
.o { background-color:#FFFFFF;border:1px solid #F1F1F1;border-radius:5px; }
.o p { font-family:'Helvetica',Arial,sans-serif;padding:0px;margin:0px; }
.l p,
.l p a, .l a { font-size:14px;line-height:20px;font-weight: bold;color:#2D2D2D;padding-bottom:6px;mso-margin-bottom-alt:6px;text-decoration:none; }
.m p,
.m p a { font-size:13px;line-height:18px;font-weight:400;color:#2D2D2D;padding-bottom:6px;mso-margin-bottom-alt:6px;text-decoration:none; }
.n p,
.n p a { font-size:12px;line-height:17px;font-weight:400;color:#2D2D2D;padding-bottom:6px;mso-margin-bottom-alt:6px;text-decoration:none; }
.p { background-color:#FFFFFF;max-width:520px;border:1px solid #E1E8ED;border:1px solid rgba(80, 80, 80, 0.3);border-radius:5px; }
.q { font-size:16px;font-family:Helvetica,Roboto,Calibri,sans-serif !important;border:1px solid #e1e8ed;border:1px solid rgba(80, 80, 80, 0.3);border-radius:10px;background-color:#FFFFFF; }
.q p { font-size:16px;font-family:system-ui,Helvetica,Roboto,Calibri,sans-serif !important;color:#222222;padding:4px 0; }
.r { border:1px solid #E1E8ED !important;border-radius:5px; }
.s p { font-size: 14px; line-height: 17px; font-weight: 400; color: #697882; text-decoration: none; }
.t p { font-family:'Helvetica',Arial,sans-serif;font-size:12px;line-height:18px;font-weight:400;color:#000000;font-style:italic;padding:4px 0px 0px; }
.v { border-radius:10px;border:solid 0px #DFD150;background-color:#2C81E5;font-family:'Open Sans','Segoe UI','Apple SD Gothic Neo','Lucida Grande','Lucida Sans Unicode',sans-serif;color:#FFFFFF; }
.v a { text-decoration:none;display:block;color:#FFFFFF; }
.w p { font-size:12px;line-height:15px;font-weight:400;color:#FFFFFF; }
.w p a { text-decoration: underline !important;color:#FFFFFF !important; }
ul { font-family:'Helvetica',Arial,sans-serif;margin:0px 0px 0px 25px !important;padding:0px !important;color:#2D2D2D;line-height:24px;list-style:disc;font-size:16px; }
ul > li { font-family:'Helvetica',Arial,sans-serif;margin:10px 0px 0px 0px !important;padding: 0px 0px 0px 0px !important; color: #2D2D2D; list-style:disc; }
ol { font-family:'Helvetica',Arial,sans-serif;margin: 0px 0px 0px 25px !important;padding:0px !important;color:#2D2D2D;line-height:24px;list-style:decimal;font-size:16px; }
ol > li { font-family:'Helvetica',Arial,sans-serif;margin:10px 0px 0px 0px !important;padding: 0px 0px 0px 0px !important; color: #2D2D2D; }
.e h3,
.e p,
.e span { padding-bottom:0px;padding-top:0px;mso-margin-top-alt:0px;mso-margin-bottom-alt:0px; }
.e span,
.e li { font-family:'Helvetica',Arial,sans-serif;font-size:16px;color:#2D2D2D;line-height:24px; }
.rec { font-family: ui-sans-serif, system-ui, -apple-system, BlinkMacSystemFont, "Segoe UI", Roboto, "Helvetica Neue", Arial, "Noto Sans", sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol", "Noto Color Emoji" !important; }
.rec__button:hover { background-color: #f9fafb !important; }
.copyright a {color: inherit !important; text-decoration: none !important; font-size: inherit !important; font-family: inherit !important; font-weight: inherit !important; line-height: inherit !important;}
.txt_social p { padding: 0; word-break: break-all; }
.table, .table-c, .table-h { border: 1px solid #C0C0C0; }
.table-c { padding:5px; background-color:#FFFFFF; }
.table-c p { color: #2D2D2D; font-family:'Helvetica',Arial,sans-serif !important;overflow-wrap: break-word; }
.table-h { padding:5px; background-color:#F1F1F1; }
.table-h p { color: #2A2A2A; font-family:'Trebuchet MS','Lucida Grande',Tahoma,sans-serif !important;overflow-wrap: break-word; }
@media only screen and (max-width:667px) {
.aa, .w100pc { width: 100% !important; }
.bb img { width: 100% !important; height: auto !important; max-width: none !important; }
.cc { padding: 0px 8px !important; }
.ee { padding-top:10px !important;padding-bottom:10px !important; }
.ff ul, .ff ol { margin: 0px 0px 0px 10px !important;padding: 0px !important; }
.ff li { margin:10px 0px 0px 10px !important; }
.r {height:140px !important;}
.s p { font-size:13px !important;line-height:15px !important; }
.mob-hide {display:none !important;}
.mob-show {display: block !important; width: auto !important; overflow: visible !important; float: none !important; max-height: inherit !important; line-height: inherit !important;}
.mob-stack {width:100% !important;display:block !important;}
.mob-w-full {width:100% !important;}
.mob-block {display:block !important;}
.embed-img {padding:0px 0px 12px 0px !important;}
.socialShare {padding-top:15px !important;}
.rec { padding-left:15px!important;padding-right:15px!important; }
.bodyWrapper { padding:7px 4px 7px 4px !important; }
.social-mobile {float:left !important;margin-top:10px !important;}
}
@media screen and (max-width: 480px) {
u + .a .gg { width: 100% !important; width: 100vw !important; }
.tok-heart { padding-top:75% !important; }
.tok-play { padding-top: 250px !important; }
}
@media screen and (max-width: 320px) {
.tok-heart { padding-top:65% !important; }
}
.u { border: 1px solid #CACACA !important; border-radius: 2px !important; background-color: #ffffff !important; padding: 0px 13px 0px 13px !important; font-family:ui-sans-serif,system-ui,-apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,"Helvetica Neue",Arial,"Noto Sans",sans-serif !important;font-size: 12px !important; color: #767676 !important; }
.u a { text-decoration: none; display: block !important; color: #767676 !important; margin: 0px !important; }
.u span, .u img { color: #767676 !important;margin:0px !important; max-height:32px !important;background-color:#ffffff !important; }
</style><!--[if mso]><style type="text/css">
h1, h2, h3, h4, h5, h6 {font-family: Arial, sans-serif !important;}
body, table, td, p, a, span {font-family: Arial, sans-serif !important;}
sup { font-size: 100% !important;vertical-align: .5em !important;mso-text-raise: -1.5% !important;line-height: 0 !important; }
ul { margin-left:0px !important; margin-right:10px !important; margin-top:20px !important; margin-bottom:20px !important; }
ul li { margin-left: 0px !important; mso-special-format: decimal; }
ol { margin-left:0px !important; margin-right:10px !important; margin-top:20px !important; margin-bottom:20px !important; }
ol li { margin-left: 0px !important; mso-special-format: decimal; }
li.listItem { margin-left:15px !important; margin-top:0px !important; }
.paddingDesktop { padding: 10px 0 !important; }
.edm_outlooklist { margin-left: -20px !important; }
.embedImage { display:none !important; }
</style><![endif]--><!-- __merge_tags_in_links__ --><style>
@font-face {
font-family: 'Open Sans';
font-style: normal;
font-weight: 700;
font-display: swap;
src: url('https://fonts.gstatic.com/s/opensans/v40/memSYaGs126MiZpBA-UvWbX2vVnXBbObj2OVZyOOSr4dVJWUgsg-1x4gaVIUwaEQbjA.woff2') format('woff2');
}
@font-face {
font-family: 'Open Sans';
font-style: italic;
font-weight: 700;
font-display: swap;
src: url('https://fonts.googleapis.com/css2?family=Open+Sans:ital,wght@1,700&display=swap') format('woff2');
}
</style></head><body class="a" style="margin:0px auto;padding:0px;word-wrap:normal;word-spacing:normal;background-color:#dedede;"><div role="article" aria-roledescription="email" aria-label="email_name" lang="en" style="font-size:1rem"><div style="display:none;max-height:0px;overflow:hidden;"> Plus more about Polychromic Objectives for Reinforcement Learning and Stochastic activations  ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ </div><table role="none" width="100%" border="0" cellspacing="0" align="center" cellpadding="0" class="gg"><tr><td align="center" valign="top"><table role="none" width="670" border="0" cellspacing="0" cellpadding="0" class="aa" style="width:670px;table-layout:fixed;"><tr><td class="bodyWrapper" align="center" valign="top" style="padding:7px 7px 7px 7px;"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td align="center" valign="top" style="border-width:0px 0px 0px 0px;border-style: solid; border-color: #2a2a2a;border-radius:10px 10px 0px 0px;background-color:#ffffff;" class="c"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" align="center"><tr id="header"><td style="padding:15px 15px 0px 15px;"><div style="padding-top:0px;padding-right:0px;padding-bottom:20px;padding-left:0px;"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td class="f" align="right" valign="top"><p> October 07, 2025 | <a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.c6q0w4g5sodbtO4I1B_pxSdB5RCIH6yy1Fm1CYma3EzYxKDC8yJqRsf_5e1gzhSqgUcjdNru3ohxDr6ggsMTJSGP1DCsCYD2wgCjXInxoXVjgY96vJew103YqM9QR6SoCrn8sp7Uco-j6_5c5tEBy7kVxML-sJF1lO9bFS_Hes-51TZKLh2MLZO_DK7DNWFlkOT1TeXhdq3vd2T_xbsgE9BHG4c5xB7imwBneV4ItHA85Mktd0ee2M-J4nmc9gbkZulsbWQLLYle6z5LRTIevUcdcOnvc25EhXwb4pjVMhmMC-GbtFJkNU6sdCZGL1mDFBDivRnxr5rjUedNZeb7c6EG7eZplVapZ4JOBvLwkNVr0xsmGevgD9EYaYcBNJAs61jwu9s2QpgS1EZu1-N_rSQDS2mJLdRvprqkWaJcr0CCYStlYEHcSWSUeDTsqncwMFIauSTmaauF3p3pch-CM7YkBDkrHPtmxv5I_oaS5ao7Nw2w799OaonzUb7_w6KFP-_CfGGr2YkGGV6IgoJFTdK3qy3tG6g4vhXOH0mF4KpiFSmTZ7VnGh4_GGaYZmXeBnrb0yXavNH7rqGqOeXt7c2CrMSmqipZEMM82A_prMcYA65xos5KX8j6-Eq3ubkPQ2DoUcLHltUBUMyBugQ_p6zxuybRmpEDhuibodwya88UdJd24nDi6Wr5qw576G4Oa7uRHdGNtjWE2Snry0dZE55YfTmYajNNyqKrB-wAyI6iG-gtvDVKn4-jFQfEaOl3t-kIVBwJCv2Ac-XYtwyYZ9nLHw5VzPof7sOAS5u0ztfreywM9sTLXRLACboeO21O63d3XknB5AHlBixaD_B8tw/4kj/GbeUl1zbTuO24U6eh_os5Q/h0/h001.ZpMEe8MU264GWNSzrOEkxP0TybpHg9OUQQnGyN-lSfI"><span class="translation_missing" title="translation missing: en.templates.posts.email.header.read_online">Read Online</span></a></p></td></tr><tr><td class="dd" align="center" valign="top" style="padding:15px 0;"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td align="center" valign="top"><h1 style="text-align:left;font-family:'Open Sans','Segoe UI','Apple SD Gothic Neo','Lucida Grande','Lucida Sans Unicode',sans-serif;font-weight:Bold;font-size:32px;color:#2A2A2A;padding:2px 0;line-height:38px;"> Training Agents Inside of Scalable World Models </h1><p style="text-align:left;font-family:'Helvetica',Arial,sans-serif;font-weight:normal;font-size:20px;color:#3E3E3E;padding:5px 0;line-height:24px;"> Plus more about Polychromic Objectives for Reinforcement Learning and Stochastic activations </p></td></tr></table></td></tr><tr><td style="line-height:0;"><div data-open-tracking="true"> <img src="https://elink4f7.mail.bycloud.ai/ss/o/u001.3wmUuY8gEWd4_869a_eXcg/4kj/GbeUl1zbTuO24U6eh_os5Q/ho.gif" alt="" width="1" height="1" border="0" style="height:1px !important;width:1px !important;border-width:0 !important;margin-top:0 !important;margin-bottom:0 !important;margin-right:0 !important;margin-left:0 !important;padding-top:0 !important;padding-bottom:0 !important;padding-right:0 !important;padding-left:0 !important;"/> </div></td></tr></table></div></td></tr><tr id="content-blocks"><td class="email-card-body" align="center" valign="top" style="padding-bottom:15px;"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td id="nov-18-th-nov-24-th-33-latest-ai-re" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:normal;padding:0px 28px;text-align:left;"><h6 style="color:#2A2A2A;font-weight:normal;mso-line-height-alt:87.5%;"><i>Sep 29th ~ Oct 6th</i><br><i>#76 Latest AI Research Explained Simply</i></h6></td></tr><tr><td><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" style=""><tr><td bgcolor="#222222" style="background-color:#222222;padding:0.0px 0.0px 0.0px 0.0px;"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"></p></td></tr></table></td></tr></table></td></tr><tr><td id="industry-news-in-1-line" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:Bold;padding:0px 28px;text-align:left;"><h2 style="color:#2A2A2A;font-weight:Bold;mso-line-height-alt:150.0%;">🗞️ Industry News in 1 Line</h2></td></tr><tr><td style="padding-bottom:12px;padding-left:50px;padding-right:40px;padding-top:12px;" class="ee"><div style="margin-left:0px;" class="edm_outlooklist"><ol start="1" style="list-style-type:decimal;margin:0px 0px;padding:0px 0px 0px 0px;"><li class="listItem ultext"><p style="mso-line-height-alt:150.0%;padding:0px;text-align:left;word-break:break-word;"><span style="background-color:#e0e0e0;"><span style="color:rgb(255, 58, 58);font-size:0.6rem;">♥ 5.2k</span></span> Zhipu AI has released GLM-4.6, a new LLM with a longer context window of 200K tokens, superior coding performance, and advanced reasoning capabilities. GLM-4.6 also has more capable agentic functions and an improved writing style that better aligns with human preferences. You can try out GLM-4.6 on the <a class="link" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.ZsobsZmG6kUZ4LjqczYBVKVDlAmESQcL-bONs-iLk77GSKR5tNm7R1KP_-Y9txPTkAniDD_eLzm6jAmlXXjzHJ2S-qoUiPJdOw_Jre3ui9RR8tI8QlkoNLSv4IokU1Kexo9oMsOn246lryYxmfGhVzYkJUHdut1jV-Oz-zxEcwqCazawsUBeHudbmBExF6Yem3HdhIMOQw-PVMyXW-av9soTFO1SUkoZmV_JGFa6FaHjq4j9EAMj7WGn4smxx92b/4kj/GbeUl1zbTuO24U6eh_os5Q/h1/h001.AuIEUKIouwmImbaUZ1s9Q4oTJjRW4DrbDWiSOEDYkjo" target="_blank" rel="noopener noreferrer nofollow"><span>Z.ai website</span></a>, use it <a class="link" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.9ggl6Mt0xphuuMReR5gVpSrFFzDetpQb3JmVcgzZn5b5izuuoOf2dGk4pNKDNQwh1-SJS_weGPv480bIIwOwoP8yqpRL_FF53jmJQId-vJIyUeNfuM3zF5PqhDYKNdQiy1g1yY9uPVaTidu_cUDEffUleR4vF5-wLSNVxIwW3LyEu37rq71aHU3pP9I6TGe5opFoMJK5PhL4PA0eXIDad5XrCK0njE6HtYAlk7-UNpU-WgjzhqTHlZHuEZuDFmoZ-y2pnzzj3olLXXeHfv236A/4kj/GbeUl1zbTuO24U6eh_os5Q/h2/h001.dL2kvyFMJlZrdJRk6sT_1Mav378lLfpkxI9qSydQ3So" target="_blank" rel="noopener noreferrer nofollow"><span>via their API</span></a>, or <a class="link" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.CxDkkVpJsBdVoe83c_tBWr6LdnAmKyWSp5j0x8wnkgGqldVpXBIMyvs9SNhydLeqa1iw9QQmtKBwk8KAcX6teZWktuL-865sAuw4UpnwAELCFGzzwscx8nx_-_MnTgLH9y48nRDK5sFFNSeHTXGeWscJ29iz_WJuwz8IvnJwYZe1LibUtrGsSx8MMPsWFVlhkgm7REx9ZekcaPXEJua9GaHdmxFm03cKbpRSIwkeaS8hYyS3kNQmLAh6W3Tt1N77EYSELsSBTzn-02ZxsgLRsA/4kj/GbeUl1zbTuO24U6eh_os5Q/h3/h001.iaCIzDj-SSlNvDX56JEOtXMjTcuu3U7rJNGuoYI20TY" target="_blank" rel="noopener noreferrer nofollow"><span>download the model weights</span></a> from Hugging Face to run it locally. </p><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:626px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/3637a68d-fb97-4345-a703-3ea07701b37b/coding_benchmark.png?t=1759857231" alt="" height="auto" width="626" style="display:block;width:100%;" border="0"/></td></tr></table></li><li class="listItem ultext"><p style="mso-line-height-alt:150.0%;padding:0px;text-align:left;word-break:break-word;"><span style="background-color:#e0e0e0;"><span style="color:rgb(255, 58, 58);font-size:0.6rem;">♥ 11k</span></span> IBM has introduced <a class="link" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.DUiN96-Eq7pUHzwEhy5j259JbYiT19QpsBdhuHcIW0StLGLb321gBTrH8yRkCuJuBNbYybCfI4l6rF43pA1RWm6k9wZcIYliCyAhuqnO8Z-SmzxTKYGeDemf-_YM_DtrVZsC7V4yW3C9KWRAy8E55j2XJs-EVZucQ2GCulkXd2X9aE9s_XjbMbtb2CdeWs8DGJ3Oz99e7hT1rNpPAeh7df_h0DpUipSNsrTu1o81gBPmT6dgrsj5zh7SaKqaw_SDFMGzPXYtsmSkmIzknd-Qw1AG91oV-c4whPGXFMxLlQinJoOVf32l5oXp_5JfszxqmUYCuCKr6rfJMVVhja_JdjWZwl7qb3sLQugQsuHCjlQ/4kj/GbeUl1zbTuO24U6eh_os5Q/h4/h001.u_AvNojeQaZsR7wwZORpzB4U-18LSMPvU3BalapeDec" target="_blank" rel="noopener noreferrer nofollow"><span>Granite 4.0</span></a>, a new generation of hyper-efficient, high-performance hybrid models that use a <b>Mamba/transformer architecture</b>, which significantly reduces memory requirements. The Granite 4.0 models are open-sourced under an <b>Apache 2.0 license</b>, and they are the first open models to receive <b>ISO 42001 certification</b> and be cryptographically signed, ensuring their security and trustworthiness. You can access Granite 4.0 on IBM <a class="link" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.DUiN96-Eq7pUHzwEhy5j26wmLriuTd7tPlt55uDRV88qFdMdMAifWdRyQPXOkGcglcOHyyerasbOpvcpOEE2sWRDNy4rS6woji6yevtlkk_pYMvVhrVwg9lltuJJ0fneoTF2KM1K69X5-GnU-R49u1WWxTVFFWsCgRytZd37nHZfcHZVgMmKyIu5NwAnIQf6h3yMTjLLvpy0DfBg6tzAyXPcoJRXB0qbIJZe0KadiN-WMbPifdRw1upEpAYcc0ZARCDE_reJBd-F_ErFFqgJyA/4kj/GbeUl1zbTuO24U6eh_os5Q/h5/h001.cKr6zNyBlNl5mUXux1gkcTjNHG4xsExPfPg-yEY7gAY" target="_blank" rel="noopener noreferrer nofollow"><span>watsonx.ai</span></a> or via platform partners, including Hugging Face, Docker Hub, and NVIDIA NIM. </p><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:626px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/6b64ff65-db35-489b-960b-0157272f96fd/image_1363588221?t=1759857470" alt="" height="auto" width="626" style="display:block;width:100%;" border="0"/></td></tr></table></li><li class="listItem ultext"><p style="mso-line-height-alt:150.0%;padding:0px;text-align:left;word-break:break-word;"><span style="background-color:#e0e0e0;"><span style="color:rgb(255, 58, 58);font-size:0.6rem;">♥ 8.3k</span></span> <a class="link" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.DUiN96-Eq7pUHzwEhy5j2wWIFm9TgZ3SHguHucUZoaV5A9Q8ff-AHDtQUdCK2jt1d68npQxVK3-0P7ak8pA9hYIspDpD2UJjGb86--sDIQiFrN5-Es2s6EvWlR5EnNLy057q1zeZVE-ibV0hn3HMU31UXjZOm90lRrOyNOuPsy0YnmFplGoEVCezPn2V815cOaZbCGJMexXRzOkOKSzCbRPOZyhpNwOtMluddbGHP_WmgNQ2Kr5wYqnK2S4xjtYTMkGcUpVILxAHdspVsq_h4Q/4kj/GbeUl1zbTuO24U6eh_os5Q/h6/h001.Z9KyRp3h9_NqhC-M4rxAeUa3HcZ63KBdTWnFx5BcC48" target="_blank" rel="noopener noreferrer nofollow"><span>Perplexity has launched Comet</span></a>, a new browser with a built-in personal AI assistant designed to work for you. Comet can help you with a variety of tasks, such as understanding how different news outlets are covering a topic, organizing your tabs, drafting emails, and even building a basic website. </p></li><li class="listItem ultext"><p style="mso-line-height-alt:150.0%;padding:0px;text-align:left;word-break:break-word;"><span style="background-color:#e0e0e0;"><span style="color:rgb(255, 58, 58);font-size:0.6rem;">♥ 2.4k</span></span> Tencent has released <a class="link" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.CxDkkVpJsBdVoe83c_tBWkSfgzu7eg1o_HUVTBLBU_tAf_VQ2-zcXBDviTfibeid4w_t13OaSw2JxQgq4i4OJf8Gh09-N67kmYzlhtI1qrxIS40goYIqd_k8IKPAhQYA32SrgyXU78BzxY6wIbX16cXtZOYrNc0VNEpWHSstq46eJYk4s00M_O_xW17vFE7VKfJcN5FETYe04wtrDD07kAYwF-ck0Uuv4qwq7ID2pamp3cjU8TWNUgZhB3qsBI9l/4kj/GbeUl1zbTuO24U6eh_os5Q/h7/h001.Hw0Utelb_pBzlpe5dJlTAGfIinNZ8yniqQlEGV7h65I" target="_blank" rel="noopener noreferrer nofollow"><span>Hunyuan Image 3.0</span></a>, and this new version has enhanced dual encoders and advanced RLHF optimization to produce stunning, high-quality images. Hunyuan Image 3.0 <b>supports both Chinese and English prompts</b> and offers flexible aspect ratios for your creative projects. You can get started or see a demo on the <a class="link" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.CxDkkVpJsBdVoe83c_tBWkSfgzu7eg1o_HUVTBLBU_tAf_VQ2-zcXBDviTfibeid4w_t13OaSw2JxQgq4i4OJf8Gh09-N67kmYzlhtI1qrxIS40goYIqd_k8IKPAhQYA32SrgyXU78BzxY6wIbX16cXtZOYrNc0VNEpWHSstq46eJYk4s00M_O_xW17vFE7Vv64M5bqPfQimuDQBkxnueaFSsljX_kJTvgcOLG2ToJK3qSfTM7SetrE_O4BT4ZAJ/4kj/GbeUl1zbTuO24U6eh_os5Q/h8/h001.kxroFuiF8ONfU9-vfc-wsKGMs6XVFndgnHYyz2CixZM" target="_blank" rel="noopener noreferrer nofollow"><span>Hunyuan Image website</span></a>. </p><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:438px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/6cf6a3b4-1565-46bc-ba84-664356e8138d/image.png?t=1759857757" alt="" height="auto" width="438" style="display:block;width:100%;" border="0"/></td></tr></table></li><li class="listItem ultext"><p style="mso-line-height-alt:150.0%;padding:0px;text-align:left;word-break:break-word;"><span style="background-color:#e0e0e0;"><span style="color:rgb(255, 58, 58);font-size:0.6rem;">♥ 14k</span></span> <a class="link" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.DIqyAo9xTeoWriogq2VlWeUmi9WmFR4pnC4wMSHAHOEi0s_79rCOu0Ids4VjFrRaM6MGxOzAW-171L7McQM2kFFjCnBfNv6K5qliP9b95f06NXlb7AM1Ayeq_q_gOU5SxlUWZ7mJT331E8juP62sfBQtajRLkJhjcptudllExZEp8NzXiPBA8x6iSOQnZ9WnRjBGebfYe9LzZbxayzqXW1SZF1kKoP1OKTKkLQ5Q23wYa7ukDG-pAcx-SapAhTR4vRjyDTKU3lTLaiklLQQpWA/4kj/GbeUl1zbTuO24U6eh_os5Q/h9/h001.ie2zBqjL_UilAv-9Gcj5PFoJQfPhsNXhzyjowb9eDWE" target="_blank" rel="noopener noreferrer nofollow"><span>OpenAI has announced Sora 2</span></a>, a video and audio generation model that can generate videos with synchronized dialogue and sound effects. It can even insert real-world people, animals, or objects into generated scenes with remarkable fidelity. You can use Sora 2 by downloading the new "Sora" social iOS app, which is now available in the U.S. and Canada. </p><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:500px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/2fd39c28-6a76-4f7b-a55d-56ca5a71d9f3/image.png?t=1759857984" alt="" height="auto" width="500" style="display:block;width:100%;" border="0"/></td></tr></table></li><li class="listItem ultext"><p style="mso-line-height-alt:150.0%;padding:0px;text-align:left;word-break:break-word;"><span style="background-color:#e0e0e0;"><span style="color:rgb(255, 58, 58);font-size:0.6rem;">♥ 425</span></span> <a class="link" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.9ggl6Mt0xphuuMReR5gVpTanKmQxfkMnn7OJKz54OPHbrSKDY2zKc0fKntGskBM0wOo3QArklugneX51NxGG_sCXszyYZk2ALwTvg0nbvC8GkJl2GXrZkBA_bu4PJep2wA1AfDWjVqoJ_xA4ukK2t7H3I-0sB2BctHuq7KOBXwk3sKAtqg1V0bIlTh6ZC6Jx5z8CWGBZr8iDrdD_fMRgVFPZq4J--jOqlx_yQao2x3M4fKHGvNjzOHD9iqFU74sxXPCpHq-n0U3tADjJ1aQiM3tUMcn-3n65wDn-76d3ObBumt72mtKeUPwY0RRWlk6JMa0BePl29E7Zd9OOlM09aBeNlVa4Snzezu3KhwZcxATNSrfrIEG_CwTS1x7bLYAl/4kj/GbeUl1zbTuO24U6eh_os5Q/h10/h001.4vcOkO_GvPdzNmYe7kDnGHygzpcnsm9pLY9T1Wyj-T0" target="_blank" rel="noopener noreferrer nofollow"><span>Google has launched Jules Tools</span></a>, which is a command-line interface for their asynchronous coding agent, Jules. This tool allows developers to interact with Jules directly from their terminal to perform tasks like writing tests, building new features, and fixing bugs. You can install Jules Tools via npm by installing the <code>@google/jules</code> package. </p></li></ol></div></td></tr><tr><td><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" style=""><tr><td bgcolor="#222222" style="background-color:#222222;padding:0.0px 0.0px 0.0px 0.0px;"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"></p></td></tr></table></td></tr></table></td></tr><tr><td><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" style=""><tr><td bgcolor="transparent" style="background-color:transparent;border-color:#2C81E5;border-style:solid;border-width:5px;padding:0.0px 0.0px 0.0px 0.0px;"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:Bold;padding:0px 28px;text-align:left;"><h2 style="color:#2A2A2A;font-weight:Bold;mso-line-height-alt:150.0%;"><span style="">Support My Newsletter</span></h2></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"><span style="color:rgb(34, 34, 34);font-family:Georgia, "Times New Roman", serif;font-size:16px;">As I aim to keep this newsletter free forever, your support means a lot. If you like reading The AI Timeline, consider forwarding it to another research enthusiast, It helps us keep this up for free!</span></p></td></tr><tr><td align="center" valign="top"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td align="center" valign="top" style="font-size:0px;line-height:0px;padding:30px 0px 30px;" class="dd"><table class="j" role="none" width="50%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td> </td></tr></table></td></tr><tr><td class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:Bold;padding:0px 28px;text-align:left;"><h2 style="color:#2A2A2A;font-weight:Bold;mso-line-height-alt:150.0%;">Share The AI Timeline</h2></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> You currently have <strong>0</strong> referrals. </p></td></tr><tr><td align="left" valign="top" style="padding-bottom:20px;padding-left:15px;padding-right:15px;padding-top:20px; display:none;width:0px;max-height:0px;overflow:hidden;mso-hide:all;height:0;font-size:0;max-height:0;line-height:0;margin:0 auto;" class="dd"><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 0;"><tr><td align="center" valign="top" style="width:313px;"><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.c6q0w4g5sodbtO4I1B_pxWc4htTObwdorovK0nFHVH-4pUdVE0ELYH5DsNemk732SjNwhPNJ25r0O8B5vYifsGNUqyW5TiZkyMsF1yreu0byy2KW36J1wDdpoLuXg2TU1F1OW8OHoHaU4-ZmrZpPU4RN-crQCEimD190CSn9fPuxpIRojBJyu1VfV5KtQD3QMVdSg2JrjEj5-xm4r4E12Whf08itqPCb9Q5W0X4rt3ubYkqCmWnLeZpmb3_RZcbIk0UE5wZnFLCQJHLFs0qZ0OGpXp89o1HU4mWIBur5Or4tQGm5M_Y8m5PvTEfYfxLRyrcRv7GyVs5oLtFfiySZ2SqtZypLA-h50h61p0uPiA7iA_PiMqlVLtM-87XL33VZi05_O3UTpWE_0nAzFRJ4TW1ayz3_vn4Zlp9IERdbnnAd_1kPLD4lAQcR5PRXgtpC5pzW8aEG85gM-il9p7IzuzX-KyLoJ054k24kHzZR5xsxcn3TfSZoJdKlB15d9sFNm3EOmxXzmknrXlnpEdwrIsGdgqKqrcLTdqi3CnUchubI2ftVeyI2IcWbhb9EniEGzc02uJGggG4y_cnqg-PZppOI32XB2yTjNvHzgjJVcWrW48VtixA_ailq4H8Ep9KE/4kj/GbeUl1zbTuO24U6eh_os5Q/h11/h001.KQxjh__0z7OcbqTd4XVr9_QOXZNo5RmDYKaKSRnrZO0" rel="noopener noreferrer nofollow" style="text-decoration:none;" target="_blank"><img src="" alt="" height="auto" width="313" style="display:block;width:100%;" border="0"/></a></td></tr></table></td></tr><tr class="btn_row"><td valign="top" style="padding-bottom:14px;padding-left:28px;padding-right:28px;padding-top:14px;text-align:left;width:100%;word-break:break-word;" class="dd"><table width="100%" role="none" border="0" cellspacing="0" cellpadding="0" style="margin:14px auto 14px auto;"><tr><td align="left" valign="middle"><table role="none" border="0" cellspacing="0" cellpadding="0"><tr><td style="background-color:#2C81E5;border-radius:8px;mso-padding-alt:14px 20px;" class="btn"><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.c6q0w4g5sodbtO4I1B_pxWc4htTObwdorovK0nFHVH-4pUdVE0ELYH5DsNemk732SjNwhPNJ25r0O8B5vYifsGNUqyW5TiZkyMsF1yreu0byy2KW36J1wDdpoLuXg2TU1F1OW8OHoHaU4-ZmrZpPU4RN-crQCEimD190CSn9fPuxpIRojBJyu1VfV5KtQD3QMVdSg2JrjEj5-xm4r4E12Whf08itqPCb9Q5W0X4rt3ubYkqCmWnLeZpmb3_RZcbIk0UE5wZnFLCQJHLFs0qZ0OGpXp89o1HU4mWIBur5Or4tQGm5M_Y8m5PvTEfYfxLRyrcRv7GyVs5oLtFfiySZ2SqtZypLA-h50h61p0uPiA7iA_PiMqlVLtM-87XL33VZi05_O3UTpWE_0nAzFRJ4TW1ayz3_vn4Zlp9IERdbnnAd_1kPLD4lAQcR5PRXgtpC5pzW8aEG85gM-il9p7IzuzX-KyLoJ054k24kHzZR5xsxcn3TfSZoJdKlB15d9sFNm3EOmxXzmknrXlnpEdwrIsGdgqKqrcLTdqi3CnUchubI2ftVeyI2IcWbhb9EniEGzc02uJGggG4y_cnqg-PZppOI32XB2yTjNvHzgjJVcWrW48VtixA_ailq4H8Ep9KE/4kj/GbeUl1zbTuO24U6eh_os5Q/h12/h001.uq15xGsVwPBXLiyrQPHZE6ACJux3_u5jkjdja0UPGJ0" target="_blank" rel="noopener noreferrer nofollow" style="background-color:#2C81E5;border-radius:8px;color:#FFFFFF;display:inline-block;font-family:'Open Sans','Segoe UI','Apple SD Gothic Neo','Lucida Grande','Lucida Sans Unicode',sans-serif;font-size:16px;font-weight:normal;line-height:18px;padding:14px 20px;text-decoration:none;"> Click to Share </a></td></tr></table></td></tr></table></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> Or copy and paste this link to others: <a class="link" href="https://mail.bycloud.ai/subscribe?ref=6SqUHb8KiF&_bhlid=bf7a73b936aab597b0df9777ef50b28c5a049d32" target="_blank" rel="noopener noreferrer nofollow" clicktracking="off"><span>https://mail.bycloud.ai/subscribe?ref=6SqUHb8KiF</span></a></p></td></tr><tr><td align="center" valign="top" style="font-size:0px;line-height:0px;padding:30px 0px 30px;" class="dd"><table class="j" role="none" width="50%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td> </td></tr></table></td></tr></table></td></tr><tr class="btn_row"><td valign="top" style="padding-bottom:14px;padding-left:28px;padding-right:28px;padding-top:14px;text-align:center;width:100%;word-break:break-word;" class="dd"><table width="100%" role="none" border="0" cellspacing="0" cellpadding="0" style="margin:14px auto 14px auto;"><tr><td align="center" valign="middle"><table role="none" border="0" cellspacing="0" cellpadding="0"><tr><td style="background-color:#2C81E5;border-radius:8px;mso-padding-alt:14px 20px;" class="btn"><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.zNfxTwpJFmrsCuJJphGRkKSrCVph9-fOYkcjx4VfJRyUw-Iv7GHKoTyxc57iFdcabeJrUAXVgdJXAkTcc7bS82ZF6NEkQHkUBgqGaM66RDbyMBpTK8pOBl6aVCc1cb8uqQ6Q6Hx1l4RRodS0U5r9FmhUZ14ZjtkmpX8RvpvmdrkQmS65XBl4NbS99bb7EC7UVb3euNeok7Xg0jmZ-omf2y-OPcf947lvqYs_7oasodS72wTyoCeTA5yNiEz3-vIc1QmjMXMyKLTlNBDgfcdotA/4kj/GbeUl1zbTuO24U6eh_os5Q/h13/h001.mYe13oiXARNALeaqOrPoP4hW9O7RqBaxiHiV94Qtqvc" target="_blank" rel="noopener noreferrer nofollow" style="background-color:#2C81E5;border-radius:8px;color:#FFFFFF;display:inline-block;font-family:'Open Sans','Segoe UI','Apple SD Gothic Neo','Lucida Grande','Lucida Sans Unicode',sans-serif;font-size:16px;font-weight:normal;line-height:18px;padding:14px 20px;text-decoration:none;"> Check Out My Patreon </a></td></tr></table></td></tr></table></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"><span style=""><a class="link" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.tLfGW26lAwaS9gFg17HSoGymQ3NNPtd5dE5MV_8UgjLbPKYFbBPtV6oAT4VYSncNiXOMe0ETHKViEemkGKRuti97gDsqlNJXOC9cMEoZt4vqGEMzd3CYIoAvubE-GTMM6e4ZzHl1I3WRxsRBZpXpRXRM3801MtJ-jquBlKHue59zy8deLb2EpA7wBE1a-68dnTKLaQhjlZ_Car0MiBvKmNznyZT4y4D035n6XKtafu9Ep5G6Z3jZOSJwNAzXb0Vfdp4CEqGdtLfO-BU0LIxjBQ/4kj/GbeUl1zbTuO24U6eh_os5Q/h14/h001.0R6FvcjVTBuY8i19UEOPfoLuB9lAuxzoq2f9TSu-5do" target="_blank" rel="noopener noreferrer nofollow"><span>Advertise with The AI Timeline! </span></a></span></p></td></tr></table></td></tr></table></td></tr><tr><td><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" style=""><tr><td bgcolor="#222222" style="background-color:#222222;padding:0.0px 0.0px 0.0px 0.0px;"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"></p></td></tr></table></td></tr></table></td></tr><tr><td id="training-agents-inside-of-scalable-" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:Bold;padding:0px 28px;text-align:left;"><h2 style="color:#2A2A2A;font-weight:Bold;mso-line-height-alt:150.0%;">Training Agents Inside of Scalable World Models</h2></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"><i>Hafner</i><span style=""><i> et al. [</i></span><i>Google DeepMind</i><span style=""><i>]</i></span></p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"><span style="background-color:#e0e0e0;"><span style="color:rgb(255, 58, 58);font-size:0.6rem;"> ♥ 485 </span></span><span style="color:rgb(44, 129, 229);font-size:0.6rem;"> </span><span style="background-color:#e0e0e0;"><span style="color:rgb(44, 129, 229);font-size:0.6rem;"> LLM Wordl Models </span></span></p></td></tr><tr><td id="introduction-to-world-models-and-dr" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:normal;padding:0px 28px;text-align:left;"><h3 style="color:#2A2A2A;font-weight:normal;mso-line-height-alt:125.0%;">Introduction to World Models and Dreamer 4</h3></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> We all know about language models, but that’s not the only kind of model out there. Researchers have developed world models, which learn from videos and simulate experiences to train intelligent behaviors. However, previous world models have struggled to accurately predict complex object interactions, especially in rich environments like video games. </p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> This paper introduces Dreamer 4, which addresses this challenge by learning a fast and accurate world model that enables reinforcement learning inside simulated experience. </p></td></tr><tr class="embed-gen-img-r"><td align="center" valign="top" style="padding:12px 27px 12px 27px;" class="dd"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td align="center" valign="top" class="o" style="padding:12px 12px 12px 12px;;background-color:#FFFFFF;border-color:#F1F1F1;border-radius:5px 5px 5px 5px;border-width:1px 1px 1px 1px;"><!--[if !mso]><!--><div style="display:none; float:left; overflow:hidden; width:0; max-height:0; line-height:0;" class="mob-show"><table role="none" border="0" cellspacing="0" cellpadding="0" align="right" width="100%"><tr><td align="center" valign="top"><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.9ggl6Mt0xphuuMReR5gVpQSxa46C2NVBKURDNuOjDh-TvqwJc6MoxM0Gg_hOzK5COBXdj98oGsQK_rZNjQ26DtdgcslmN5FPyY7_-uggqYDFLUjeFx4gSb9q1GiCchJ-7hPG0r3GWNaSAlBMC1BVASIMRBLjUibrtJKcHEhzQhQfvO6GtBWose2filvqm9Xh7bEi256YMyU1_Y_wFT7R5wrSdBlFuJUEGiN94wd9uF2R2H_M0CzRRJ2dy6avS8A9785aW9FdkClcJ9XfkZWnSg/4kj/GbeUl1zbTuO24U6eh_os5Q/h15/h001.XD4cyqDIrDLpVzusL8GdPucBR0Nc1-7eC300w_MyorI" target="_blank"><img src="" width="100%" style="height:auto;display:block;"/></a></td></tr><tr><td height="16" style="font-size:16px;line-height:16px;"> </td></tr></table></div><!--<![endif]--><table role="none" border="0" cellspacing="0" cellpadding="0" align="right" width="100%"><tr><td width="57%" align="center" valign="middle" class="mob-stack"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td align="left" valign="middle" class="l"><p><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.9ggl6Mt0xphuuMReR5gVpQSxa46C2NVBKURDNuOjDh-TvqwJc6MoxM0Gg_hOzK5COBXdj98oGsQK_rZNjQ26DtdgcslmN5FPyY7_-uggqYDFLUjeFx4gSb9q1GiCchJ-7hPG0r3GWNaSAlBMC1BVASIMRBLjUibrtJKcHEhzQhQfvO6GtBWose2filvqm9Xh7bEi256YMyU1_Y_wFT7R51Hfi5mFhGW9lnXEWZDXViI1aaRwMohvPvAvf7L1OqBbm4b-ELIDmsmRDAI0ozISyw/4kj/GbeUl1zbTuO24U6eh_os5Q/h16/h001.iyKtBdcXZxBXVx6y66dnDV2JDQ8MiDElvhsRTn51QbQ" style="text-decoration:none;font-style:normal;color:#2D2D2D !important;font-size:14px;line-height:20px;" target="_blank"> Training Agents Inside of Scalable World Models <tr><td align="left" valign="top" class="m"><p style="font-size:13px;line-height:19px;color:#2D2D2D;"> Introducing Dreamer 4 </p></td></tr><tr><td align="left" valign="bottom" class="n" style="vertical-align:bottom;padding-top:12px;"><p style="word-break:break-word;">danijar.com/project/dreamer4</p></td></tr></a></p></td></tr></table></td><td width="3%" style="font-size:16px;line-height:16px;" class="mob-hide"> </td><td width="40%" align="left" valign="top" class="mob-hide"><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.9ggl6Mt0xphuuMReR5gVpQSxa46C2NVBKURDNuOjDh-TvqwJc6MoxM0Gg_hOzK5COBXdj98oGsQK_rZNjQ26DtdgcslmN5FPyY7_-uggqYDFLUjeFx4gSb9q1GiCchJ-7hPG0r3GWNaSAlBMC1BVASIMRBLjUibrtJKcHEhzQhQfvO6GtBWose2filvqm9Xh7bEi256YMyU1_Y_wFT7R52LVypzN8M4-b8LM5__OPCJlHdvQ29IlyGaZFeFFAGIV9HIZrUgT38Yshrc1GFKx-Q/4kj/GbeUl1zbTuO24U6eh_os5Q/h17/h001.90-sxqVc8PTuh2IDJ02Y1Ww8G7Ev8BOvSOxLoLn0pQs" target="_blank"><img src="" width="230" style="height:auto;display:block;"/></a></td></tr></table></td></tr></table></td></tr><tr><td id="inner-workings-of-dreamer-4" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:normal;padding:0px 28px;text-align:left;"><h3 style="color:#2A2A2A;font-weight:normal;mso-line-height-alt:125.0%;">Inner Workings of Dreamer 4</h3></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> Dreamer 4 consists of two main components: a tokenizer and a dynamics model, both built on an efficient transformer architecture. The tokenizer compresses video frames into compact representations, while the dynamics model predicts future representations based on past actions and observations. </p></td></tr><tr><td align="center" valign="top" style="padding-bottom:20px;padding-left:15px;padding-right:15px;padding-top:20px; " class="dd"><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:626px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/b92593b7-b0a8-41cb-81d0-c5d50f8f2ea9/image.png?t=1759852393" alt="" height="auto" width="626" style="display:block;width:100%;" border="0"/></td></tr><tr><td align="center" valign="top" class="t" style="width:626px; padding: 4px 0px 4px 0px;"><p>World model design. Dreamer 4 consists of a causal tokenizer and an interactive dynamics model, which both use the same block-causal transformer architecture.</p></td></tr></table></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> This setup allows the model to handle multiple modalities, like images and actions, within a single transformer. It uses a shortcut forcing objective, which helps the model generate frames quickly and accurately with just four sampling steps per frame. This approach reduces error accumulation over long video sequences and supports real-time inference on a single GPU. </p></td></tr><tr><td align="center" valign="top" style="padding-bottom:20px;padding-left:15px;padding-right:15px;padding-top:20px; " class="dd"><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:626px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/415b77c1-87d6-412c-910d-0e89f8a2536f/image.png?t=1759852340" alt="" height="auto" width="626" style="display:block;width:100%;" border="0"/></td></tr></table></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> The model is trained in three phases. First, the tokenizer and dynamics model are pretrained on videos, with or without action labels. Next, the model is fine-tuned with task-specific inputs to predict actions and rewards. Finally, the policy is improved through imagination training, where the agent practices decision-making inside the world model using reinforcement learning. </p></td></tr><tr><td align="center" valign="top" style="padding-bottom:20px;padding-left:15px;padding-right:15px;padding-top:20px; " class="dd"><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:626px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/57bafb15-2a86-4d0f-9484-2c87cb4682a7/image.png?t=1759852368" alt="" height="auto" width="626" style="display:block;width:100%;" border="0"/></td></tr><tr><td align="center" valign="top" class="t" style="width:626px; padding: 4px 0px 4px 0px;"><p>Agent performance in Minecraft without environment interaction. </p></td></tr></table></td></tr><tr><td id="evaluation-and-results-of-dreamer-4" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:normal;padding:0px 28px;text-align:left;"><h3 style="color:#2A2A2A;font-weight:normal;mso-line-height-alt:125.0%;">Evaluation and Results of Dreamer 4</h3></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> Dreamer 4 was able to achieve a major milestone: it is the first agent to obtain diamonds in Minecraft using only offline data, without any environment interaction. This task requires executing over 20,000 low-level mouse and keyboard actions from raw pixels. Dreamer 4 significantly outperforms previous methods, such as OpenAI’s VPT agent, despite using <b>100 times less data</b>. </p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> It also surpasses agents that leverage large vision-language models like Gemma 3, nearly tripling the success rate for crafting iron pickaxes. The world model itself shows remarkable accuracy in predicting object interactions and game mechanics, successfully completing 14 out of 16 complex tasks in human-in-the-loop evaluations. </p></td></tr><tr><td align="center" valign="top" style="padding-bottom:20px;padding-left:15px;padding-right:15px;padding-top:20px; " class="dd"><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:626px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/d6ce7302-fb6b-48ce-99d9-a76795b56d61/image.png?t=1759852479" alt="" height="auto" width="626" style="display:block;width:100%;" border="0"/></td></tr><tr><td align="center" valign="top" class="t" style="width:626px; padding: 4px 0px 4px 0px;"><p>Action generalization.</p></td></tr></table></td></tr><tr class="btn_row"><td valign="top" style="padding-bottom:14px;padding-left:28px;padding-right:28px;padding-top:14px;text-align:center;width:100%;word-break:break-word;" class="dd"><table width="100%" role="none" border="0" cellspacing="0" cellpadding="0" style="margin:14px auto 14px auto;"><tr><td align="center" valign="middle"><table role="none" border="0" cellspacing="0" cellpadding="0"><tr><td style="background-color:#2C81E5;border-radius:8px;mso-padding-alt:14px 20px;" class="btn"><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.fUNb4GdFo9D3F8WuLArtoV5sElgytBlvJRzI9WtI92ZDCTLU_mLW-baYc7sqX1l7verPAbAHpAtffYqhzaGjmRrlf5NkTdrEEr37BTalFdKxTJ6QW4oDOkZ1QREpNIdFJkW9Pp1jyhOta5uU7V-60SQFw5r-Bqf42ovxhOj_ZXnAX8SM4GyTV7zE3KKm6pkXlPZXUBFk7XMgMK_HIBpaXEqKGIOE7heUZARhoGnqYgBARmcFF73r1N2qfxE9Ky-A9E9usCYXhOC9tt0rcFkVxA/4kj/GbeUl1zbTuO24U6eh_os5Q/h18/h001.ICzx1hyW35FzWrwvbPjIZpL1dvjFFQTE57QZBnG8cjQ" target="_blank" rel="noopener noreferrer nofollow" style="background-color:#2C81E5;border-radius:8px;color:#FFFFFF;display:inline-block;font-family:'Open Sans','Segoe UI','Apple SD Gothic Neo','Lucida Grande','Lucida Sans Unicode',sans-serif;font-size:16px;font-weight:normal;line-height:18px;padding:14px 20px;text-decoration:none;"> Read Full Paper </a></td></tr></table></td></tr></table></td></tr><tr><td><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" style=""><tr><td bgcolor="#222222" style="background-color:#222222;padding:0.0px 0.0px 0.0px 0.0px;"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"></p></td></tr></table></td></tr></table></td></tr><tr><td id="stochastic-activations" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:Bold;padding:0px 28px;text-align:left;"><h2 style="color:#2A2A2A;font-weight:Bold;mso-line-height-alt:150.0%;">Stochastic activations</h2></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"><i>Lomeli et al. [Meta FAIR, Ecole Normale Supérieure Paris Saclay, Paris Cité University]</i></p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"><span style="background-color:#e0e0e0;"><span style="color:rgb(255, 58, 58);font-size:0.6rem;"> ♥ 22k </span></span><span style="color:rgb(44, 129, 229);font-size:0.6rem;"> </span><span style="background-color:#e0e0e0;"><span style="color:rgb(44, 129, 229);font-size:0.6rem;"> LLM Activation Functions </span></span><span style="color:rgb(44, 129, 229);font-size:0.6rem;"> </span></p></td></tr><tr><td id="introduction-to-stochastic-activati" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:normal;padding:0px 28px;text-align:left;"><h3 style="color:#2A2A2A;font-weight:normal;mso-line-height-alt:125.0%;">Introduction to Stochastic Activations</h3></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> Many AI architectures use ReLU activation because it produces sparse outputs (many zeros), which can speed up inference by reducing the number of calculations needed. However, ReLU has a known weakness: for negative inputs, its gradient is zero, which can halt learning in parts of the network during training. </p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> On the other hand, the SILU activation avoids this problem and generally leads to better model accuracy, but it doesn’t produce sparsity. So, how can we get the best of both worlds? </p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> This paper introduces stochastic activations. Instead of committing to one activation function, this approach randomly chooses between ReLU and SILU during training. </p></td></tr><tr><td align="center" valign="top" style="padding-bottom:20px;padding-left:15px;padding-right:15px;padding-top:20px; " class="dd"><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:626px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/5354dab7-c1fe-4de6-addb-7a09f29e764b/image.png?t=1759854065" alt="" height="auto" width="626" style="display:block;width:100%;" border="0"/></td></tr><tr><td align="center" valign="top" class="t" style="width:626px; padding: 4px 0px 4px 0px;"><p>Stochastic activation randomly selects one of two activations when x < 0: (1) RELU selected with probability 1-p; otherwise (2) another activation, in particular SILU.</p></td></tr></table></td></tr><tr><td id="inner-workings-of-stochastic-activa" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:normal;padding:0px 28px;text-align:left;"><h3 style="color:#2A2A2A;font-weight:normal;mso-line-height-alt:125.0%;">Inner Workings of Stochastic Activations</h3></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> The core idea behind stochastic activations is to randomly switch between ReLU and SILU when the input to the activation function is negative. This is controlled by a Bernoulli random variable: with probability (p), the model uses SILU, and with probability (1-p), it uses ReLU. </p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> For positive inputs, the function remains fixed, i.e., it is either the identity (like ReLU) or as SILU, depending on the configuration. This stochastic behavior during training ensures the network experiences both the smooth gradients of SILU, which help optimization, and the sparsity-inducing pattern of ReLU, which is useful later. </p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> The authors combined this technique with a fine-tuning step called Swi+FT to prepare the model for efficient inference. Here, the model is first pre-trained mostly with SILU or a stochastic mix, and then, for the last 5–10% of training steps, it switches entirely to ReLU. </p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> This final tuning phase adapts the weights to ReLU’s behavior, so the model performs well when ReLU is used at inference time. </p></td></tr><tr><td align="center" valign="top" style="padding-bottom:20px;padding-left:15px;padding-right:15px;padding-top:20px; " class="dd"><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:626px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/af7ab321-565c-42b2-8d3c-15087d242fac/image.png?t=1759853926" alt="" height="auto" width="626" style="display:block;width:100%;" border="0"/></td></tr><tr><td align="center" valign="top" class="t" style="width:626px; padding: 4px 0px 4px 0px;"><p>Losses during the last 500 steps of the training loss of LM1.</p></td></tr></table></td></tr><tr><td id="evaluation-and-results-of-stochasti" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:normal;padding:0px 28px;text-align:left;"><h3 style="color:#2A2A2A;font-weight:normal;mso-line-height-alt:125.0%;">Evaluation and Results of Stochastic Activations</h3></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> The researchers tested this method on models with 1.5 billion and 3 billion parameters, and evaluated it on a range of tasks, including code generation, common-sense reasoning, and question answering. </p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> When using ReLU at inference time, models trained with stochastic activations and fine-tuning (Swi+FT) achieved validation losses close to those of SILU-trained models, and significantly better than models trained with ReLU alone. </p></td></tr><tr><td align="center" valign="top" style="padding-bottom:20px;padding-left:15px;padding-right:15px;padding-top:20px; " class="dd"><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:626px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/25df734b-0e00-4afc-85bd-3ec36be74e25/image.png?t=1759854131" alt="" height="auto" width="626" style="display:block;width:100%;" border="0"/></td></tr><tr><td align="center" valign="top" class="t" style="width:626px; padding: 4px 0px 4px 0px;"><p>Performance per benchmark of the RELU and SILU.</p></td></tr></table></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> It was able to produce <b>1.65 times faster inference</b> on a CPU, with around 90% of activation outputs being zero. This shows that the approach successfully balances accuracy and computational cost. Additionally, when stochastic activations were used directly during inference for text generation, they provided a way to sample diverse outputs. In some tasks, this stochastic generation <b>outperformed standard temperature sampling</b>, though results varied across benchmarks. </p></td></tr><tr class="btn_row"><td valign="top" style="padding-bottom:14px;padding-left:28px;padding-right:28px;padding-top:14px;text-align:center;width:100%;word-break:break-word;" class="dd"><table width="100%" role="none" border="0" cellspacing="0" cellpadding="0" style="margin:14px auto 14px auto;"><tr><td align="center" valign="middle"><table role="none" border="0" cellspacing="0" cellpadding="0"><tr><td style="background-color:#2C81E5;border-radius:8px;mso-padding-alt:14px 20px;" class="btn"><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.fUNb4GdFo9D3F8WuLArtoV5sElgytBlvJRzI9WtI92alJxe5vTvLSL9GvS37oekahx1xliCfzsbSjE5FXmAcxVS69VWQc0NBNND14XT71i9CDS89RP8AndJ8eEnksijpcwG58jJxcSsmWls5PcEf1TqTPH3XcwP_UkdZxr0iOTgLXKbu6nxka9l1fEoubvU0w71kejQlig-6ojOGLYjC_OKBt4p5O58uVjGWi7NU9Nfi-9gyWrbSVg1ulsZy-KBrA6h0RtrFtNw3sVYABLevQg/4kj/GbeUl1zbTuO24U6eh_os5Q/h19/h001.RKwe_X-N0LJj4xoZvSdmBaWgVMVdBozRH2L-itik1Ew" target="_blank" rel="noopener noreferrer nofollow" style="background-color:#2C81E5;border-radius:8px;color:#FFFFFF;display:inline-block;font-family:'Open Sans','Segoe UI','Apple SD Gothic Neo','Lucida Grande','Lucida Sans Unicode',sans-serif;font-size:16px;font-weight:normal;line-height:18px;padding:14px 20px;text-decoration:none;"> Read Full Paper </a></td></tr></table></td></tr></table></td></tr><tr><td><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" style=""><tr><td bgcolor="#222222" style="background-color:#222222;padding:0.0px 0.0px 0.0px 0.0px;"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0"><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"></p></td></tr></table></td></tr></table></td></tr><tr><td id="polychromic-objectives-for-reinforc" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:Bold;padding:0px 28px;text-align:left;"><h2 style="color:#2A2A2A;font-weight:Bold;mso-line-height-alt:150.0%;">Polychromic Objectives for Reinforcement Learning</h2></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"><i>Hamid</i><span style=""><i> et al. [</i></span><i>Stanford University</i><span style=""><i>]</i></span></p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"><span style="background-color:#e0e0e0;"><span style="color:rgb(255, 58, 58);font-size:0.6rem;"> ♥ 424 </span></span><span style="color:rgb(44, 129, 229);font-size:0.6rem;"> </span><span style="background-color:#e0e0e0;"><span style="color:rgb(44, 129, 229);font-size:0.6rem;"> LLM Reinforcement Learning </span></span><span style="color:rgb(44, 129, 229);font-size:0.6rem;"> </span><span style="background-color:#e0e0e0;"><span style="color:rgb(44, 129, 229);font-size:0.6rem;"> bycloud’s pick </span></span><span style="color:rgb(44, 129, 229);font-size:0.6rem;"> </span></p></td></tr><tr><td id="introduction-to-polychromic-objecti" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:normal;padding:0px 28px;text-align:left;"><h3 style="color:#2A2A2A;font-weight:normal;mso-line-height-alt:125.0%;">Introduction to Polychromic Objectives in RLFT</h3></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> Reinforcement learning fine-tuning is a powerful technique for tuning pretrained AI models, but it often comes with a hidden cost. As policies are refined for better performance, they can lose the rich diversity of behaviors they once had. </p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> This entropy collapse makes it harder for models to explore new solutions and adapt to unfamiliar tasks. To tackle this issue, researchers have developed polychromic objectives, a new approach that explicitly encourages policies to maintain and refine a wide range of behaviors during fine-tuning. </p></td></tr><tr><td id="inner-workings-of-polychromic-ppo" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:normal;padding:0px 28px;text-align:left;"><h3 style="color:#2A2A2A;font-weight:normal;mso-line-height-alt:125.0%;">Inner Workings of Polychromic PPO</h3></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> This method shifts the focus from optimizing individual trajectories to evaluating entire sets of them. Instead of rewarding a single successful path, the policy is trained to generate groups of trajectories that collectively exhibit high performance and diversity. This broader perspective helps prevent the model from over-specializing on just a few high-reward actions. </p></td></tr><tr><td align="center" valign="top" style="padding-bottom:20px;padding-left:15px;padding-right:15px;padding-top:20px; " class="dd"><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:626px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/420a76ac-4968-4708-9f3e-7d2eab692859/image.png?t=1759854763" alt="" height="auto" width="626" style="display:block;width:100%;" border="0"/></td></tr></table></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> This reinforcement learning approach uses the polychromic objective, which scores a set based on both the average reward of its trajectories and a measure of their diversity. For example, in grid-world tasks, diversity might be defined by whether trajectories visit different rooms, while in algorithmic tasks, it could involve generating unique sequences. By combining these elements, the objective ensures that policies are incentivized to explore varied strategies without sacrificing success. </p></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> To implement this in practice, the researchers adapted proximal policy optimization (PPO) into polychromic PPO. They use vine sampling to collect multiple rollouts from key states during training, and then compute a shared advantage signal for all trajectories in a set. This means that every action in a diverse and successful set receives the same positive update, reinforcing the policy to produce such sets consistently. The result is a fine-tuning process that naturally balances exploitation of known good behaviors with exploration of new ones. </p></td></tr><tr><td id="evaluation-and-results-of-polychrom" class="dd" align="left" valign="top" style="color:#2A2A2A;font-weight:normal;padding:0px 28px;text-align:left;"><h3 style="color:#2A2A2A;font-weight:normal;mso-line-height-alt:125.0%;">Evaluation and Results of Polychromic PPO</h3></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> The researchers tested this approach on BabyAI, Minigrid, and Algorithmic Creativity tasks, and the results show that polychromic PPO achieves higher success rates and better coverage of different environment configurations compared to standard RL methods. In pass@k evaluations, where models get multiple attempts to solve a task, polychromic PPO shows substantially improved performance as k increases. In challenging tasks like Bosslevel, it achieved up to <b>15% higher pass rates</b> with more attempts, while baselines plateaued early. </p></td></tr><tr><td align="center" valign="top" style="padding-bottom:20px;padding-left:15px;padding-right:15px;padding-top:20px; " class="dd"><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:626px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/e8b9e220-572d-4a40-9c01-ab34078ba291/image.png?t=1759854783" alt="" height="auto" width="626" style="display:block;width:100%;" border="0"/></td></tr><tr><td align="center" valign="top" class="t" style="width:626px; padding: 4px 0px 4px 0px;"><p>Average reward and success rate (%) on BabyAI tasks.</p></td></tr></table></td></tr><tr><td class="dd" align="left" style="padding:0px 28px;text-align:left;word-break:break-word;"><p style="mso-line-height-alt:150.0%;"> While there is a slight trade-off in some validity metrics, the overall gains in creativity and coverage make polychromic objectives a promising direction for future AI systems that need to explore and innovate. </p></td></tr><tr><td align="center" valign="top" style="padding-bottom:20px;padding-left:15px;padding-right:15px;padding-top:20px; " class="dd"><table role="none" border="0" cellspacing="0" cellpadding="0" style="margin:0 auto 0 auto;"><tr><td align="center" valign="top" style="width:626px;"><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/6ac135d9-e483-4769-911d-72db3d2d6db8/image.png?t=1759854835" alt="" height="auto" width="626" style="display:block;width:100%;" border="0"/></td></tr><tr><td align="center" valign="top" class="t" style="width:626px; padding: 4px 0px 4px 0px;"><p>Average pass rate (%) in one attempt on BabyAI tasks under large initial-state perturbations.</p></td></tr></table></td></tr><tr class="btn_row"><td valign="top" style="padding-bottom:14px;padding-left:28px;padding-right:28px;padding-top:14px;text-align:center;width:100%;word-break:break-word;" class="dd"><table width="100%" role="none" border="0" cellspacing="0" cellpadding="0" style="margin:14px auto 14px auto;"><tr><td align="center" valign="middle"><table role="none" border="0" cellspacing="0" cellpadding="0"><tr><td style="background-color:#2C81E5;border-radius:8px;mso-padding-alt:14px 20px;" class="btn"><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.fUNb4GdFo9D3F8WuLArtoV5sElgytBlvJRzI9WtI92a0IWcKNjHgG7_a8MncsqclTP7ROTN0iuHDi558QNGPr_kW6u7AdrXtDV91XwFUh2xbaPcVbJ4_eqOUCzLgimnAcSPO-TaJEMLhKNj9bkiLyiamgrEozzZIPm8WL-WjQaeASWtlZjNK1Td60705St03pE5f5t0ylj-tDLZbLxpfIqZ_3OZ-fAAS7TscCqq1-crGwolN-E5JykTU8YXUiBmWF6CxlzFAliuZGaVfLrGvhQ/4kj/GbeUl1zbTuO24U6eh_os5Q/h20/h001.rDQh1vsthteUUiDMYjAd8tRAFcn_a37BXslJ4yJwEpQ" target="_blank" rel="noopener noreferrer nofollow" style="background-color:#2C81E5;border-radius:8px;color:#FFFFFF;display:inline-block;font-family:'Open Sans','Segoe UI','Apple SD Gothic Neo','Lucida Grande','Lucida Sans Unicode',sans-serif;font-size:16px;font-weight:normal;line-height:18px;padding:14px 20px;text-decoration:none;"> Read Full Paper </a></td></tr></table></td></tr></table></td></tr><tr><td class="dd" align="center" valign="top" style="padding:20px;"><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.amatuKKICSickUKplYJXmCaj1gILtNUrL6iGf9RrElO4oFgGthO_ewXJh08mOaANnbf6PGNnJngjKpU6KjEm07kyypo6I6AiKRCNrUjJx0Tyo44JaupsV-2NSsqaxqInC8SeJBa6rMRG9T74Qa1aXm4IThrd8b4fv-9pLPs4JEFusZHQISqo-xF3A0Pozl4dKhcGKGZohADU0fX9fMazQ1b7A7G0RDciSaFBX7uqbab5U3r-JHl8UXoFLI7DmhPYRo58D33aEukaYEHb-3YCRlBPIwF6vmqL7SFOrCr9J3o/4kj/GbeUl1zbTuO24U6eh_os5Q/h21/h001.jSEL5yzQSPET3Y1TYazEQFlQnZBSYvAnBv8IMoIjnQg" style="text-decoration:none;"><table align="center" width="100%" cellpadding="0" cellspacing="0" border="0" role="none" style="max-width:520px;margin:0 auto;"><tr><td class="p" width="100%" style="padding:2px;border:none;"><table width="100%" cellpadding="0" cellspacing="0" border="0" role="none"><tr><td align="center" valign="top" style="width:100%;"><div style="max-height:0;position:relative;opacity:0.999;width:100%;mso-hide:all;"><div style="display:inline-block;width:100%;padding-top:25%;"><img width="20%" height="auto" loading="lazy" alt="" style="border:0;" src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/static_assets/youtube_play_icon.png"/></div></div><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.amatuKKICSickUKplYJXmCaj1gILtNUrL6iGf9RrElO4oFgGthO_ewXJh08mOaANnbf6PGNnJngjKpU6KjEm07kyypo6I6AiKRCNrUjJx0Tyo44JaupsV-2NSsqaxqInC8SeJBa6rMRG9T74Qa1aXm4IThrd8b4fv-9pLPs4JEFusZHQISqo-xF3A0Pozl4dKhcGKGZohADU0fX9fMazQ_95OXGoFTt8zFyUe6kTzvzzAnyBB8CMdXOzQ9tyIk4DP-cX5lKd4aOZzTa2hhal3EKNBEAQnrAM_Ra74o9OVsw/4kj/GbeUl1zbTuO24U6eh_os5Q/h22/h001.KOlW1UVuGoTUNwsc_arLbNqA3d12CgVx9co9VsK7qlI" style="text-decoration:none;"><img src="https://i.ytimg.com/vi/SvIJ-BIAPNI/maxresdefault.jpg" width="480" height="auto" loading="lazy" alt="YouTube video by bycloud" style="display:block;height:auto;border:0;outline:none;text-decoration:none;background-color:#000000;width:100%;"/></a></td></tr><tr><td><p style="font-size:12px;font-weight:500;font-style:italic;font-family:Helvetica, Calibri, sans-serif;color: #686a6d; padding-top:0 !important;padding-bottom:6px !important; padding-left:4px !important;"> The Art of Making AI Wrapper: Context Engineering Explained </p></td></tr></table></td></tr></table></a></td></tr></table></td></tr></table></td></tr><tr><td align="center" valign="top"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td><tr><td class="b" align="center" valign="top" bgcolor="#2a2a2a" style="padding:0px 0px 0px 0px;border-style:solid;border-width: 0px 0px 0px 0px;border-color: #2a2a2a;border-bottom-left-radius:10px;border-bottom-right-radius:10px;"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td align="center" valign="top" bgcolor="#73ddff" style="padding:12px"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td><span style="padding-left:1px;"></span></td><td align="center" valign="middle" width="75" style="width:75px;"><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.1muhFWIqieRYpaJ-FbWSCQqcWoV4NNHHr5SkP9THApWUO4S9eWSDBFDMKQ83N4CY1l4kXQTU9YnEEqXRrg_2uhS94rQOKDl60C6UO57Zu1mJCFi_zhfD-a_hnJHdTQ7EUXDUzRIzxOBEtaZIBf69jNKhk7VnP1IUE8ZJPEsmhjXaGU8H5-GcqnEgZU38Qr3CfnPTtshChZ9qeXvST54jkzTT_WPIZU6wTp8Nw2WoMjiuCJlDK2HCetIxFRrYIMnyRI2w5ffBvpMbOqyWc9kkJw/4kj/GbeUl1zbTuO24U6eh_os5Q/h23/h001.eN6I4TfTq6cpNk_V1XUsIaBGdhOnsQA6A6gVDkxGxBE" style="text-decoration:none;"><img width="22" height="22" alt="tw" border="0" style="display:block;max-width:22px;color:Dark" src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/static_assets/x_dark.png"/></a></td><td align="center" valign="middle" width="75" style="width:75px;"><a href="https://elink4f7.mail.bycloud.ai/ss/c/u001.amatuKKICSickUKplYJXmBoQnQ9VXnB2zTxBG4HeHBi5iti4l06m5fR1UTFq_vFgQaGMmutCjJbuBFU8WHbRj6heToGsiZHlry3dxu5DEimeQbpBAMyhKdSbaWrmIf3btIDXcFX6niTd9QL-ES2rrRq48ZRC4g8w9Bs2KPxEr9x5mhdtKy1xhZmWvSaNZt2cDdi0LNmu2JEuISs4wJu2s6tBMvuftqbTRhIab5VuLfzWyO1zolTeYjGXyBzsk1HQ7f1JmrVjBllnpZ5atf_WNg/4kj/GbeUl1zbTuO24U6eh_os5Q/h24/h001.nEEn0R9G0t_tYT_xW9-IOTIv2R6mnEzysRT6V-LPU_E" style="text-decoration:none;"><img width="22" height="16" alt="yt" border="0" style="display:block;max-width:22px;color:Dark" src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/static_assets/youtube_dark.png"/></a></td><td><span style="padding-left:1px;"></span></td></tr></table></td></tr><tr><td height="10" style="line-height:1px;font-size:1px;height:10px;"> </td></tr><tr><td class="w" align="center" valign="top" style="padding:15px 15px 15px 15px;"><table role="none" width="100%" border="0" cellspacing="0" cellpadding="0" align="center"><tr><td align="center" valign="top"><p style="font-family:'Verdana',Geneva,sans-serif;color:#FFFFFF!important;"> Update your email preferences or unsubscribe <a class="link" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.c6q0w4g5sodbtO4I1B_pxWc4htTObwdorovK0nFHVH-4pUdVE0ELYH5DsNemk732SjNwhPNJ25r0O8B5vYifsBhEpz-DJgyVFmavJPa0OyKRRnvw4o7XGyvIv7PRofnmpZnYnICDD1XNlD2dufcAYx0gPvA3uVeBNzAld_pzjYidJ2i8apTDu4l9jgFGJf5Bp92cxA_cIdu1-tkCGPAVMkufL0WVQnKKtHzSTFodyuoytEpn0F9ZQp7cv03la4NYkYmzgZMdwcYdSzrfuHpUbSARkPBKvlcNb_aozI_ixJRi5FKT3wK8kRhViSkzFQQ2YZdYqei3TOwzhSWgAE3jdVnc5jV3Qxx0Plw-ZGnLTrlkvzy01GEuaDnRSYYJP3ekiPQSoZfHc11SVt0QN5fWNThvwppp0cfMVMa3MoSJY80JOTGDoVV6x8GrSk6gdaW-ocK_Ue1LJsbVocRIr8JvPyIrK5zGqKQt41mwDA6e2tCbLc8IIPs6pWzLdnWBtPd0JzGrsknpFunvOi6178l8pJe4tVd3wzMQhenviaNaRRmXqyFsPTsGloAozkXyeg60UgVGTK37_SeInEaJKxJbzJfAD84MOUj8RnENqFKGa_KicF9H-ij3C-y-hLV1EuWHtkoGuL-4BVw2zkG9vuftzbrjXM62rBXllwTfHPwaiPOOaLXdxLGANDwtebjTOvWOIidho5BYHuok8J0Ja_NgDQzL6Hf4qdu4MOeGZH7aRLqbYI_HHipzaCJykWWjBofpy9hXxImaJex8KSWCZXiMFtHm01BEF7Mqg89nVEtvMZddRMoAV24LJSc40skfuMH6XqE6TyP1etvoG6o8Tjd4QNQlOmL7TXPqzL_0vFXpEnIz6Wer07fQGKkZOWTvknAxdJhvwheSHFuuTP8kindoav9Cmaxx0UGHPuuUl8x026BHm-bczsyYaMQhMPaxdLuqYkU0Rs8PKfbQaA-Sn5XTvv_qDBqsyaYVWb_3cwOIIY0/4kj/GbeUl1zbTuO24U6eh_os5Q/h25/h001.PDU5FnO6wlhxedZcbfj5n6upFAz_nGmBbt0ozFJaqcM" style="text-decoration:underline;text-decoration-color:#FFFFFF!important;color:#FFFFFF!important;"> here</a></p><p class="copyright" style="font-family:'Verdana',Geneva,sans-serif;color:#FFFFFF!important;"> © 2025 bycloudai </p><p style="font-family:'Verdana',Geneva,sans-serif;color:#FFFFFF!important;"> 228 Park Ave S, #29976, New York, New York 10003, United States </p></td></tr><tr style="display: table-row !important;"><td align="center" valign="top" style="padding-top:20px;" style="display:table-cell !important;"><table role="none" border="0" cellspacing="0" cellpadding="0" align="center" style="display:table !important;"><tr style="display:table-row !important;"><td class="u" align="center" valign="middle" height="32" style="height:32px;display:table-cell !important; max-height: 32px !important;margin:0px !important; background-color: #ffffff !important;"><a style="line-height:32px !important;text-decoration:none;display:block !important;" href="https://elink4f7.mail.bycloud.ai/ss/c/u001.DUiN96-Eq7pUHzwEhy5j28olDWFpV5DDKfdk_OdOKOjQZwAlcCFb6mmSF7Sv-aque3bmATukq4SA_IcHJ7ZRwYx3yuxXXHvlLtt04kQDtseifJorH5Imt5RfyjMZauW5pLIbbQoUB5lLTvQlllzJyZ_HanGRuLP_loQQxpdTEtW_33Q2rGPC8PjdrjAEyGIZFqoERQ8L87LIxaDVbwq2XgkmonLd5OPejLNz9ULCMXB-MUZguOe6yG8ZDeBHliyU/4kj/GbeUl1zbTuO24U6eh_os5Q/h26/h001.nmj6lVGzc5wqivfq8pq5ab-QnbsE6OFxZSYDbprIgQQ"><img src="https://media.beehiiv.com/output-onlinepngtools.png" width="16" alt="beehiiv logo" style="display:inline-block !important;max-width:16px !important; vertical-align:-3px !important;width: 16px !important;" border="0"/><span style="padding-left:11px !important;display: inline-block !important;">Powered by beehiiv</span></a></td></tr></table></td></tr><tr><td align="left" valign="top" height="2" style="height:2px;"><a href='https://elink4f7.mail.bycloud.ai/ss/c/u001.CxDkkVpJsBdVoe83c_tBWsHIaP4XNp0WgUYqLvHcKk_3uqk_KIkz4ddLinhFbud6JuxLFdSUhYnR7b1NSsmbtzXNGNblnEEMKUtkCAjkn8Y/4kj/GbeUl1zbTuO24U6eh_os5Q/h27/h001.6tfO1opriI83Rml_nAquOlt9SxXIkVe4koNYDrKq4nk' style="color: #2a2a2a !important; cursor: default; font-size: 1px; text-decoration: none;"> Terms of Service </a></td></tr></table></td></tr></table></td></tr></td></tr></table></td></tr></table></td></tr></table></td></tr></table></div></body></html>