News

Katie Parrott in Working Overtime As AI races ahead, we try to step back from the fray every once in a while. Each quarter, we gather for a "think week” to reflect on our work from the previous ...
Mixture-of-Recursions (MoR) is a new AI architecture that promises to cut LLM inference costs and memory use without ...
Ever since researchers began noticing a slowdown in improvements to large language models using traditional training methods, ...
Multiagent reinforcement learning (MARL) has received increasing attention and been used to solve cooperative multiagent decision-making and learning control tasks. However, the high complexity of the ...
In this research work authors have experimentally validated a blend of Machine Learning and Nonlinear Model Predictive Control (NMPC) framework designed to track the temperature profile in a Batch ...
Warfarin is a commonly prescribed anticoagulant with a narrow therapeutic window, which requires frequent and specialized monitoring. This work aims to develop standardized optimal warfarin dose ...
We use Reinforcement Learning from Verifiable Rewards (RLVR) to train MemAgent, extending the DAPO algorithm to support end-to-end optimization of Agent Workflows with multi-turn context-independent ...