Model Based Algorithm in RL

News

29d

DeepCoder delivers top coding performance in efficient 14B open model

DeepCoder-14B competes with frontier models like o3 and o1—and the weights, code, and optimization platform are open source.

12d

30 seconds vs. 3: The d1 reasoning framework that’s slashing AI response times

Researchers from UCLA and Meta AI have introduced d1, a novel framework using reinforcement learning (RL) to significantly enhance the reasoning capabilities of diffusion-based large language models ...

International Monetary Fund2y

AI and Macroeconomic Modeling: Deep Reinforcement Learning in an RBC model

This study seeks to construct a basic reinforcement learning-based AI-macroeconomic simulator. We use a deep RL (DRL) approach (DDPG ... variables or sectors to the model or by incorporating different ...

Tech Xplore on MSN10d

Reinforcement learning boosts reasoning skills in new diffusion-based language model d1

A team of AI researchers at the University of California, Los Angeles, working with a colleague from Meta AI, has introduced d1, a diffusion-large-language-model-based framework that has been improved ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results