News

Apple recently introduced its open-source DCLM-7B model, showcasing the potential of data curation in enhancing model performance. However, the DCLM-7B performs poorly against Microsoft's Phi-3.
Apple has just released an AI model that, rather than generating code from left to right, does it out of order and all at ...
Cavil-Qwen3-4B is an open-source Large Language Model (LLM) designed by SUSE to automate legal compliance within the ...
The OpenAI Evals platform includes a sizable open-source collection of difficult evaluations, which may be used to test many aspects of LLM performance. These evaluations are adaptable to particular ...
This fully open-source model is available in two versions—Base and Chat—and achieves the highest MOF classification, “open science.” With a 32k token context size and features like grouped-query ...
LiteLLM is an open-source project that tackles this fragmentation head-on by providing a unified interface (and gateway) to call more than 100 LLM APIs using a single, consistent format.
MiMo-7B LLM is Xiaomi's first open-source AI model focused on reasoning and code, which matches larger LLMs in performance with 7B parameters.
The company introduced its new NVLM 1.0 family in a recently released white paper, and it’s spearheaded by the 72 billion-parameter NVLM-D-72B model. “We introduce NVLM 1.0, a family of ...
Its new DeepSeek-V3 model is not only open source, it also claims to have been trained for only a fraction of the effort required by competing models, while performing significantly better.
On January 20, 2025, Chinese AI startup DeepSeek unveiled R1, an open-source large language model (LLM) that is redefining industry expectations.
Chart 1: Development timeline of key open- and closed-source LLMs, 2019-2025. LLM business model characteristics. Chart 2: Characteristics of open- and closed-source LLMs. Hybrid model trends. Low ...