Mistral AI and the Allen Institute for AI today released new large language models that they claim are among the most advanced in their respective categories. Mistral’s model is called Mistral Small 3 ...
On January 20, 2025, Chinese AI startup DeepSeek unveiled R1, an open-source large language model (LLM) that is redefining industry expectations. Designed to offer performance on par with proprietary ...
Large language models (LLMs) are transforming how enterprises operate, but their "black box" nature often leaves enterprises grappling with unpredictability. Addressing this critical challenge, ...
For the first time since GPT-2 in 2019, OpenAI is releasing new open-weight large language models. It's a major milestone for a company that has increasingly been accused of forgoing its original ...
Some of the world’s most widely used open-weight generative AI (GenAI) services are profoundly susceptible to so-called “multi-turn” prompt injection or jailbreaking cyber attacks, in which a ...
As demand for private AI infrastructure accelerates, LLM.co introduces a streamlined hub for discovering and deploying open-source language ...
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Cory Benfield discusses the evolution of ...
The future of large language models is likely to be open source, according to Marc Benioff, co-founder and longstanding CEO of Salesforce.… Despite the hype around LLM builders — OpenAI sucked in $40 ...
I’ve been writing about the democratic future of large language models (LLMs). Will this tech turn out to be an inherently centralized, authoritarian technology like nuclear power, or a more ...
Huawei’s Computing Systems Lab in Zurich has introduced a new open-source quantization method for large language models (LLMs) aimed at reducing memory demands without sacrificing output quality.