Categories: LATEST NEWS

IBM Expands Granite Model Family with New Multi-Modal and Reasoning AI Built for the Enterprise

ARMONK, N.Y., Feb. 26, 2025 /PRNewswire/ — IBM (NYSE: IBM) today debuted the next generation of its Granite large language model (LLM) family, Granite 3.2, in a continued effort to deliver small, efficient, practical enterprise AI for real-world impact.

All Granite 3.2 models are available under the permissive Apache 2.0 license on Hugging Face. Select models are available today on IBM watsonx.ai, Ollama, Replicate, and LM Studio, and expected soon in RHEL AI 1.5 – bringing advanced capabilities to businesses and the open-source community. Highlights include:

A new vision language model (VLM) for document understanding tasks which demonstrates performance that matches or exceeds that of significantly larger models – Llama 3.2 11B and Pixtral 12B – on the essential enterprise benchmarks DocVQA, ChartQA, AI2D and OCRBench¹. In addition to robust training data, IBM used its own open-source Docling toolkit to process 85 million PDFs and generated 26 million synthetic question-answer pairs to enhance the VLM’s ability to handle complex document-heavy workflows.
Chain of thought capabilities for enhanced reasoning in the 3.2 2B and 8B models, with the ability to switch reasoning on or off to help optimize efficiency. With this capability, the 8B model achieves double-digit improvements from its predecessor in instruction-following benchmarks like ArenaHard and Alpaca Eval without degradation of safety or performance elsewhere². Furthermore, with the use of novel inference scaling methods, the Granite 3.2 8B model can be calibrated to rival the performance of much larger models like Claude 3.5 Sonnet or GPT-4o on math reasoning benchmarks such as AIME2024 and MATH500.³
Slimmed-down size options for Granite Guardian safety models that maintain performance of previous Granite 3.1 Guardian models at 30% reduction in size. The 3.2 models also introduce a new feature called verbalized confidence, which offers more nuanced risk assessment that acknowledges ambiguity in safety monitoring.

IBM’s strategy to deliver smaller, specialized AI models for enterprises continues to demonstrate efficacy in testing, with the Granite 3.1 8B model recently yielding high marks on accuracy in the Salesforce LLM Benchmark for CRM.

The Granite model family is supported by a robust ecosystem of partners, including leading software companies embedding the LLMs into their technologies.

“At CrushBank, we’ve seen first-hand how IBM’s open, efficient AI models deliver real value for enterprise AI – offering the right balance of performance, cost-effectiveness, and scalability,” said David Tan, CTO, CrushBank. “Granite 3.2 takes it further with new reasoning capabilities, and we’re excited to explore them in building new agentic solutions.”

Granite 3.2 is an important step in the evolution of IBM’s portfolio and strategy to deliver small, practical AI for enterprises. While chain of thought approaches for reasoning are powerful, they require substantial compute power that is not necessary for every task. That is why IBM has introduced the ability to turn chain of thought on or off programmatically. For simpler tasks, the model can operate without reasoning to reduce unnecessary compute overhead. Additionally, other reasoning techniques like inference scaling have shown that the Granite 3.2 8B model can match or exceed the performance of much larger models on standard math reasoning benchmarks. Evolving methods like inference scaling remains a key area of focus for IBM’s research teams.⁴

Alongside Granite 3.2 instruct, vision, and guardrail models, IBM is releasing the next generation of its TinyTimeMixers (TTM) models (sub 10M parameters), with capabilities for longer-term forecasting up to two years into the future. These make for powerful tools in long-term trend analysis, including finance and economics trends, supply chain demand forecasting and seasonal inventory planning in retail.

“The next era of AI is about efficiency, integration, and real-world impact – where enterprises can achieve powerful outcomes without excessive spend on compute,” said Sriram Raghavan, VP, IBM AI Research. “IBM’s latest Granite developments focus on open solutions demonstrate another step forward in making AI more accessible, cost-effective, and valuable for modern enterprises.”

To learn more about Granite 3.2, read this technical article.

Liat

Next Tria to launch five new product families powered by Qualcomm at Embedded World »

Previous « IBM Completes Acquisition of HashiCorp, Creates Comprehensive, End-to-End Hybrid Cloud Platform

Published by

Liat

11 months ago

Quantum Machines to Establish Flagship Hub at the Illinois Quantum and Microelectronics Park

New collaboration will establish a quantum-control–enabled center at the IQMP to accelerate and scale fault-tolerant…

9 hours ago

LATEST NEWS

SENAI raises $6.2M to launch real-time intelligence for threats hiding in online video content

With a seed round led by 10D Ventures, SENAI emerges from stealth to help government…

9 hours ago

LATEST NEWS

Dassault Systèmes and NVIDIA Partner to Build Industrial AI Platform Powering Virtual Twins

Shared industrial AI architecture combines Virtual Twins and AI infrastructure deployable at scale. Science-validated world…

9 hours ago

NEW PRODUCTS

New Power Module Enhances AI Data Center Power Density and Efficiency

Microchip’s MCPF1525 power module with PMBus™ delivers 25A DC-DC power, stackable up to 200A The…

2 days ago

LATEST NEWS

Datarails Launches Spend Control to Give CFOs Full Visibility on Contracts and Eliminate Zombie Subscriptions

New AI-powered platform – the first with full ERP integration – includes an AI agent…

2 days ago

LATEST NEWS

AccuLine reports 94% sensitivity in clinical trial of its 4-minute cardiac diagnostic system

The study validated the CORA system’s ability to rule out coronary artery disease with a…

4 days ago

IBM Expands Granite Model Family with New Multi-Modal and Reasoning AI Built for the Enterprise

Related Post

Recent Posts

Quantum Machines to Establish Flagship Hub at the Illinois Quantum and Microelectronics Park

SENAI raises $6.2M to launch real-time intelligence for threats hiding in online video content

Dassault Systèmes and NVIDIA Partner to Build Industrial AI Platform Powering Virtual Twins

New Power Module Enhances AI Data Center Power Density and Efficiency

Datarails Launches Spend Control to Give CFOs Full Visibility on Contracts and Eliminate Zombie Subscriptions

AccuLine reports 94% sensitivity in clinical trial of its 4-minute cardiac diagnostic system