Large language models (LLMs) have dominated the AI conversation, fueled by the popularity of ChatGPT and similar gen AI apps. However, small language models (SLMs) are on the rise. More compact and efficient, SLMs use less memory and processing power. This makes them well-suited for resource-constrained environments.
Their ability to produce unique content and insights from vast datasets has captivated the world and spurred new tools and apps, further cementing their cultural impact, especially in the enterprise. With Dr. Juan Bernabe Moreno, Director of IBM Research® Europe for UK and Ireland, we’ll explore how SLMs can benefit enterprise AI adoption.
Generative artificial intelligence (gen AI) holds significant productivity potential for enterprises. However, when implementing AI models, bigger isn’t always better. There are 2 major LLM issues preventing enterprise adoption.
First, many LLMs are general-purpose, limiting their value. “Average” company models are ineffective because no “average” company exists. Effective AI models must be built, tuned and deployed to specific organizational needs.
Second, many proprietary LLMs are ‘black boxes’—a closed model in which only the company that owns it can see the components of—lacking data transparency and hindering tuning with enterprise data, where AI’s true value lies. This leaves enterprises responsible for model performance without insight or control.
These issues prevent trust and understanding of model safety. Without effectively choosing gen AI solutions to meet industry, legal and regulatory requirements, companies can’t fully use gen AI’s power.
For successful gen AI, enterprises need 3 things:
To meet these needs and maximize AI value, enterprises are turning to SLMs and discovering the power of small. SLMs offer a compelling alternative to general-purpose LLMs.
SLMs have received far less attention and fanfare, but they offer a compelling option for organizations of all sizes. Their energy efficiency, data transparency and strong performance—often matching or exceeding larger models—unlock responsible gen AI adoption without hindering innovation.
Generally, anything under 30 billion parameters is considered an SLM. Key advantages include lower costs, reduced energy consumption and improved data transparency and integrity. A new generation of smaller models, such as IBM® Granite™, built on cleaned, filtered datasets for specific tasks, reduce risks such as bias and inappropriate output while increasing data visibility.
This trustworthy base model enables confident integration of proprietary data, unlocking AI’s true value. IBM provides an intellectual property (IP) indemnity for all Granite models further boosting confidence in merging their data with the models.
IBM believes open-source base models empower organizations to create specialized, data-infused models. To support this, IBM open sourced its Granite family of customizable SLMs, trained on transparent, filtered datasets.
Combining a small Granite model with enterprise data can achieve task-specific performance rivaling larger models at a fraction of the cost. Early proofs-of-concept show IBM Granite models costing significantly less (between 3 and 23 times less) than large frontier models, while outperforming or matching similarly sized competitors on key benchmarks.
Also, new techniques such as InstructLab—introduced by IBM® and RedHat® in May 2024—simplify enterprise data infusion into LLMs. InstructLab enables enterprises to customize AI models using far less human-generated information and computing resources than traditional retraining.
SLMs, including Granite models, are already making an impact. Global sports institutions use Granite models, tuned with their own domain data, to enhance fan experiences with AI-generated commentary. Internally, IBM uses Granite models to power its human resources (HR) service platform, AskHR. By using natural language prompts, IBMers can access HR services in one place, saving time for both employees and HR professionals.
Vast LLMs are not the only way to benefit from gen AI. Smaller, more accessible, specialized models offer needed efficiency, trust, flexibility and performance at a lower cost, financially and environmentally.
We surveyed 2,000 organizations about their AI initiatives to discover what’s working, what’s not and how you can get ahead.
IBM® Granite™ is our family of open, performant and trusted AI models tailored for business and optimized to scale your AI applications. Explore language, code, time series and guardrail options.
Access our full catalog of over 100 online courses by purchasing an individual or multi-user subscription today, enabling you to expand your skills across a range of our products at a low price.
Led by top IBM thought leaders, the curriculum is designed to help business leaders gain the knowledge needed to prioritize the AI investments that can drive growth.
IBM web domains
ibm.com, ibm.org, ibm-zcouncil.com, insights-on-business.com, jazz.net, mobilebusinessinsights.com, promontory.com, proveit.com, ptech.org, s81c.com, securityintelligence.com, skillsbuild.org, softlayer.com, storagecommunity.org, think-exchange.com, thoughtsoncloud.com, alphaevents.webcasts.com, ibm-cloud.github.io, ibmbigdatahub.com, bluemix.net, mybluemix.net, ibm.net, ibmcloud.com, galasa.dev, blueworkslive.com, swiss-quantum.ch, blueworkslive.com, cloudant.com, ibm.ie, ibm.fr, ibm.com.br, ibm.co, ibm.ca, community.watsonanalytics.com, datapower.com, skills.yourlearning.ibm.com, bluewolf.com, carbondesignsystem.com, openliberty.io