OpenSearch is an open source search and analytics engine used to index, query and analyze data from a wide range of data sources.
Built on Apache Lucene and originally derived from Elasticsearch—another search and analytics engine—OpenSearch provides a scalable and distributed architecture for real-time search, observability, log analytics and security analytics use cases.
OpenSearch includes OpenSearch Dashboards for data visualization and application monitoring. It also features a broad ecosystem of plugins, application programming interfaces (APIs) and clients that support analytics workflows across modern data environments.
Because it is developed as an open source project with a community-driven roadmap, organizations can use OpenSearch without licensing restrictions or vendor lock-in. Its compatibility with earlier versions of Elasticsearch—along with its extensible plugin framework—allows teams to adopt OpenSearch as a flexible analytics engine for operational workloads, machine learning pipelines and search applications.
Industry newsletter
Get curated insights on the most important—and intriguing—AI news. Subscribe to our weekly Think newsletter. See the IBM Privacy Statement.
Your subscription will be delivered in English. You will find an unsubscribe link in every newsletter. You can manage your subscriptions or unsubscribe here. Refer to our IBM Privacy Statement for more information.
Today’s organizations generate significant volumes of data that can be invaluable, but only if the data is indexed, searchable and available in real time. OpenSearch delivers this functionality through an open source search architecture designed for scale, cost efficiency and interoperability.
In practice, OpenSearch offers:
Companies gain full visibility into OpenSearch’s codebase and roadmap, allowing them to customize the platform to meet internal requirements.
OpenSearch maintains API and query syntax compatibility with open source Elasticsearch, meaning organizations can adopt or modernize workloads without extensive rewrites.
OpenSearch can ingest logs, metrics and traces at scale, powering the operational dashboards used for troubleshooting and analysis.
With built-in authentication and access control, teams can apply search capabilities across security workloads.
As open source software, OpenSearch can be deployed on-premises, across cloud providers or through managed service offerings.
OpenSearch started as a community response to licensing changes for Elasticsearch and Kibana, a popular visualization layer. Earlier versions of Elasticsearch were released under the Apache 2.0 license, but subsequent releases adopted the Server Side Public License (SSPL) and Elastic License. These licenses restricted open source reuse, creating challenges for organizations that relied on freely deployable and redistributable search software.
To preserve an open search ecosystem, Amazon Web Services (AWS) forked (that is, created an independent copy of) the last Apache 2.0 versions of Elasticsearch and Kibana, creating the OpenSearch Project. The project introduced new features and enhancements under an open governance model, and expanded compatibility with Elasticsearch APIs and client libraries to simplify migration.
Since then, the OpenSearch Project has evolved independently. It features a community-driven roadmap, contributions from multiple providers and a growing ecosystem of plugins hosted on GitHub. While it remains compatible with many Elasticsearch patterns, OpenSearch has expanded its feature set with plugins for vector search, anomaly detection and advanced observability tools.
While both projects share a common origin, their paths have diverged. Elasticsearch continues under SSPL and Elastic License with a proprietary feature development strategy. OpenSearch, by contrast, remains Apache 2.0–licensed, prioritizing openness, extensibility and operational visibility. As a result, organizations choosing between the two now evaluate not just features, but also governance models, licensing terms and long-term ecosystem direction.
Compatibility continues to be an important bridge between the projects: OpenSearch still supports many Elasticsearch APIs, query patterns and client libraries from earlier versions, helping teams migrate with minimal refactoring. It also preserves similar repository structures and index formats, maintaining familiarity for users transitioning from Elasticsearch.
OpenSearch is built on a distributed architecture designed for scale and real-time performance. Its core components include clusters, nodes, indices, shards and documents—all working together to store and retrieve data efficiently.
Nodes are servers or containerized instances that perform indexing, querying and storage operations. Common node types include:
A cluster is a collection of one or more nodes that work together to manage data and execute queries. Clusters provide redundancy and load balancing so that node failures do not affect overall performance. Each cluster maintains metadata about indices, shards and routing information.
An index is a logical namespace similar to a relational database table. It contains mappings that define the structure of JSON documents and references to the shards that store those documents. The term “index” is also used as a verb to describe the act of populating an index with data.
Documents are JSON objects that represent individual records. Put simply, it’s the data being stored and searched for. When indexed, fields within each document are analyzed, tokenized and stored in inverted indices.
Shards are the fundamental storage units in OpenSearch where documents live. Each index consists of primary shards and optional replica shards.
Because each shard is a standalone Lucene instance (a self-contained search engine library), OpenSearch distributes shards across nodes to parallelize search operations and scale performance.
So, how does this all come together? When a document is indexed, OpenSearch analyzes the content and applies text analyzers and tokenizers. After processing, it writes the terms into the appropriate shard.
Indexing is handled by data nodes and can be distributed across the cluster for speed and reliability. Queries are then submitted to a coordinating node, which identifies the shards containing relevant data, forwards the query to those shards and aggregates the results.
Think of it as a restaurant kitchen with different stations. Indexing is like prepping ingredients and sending them to the right station so it’s ready when the order comes in. When a query arrives, the coordinating node acts like the expediter—calling out what’s needed, gathering each station’s contribution and delivering one finished plate.
OpenSearch includes built-in features for search, analytics and observability. Plugins and extensions expand functionality, allowing teams to tailor the platform for specialized workloads.
While not exhaustive, these popular extensions enable advanced analytics, machine learning (ML) and observability scenarios:
Organizations that prefer a managed experience can also use Amazon OpenSearch Service, which automates scaling, backups, node replacement and maintenance for OpenSearch clusters on AWS.
OpenSearch Dashboards is the visualization and analytics interface for OpenSearch. It provides an interactive environment for exploring indexed data, building visualizations and creating operational dashboards used across observability, security analytics and application monitoring workflows. For instance, teams can leverage Dashboards to visualize trends in metrics and investigate anomalies in near real time.
OpenSearch Dashboards supports the creation of charts, tables, maps, notebooks and custom panels. It also includes features designed to streamline analysis. Notebooks allow users to combine visualizations and text into a single narrative, while operational panels organize observability visualizations created with Piped Processing Language into a unified display.
Because OpenSearch Dashboards shares a user interface (UI) heritage with Kibana, many data teams find the workflow familiar. However, it is developed under its own roadmap and includes capabilities that reflect the broader OpenSearch feature set.
OpenSearch supports a wide range of use cases across industries, including:
Teams index logs from applications, infrastructure and cloud services to analyze performance issues and troubleshoot outages. OpenSearch supports high-volume ingest and real-time analytics, which makes it suitable for distributed production systems, such as a multinational e-commerce site.
With support for metrics, logs and traces, OpenSearch provides an integrated observability platform. Trace Analytics visualizes service interactions, while application analytics correlates telemetry to understand system behavior and pinpoint latency or failures. Dashboards and PPL queries allow teams to investigate issues quickly and create reusable operational views.
OpenSearch’s anomaly detection and ML Commons algorithms enable organizations to apply search and analytics techniques across security operations. Teams use it to detect unusual patterns in authentication logs or application behavior, as well as trigger notifications when conditions or thresholds are met.
Organizations use OpenSearch as the search engine behind websites, product catalogs and enterprise content systems. Full-text search, autocomplete, phrase matching and vector search support a range of user experience and recommendation use cases.
OpenSearch Dashboards provides interactive visualizations, reporting and notebooks that help teams explore data, monitor trends, track KPIs and share insights with stakeholders.
With ML Commons, teams can run model-driven operations inside OpenSearch, such as clustering, classification and forecasting. These capabilities support use cases like fraud detection, demand prediction, customer segmentation and enrichment of downstream data pipelines.
Scale analytics and AI with all your data, wherever it resides, through an open, hybrid and governed data store.
Design a data strategy that eliminates data silos, reduces complexity, and improves data quality for exceptional customer and employee experiences.
Successfully scale AI with the right strategy, data, security and governance.