About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Filter by
Filter by
Filter by
1 – 30 of 300 items
[]
300 results 15 April 2025
Explainer
What Is Data Curation?
Data curation is the process of creating and managing datasets so that people can find, access, use and reuse data as necessary.
Data curation

11 April 2025
Explainer
What Is Real-Time Data Integration?
Real-time data integration involves capturing and processing data from multiple sources as soon as it's available, and then immediately integrating it into a target system.
Data integration

09 April 2025
Insights
The real cost of delayed data in an always-on world
Companies that embrace real-time streaming data integration are outpacing their peers, unlocking faster insights, optimizing operations and delivering superior customer experiences.
Data integration

03 April 2025
Explainer
What Is Metadata Management?
Metadata management refers to organizing, optimizing and using metadata to improve the accessibility and quality of an organization’s data.
Metadata management

21 March 2025
Explainer
What Is a Data Product?
A data product is a reusable asset that contains ready-to-use, processed data for various business use cases.
Data product

20 March 2025
Explainer
What Is a Primary Key?
A primary key is a column or columns in a database table with values that uniquely identify each row or record.
Primary key

11 March 2025
Explainer
What Is Data Processing?
Data processing is the conversion of raw data into usable information through structured steps such as data collection, preparation, analysis and storage.
Data processing

06 March 2025
Explainer
What Is a Directed Acyclic Graph (DAG)?
A directed acyclic graph (DAG) is a type of graph in which nodes are linked by one-way connections that do not form any cycles.
Directed acyclic graph

28 February 2025
Data Intelligence
Learn how data intelligence can help organizations get the most out of their data
Data management

24 February 2025
Explainer
What Is the Modern Data Stack?
Modern data stack (MDS) refers to integrated, cloud-based tools and technologies that enable the collection, ingestion, storage, cleaning, transformation, analysis and governance of data.
Modern data stack

21 February 2025
Explainer
What Is an AI Data Center?
An AI data center is a facility that houses the specific IT infrastructure needed to train, deploy and deliver AI applications and services.
Data centers

12 February 2025
Explainer
What are Data Silos?
Data silos are isolated collections of data that make it hard to share data between different departments, systems and business units.
Data silos

10 February 2025
Explainer
What Is Data Strategy?
A data strategy is a detailed plan for leveraging data to improve decision-making, optimize business processes and achieve business goals.
Data strategy

07 February 2025
Explainer
Structured vs. Unstructured Data: What’s the Difference?
A look into structured and unstructured data, their key differences, definitions, use cases and more.
Structured data

06 February 2025
Explainer
What Is a Resilient Distributed Dataset (RDD)?
A Resilient Distributed Dataset (RDD) is an immutable, fault-tolerant collection of elements distributed across multiple nodes for parallel processing.
RDDs

05 February 2025
Tutorial
Using the watsonx.ai Time Series Forecasting API to predict energy demand
In this tutorial, you will discover how to perform timeseries forecasting using the watsonx.ai Timeseries Forecasting API and SDK to predict energy demand. This notebook demonstrates the usage of a pre-trained time series foundation model for multivariate forecasting tasks and demonstrates a variety of features available using Time Series Foundation Models.
Time series forecasting

31 January 2025
Explainer
What is Delta Lake?
Delta Lake is an open-source data storage format that combines Apache Parquet data files with a robust metadata log.
Data lake

24 January 2025
Insights
Is B2Bi or MFT better for your company
For IT and business leaders, understanding the subtle differences between B2Bi and MFT is the key to achieving operational efficiency.
Data exchange

21 January 2025
News
Why maintaining data cleanliness is essential to cybersecurity
Why maintaining data cleanliness is essential to cybersecurity | IBM
Data cleaning

20 January 2025
Insights
Unlock new capabilities with a vertical data platform
Data products—powered by vertical data platforms—help overcome these challenges, unlocking new opportunities and enabling data-driven business models.
Data platform

16 January 2025
Explainer
What Is a Data Lake?
A data lake is a low-cost data storage environment designed to handle massive amounts of raw data in any format.
Data lake

13 January 2025
Explainer
What Is a Data Cloud?
A data cloud is a data management system that unifies various data sources so they can be used more effectively by organizations.
Data cloud

06 January 2025
Explainer
What is Milvus?
Milvus is an open source vector database known for its scalable storage for vector embeddings and high-performance similarity searches of vector data.
Vector databases

30 December 2024
Explainer
What is Streaming Data?
Streaming data is the continuous flow of real-time data from various sources.
Streaming data

20 December 2024
Insights
How to craft a comprehensive data cleanliness policy
As data cleanliness becomes more critical for your organization's security and efficiency, these five steps help keep your data accurate and safe.
Data cleaning

13 December 2024
Explainer
What is Data Intelligence?
Data intelligence combines core data management principles with AI and other tools to understand how enterprise data is produced and used.
Data intelligence

12 December 2024
Explainer
What Is Data synchronization?
Data synchronization, or data sync, is the continuous process of keeping data records accurate and uniform across network systems and devices.
Data synchronization

10 December 2024
Explainer
What is a Dataset?
A dataset is a collection of data typically organized in tables, arrays or other formats for easy retrieval and analysis.
Data sets

06 December 2024
Insights
Synthetic Data Generation
Synthetic data is artificially generated information that can supplement or even replace real-world data when training or testing artificial intelligence (AI) models. To help enterprises get the most out of artificial data, here are 8 best practices for synthetic data generation.
Synthetic data

03 December 2024
Insights
3 reasons why you need data observability for your streaming data pipelines
The rise of fast commerce, characterized by rapid order fulfillment and delivery, has further amplified the need for businesses to harness the power of streaming data.
Data observability
