Active loading indicator
1 – 30 of 300 items
[]
300 results
15 April 2025
Explainer
What Is Data Curation?

Data curation is the process of creating and managing datasets so that people can find, access, use and reuse data as necessary.

Data curation
11 April 2025
Explainer
What Is Real-Time Data Integration?

Real-time data integration involves capturing and processing data from multiple sources as soon as it's available, and then immediately integrating it into a target system.

Data integration
09 April 2025
Insights
The real cost of delayed data in an always-on world

Companies that embrace real-time streaming data integration are outpacing their peers, unlocking faster insights, optimizing operations and delivering superior customer experiences.

Data integration
03 April 2025
Explainer
What Is Metadata Management?

Metadata management refers to organizing, optimizing and using metadata to improve the accessibility and quality of an organization’s data.

Metadata management
21 March 2025
Explainer
What Is a Data Product?

A data product is a reusable asset that contains ready-to-use, processed data for various business use cases.

Data product
20 March 2025
Explainer
What Is a Primary Key?

A primary key is a column or columns in a database table with values that uniquely identify each row or record.

Primary key
11 March 2025
Explainer
What Is Data Processing?

Data processing is the conversion of raw data into usable information through structured steps such as data collection, preparation, analysis and storage.

Data processing
06 March 2025
Explainer
What Is a Directed Acyclic Graph (DAG)?

A directed acyclic graph (DAG) is a type of graph in which nodes are linked by one-way connections that do not form any cycles.

Directed acyclic graph
28 February 2025
Data Intelligence

Learn how data intelligence can help organizations get the most out of their data

Data management
24 February 2025
Explainer
What Is the Modern Data Stack?

Modern data stack (MDS) refers to integrated, cloud-based tools and technologies that enable the collection, ingestion, storage, cleaning, transformation, analysis and governance of data.

Modern data stack
21 February 2025
Explainer
What Is an AI Data Center?

An AI data center is a facility that houses the specific IT infrastructure needed to train, deploy and deliver AI applications and services.

Data centers
12 February 2025
Explainer
What are Data Silos?

Data silos are isolated collections of data that make it hard to share data between different departments, systems and business units.

Data silos
10 February 2025
Explainer
What Is Data Strategy?

A data strategy is a detailed plan for leveraging data to improve decision-making, optimize business processes and achieve business goals.

Data strategy
07 February 2025
Explainer
Structured vs. Unstructured Data: What’s the Difference?

A look into structured and unstructured data, their key differences, definitions, use cases and more.

Structured data
06 February 2025
Explainer
What Is a Resilient Distributed Dataset (RDD)?

A Resilient Distributed Dataset (RDD) is an immutable, fault-tolerant collection of elements distributed across multiple nodes for parallel processing.

RDDs
05 February 2025
Tutorial
Using the watsonx.ai Time Series Forecasting API to predict energy demand

In this tutorial, you will discover how to perform timeseries forecasting using the watsonx.ai Timeseries Forecasting API and SDK to predict energy demand. This notebook demonstrates the usage of a pre-trained time series foundation model for multivariate forecasting tasks and demonstrates a variety of features available using Time Series Foundation Models.

Time series forecasting
31 January 2025
Explainer
What is Delta Lake?

Delta Lake is an open-source data storage format that combines Apache Parquet data files with a robust metadata log.

Data lake
24 January 2025
Insights
Is B2Bi or MFT better for your company

For IT and business leaders, understanding the subtle differences between B2Bi and MFT is the key to achieving operational efficiency.

Data exchange
21 January 2025
News
Why maintaining data cleanliness is essential to cybersecurity

Why maintaining data cleanliness is essential to cybersecurity | IBM

Data cleaning
20 January 2025
Insights
Unlock new capabilities with a vertical data platform

Data products—powered by vertical data platforms—help overcome these challenges, unlocking new opportunities and enabling data-driven business models.

Data platform
16 January 2025
Explainer
What Is a Data Lake?

A data lake is a low-cost data storage environment designed to handle massive amounts of raw data in any format.

Data lake
13 January 2025
Explainer
What Is a Data Cloud?

A data cloud is a data management system that unifies various data sources so they can be used more effectively by organizations.

Data cloud
06 January 2025
Explainer
What is Milvus?

Milvus is an open source vector database known for its scalable storage for vector embeddings and high-performance similarity searches of vector data.

Vector databases
30 December 2024
Explainer
What is Streaming Data?

Streaming data is the continuous flow of real-time data from various sources.

Streaming data
20 December 2024
Insights
How to craft a comprehensive data cleanliness policy

As data cleanliness becomes more critical for your organization's security and efficiency, these five steps help keep your data accurate and safe.

Data cleaning
13 December 2024
Explainer
What is Data Intelligence?

Data intelligence combines core data management principles with AI and other tools to understand how enterprise data is produced and used.

Data intelligence
12 December 2024
Explainer
What Is Data synchronization?

Data synchronization, or data sync, is the continuous process of keeping data records accurate and uniform across network systems and devices.

Data synchronization
10 December 2024
Explainer
What is a Dataset?

A dataset is a collection of data typically organized in tables, arrays or other formats for easy retrieval and analysis.

Data sets
06 December 2024
Insights
Synthetic Data Generation

Synthetic data is artificially generated information that can supplement or even replace real-world data when training or testing artificial intelligence (AI) models. To help enterprises get the most out of artificial data, here are 8 best practices for synthetic data generation.

Synthetic data
03 December 2024
Insights
3 reasons why you need data observability for your streaming data pipelines

The rise of fast commerce, characterized by rapid order fulfillment and delivery, has further amplified the need for businesses to harness the power of streaming data.

Data observability
1 – 30 of 300 items