What is big data?

Big Data is being generated at all times. Every digital process and social media exchange produces it. Systems, sensors and mobile devices transmit it. Much of this data is coming to us in an unstructured form, making it difficult to put into structured tables with rows and columns. To extract insights from this complex data, Big Data projects often rely on cutting edge analytics involving data science and machine learning. Computers running sophisticated algorithms can help enhance the veracity of information by sifting through the noise created by Big Data's massive volume, variety, and velocity.

Volume

Big data - Scale of the data

Scale of the data

The ability to process large amounts of data and what you do with that data.

Variety

Big data - Different forms of data

Different forms of data

Making sense out of unstructured data by trying to capture all of the data that pertains to our decision-making process.

Velocity

Big data - Analysis of streaming data

Analysis of streaming data

The rate at which data arrives at the enterprise and the time that it takes the enterprise to process and understand that data.

Veracity

Big data - Uncertainty of the data

Uncertainty of the data

The quality or trustworthiness of the data. The quality or trustworthiness of the data. Tools that help handle big data’s veracity discard “noise” and transform the data into trustworthy insights.

What is changing in the realm of big data?

Big data is changing the way people within organizations work together. It is creating a culture in which business and IT leaders must join forces to realize value from all data. Insights from big data can enable all employees to make better decisions—deepening customer engagement, optimizing operations, preventing threats and fraud, and capitalizing on new sources of revenue. But escalating demand for insights requires a fundamentally new approach to architecture, tools and practices.

Browse all big data products

IBM BigInsights Logo

IBM BigInsights

IBM BigInsights

An industry standard Hadoop offering that combines the best of open source software with enterprise-grade capabilities. It helps organizations to cost effectively manage and analyze all kinds of data, including semi-structured and unstructured.

IBM Spark Analytics Logo

IBM Analytics for Apache Spark

IBM Analytics for Apache Spark

Increase your analytics agility with the power of open source Apache Spark. Process large data volumes at great speed in a hosted, managed, secure environment.

IBM Cloudant Logo

IBM Cloudant

IBM Cloudant

Give your application uninterrupted data access, offline and online, anywhere in the world, with a fully managed NoSQL database service. Let IBM manage the database layer so you can build more, grow more and sleep more.

IBM Streams Logo

IBM Streams

IBM Streams

Helps to capture and analyze streaming data, make decisions while events are happening. IBM Streams offers a complete solution with a development environment, runtime and analytics toolkits.

IBM dashDB Logo

IBM dashDB

IBM dashDB

Analyze your data where it resides—in the cloud—with a fully managed columnar data warehouse service. Leverage in-database predictive analytics and massively parallel processing (MPP) to do more with your data.

IBM Data Science Experience Logo

IBM Data Science Experience

IBM Data Science Experience

Cloud-based, social workspace that helps data scientists consolidate their use of and collaborate across multiple open source tools such as R and Python.

IBM Compose Logo

IBM Compose

IBM Compose

Run web and mobile apps on fully managed, hand-picked open source databases with an integrated database-as-a-platform service. Gain flexibility and scale without losing cycles to database management.

IBM InfoSphere Big Match Logo

IBM InfoSphere Big Match

IBM InfoSphere Big Match

Helps analyze big volumes of structured and unstructured data to provide complete and accurate customer information—without increasing risk of errors or data loss when moving data from source to source.

IBM BigInsights BigIntegrate Logo

IBM BigInsights BigIntegrate

IBM BigInsights BigIntegrate

A data integration solution that provides connectivity, transformation, and data delivery features that execute on the data nodes of a Hadoop cluster.

IBM BigInsights BigQuality Logo

IBM BigInsights BigQuality

IBM BigInsights BigQuality

Helps ensure information quality and provides the ability to quickly adapt to strategic business changes by stewardship and monitoring of data and application of data quality rules for your Hadoop data.

IBM Information Governance Catalog Logo

IBM Information Governance Catalog

IBM Information Governance Catalog

Provides comprehensive information integration capabilities to help you understand and govern your information.

IBM Informix Logo

IBM Informix

IBM Informix

A secure embeddable database, optimized for OLTP, IoT and is forging new frontiers with its unique ability to seamlessly integrate SQL, NoSQL/JSON, time series and spatial data.

How can you realize the greatest value from big data?

Data Scientists

Data scientist

With connected devices and social media transforming the way people live, work and buy, today’s data is increasingly “born in the cloud.” Capturing the true value of data means acting fast with the latest analytic tools and spending less time managing your infrastructure.

Learn more

Application developer

Application developer

Use powerful, open source database technologies to power your apps—providing flexibility, scalability, and geospatial capabilities in a fully managed service. Make your web and mobile applications more scalable and available to users, wherever they are.

Learn more

IT Architect

Enterprise architect

Modernize and extend your online transaction processing (OLTP) databases and data warehouses to a hybrid cloud architecture. Business users can gain valuable insights easily and more cost-effectively with the most complete and integrated set of data and analytics services.

Learn more

Big data resources

 

The Data Warehouse Evolved: A Foundation for Analytical Excellence

Explore a Best-in-Class approach to data management and how companies are prioritizing data technologies to drive growth and efficiency.

 

Forrester: Big Data Fabric Drives Innovation and Growth

Learn how next-generation big data management enables self-service and agility

 

Understanding big data beyond the hype

Stay on top of all the changes including, Hadoop-based analytics, streaming analytics, warehousing (including BigSQL), data asset discovery, integration, and governance