My IBM

What is structured query language (SQL)?

26 June 2025

Authors

Alexandra Jonker

Editorial Content Lead

Tim Mucci

IBM Writer

Gather

What is structured query language?

Structured query language (SQL) is a domain-specific, standardized programming language used to interact with relational database management systems (RDBMS) such as MySQL, SQL Server, IBM Db2, PostgreSQL and Oracle Database.

SQL queries (also known as SQL commands or SQL statements) allow users to easily add, retrieve, update, delete, aggregate and otherwise manage data in a relational database (or SQL database). In these systems, structured data is ordered into rows and columns, which together form tables. Data is typically structured across multiple related tables that can be linked using primary or foreign keys.

Unlike other programming languages, SQL is a declarative language, which means it describes what the user wants the computer to do rather than how to achieve it. This contrasts with imperative or procedural languages (such as Java and COBOL) which require step-by-step instructions.

For example, when users write an SQL query to fetch data, they don't need to outline the steps the database should take to gather that data; they just describe what the result should look like:

SELECT name, email

FROM customers

WHERE city = 'New York' ;

SQL is a fundamental and versatile tool in the modern tech stack, known for its data manipulation capabilities, rapid query processing and strong security features. It also offers compatibility across different systems, scalability to handle growing data needs and robust open source and community support.

The history of SQL

In the 1970s, IBM scientists Donald Chamberlin and Raymond Boyce developed and introduced SQL. It originated from the concept of relational models and was initially called structured English query language (SEQUEL) before being shortened to SQL. It became commercially available in 1979 and has since become the global standard for relational database management systems.

SQL was standardized by the American National Standards Institute (ANSI) in 1986 and the International Organization for Standardization (ISO) in 1987. Despite being a standard, SQL has various dialects, such as T-SQL for Microsoft SQL Server and PL/SQL for Oracle Database. These SQL dialects meet specific system needs while remaining compliant with the core ANSI standard commands such as SELECT, UPDATE, DELETE, INSERT and WHERE.

Industry newsletter

The latest tech news, backed by expert insights

Stay up to date on the most important—and intriguing—industry trends on AI, automation, data and beyond with the Think newsletter. See the IBM Privacy Statement.

Why is SQL important?

Since its development in the 1970s, SQL has become the backbone of modern database systems.

Unlike general-purpose programming languages, SQL is purpose-built for relational databases—and relational databases are, in turn, optimized for SQL. This mutual design renders SQL a highly efficient data management tool.

SQL’s declarative nature makes it accessible even to users with limited programming experience, making it an ideal language for beginners. Its widespread use and integration with other programming languages like Python and Java also make it a valuable skill in broader programming and data environments.

Data analysts, data scientists and database administrators regularly use SQL because it excels at tasks such as data processing, data definition, access control, data sharing, data integration and big data analytics.

In data science, SQL is used to create databases that store large data sets needed for data analysis. Its ability to manipulate and retrieve data from these vast, structured datasets is also crucial in the development of artificial intelligence (AI) and machine learning (ML) applications, which depend on high-quality data for training.

By adhering to ACID properties—atomicity, consistency, isolation and durability—SQL helps ensure reliable transaction processing for critical use cases and sensitive data handling. It also supports more accurate data-driven decision-making, advanced analytics and enhanced business intelligence.

Learn SQL

What are the advantages of using SQL?

SQL offers many advantages, which is why it has remained one of the most widely used and enduring programming languages.

Easy data manipulation

SQL’s simple commands (such as GROUP BY, ORDER BY, GRANT and REVOKE) empower users of all skill levels to work with databases.

Rapid query processing

SQL indexes and query optimization techniques improve the speed of data retrieval, and subsequently, enhance database performance.

Robust data security

SQL databases include security features such as user authentication, access controls and encryption to protect data.

Commonality and compatibility

SQL adheres to ANSI and ISO SQL standards, which help ensure compatibility with various systems and platforms, including cloud environments and big data tools.

Scalability

SQL can effectively manage both small and large databases, adapting to growing data needs without significant performance loss.

Open source support

Many SQL databases are open source and supported by a large, active community that contributes to continuous improvement and problem-solving.

How does a SQL query work?

A relational database organizes data in a tabular format (rows and columns) and facilitates relationships between different tables. For instance, a customer service database might use separate tables for customer information, purchases, product codes and contacts, linked by keys like a unique customer ID.

SQL allows users to write queries (and subqueries) to manipulate this data. These commands run through several software components during the SQL process:

A parser verifies the correctness of SQL statements and converts them into a format that the database can understand, such as tokenized symbols. This step involves syntax analysis and semantic checking. The parser will also help ensure the user is authorized to perform the operation.

Then, a relational engine—also known as a query optimizer—plans the most efficient data retrieval, modification or addition strategies. It does so by evaluating different query execution plans. It writes the plan in bytecode, which is a virtual machine language. This step is crucial for optimizing database performance and resource use.

Finally, a storage engine processes the bytecode, runs the SQL statement and manages physical data storage. It handles the physical representation of data, including file formats and data buffering. It also returns the result to the user or app. This step helps ensure efficient data access and updates on the disk. This linkage often involves relationships, such as one-to-many or many-to-many, established using primary and foreign keys to help ensure data integrity.

Mixture of Experts | 4 July, episode 62

Decoding AI: Weekly News Roundup

Join our world-class panel of engineers, researchers, product leaders and more as they cut through the AI noise to bring you the latest in AI news and insights.

Watch the latest podcast episodes

Key components of SQL systems

Relational database management systems (also called SQL systems) consist of many components, including:

Databases: A digital repository for storing, managing and securing organized collections of data.
Database tables: Data formatted into rows and columns; each contains information on one type of entity.
SQL queries: SQL queries are instructions written in SQL used to manipulate data within a relational database.
SQL constraints: Rules that control data in database columns or tables to maintain data integrity.
Stored procedures: SQL commands that are saved for continued reuse.
Transactions: One or more SQL commands bundled as a single unit of work or operation.
Data types: Rules that define the type of data that can be stored in a column.
Indexes: A database object that speeds up data retrieval by reducing the number of disk accesses needed for a query.
Views: Virtual tables based on SQL queries that simplify complex queries and improve security by restricting access to underlying data.
Security and permissions: Functions to manage user access, while backup and recovery mechanisms protect data against loss or corruption.

Types of SQL commands: DDL, DML, DQL, DCL, and TCL

SQL commands are traditionally divided into the following categories:

Data definition language (DDL)
Data manipulation language (DML)
Data control language (DCL)
Data query language (DQL)
Transaction control language (TCL)

Data definition language (DDL)

Data definition language manages database objects like tables, views and indexes. It defines the structure and organization of the stored data and the relationships among stored data items.

Data manipulation language (DML)

Data manipulation language manages data within databases through operations like INSERT, UPDATE and OUTER JOIN—which add, modify and combine data.

Data control language (DCL)

Data control language controls data access through commands like GRANT (to give permissions) and REVOKE (to remove permissions). It can restrict a user's ability to retrieve, add and modify data.

Data query language (DQL)

Data query language executes data queries to retrieve information, typically using the SELECT command. It can retrieve specific data items or a range of items.

Transaction control language (TCL)

Transaction control language manages transaction changes to help ensure data integrity and supports ROLLBACK and COMMIT operations for undoing or storing changes, respectively. It is used to coordinate data sharing by concurrent users.

What are the most common SQL commands?

SQL databases support various SQL statements for data operations. However, SQL commands can vary depending on the database, which may use its own SQL syntax.

Basic SQL commands include:

SELECT

Retrieves data from one or more tables.

SELECT name, email

FROM customers

WHERE city = 'New York' ;

This statement retrieves the name and email of all customers who live in New York from the customers table.

INSERT

Adds new rows to a table.

INSERT INTO customers (name, email, city)

VALUES ('Jane Doe', 'jane.doe@example.com', 'Los Angeles') ;

This statement adds a new row to the customers table with the name 'Jane Doe', email 'jane.doe@example.com' and city 'Los Angeles.'

UPDATE

Modifies existing data in a table.

UPDATE customers

SET email = 'new.email@example.com'

WHERE name = 'John Doe' ;

This statement updates the email of the customer named 'John Doe' in the customers table to 'new.email@example.com.'

DELETE

Removes rows from a table based on a condition.

DELETE FROM customers

WHERE city = 'Boston' ;

This statement deletes all rows from the customers table where the city is 'Boston.'

CREATE TABLE

Defines a new table and its structure.

CREATE TABLE products (
product_id INT PRIMARY KEY,
name VARCHAR(100),
price DECIMAL(10, 2)
) ;

This statement creates a new table called products with three columns: product_id as an integer primary key, name as a variable character string up to 100 characters and price as a decimal with ten digits and two decimal places.

ALTER TABLE

Modifies the structure of an existing table.

ALTER TABLE customers

ADD COLUMN birthday DATE ;

This statement adds a new column birthday of type DATE to the existing customers table.

DROP TABLE

Deletes a table and all its data.

DROP TABLE old_customers ;

This statement deletes the old_customers table along with all its data.

JOIN

Combines rows from two or more tables based on a related column.

SELECT c.name, p.name AS product_name
FROM customers c
JOIN orders o ON c.customer_id = o.customer_id
JOIN products p ON o.product_id = p.product_id
WHERE c.city = 'New York' ;

The SQL JOIN statement retrieves the names of customers and the names of the products they ordered. It joins the customers, orders and products tables based on the customer_id and product_id, selecting only those customers who live in New York.

SQL vs. NoSQL databases

SQL databases are relational databases, where structured data is stored in rows and tables that are linked in various ways. SQL is the standard language for interacting with these databases.

NoSQL databases (or non-relational databases) emerged in the late 2000s to handle data with less structure. These types of databases (such as MongoDB) offer more flexible data models compared to SQL databases.

Key differences include:

Scalability
Structure
Performance
Use cases
Knowledge and community
Maintenance and management

Scalability

NoSQL databases are horizontally scalable, managing higher traffic by adding more servers. In contrast, SQL databases are traditionally vertically scalable, requiring more powerful hardware to handle increased load.

Structure

SQL databases use a table-based structure ideal for multi-row transactions and complex queries across related data, thanks to robust indexing and joining capabilities. NoSQL offers various structures, such as key-value, document, graph or wide column stores, catering to different needs and allowing for more flexibility with semi-structured or unstructured data.

Performance

SQL databases are optimized for complex queries with strict data consistency, following the ACID principles. NoSQL databases, which follow the BASE principles (basically available, soft state, eventual consistency), provide faster performance for specific types of data but with different consistency guarantees.

Use cases

SQL databases are often chosen for applications requiring complex transactions, consistent data and strict schema adherence, like financial systems, e-commerce platforms or CRM databases. NoSQL is preferred for rapidly changing, large-scale or semi-structured data, such as in social networks, real-time analytics or content management systems.

Knowledge and community

SQL databases have a more extensive range of resources, such as SQL tutorials and community support due to its longer history and widespread adoption. NoSQL often requires less upfront design and can be easier to scale but often requires more custom development for complex querying and data consistency.

Maintenance and management

SQL databases require careful schema design and can be demanding in terms of maintenance for schema changes. A NoSQL DBMS offers easier scalability and adaptability for schema changes without extensive downtime or restructuring.

SQL vs. NoSQL Databases: What’s the difference?

What is SQL injection?

Despite the security strengths of many SQL databases, other enterprise applications can be vulnerable to security issues—such as weak authentication, insecure design and misconfiguration. Due to these vulnerabilities, SQL injection remains a real-world threat to organizations.

SQL injection occurs when hackers manipulate SQL queries to access or corrupt database information. Understanding these vulnerabilities and implementing robust security measures is critical for safeguarding SQL data.

Four steps to better business forecasting with analytics

Use the power of analytics and business intelligence to plan, forecast and shape future outcomes that best benefit your company and customers.

Resources

The hybrid, open data lakehouse for AI

Simplify data access and automate data governance. Discover the power of integrating a data lakehouse strategy into your data architecture, including cost-optimizing your workloads and scaling AI and analytics, with all your data, anywhere.

Data management for AI and analytics

Access our guide to learn how to use the right databases for applications, analytics and generative AI.

Managing data for AI and analytics at scale

Learn how an open data lakehouse approach can provide trustworthy data and faster analytics and AI projects execution.

Gartner® Predicts 2024: How AI will impact analytics users

Gain unique insights into the evolving landscape of ABI solutions, highlighting key findings, assumptions and recommendations for data and analytics leaders.

Increase AI adoption with AI-ready data

Discover why AI-powered data intelligence and data integration are critical to drive structured and unstructured data preparedness and accelerate AI outcomes.

What is structured query language (SQL)?

Authors

What is structured query language?

The history of SQL

The latest tech news, backed by expert insights

Thank you! You are subscribed.

Why is SQL important?

What are the advantages of using SQL?

How does a SQL query work?

Decoding AI: Weekly News Roundup

Key components of SQL systems

Types of SQL commands: DDL, DML, DQL, DCL, and TCL

Data definition language (DDL)

Data manipulation language (DML)

Data control language (DCL)

Data query language (DQL)

Transaction control language (TCL)

What are the most common SQL commands?

SELECT

INSERT

UPDATE

DELETE

CREATE TABLE

ALTER TABLE

DROP TABLE

JOIN

SQL vs. NoSQL databases

Scalability

Structure

Performance

Use cases

Knowledge and community

Maintenance and management

What is SQL injection?

Resources

Related solutions