Data ingestion techniques
You can use various methods to ingest data into Db2® Big SQL. The method that you use, the size of the data files, and the file format can all have an impact on query performance.
The following table summarizes the uses, strengths, and limitations of different approaches to ingesting data into Db2 Big SQL.
| Method | Uses | Strengths | Limitations |
|---|---|---|---|
| Adding files to HDFS |
|
|
|
| Syncing the Db2 Big SQL and Hive catalogs |
|
|
|
| Using the INSERT...SELECT statement |
|
|
|
| Using the LOAD HADOOP statement |
|
|
|
| CREATE TABLE...AS |
|
|
|
| Sqoop |
|
|
|
|
Important: Avoid using any data ingestion technique that results in many small files,
because this can have a negative impact on query performance.
|
|||
Best practices:
- Overview of data ingestion techniques
- The following article presents an overview of various ingestion techniques and explains some best practices for considering number of files, file sizes, and placement of files:
- Specific ingestion techniques
- Specific ingestion techniques and best practices are described in the following articles:
- Additional documentation
- See the following Knowledge Center topics: