Spark libraries
You can enhance your Spark capabilities with libraries in Analytics Engine powered by Apache Spark instances.
Analytics Engine powered by Apache Spark provides these libraries:
- The data skipping library can significantly boost the performance of SQL queries by skipping over irrelevant data objects or files based on a summary metadata associated with each object. See Using the data skipping libraries.
- The time series library allows you to perform various key operations on time series data, including segmentation, forecasting, joins, transforms, and reducers. See Time series analysis.
- The spatio-temporal library expands your data science analysis to include location analytics by gathering, manipulating and displaying imagery, GPS, satellite photography and historical data. See Using the geospatio-temporal library.
- Parquet modular encryption protects sensitive information in Parquet files. See Parquet encryption.
Parent topic: Apache Spark