DSS and Spark¶
- Usage of Spark in DSS
- Spark on Kubernetes
- Setting up (without Kubernetes)
- Spark configurations
- Interacting with DSS datasets
- Spark pipelines
- Limitations and attention points
Spark is a general engine for distributed computation. Once Spark integration is setup, DSS will offer settings to choose Spark as a job’s execution engine in various components.