Dataiku Documentation
  • Academy
    • Join the Academy
      Benefit from guided learning opportunities →
      • Quick Starts
      • Learning Paths
      • New Features
      • Certifications
      • Academy Discussions
  • Community
      • Explore the Community
        Discover, share, and contribute →
      • Learn About Us
      • Ask A Question
      • What's New?
      • Discuss Dataiku
      • Using Dataiku
      • Setup And Configuration
      • General Discussion
      • Plugins & Extending Dataiku
      • Product Ideas
      • Programs
      • Frontrunner Awards
      • Dataiku Neurons
      • Community Resources
      • Community Feedback
      • User Research
  • Documentation
    • Reference Documentation
      Comprehensive specifications of Dataiku →
      • User's Guide
      • Specific Data Processing
      • Automation & Deployment
      • APIs
      • Installation & Administration
      • Other Topics
  • Knowledge
    • Knowledge Base
      Articles and tutorials on Dataiku features →
      • User Guide
      • Admin Guide
      • Dataiku Solutions
      • Dataiku Cloud
  • Developer
    • Developer Guide
      Tutorials and articles for developers and coder users →
      • Getting Started
      • Concepts and Examples
      • Tutorials
      • API Reference
  • User's Guide
  • DSS concepts
  • Connecting to data
  • Exploring data
  • Charts
  • The Flow
  • Data preparation
  • Visual recipes
  • Code recipes
  • Schemas, storage types and meanings
  • Generative AI and LLM Mesh
  • Machine learning
  • MLOps
  • Interactive statistics
  • Code notebooks
  • Code Studios
  • Webapps
  • Collaboration
  • AI Assistants
  • Dashboards
  • Workspaces
  • Stories
  • Data Catalog
  • Dataiku Applications
  • Working with partitions
  • DSS and SQL
  • DSS and Python
  • DSS and R
  • DSS and Spark
  • Code environments
  • Specific Data Processing
  • Time Series
  • Geographic data
  • Text & Natural Language Processing
  • Images
  • Audio
  • Video
  • Automation & Deployment
  • Metrics, checks and Data Quality
  • Automation scenarios
  • Production deployments and bundles
  • API Node & API Deployer: Real-time APIs
  • Governance
  • APIs
  • Python APIs
  • R API
  • Public REST API
  • Additional APIs
  • Installation & Administration
  • Installing and setting up
  • Elastic AI computation
  • DSS in the cloud
    • DSS in AWS
    • DSS in Azure
    • DSS in GCP
  • DSS and Hadoop
  • Metastore catalog
  • Operating DSS
  • Security
  • User Isolation
  • Email Notifications
  • Other topics
  • Plugins
  • Streaming data
  • Formula language
  • Custom variables expansion
  • Sampling methods
  • Accessibility
  • Troubleshooting
  • Release notes
  • Other Documentation
  • Third-party acknowledgements
Dataiku DSS
You are viewing the documentation for version 13 of DSS.
  • »
  • DSS in the cloud Open page in a new tab

DSS in the cloud¶

DSS can run fully in the cloud.

When running in the cloud, DSS features advanced integrations with the managed services of the cloud providers, allowing deployment of complex architectures in a fully managed fashion.

DSS has advanced support for:

  • Amazon Web Services (AWS)

  • Microsoft Azure

  • Google Cloud Platform (GCP)

This section documents the various integration points, and provides some reference architectures for fully-managed cloud services

  • DSS in AWS
    • Reference architecture: managed compute on EKS with Glue and Athena
      • Overview
      • Architecture diagram
      • Security
      • Main steps
        • Prepare the instance
        • Setup connectivity to AWS
        • Install DSS
        • Setup container configuration in DSS
        • Setup Spark and Metastore in DSS
        • Setup S3 connections
        • Setup Athena connections
        • Install EKS plugin
        • Create your first cluster
        • Use it
  • DSS in Azure
    • Reference architecture: manage compute on AKS and storage on ADLS gen2
      • Overview
      • Security
      • Main steps
        • Prepare the instance
        • Install DSS
        • Setup containerized execution configuration in DSS
        • Setup Spark and metastore in DSS
        • Setup ADLS gen2 connections
        • Install AKS plugin
        • Create your first cluster
        • Use your cluster
  • DSS in GCP
    • Reference architecture: managed compute on GKE and storage on GCS
      • Overview
      • Security
      • Main steps
        • Prepare the instance
        • Install DSS
        • Setup containerized execution configuration in DSS
        • Setup Spark and metastore in DSS
        • Setup GCS connections
        • Install GKE plugin
        • Create your first cluster
        • Use your cluster
Next Previous

© Copyright 2025, Dataiku

Built with Sphinx using a theme provided by Read the Docs.