Dataiku Documentation
  • Discussions
    • Setup & Configuration
    • Using Dataiku DSS
    • Plugins & Extending Dataiku DSS
    • General Discussion
    • Job Board
    • Community Resources
    • Product Ideas
  • Knowledge
    • Getting Started
    • Knowledge Base
    • Documentation
  • Academy
    • Quick Start Programs
    • Learning Paths
    • Certifications
    • Course Catalog
    • Academy Discussions
  • Community Programs
    • Upcoming User Events
    • Find a User Group
    • Past Events
    • Community Conundrums
    • Dataiku Neurons
    • Banana Data Podcast
  • What's New
  • User's Guide
  • DSS concepts
  • Connecting to data
  • Exploring your data
  • Schemas, storage types and meanings
  • Data preparation
  • Charts
  • Interactive statistics
  • Machine learning
  • The Flow
  • Visual recipes
  • Recipes based on code
  • Code notebooks
  • MLOps
  • Webapps
  • Code Studios
  • Code reports
  • Dashboards
  • Workspaces
  • Dataiku Applications
  • Working with partitions
  • DSS and SQL
  • DSS and Python
  • DSS and R
  • DSS and Spark
  • Code environments
  • Collaboration
  • Specific Data Processing
  • Time Series
  • Geographic data
  • Text & Natural Language Processing
  • Images
  • Audio
  • Video
  • Automation & Deployment
  • Automation scenarios, metrics, and checks
  • Production deployments and bundles
  • API Node & API Deployer: Real-time APIs
  • Governance
  • APIs
  • Python APIs
  • R API
  • Public REST API
  • Additional APIs
  • Installation & Administration
  • Installing and setting up
  • Elastic AI computation
  • DSS in the cloud
    • DSS in AWS
    • DSS in Azure
    • DSS in GCP
  • DSS and Hadoop
  • Metastore catalog
  • Operating DSS
  • Security
  • User Isolation
  • Other topics
  • Plugins
  • Streaming data
  • Formula language
  • Custom variables expansion
  • Sampling methods
  • Accessibility
  • Troubleshooting
  • Release notes
  • Other Documentation
  • Third-party acknowledgements
Dataiku DSS
You are viewing the documentation for version 11 of DSS.
  • »
  • DSS in the cloud

DSS in the cloudΒΆ

DSS can run fully in the cloud.

When running in the cloud, DSS features advanced integrations with the managed services of the cloud providers, allowing deployment of complex architectures in a fully managed fashion.

DSS has advanced support for:

  • Amazon Web Services (AWS)

  • Microsoft Azure

  • Google Cloud Platform (GCP)

This section documents the various integration points, and provides some reference architectures for fully-managed cloud services

  • DSS in AWS
    • Reference architecture: managed compute on EKS with Glue and Athena
      • Overview
      • Architecture diagram
      • Security
      • Main steps
        • Prepare the instance
        • Setup connectivity to AWS
        • Install DSS
        • Setup container configuration in DSS
        • Setup Spark and Metastore in DSS
        • Setup S3 connections
        • Setup Athena connections
        • Install EKS plugin
        • Create your first cluster
        • Use it
  • DSS in Azure
    • Reference architecture: manage compute on AKS and storage on ADLS gen2
      • Overview
      • Security
      • Main steps
        • Prepare the instance
        • Install DSS
        • Setup containerized execution configuration in DSS
        • Setup Spark and metastore in DSS
        • Setup ADLS gen2 connections
        • Install AKS plugin
        • Create your first cluster
        • Use your cluster
  • DSS in GCP
    • Reference architecture: managed compute on GKE and storage on GCS
      • Overview
      • Security
      • Main steps
        • Prepare the instance
        • Install DSS
        • Setup containerized execution configuration in DSS
        • Setup Spark and metastore in DSS
        • Setup GCS connections
        • Install GKE plugin
        • Create your first cluster
        • Use your cluster
Next Previous

© Copyright 2022, Dataiku

Built with Sphinx using a theme provided by Read the Docs.