Migration operations

Migrating the base port

It is possible to change the base port of an existing Data Science Studio instance, by editing the installation configuration file:

  • Stop DSS

    DATADIR/bin/dss stop
    
  • Edit the DATADIR/install.ini installation configuration file

    [server]
    port = NEW_BASE_PORT
    
  • Regenerate DSS configuration

    DATADIR/bin/dssadmin regenerate-config
    
  • Restart DSS

    DATADIR/bin/dss start
    

Migrating the installation directory

It is possible to change the installation directory of an existing Data Science Studio instance, by replaying the installer in “upgrade” mode:

  • Stop DSS

    DATADIR/bin/dss stop
    
  • Move the installed kit to its new location (or unpack the .tar.gz distribution archive to a new location)

    mv OLD_DIR/dataiku-dss-VERSION NEW_DIR/
    # or
    cd NEW_DIR
    tar xf /PATH/TO/dataiku-dss-VERSION.tar.gz
    
  • Run the installer in upgrade mode

    NEW_DIR/dataiku-dss-VERSION/installer.sh -d DATA_DIR -u
    
  • Restart DSS

    DATADIR/bin/dss start
    

Migrating the data directory

It is possible to change the path of the data directory of an existing Data Science Studio instance, by replaying the installer in “upgrade” mode. Note that the Python virtual environment has to be rebuilt after migration, as described here. This is because Python virtual environments embed their installation path in various places.

  • Stop DSS

    DATADIR/bin/dss stop
    
  • Save the list of locally-installed Python packages

    DATADIR/bin/pip freeze -l >local-python-packages.txt
    
  • Move the data directory to its new location

    mv DATADIR NEWDATADIR
    
  • Remove the Python virtualenv, keeping a backup copy

    mv NEWDATADIR/pyenv NEWDATADIR/pyenv.backup
    
  • Run the installer in upgrade mode. This recreates the Python virtualenv

    dataiku-dss-VERSION/installer.sh -d NEWDATADIR -u
    
  • Reinstall locally-installed Python packages (if any)

    NEWDATADIR/bin/pip install -r local-python-packages.txt
    
  • If you are in Multi-user security mode:

    • As root, edit the file NEWDATADIR/security/security-config.ini to update, in section dirs, the path under dss_data_dir
    • Still as root, update the paths in sudoers file /etc/sudoers.d/dataiku-dss-<dss_user>-<dss_instance_id>. You can instead run NEWDATADIR/bin/dssadmin install-impersonation DSSUSER that should do it for you, then review it to make sure the paths were updated correctly.
  • Restart DSS

    NEWDATADIR/bin/dss start
    
  • When everything is considered stable, remove the backup

    rm -rf NEWDATADIR/pyenv.backup
    

Note

If you could not save the list of locally-installed Python packages before migration (step 2 above), it is possible to reconstitute it by looking at the package installation directory:

ls -l NEWDATADIR/pyenv.backup/lib/python2.7/site-packages