PI System / PIWebAPI server¶
You can create datasets based on data located in an OSIsoft PI System.
This capability is provided by the “pi-system” plugin, which you need to install. Please see Installing plugins.
The PI System servers you want to access must have PIWebAPI enabled.
This plugin is Not supported
Setup the authentication preset¶
Choose an authentification method : Basic or NTLM
Define a default server. Note that this setting can be overwritten later on by Dataiku users.
Select Users can override server URL to allow Dataiku users to use another server than the default one defined for this preset.
Select Users can disable SSL checks to allow Dataiku users to disable the SSL checks for this preset.
If your PI Server WebApi use a custom SSL certificate, set the path to a it. The certificate file and its containing directory must be readable by all system users of the Dataiku instance.
Setup authentication per user¶
Go to your Dataiku profile page > Credentials > Name of the preset. Click on the corresponding edit button and enter your PI System username / password.
Attribute search Dataset¶
Select the server and database you want to search attributes on. Attributes can be searched based on their name or description. Wildcards (“*”, “?”) can be used in the search box.
Two types of datasets can be produced:
A list of assets paths, which can be used later on to download the actual assets metrics (using the Asset metrics downloader recipe)
A list of assets and metrics:
Event frames search Dataset¶
Select a server and a database. Event frames can be search based on:
their name. Wildcards (“*”, “?”) can be used in the search box.
start / end time, using the available time formats
Two types of datasets can be produced:
A list of Event frames, which can be used later on to download the linked assets and metrics (using the Event frames downloader Recipe recipe)
A list of assets and metrics linked to the event frame. To do so:
Select Retrieve metrics
Select the type of data expected
Select one or several events from in the ‘Event frame to retrieve’ box. If none are selected, all events matching the search will be retrieved.
PIWebAPI Toolbox Dataset¶
This dataset allows a quick and direct access to a database, an element, an attribute or a tag, provided its full path is known. With knowledge of PIWebAPI, partial paths can be used for exploration. If the preset used has writing credentials to the attribute, the obtained dataset can also be used to write information back from Dataiku to PI System.
In the ‘Object path / Tag’ box, set either:
the path to a database (\\server_name\database)
the path to an element (\\server_name\database\element_1 .. \element_n)
the path to an attribute (\\server_name\database\element_1 .. \element_n|attribute)
an attribute tag (\\server_name\tag.name)
Select the type of data expected. Complete the missing element (Start / end times, interval, sync time…) accordingly.
Assets metrics downloader Recipe¶
The recipe outputs a dataset containing all the values for each of the attributes present in the input. Select the input dataset’s column containing the list of paths.
Event frames downloader Recipe¶
The recipe outputs the dataset with all the assets involved in the event, as well as all the metrics for the duration of the event.
Select the column containing the webids and the type of data expected.
Transpose & Synchronize Recipe¶
Data retrieved from PI server may contain timelines of several attributes stacked one after another (also called “long format”). To perform some analyses, it can be necessary to convert from long format to a timestamp/path array. Also, timestamps are probably slightly out of sync with one another, and in order to compare the values, they first have to be temporaly aligned. Both these steps can be done in one go using the Transpose & Synchronize recipe.
Select the columns containing the timestamps, the assets paths, and the metric’s values. Type in the full path of the asset to use as a time reference.
The recipe outputs the dataset with:
a timestamp column which contains the timestamps of the time reference asset
a column per asset, containing the last available measure at the time indicated by the reference timestamp
Activating the “Show advanced parameters” option let you:
Disable the SSL check, if this is allowed by the preset’s configuration
Point to a .pem file containing an SSL certificate. Note that this file and its containing directory must be visible to all accounts on the Dataiku server.
Speed up values retrieval by activating the batch mode. The default value for the batches size is 500 rows. Increasing this number will speed up transfer but also increase the risk of getting a ratio error from the PI-System server.
Interpolated returns the interpolated values across the specified time range with a chosen sampling interval. A start time anchor can also be set.
Plot returns the values across a specified time range
Recorded returns the compressed values for the selected time range.
Value returns the current value.
Summary returns the time weighted data summary across the last day.
The maximum number of points that can be retrieved for a given attribute is limited by PiWebAPI. The default limit is usually set to 1000 elements. Once this number of rows is reached, the search time span has to be reduced or split across several search operations.
Date and time can be entered using two different formats:
Absolute dates are using the ISO 8601 format, and should follow this pattern: YYYY-MM-DDThh:mm:ssZ
Relative times and dates are also possible using the OSIsoft’s Time String format