Geocoding and reverse geocoding¶
Geocoding (sometimes called Forward Geocoding) is the process of transforming an address into geographic coordinates
Reverse geocoding is the process geographic coordinates into administrative information (country, region, city, …)
It should be noted that geocoding and reverse geocoding are always best-effort activities. It is not always possible to perform either of these activities, data may be incomplete depending on locations. Dataiku is not able to provide any guarantee as to the completeness of correctness of any geocoding-related data.
Geocoding¶
Note
This capability is provided by the “Geocoder plugin, which you need to install. Please see Installing plugins.
This plugin is Not supported
The Geocoder uses online geocoding service providers. Your DSS instance needs to have outgoing Internet access.
You will need an API key for most of these providers. Some providers have some free plans, sometimes with limits, sometimes with various usage policies. Please make sure to review the usage policy of each provider before using it.
Not all providers have the same level of coverage of the world, so you should use providers depending on their coverage.
Providers with “Batch available” usually have significantly better performance
Provider |
Optimal for |
Usage Policy |
Batch available |
---|---|---|---|
ArcGIS |
World |
||
Baidu |
China |
API key |
|
Bing |
World |
API key |
yes |
CanadaPost |
Canada |
API key |
|
FreeGeoIP |
World |
||
Gaode |
China |
API key |
|
Geocoder.ca (Geolytica) |
CA & US |
Rate Limit |
|
GeocodeFarm |
World |
Policy |
|
GeoNames |
World |
Username |
|
GeoOttawa |
Ottawa |
||
Gisgraphy |
World |
API key |
|
World |
Rate Limit, Policy |
||
HERE |
World |
API key |
|
IPInfo |
World |
Rate Limit, Plans |
|
Komoot (OSM powered) |
World |
||
LocationIQ |
World |
API Key |
|
Mapbox |
World |
API key |
|
MapQuest |
World |
API key |
yes |
MaxMind |
World |
||
OpenCage |
World |
API key |
|
OpenStreetMap |
World |
Policy |
|
Tamu |
US |
API key |
|
TGOS |
Taiwan |
||
TomTom |
World |
API key |
|
USCensus |
US |
yes |
|
What3Words |
World |
API key |
|
Yahoo |
World |
||
Yandex |
Russia |
The plugin provides a recipe. You can use this recipe multiple times in a row using different providers, for exemple in case the previous providers failed on some inputs. The recipe will only try to recompute rows for which outputs are not already filled.
Reverse geocoding¶
DSS provides two different reverse geocoding capabilities:
A native one, which reverse geocodes at the city level, and uses bundled data. This reverse geocoder is available as a preparation processor. It does not use any external provider, does not need any API key or payment, and is fast
The ability to call external providers, which require API keys or payments, requires Internet access, and is usually significantly slower, but that provide better resolution (up to the address level)
Bundled-data city-level reverse geocoder¶
Note
This capability is provided by the “Reverse geocoding” plugin, which you need to install. Please see Installing plugins.
This plugin is covered by Tier 2 support
The reverse geocoding prepare processor takes geographic coordinates as input and extracts the different levels of administrative boundary to which it belongs (country, region, city …). The administrative boundaries we use are the ones defined in Open Street Map. The type of administrative boundary for each level depends on the country. For more information please refer to the Open Street Map documentation.
You need a column containing a Geo Point or a Geometry as input.
The processor outputs two columns per administrative level for which you provide a column name: one with the local name of the administrative entity and one with the English name.
Selecting “Output the smallest selected administrative area” will output the shape of the administrative entity. This polygon is encoded using WKT format. It is displayed as a third column to the administrative level for which you provided a column name. In case several levels are selected, only the smallest in size is displayed (for instance if both city and country are selected, it will return the shape of the city).
Online reverse geocoder¶
Note
This capability is provided by the “Geocoder plugin, which you need to install. Please see Installing plugins.
This plugin is Not supported
The Reverse Geocoder uses online reverse geocoding service providers. Your DSS instance needs to have outgoing Internet access.
You will need an API key for most of these providers. Some providers have some free plans, sometimes with limits, sometimes with various usage policies. Please make sure to review the usage policy of each provider before using it.
Not all providers have the same level of coverage of the world, so you should use providers depending on their coverage.
Providers with “Batch available” usually have significantly better performance
Provider |
Optimal for |
Usage Policy |
Batch available |
---|---|---|---|
ArcGIS |
World |
||
Baidu |
China |
API key |
|
Bing |
World |
API key |
yes |
Gaode |
China |
API key |
|
GeocodeFarm |
World |
Policy |
|
Gisgraphy |
World |
API key |
|
World |
Rate Limit, Policy |
||
HERE |
World |
API key |
|
Komoot (OSM powered) |
World |
||
LocationIQ |
World |
API Key |
|
Mapbox |
World |
API key |
|
MapQuest |
World |
API key |
|
OpenCage |
World |
API key |
|
OpenStreetMap |
World |
Policy |
|
USCensus |
US |
||
What3Words |
World |
API key |
|
Yandex |
Russia |
The plugin provides a recipe. You can use this recipe multiple times in a row using different providers, for exemple in case the previous providers failed on some inputs. The recipe will only try to recompute rows for which outputs are not already filled.
Zipcode geocoding¶
Zipcode geocoding provides “Country + zipcode” –> “Geographic coordinates” resolution, at the city-level resolution.
Note
This capability is provided by the “Zipcode geocoding” plugin, which you need to install. Please see Installing plugins.
This plugin is covered by Tier 2 support
It is available as a preparation recipe processor
You need a column containing the country (name or ISO code) and a column containing the zipcode
The processor outputs a Geo Point column