InfoSum Developer API (2.0)

Download OpenAPI specification:Download

InfoSum Support: support@infosum.com Terms of Service

This page contains the documentation for the InfoSum Developer API. The methods allow you to try making requests and inspect the input and output formats.

Official Documentation

Accounts

Service for mapping users to their display name and companies.

Get a company's display name.

Authorizations:

API_Keys

query Parameters

company_id

string

The ID of the company to look up the display name for.

Responses

Response samples

Content type

application/json

{"company_display_name": "string"
}

Get a user's account details.

Get a user's account details .

Authorizations:

API_Keys

Responses

Response samples

200
401
403
500
default

Content type

application/json

{"company": {"display_name": "string",
"id": "string"
},
"display_name": "string",
"id": "string"
}

List users' display name and company name.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
filter	string The filter to restrict the list of returned users.

Responses

Response samples

Content type

application/json

{"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
},
"users": [{"company_id": "string",
"company_name": "string",
"user_display_name": "string",
"user_id": "string"
}
]
}

Connections

Service for estimating intersections between datasets to establish if they are connected by common keys.

Calculate match rate

Calculates the match rate between two PAIR enabled datasets.

Authorizations:

API_Keys

Request Body schema: application/json
required

CalculateMatchRateRequest is the input to the CalculateMatchRate method.

advertiser_dataset_id	string AdvertiserDatasetId is the ID of the dataset belonging to the advertiser.
publisher_dataset_id	string PublisherDatasetId is the ID of the dataset belonging to the publisher.

Responses

Request samples

Payload

Content type

application/json

{"advertiser_dataset_id": "string",
"publisher_dataset_id": "string"
}

Response samples

Content type

application/json

{"match_rate_percentage": "string"
}

Queries

Service for executing queries against datasets in the InfoSum platform.

InfoSum Support Portal Article

Create a PAIR enabled activation query

Create a PAIR enabled activation query.

Authorizations:

API_Keys

Request Body schema: application/json
required

CreatePairQueryRequest is the input to the CreatePairQuery method.

collaborator_dataset_id	string CollaboratorDatasetId is the ID of the dataset you are collaborating with.
	object ColumnIdentifier contains details of the column to be used as the identifier.
dataset_id	string DatasetId is the ID of the dataset you want the query to be run against.
query_name	string QueryName is the name of the query.

Responses

Request samples

Payload

Content type

application/json

{"collaborator_dataset_id": "string",
"column_identifier": {"column_name": "string",
"column_type": "PAIR_COLUMN_TYPE_INVALID"
},
"dataset_id": "string",
"query_name": "string"
}

Response samples

Content type

application/json

{"query_id": "string"
}

Create new private path collaboration

Create a new private path collaboration between given datasets

Authorizations:

API_Keys

Request Body schema: application/json
required

CreatePrivatePathRequest is the input to the CreatePrivatePath method.

dataset_ids

Array of strings

DatasetIds is the list of datasets allowed in the collaboration.

Responses

Request samples

Payload

Content type

application/json

{"dataset_ids": ["string"
]
}

Response samples

Content type

application/json

{"collaboration_id": "string"
}

Get details of an existing private path collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id

required

string

CollaborationId is the unique ID generated for the private path collaboration.

Responses

Response samples

Content type

application/json

{"private_path_datasets": [{"dataset_id": "string",
"use_time": "2019-08-24T14:15:22Z"
}
]
}

Reports

Service for generating and managing data insight reports.

List insight reports.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string The ordering for returned insight reports.
filter	string Filter is a string containing logic that can specify a restricted set of reports to be returned. e.g. `name co 'Test' or name co 'Acme'` A string containing logic that can specify a restricted set of insight reports to be returned. e.g. `name eq 'Test' or name co 'Acme'`

Responses

Response samples

Content type

application/json

{"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
},
"reports": [{"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"owner_id": "string",
"progress": {"completion_percentage": 0.1
},
"report_id": "string",
"report_name": "string",
"report_type": "string",
"status": "NO_STATUS"
}
]
}

Create a new insight report.

Authorizations:

API_Keys

Request Body schema: application/json
required

CreateRequest is the Create rpc input.

	object Baseline defines the dataset/audience to be used as the baseline data.
baseline_filter	string An IQL filter to define how to filter the baseline.
categories	Array of strings The list names of categories to include in the insight report.
name	string Name is the user given name to assign to a insight report.
	object Seed defines the dataset/audience to be used as the seed data.
seed_filter	string An IQL filter to define how to filter the seed.

Responses

Request samples

Payload

Content type

application/json

{"baseline": {"dataset_id": "string"
},
"baseline_filter": "string",
"categories": ["string"
],
"name": "string",
"seed": {"dataset_id": "string"
},
"seed_filter": "string"
}

Response samples

Content type

application/json

{"report_id": "string"
}

Delete an insight report.

Authorizations:

API_Keys

path Parameters

report_id

required

string

The uniquely identifying ID of an insight report.

Responses

Response samples

Content type

application/json

{ }

Get details of an insight report.

Authorizations:

API_Keys

path Parameters

report_id

required

string

ReportID uniquely specifies an insight report.

Responses

Response samples

Content type

application/json

{"baseline": {"dataset_id": "string"
},
"baseline_filter": "string",
"categories": ["string"
],
"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"error": {"details": {"code": "string",
"message": "string"
},
"error": {"code": "string",
"message": "string"
},
"help_url": "string",
"request_id": "string",
"time": "2019-08-24T14:15:22Z"
},
"owner_id": "string",
"report_id": "string",
"report_name": "string",
"seed": {"dataset_id": "string"
},
"seed_filter": "string",
"status": "NO_STATUS"
}

Get a cross category result for a completed report.

Authorizations:

API_Keys

path Parameters

report_id

required

string

Report ID uniquely specifies an insight report.

query Parameters

category_1	string The name of the first category to get the cross category result for.
category_2	string The name of the second category to get the cross category result for.

Responses

Response samples

Content type

application/json

{"result": {"baseline_background": "string",
"category_name_1": "string",
"category_name_2": "string",
"intersection_background": "string",
"values": [{"baseline_foreground": "string",
"category_value_1": "string",
"category_value_2": "string",
"index": 0,
"intersection_foreground": "string"
}
]
}
}

List the single category results of a completed insight report.

Authorizations:

API_Keys

path Parameters

report_id

required

string

Report ID uniquely specifies an insight report.

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string The ordering for returned insight reports.
filter	string Filter is a string containing logic that can specify a restricted set of reports to be returned. e.g. `category_name eq 'Category1' or category_name co 'string'` A string containing logic that can specify a restricted set of insight report results to be returned. e.g. `category_name eq 'Category1' or category_name co 'string'`

Responses

Response samples

Content type

application/json

{"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
},
"results": [{"baseline_background": "string",
"category_name": "string",
"intersection_background": "string",
"values": [{"baseline_foreground": "string",
"category_value": "string",
"index": 0,
"intersection_foreground": "string"
}
]
}
]
}

Get the status of an insight report.

Authorizations:

API_Keys

path Parameters

report_id

required

string

ReportID uniquely specifies an insight report.

Responses

Response samples

Content type

application/json

{"error": {"details": {"code": "string",
"message": "string"
},
"error": {"code": "string",
"message": "string"
},
"help_url": "string",
"request_id": "string",
"time": "2019-08-24T14:15:22Z"
},
"progress": {"completion_percentage": 0.1
},
"status": "NO_STATUS"
}

Automation Configs

Service for managing configurations for scheduling and automating actions in the InfoSum platform.

List Automation Configs

Lists configs belonging to your company subject to the provided filter.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string Default: "updated_date_time DESC" Comma separated list of sorting parameters and ordering in the form of `{sort_column} {ASC/DESC}`. Supported sort fields are: `id`, `config_id`, `executed_by`, `started_date_time`, `last_updated_date_time`, `current_step`, `current_step_status`, `cloud_vault_id`.
filter	string Filter string as per the docs. Supported filter fields are: `id`, `config_id`, `executed_by`, `started_date_time`, `last_updated_date_time`, `current_step`, `current_step_status`, `cloud_vault_id`.

Responses

Response samples

Content type

application/json

{"configs": [{"automation_steps": [{"import": {"config_id": "string"
},
"normalization": {"config_id": "string"
},
"publish": {"config_id": "string",
"dataset_id": "string"
},
"recordset_creation": {"config_id": "string"
}
}
],
"cloud_vault_id": "string",
"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"cron_timer": "0 12 * * *",
"description": "string",
"enabled": true,
"end_date_time": "2019-11-22T22:30:00Z",
"id": "string",
"name": "string",
"start_date_time": "2019-11-22T22:30:00Z",
"timezone_identifier": "Europe/London",
"updated_date_time": "2019-08-24T14:15:22Z"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Create Automation Config

Creates and validates an Automation Config based on the provided list of ordered tasks.

Authorizations:

API_Keys

Request Body schema: application/json
required

CreateConfigRequest is the information provided when creating an Automation Config.

cloud_vault_id	string The ID of the Cloud Vault to be used for reading and writing data during Automation Execution.
cron_timer	string The execution schedule of the Automation in UNIX CRON format as described in https://man7.org/linux/man-pages/man5/crontab.5.html. The schedule is executed relative to the supplied `timezone_identifier` for the Automation.
description	string The user-friendly description of the Automation Config.
enabled	boolean The 'enabled' flag allows Automation Execution triggers to be paused and resumed without adjusting the rest of the config or schedule.
end_date_time	string <date-time> Optional date-time to cease triggering Automation Executions. Format is RFC3339 which defaults to UTC. Offsets relative to Zulu time must be provided to convey local times. This parameter is unaffected by `timezone_identifier`. An empty value indicates that the schedule should never cease.
name	string The user-friendly name of the Automation Config. Valid characters are alphanumeric, '_', '-' and spaces.
start_date_time	string <date-time> Optional date-time to begin triggering Automation Executions based on the CRON schedule. Format is RFC3339 which defaults to UTC. Offsets relative to Zulu time must be provided to convey local times. This parameter is unaffected by `timezone_identifier`. An empty value indicates that the schedule should begin as soon as possible.
	Array of objects (Automation_Configs.AutomationStep) The sequence of tasks which constitute an Execution of this Automation Config.
timezone_identifier	string The Timezone Identifier to schedule Automation Executions relative to using the `cron_timer`. This parameter does not affect the start or end times for scheduling. The format is from the IANA TZ database: https://en.wikipedia.org/wiki/List_of_tz_database_time_zones. Be aware that scheduling near the lost or gained hour in a timezone which implements daylight savings time (DST) can lead to schedule disruption during transitions.

Responses

Request samples

Payload

Content type

application/json

{"cloud_vault_id": "string",
"cron_timer": "0 12 * * *",
"description": "string",
"enabled": true,
"end_date_time": "2019-11-23T18:30:00-04:00",
"name": "string",
"start_date_time": "2019-11-22T18:30:00-04:00",
"steps": [{"import": {"config_id": "string"
},
"normalization": {"config_id": "string"
},
"publish": {"config_id": "string",
"dataset_id": "string"
},
"recordset_creation": {"config_id": "string"
}
}
],
"timezone_identifier": "Europe/London"
}

Response samples

Content type

application/json

{"config": {"automation_steps": [{"import": {"config_id": "string"
},
"normalization": {"config_id": "string"
},
"publish": {"config_id": "string",
"dataset_id": "string"
},
"recordset_creation": {"config_id": "string"
}
}
],
"cloud_vault_id": "string",
"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"cron_timer": "0 12 * * *",
"description": "string",
"enabled": true,
"end_date_time": "2019-11-22T22:30:00Z",
"id": "string",
"name": "string",
"start_date_time": "2019-11-22T22:30:00Z",
"timezone_identifier": "Europe/London",
"updated_date_time": "2019-08-24T14:15:22Z"
}
}

Delete Automation Config

Remove the specified Automation Config along with any execution information and history. No new executions can be triggered for deleted Automation Configs.

Authorizations:

API_Keys

path Parameters

config_id

required

string

The ID of the Automation Config to be deleted.

Responses

Response samples

200
401
403
500
default

Content type

application/json

{ }

Get Automation Config

Fetches an Automation Config by its Config ID.

Authorizations:

API_Keys

path Parameters

config_id

required

string

The ID of the Automation Config to retrieve.

Responses

Response samples

Content type

application/json

{"config": {"automation_steps": [{"import": {"config_id": "string"
},
"normalization": {"config_id": "string"
},
"publish": {"config_id": "string",
"dataset_id": "string"
},
"recordset_creation": {"config_id": "string"
}
}
],
"cloud_vault_id": "string",
"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"cron_timer": "0 12 * * *",
"description": "string",
"enabled": true,
"end_date_time": "2019-11-22T22:30:00Z",
"id": "string",
"name": "string",
"start_date_time": "2019-11-22T22:30:00Z",
"timezone_identifier": "Europe/London",
"updated_date_time": "2019-08-24T14:15:22Z"
},
"last_execution_id": "string"
}

Update Automation Config

Updates the specified Automation Config, overriding any old fields with new ones provided on the request.

Authorizations:

API_Keys

path Parameters

config_id

required

string

The ID of the Automation Config to be updated.

Request Body schema: application/json
required

Config is the config the user wants to update.

cron_timer	string The execution schedule of the Automation in UNIX CRON format as described in https://man7.org/linux/man-pages/man5/crontab.5.html. The schedule is executed relative to the supplied `timezone_identifier` for the Automation.
description	string The user-friendly description of the Automation Config.
enabled	boolean The 'enabled' flag allows Automation Execution triggers to be paused and resumed without adjusting the rest of the config or schedule.
end_date_time	string <date-time> Optional date-time to cease triggering Automation Executions. Format is RFC3339 which defaults to UTC. Offsets relative to Zulu time must be provided to convey local times. This parameter is unaffected by `timezone_identifier`. An empty value indicates that the schedule should never cease.
name	string The user-friendly name of the Automation Config. Valid characters are alphanumeric, '_', '-' and spaces.
start_date_time	string <date-time> Optional date-time to begin triggering Automation Executions based on the CRON schedule. Format is RFC3339 which defaults to UTC. Offsets relative to Zulu time must be provided to convey local times. This parameter is unaffected by `timezone_identifier`. An empty value indicates that the schedule should begin as soon as possible.
timezone_identifier	string The Timezone Identifier to schedule Automation Executions relative to using the `cron_timer`. This parameter does not affect the start or end times for scheduling. The format is from the IANA TZ database: https://en.wikipedia.org/wiki/List_of_tz_database_time_zones. Be aware that scheduling near the lost or gained hour in a timezone which implements daylight savings time (DST) can lead to schedule disruption during transitions.

Responses

Request samples

Payload

Content type

application/json

{"cron_timer": "0 12 * * *",
"description": "string",
"enabled": true,
"end_date_time": "2019-11-22T18:30:00-04:00",
"name": "string",
"start_date_time": "2019-11-22T18:30:00-04:00",
"timezone_identifier": "Europe/London"
}

Response samples

Content type

application/json

{"config": {"automation_steps": [{"import": {"config_id": "string"
},
"normalization": {"config_id": "string"
},
"publish": {"config_id": "string",
"dataset_id": "string"
},
"recordset_creation": {"config_id": "string"
}
}
],
"cloud_vault_id": "string",
"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"cron_timer": "0 12 * * *",
"description": "string",
"enabled": true,
"end_date_time": "2019-11-22T22:30:00Z",
"id": "string",
"name": "string",
"start_date_time": "2019-11-22T22:30:00Z",
"timezone_identifier": "Europe/London",
"updated_date_time": "2019-08-24T14:15:22Z"
}
}

Automation Execution

Service for fetching previous automation runs and triggering a manual run of an automation configuration.

Execute Automation Config

Execute Automation Config manually initiates the Execution of an Automation Config ignoring its schedule.

Authorizations:

API_Keys

path Parameters

config_id

required

string

The ID of the Automation Config to manually trigger an Execution of.

Responses

Response samples

Content type

application/json

{"execution_id": "string"
}

List Executions

List Executions retrieves Automation Config Executions subject to the provided filters.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma separated value to define sorting. e.g 'name,created_time DESC'. The values are separated by space, '{column} {ASC/DESC}'. These are the values that are sortable: none
filter	string Filter is a string containing logic that can specify a restricted set of Recordsets to be returned. e.g. `name co 'Test' or name co 'Acme'`. These are the supported fields for filtering: none

Responses

Response samples

Content type

application/json

{"executions": [{"cloud_vault_id": "string",
"config_id": "string",
"current_automation_step": "TASK_TYPE_INVALID",
"current_step_status": "NO_STATUS",
"error": {"details": {"code": "string",
"message": "string"
},
"error": {"code": "string",
"message": "string"
},
"help_url": "string",
"request_id": "string",
"time": "2019-08-24T14:15:22Z"
},
"executed_by": "string",
"id": "string",
"last_updated_date_time": "2019-08-24T14:15:22Z",
"started_date_time": "2019-08-24T14:15:22Z"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Get Execution

Get Execution fetches details of an Automation Execution by its ID.

Authorizations:

API_Keys

path Parameters

execution_id

required

string

The ID of the Automation Execution to retrieve.

Responses

Response samples

Content type

application/json

{"execution": {"cloud_vault_id": "string",
"config_id": "string",
"current_automation_step": "TASK_TYPE_INVALID",
"current_step_status": "NO_STATUS",
"error": {"details": {"code": "string",
"message": "string"
},
"error": {"code": "string",
"message": "string"
},
"help_url": "string",
"request_id": "string",
"time": "2019-08-24T14:15:22Z"
},
"executed_by": "string",
"id": "string",
"last_updated_date_time": "2019-08-24T14:15:22Z",
"started_date_time": "2019-08-24T14:15:22Z"
},
"execution_steps": [{"automation_execution_id": "string",
"current_task_status": "NO_STATUS",
"error": {"details": {"code": "string",
"message": "string"
},
"error": {"code": "string",
"message": "string"
},
"help_url": "string",
"request_id": "string",
"time": "2019-08-24T14:15:22Z"
},
"started_date_time": "2019-08-24T14:15:22Z",
"task_config_id": "string",
"task_id": "string",
"task_type": "TASK_TYPE_INVALID",
"updated_date_time": "2019-08-24T14:15:22Z"
}
]
}

Collaboration Connections

List Connections

Returns a list of connections between datasets in the given collaboration. The output is affected by provided filters.

Authorizations:

API_Keys

path Parameters

collaboration_id

required

string

The ID of the collaboration to get connection details from.

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'created_time,name DESC' The values are seperated by space, '{column} {ASC/DESC}' Valid columns: `created_date_time`, `updated_date_time`, `owner`, `status`, `name`
filter	string Supported filters are `from_dataset_id`, `to_dataset_id`

Responses

Response samples

200
401
403
500
default

Content type

application/json

{"connections": [{"from_dataset_id": "string",
"highest_overlap_key": {"id": "string",
"intersection": "string",
"intersection_percentage": 0.1,
"name": "string"
},
"id": "string",
"to_dataset_id": "string"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Get Connection

Returns the details for the requested connection ID.

Authorizations:

API_Keys

path Parameters

collaboration_id required	string The ID of the collaboration to get connection details from.
connection_id required	string The ID of the connection to get details about.

Responses

Response samples

Content type

application/json

{"connection": {"from_dataset_id": "string",
"highest_overlap_key": {"id": "string",
"intersection": "string",
"intersection_percentage": 0.1,
"name": "string"
},
"id": "string",
"keys": [{"id": "string",
"intersection": "string",
"intersection_percentage": 0.1,
"name": "string"
}
],
"to_dataset_id": "string"
}
}

Collaboration Datasets

List Datasets

Returns a list of datasets between datasets in the given collaboration. The output is affected by provided filters.

Authorizations:

API_Keys

path Parameters

collaboration_id

required

string

The ID of the collaboration to get dataset details from.

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'created_time,name DESC' The values are seperated by space, '{column} {ASC/DESC}' Valid columns: `created_date_time`, `updated_date_time`, `owner`, `name`
filter	string Supported filters are `dataset_id`, `owner_id`

Responses

Response samples

200
401
403
500
default

Content type

application/json

{"datasets": [{"attributes": [{"label": "string",
"type": "string"
}
],
"collaboration_dataset_name": "string",
"id": "string",
"keys": [{"id": "string",
"label": "string"
}
],
"output_columns": ["string"
],
"owner_id": "string",
"public_dataset_name": "string",
"row_count": "string",
"type": "DATASET_TYPE_INVALID"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Get Dataset

Returns the details for the requested dataset within given collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id required	string The ID of the collaboration the dataset belongs to.
dataset_id required	string The ID of the dataset you want information about.

Responses

Response samples

Content type

application/json

{"attributes": [{"fill_rate": 0.1,
"id": "string",
"label": "string",
"type": "string"
}
],
"collaboration_dataset_name": "string",
"id": "string",
"keys": [{"fill_rate": 0.1,
"id": "string",
"label": "string"
}
],
"output_columns": ["string"
],
"owner_id": "string",
"public_dataset_name": "string",
"published_date_time": "2019-08-24T14:15:22Z",
"region": "UNKNOWN",
"row_count": "string",
"type": "DATASET_TYPE_INVALID"
}

Get Attribute Stats

Get detailed statistics for the attributes in the given dataset in the context of the collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id required	string The ID of the collaboration to get dataset stats from.
dataset_id required	string The id of the dataset to get summary stats for.

Responses

Response samples

Content type

application/json

{"id_to_representations_stats": {"property1": {"representations_stats": [{"distinct_values": "string",
"multi_value": true,
"representation_id": "string",
"valid_values": "string"
}
]
},
"property2": {"representations_stats": [{"distinct_values": "string",
"multi_value": true,
"representation_id": "string",
"valid_values": "string"
}
]
}
}
}

Get Export Column Stats

Get detailed statistics for the export columns in the given dataset in the context of the collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id required	string The ID of the collaboration to get dataset stats from.
dataset_id required	string The id of the dataset to get export column stats for.

Responses

Response samples

Content type

application/json

{"label_to_stats": {"property1": {"distinct_values": "string",
"multi_values": true,
"valid_values": "string"
},
"property2": {"distinct_values": "string",
"multi_values": true,
"valid_values": "string"
}
}
}

Get Key Stats

Get detailed statistics for the keys in the given dataset in the context of the collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id required	string The ID of the collaboration to get dataset stats from.
dataset_id required	string The id of the dataset to get summary stats for.

Responses

Response samples

Content type

application/json

{"key_id_to_stats": {"property1": {"distinct_identifiers": "string",
"distinct_values": "string",
"identifiers": "string",
"multi_value": true,
"valid_values": "string"
},
"property2": {"distinct_identifiers": "string",
"distinct_values": "string",
"identifiers": "string",
"multi_value": true,
"valid_values": "string"
}
}
}

Get Dataset Stats

Get the counts for dataset properties like number of keys, categories or rows in the context of the collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id required	string The ID of the collaboration to get dataset stats from.
dataset_id required	string The id of the dataset to get summary stats for.

Responses

Response samples

Content type

application/json

{"attribute_columns": "string",
"export_columns": "string",
"key_columns": "string",
"total_rows": "string"
}

Collaboration Management

List Collaborations

Returns a list of collaboration summaries where the requesting users company is the owner or a member. The output is affected by provided filters.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'created_time,name DESC' The values are seperated by space, '{column} {ASC/DESC}' Valid columns: `created_date_time`, `updated_date_time`, `owner`, `status`, `name`
filter	string Supported filters are `id`, `name`, `description`, `created_date_time`, `updated_date_time`, `owner`, `members`, `status`

Responses

Response samples

200
401
403
500
default

Content type

application/json

{"collaborations": [{"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"id": "string",
"members": [{"display_name": "string",
"id": "string",
"rights": {"contributor": true,
"detailed_logs": true,
"operator": true
}
}
],
"name": "string",
"owner": {"display_name": "string",
"id": "string",
"rights": {"contributor": true,
"detailed_logs": true,
"operator": true
}
},
"private_path_enabled": true,
"status": "STATUS_INVALID",
"updated_date_time": "2019-08-24T14:15:22Z"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Create Collaboration

Creates a new collaboration with the provided company IDs as members. Members cannot be added after the collaboration is created.

Authorizations:

API_Keys

Request Body schema: application/json
required

	Array of objects (Collaboration_Management.CreateCollaborationMember) The companies and rights of the other members in the collaboration.
	object The rights that you grant yourself under this collaboration.
description	string An optional description for the collaboration, eg; the purpose or goal.
name	string A name for this collaboration

Responses

Request samples

Payload

Content type

application/json

{"collaborators": [{"company_id": "string",
"rights": {"contributor": true,
"detailed_logs": true,
"operator": true
}
}
],
"creator_rights": {"contributor": true,
"detailed_logs": true,
"operator": true
},
"description": "string",
"name": "string"
}

Response samples

Content type

application/json

{"collaboration_id": "string"
}

Get Collaboration

Returns the details for the requested collaboration ID.

Authorizations:

API_Keys

path Parameters

collaboration_id

required

string

The ID of the collaboration to get details about.

Responses

Response samples

Content type

application/json

{"collaboration": {"activation_datasets": [{"collaboration_dataset_name": "string",
"dataset_id": "string",
"exportable_to_members": ["string"
],
"intersection_only": true,
"owner_id": "string"
}
],
"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"destination_ids": ["string"
],
"id": "string",
"insight_datasets": [{"activation_allowed": true,
"allowed_attributes": ["string"
],
"attribute_restriction": true,
"collaboration_dataset_name": "string",
"dataset_id": "string",
"intersection_only": true,
"owner_id": "string"
}
],
"members": [{"display_name": "string",
"id": "string",
"rights": {"contributor": true,
"detailed_logs": true,
"operator": true
}
}
],
"name": "string",
"owner": {"display_name": "string",
"id": "string",
"rights": {"contributor": true,
"detailed_logs": true,
"operator": true
}
},
"status": "STATUS_INVALID",
"updated_date_time": "2019-08-24T14:15:22Z"
}
}

Add Activation Dataset

Adds an activation dataset to the collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id

required

string

The ID of the collaboration the dataset should be added to.

Request Body schema: application/json
required

collaboration_dataset_name	string A name, unique within this collaboration, used to refer to the dataset in queries, audience builder, etc. Note: the value should between 1 and 17 characters long and alphanumeric only
dataset_id	string The ID of the activation dataset to be added to the collaboration.
exportable_to_members	Array of strings The collaboration members whose destinations can be used as a target to export activation query results to from this dataset. If this field is empty then no query results from this dataset can be exported.
intersection_only	boolean Whether the activation dataset should only allow intersection queries.

Responses

Request samples

Payload

Content type

application/json

{"collaboration_dataset_name": "string",
"dataset_id": "string",
"exportable_to_members": ["string"
],
"intersection_only": true
}

Response samples

Content type

application/json

{ }

Remove Activation Dataset

Removes an activation dataset owned by the caller from the collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id required	string The ID of the collaboration the dataset should be removed from.
dataset_id required	string The ID of the activation dataset to be removed from the collaboration.

Responses

Response samples

Content type

application/json

{ }

Close Collaboration

Closes a collaboration you own. Irreversible. This means that the collaboration can no longer be used for new queries or exports but previous activity can still be seen.

Authorizations:

API_Keys

path Parameters

collaboration_id

required

string

The ID of the collaboration to close.

Responses

Response samples

Content type

application/json

{ }

Add Destination

Adds a destination to the collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id

required

string

The ID of the collaboration the destination should be added to.

Request Body schema: application/json
required

destination_id

string

The ID of the destination to add.

Responses

Request samples

Payload

Content type

application/json

{"destination_id": "string"
}

Response samples

Content type

application/json

{ }

Remove Destination

Removes a destination owned by the caller from the collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id required	string The ID of the collaboration the destination should be removed from.
destination_id required	string The ID of the destination to remove.

Responses

Response samples

Content type

application/json

{ }

List Collaboration Events

Lists the events associated with this collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id

required

string

The ID of the collaboration to fetch events for.

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'created_time,name DESC' The values are seperated by space, '{column} {ASC/DESC}' Valid columns: `event_date_time`
filter	string Supported filters are `event_type` and `company_id`

Responses

Response samples

Content type

application/json

{"events": [{"company_id": "string",
"entities": [{"company_id": "string",
"id": "string",
"name": "string",
"type": "ENTITY_INVALID"
}
],
"event_date_time": "2019-08-24T14:15:22Z",
"message": "string",
"metadata": {"property1": "string",
"property2": "string"
},
"type": "TYPE_INVALID"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Export Collaboration Events

Exports the events associated with this collaboration as a csv file.

Authorizations:

API_Keys

path Parameters

collaboration_id

required

string

The ID of the collaboration to export events for.

query Parameters

order_by	string OrderBy is a comma seperated value to define sorting. e.g 'created_time,name DESC' The values are seperated by space, '{column} {ASC/DESC}' Valid columns: `event_date_time`
filter	string Supported filters are `event_type` and `company_id`

Responses

Response samples

Content type

application/json

"string"

Add Insight Dataset

Adds an insight dataset to the collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id

required

string

The ID of the collaboration the dataset should be added to.

Request Body schema: application/json
required

activation_allowed	boolean If true, allows the dataset to be used in activation queries within the collaboration.
allowed_attributes	Array of strings A list of the attributes in the insight dataset that can be queried against.
attribute_restriction	boolean If true, restricts the allowed attributes to only those in the `allowed_attributes` list. If false, then all attributes (present and future) will be allowed.
collaboration_dataset_name	string A name, unique within this collaboration, used to refer to the dataset in queries, audience builder, etc. Note: the value should between 1 and 17 characters long and alphanumeric only
dataset_id	string The ID of the insight dataset to be added to the collaboration.
intersection_only	boolean Whether the insight dataset should only allow intersection queries.

Responses

Request samples

Payload

Content type

application/json

{"activation_allowed": true,
"allowed_attributes": ["string"
],
"attribute_restriction": true,
"collaboration_dataset_name": "string",
"dataset_id": "string",
"intersection_only": true
}

Response samples

Content type

application/json

{ }

Remove Insight Dataset

Removes an insight dataset owned by the caller from the collaboration.

Authorizations:

API_Keys

path Parameters

collaboration_id required	string The ID of the collaboration the dataset should be removed from.
dataset_id required	string The ID of the insight dataset to be removed from the collaboration.

Responses

Response samples

Content type

application/json

{ }

Leave Collaboration

Leaving the collaboration will mean the leaving company will no longer be able to see any details about the collaboration including the event logs.

Authorizations:

API_Keys

path Parameters

collaboration_id

required

string

The ID of the collaboration to leave.

Responses

Response samples

Content type

application/json

{ }

Datasets

Service for managing datasets in the InfoSum platform.

Returns a list of datasets and their summary information.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'name,created_time DESC' The values are seperated by space, '{column} {ASC/DESC}' The values that are sortable are: dataset_id public_name private_id description owner_id created_by dataset_type (DATASET_TYPE_INSIGHT / DATASET_TYPE_IDENTITY) region recordset_id execution_id publish_date_time (in RFC3339 format) created_date_time (in RFC3339 format) expiration_date_time (in RFC3339 format)
filter	string Filter is a string containing logic that can specify a restricted set of Cloud Vaults to be returned. e.g. `name co 'Test' or name co 'Acme'` The filterable fields are: dataset_id public_name private_id description owner_id created_by dataset_type (DATASET_TYPE_INSIGHT / DATASET_TYPE_IDENTITY) region execution_id recordset_id created_date_time (in RFC3339 format) expiration_date_time (in RFC3339 format) publish_date_time (in RFC3339 format)

Responses

Response samples

Content type

application/json

{"datasets": [{"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"dataset_id": "string",
"dataset_type": "DATASET_TYPE_INVALID",
"description": "string",
"expiration": {"do_not_expire": true,
"expiration_date_time": "2019-08-24T14:15:22Z"
},
"owner_id": "string",
"private_id": "string",
"public_name": "string",
"publish_details": {"execution_id": "string",
"publish_date_time": "2019-08-24T14:15:22Z",
"recordset_id": "string"
},
"published_by": "string",
"region": "UNKNOWN"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Creates a dataset.

Authorizations:

API_Keys

Request Body schema: application/json
required

CreateDatasetRequest is the input used by the CreateDataset rpc.

dataset_type	string Default: "DATASET_TYPE_INVALID" Enum: "DATASET_TYPE_INVALID" "DATASET_TYPE_INSIGHT" "DATASET_TYPE_IDENTITY" DatasetType denotes whether the dataset is insight or activation.
description	string Description is a user defined description for the dataset. Description has a max length of 1000 characters.
	object Expiration specifies information about how the dataset should expire.
private_id	string PrivateId is the name of the dataset visible only within the owner company, used for querying. PrivateIDs can only contain upper and lower case letter and numbers with a minimum length of 1, and a maximum length of 17.
public_name	string PublicName is the name of the dataset as seen by other companies with permission to the dataset. PublicNames can only contain alphanumeric (ascii) characters, including dash, underscore and spaces with a max length of 100 and a minimum length of 1.
region	string Default: "UNKNOWN" Enum: "UNKNOWN" "AWS_EU_WEST_2" "AWS_US_EAST_1" "IBM_EU_CENTRAL_1" "AWS_EU_CENTRAL_1" "AWS_AP_SOUTHEAST_2" Region is the cloud vault region the dataset can accept input data from.

Responses

Request samples

Payload

Content type

application/json

{"dataset_type": "DATASET_TYPE_INVALID",
"description": "string",
"expiration": {"do_not_expire": true,
"expiration_date_time": "2019-08-24T14:15:22Z"
},
"private_id": "string",
"public_name": "string",
"region": "UNKNOWN"
}

Response samples

Content type

application/json

{"dataset_id": "string"
}

Deletes a dataset and any underlying compute resource.

Authorizations:

API_Keys

path Parameters

dataset_id

required

string

DatasetId is the ID of the dataset to be deleted.

Responses

Response samples

Content type

application/json

{ }

Returns a dataset and the summary information.

Authorizations:

API_Keys

path Parameters

dataset_id

required

string

DatasetId is the ID of the dataset to be retrived.

Responses

Response samples

Content type

application/json

{"dataset": {"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"dataset_id": "string",
"dataset_type": "DATASET_TYPE_INVALID",
"description": "string",
"expiration": {"do_not_expire": true,
"expiration_date_time": "2019-08-24T14:15:22Z"
},
"owner_id": "string",
"private_id": "string",
"public_name": "string",
"publish_details": {"execution_id": "string",
"publish_date_time": "2019-08-24T14:15:22Z",
"recordset_id": "string"
},
"published_by": "string",
"region": "UNKNOWN"
}
}

Updates the top level information about a dataset.

Authorizations:

API_Keys

path Parameters

dataset_id

required

string

DatasetId is the ID of the dataset to be updated.

Request Body schema: application/json
required

Dataset contains details of the update which can be partial.

description	string Description is a user defined description for the dataset. Description has a max length of 1000 characters.
	object Expiration specifies information about how the dataset should expire.
private_id	string PrivateId is the name of the dataset visible only within the owner company, used for querying. PrivateIDs can only contain upper and lower case letter and numbers with a minimum length of 1, and a maximum length of 17.
public_name	string PublicName is the name of the dataset as seen by other companies with permission to the dataset. PublicNames can only contain alphanumeric (ascii) characters, including dash, underscore and spaces with a max length of 100 and a minimum length of 1.

Responses

Request samples

Payload

Content type

application/json

{"description": "string",
"expiration": {"do_not_expire": true,
"expiration_date_time": "2019-08-24T14:15:22Z"
},
"private_id": "string",
"public_name": "string"
}

Response samples

Content type

application/json

{ }

Dataset Stats

Service for inspecting the key and category statistics of published datasets.

Dataset Lineage

Service for establishing the provenance of published datasets.

Get the lineage of a published dataset

Get all the configs used to publish a dataset

Authorizations:

API_Keys

path Parameters

dataset_id

required

string

The ID of the published dataset to get lineage for.

Responses

Response samples

Content type

application/json

{"cloud_vault_id": "string",
"lineage": [{"import": {"config_id": "string"
},
"normalization": {"config_id": "string"
},
"publish": {"config_id": "string"
},
"recordset_creation": {"config_id": "string"
}
}
]
}

Destination Instances

List Destination Instances

Returns a list of Destinations owned by the caller.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string Supported order_by fields are id, created_time, name, updated_time, destination_type_id, creator_id
filter	string Supported filters are name, owner_id, destination_type_id, outgoing_permission_id, creator_id
permissionable_to	string PermissionableTo is an optional company ID which if set, filters the returned destinations based on the ability to attach the instance to a permission sent to that recipient company.

Responses

Response samples

200
401
403
500
default

Content type

application/json

{"destinations": [{"created_time": "2019-08-24T14:15:22Z",
"creator_id": "string",
"description": "string",
"destination_type_id": "string",
"id": "string",
"incoming_share_id": "string",
"name": "string",
"outgoing_share_ids": ["string"
],
"owner_id": "string",
"permission_ids": ["string"
],
"permissioned_to_permitted_company": true,
"updated_time": "2019-08-24T14:15:22Z"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Create Destination Instance

Accepts a configuration conforming to the JSON-Schema of the Destination type to be created. Returns Destination ID.

Authorizations:

API_Keys

Request Body schema: application/json
required

CreateDestinationInstanceRequest contains all parameters required to create a Destination instance.

config	object Config is the config for the destination.
description	string Description is an optional description given to this destination.
destination_type_id	string DestinationTypeId is the InfoSum type ID of this destination.
editable_paths	Array of strings EditablePaths is the list of schema paths that can be edited at push time.
name	string Name is the display name given to this destination.

Responses

Request samples

Payload

Content type

application/json

{"config": { },
"description": "string",
"destination_type_id": "string",
"editable_paths": ["string"
],
"name": "string"
}

Response samples

Content type

application/json

{"destination_id": "string"
}

Delete Destination Instance

Deletes an owned destination instance.

Authorizations:

API_Keys

path Parameters

destination_id

required

string

Destination instance ID.

Responses

Response samples

Content type

application/json

{ }

Get Destination Instance

Returns details about an owned Destination Instance.

Authorizations:

API_Keys

path Parameters

destination_id

required

string

Destination instance ID.

Responses

Response samples

Content type

application/json

{"destination": {"config": { },
"created_time": "2019-08-24T14:15:22Z",
"creator_id": "string",
"description": "string",
"destination_type_id": "string",
"editable_paths": ["string"
],
"id": "string",
"incoming_share_id": "string",
"name": "string",
"outgoing_share_ids": ["string"
],
"owner_id": "string",
"permission_ids": ["string"
],
"permissioned_to_permitted_company": true,
"updated_time": "2019-08-24T14:15:22Z"
}
}

Update Destination Instance

Update an owned destination instance.

Authorizations:

API_Keys

path Parameters

destination_id

required

string

destination_id is the ID of the Destination instance.

Request Body schema: application/json
required

DestinationInstanceUpdate is the update to the Destination instance configuration.

config	object Complete destination config. If set, this field acts as a PUT on the underlying config. All fields must be provided even if they are unchanged in the update. The values for previously set `x-infosum-secret` properties can be retained by re-submitting the `X-INFOSUM-REDACTED` value.
description	string Description is an optional description given to this destination.
editable_paths	Array of strings EditablePaths is the list of schema paths that can be edited at push time.
name	string Name is the display name given to this destination.

Responses

Request samples

Payload

Content type

application/json

{"config": { },
"description": "string",
"editable_paths": ["string"
],
"name": "string"
}

Response samples

Content type

application/json

{"destination": {"config": { },
"created_time": "2019-08-24T14:15:22Z",
"creator_id": "string",
"description": "string",
"destination_type_id": "string",
"editable_paths": ["string"
],
"id": "string",
"incoming_share_id": "string",
"name": "string",
"outgoing_share_ids": ["string"
],
"owner_id": "string",
"permission_ids": ["string"
],
"permissioned_to_permitted_company": true,
"updated_time": "2019-08-24T14:15:22Z"
}
}

Attach Destination Instance to Permission

Attach the specified Destination Instance to the given Permission. The destination must be either owned by, or shared with your company.

Authorizations:

API_Keys

path Parameters

destination_id

required

string

Destination Instance ID to attach.

Request Body schema: application/json
required

permission_id

string

Permission ID to attach to.

Responses

Request samples

Payload

Content type

application/json

{"permission_id": "string"
}

Response samples

Content type

application/json

{"permission_ids": ["string"
]
}

Detach Destination Instance from Permission

Removes the link between the specified Destination Instance and the given Permission. The destination must be either owned by, or shared with your company.

Authorizations:

API_Keys

path Parameters

destination_id

required

string

Destination Instance ID to detach.

Request Body schema: application/json
required

permission_id

string

Permission ID to detach from.

Responses

Request samples

Payload

Content type

application/json

{"permission_id": "string"
}

Response samples

Content type

application/json

{"permission_ids": ["string"
]
}

Duplicate Destination Instance

Creates a new copy of the specified, owned destination instance.Returns Destination ID.

Authorizations:

API_Keys

path Parameters

destination_id

required

string

Destination Instance ID to duplicate.

Request Body schema: application/json
required

object (Destination_Instances.DuplicateDestinationInstanceBody)

Responses

Request samples

Payload

Content type

application/json

{ }

Response samples

Content type

application/json

{"destination_id": "string"
}

List Destination types

Returns a list of supported Destination types available to the user. Returned Destination type IDs can be used to fetch the schema and create a destination.

Authorizations:

API_Keys

Responses

Response samples

Content type

application/json

{"destination_types": [{"id": "string",
"name": "string"
}
]
}

Get Destination Type External Config

Returns details required when configuring the third-party service to receive data for this Destination type. Returns empty if the Destination type doesn't require any configuration related to the InfoSum user.

Authorizations:

API_Keys

path Parameters

destination_type_id

required

string

Destination type identifier.

Responses

Response samples

Content type

application/json

{"external_config": { }
}

Get Destination Type Schema

Returns the configuration JSON-Schema for the requested Destination type ID. The JSON-Schema can be used to define a configuration for creating a Destination.

Authorizations:

API_Keys

path Parameters

destination_type_id

required

string

Destination type identifier.

Responses

Response samples

Content type

application/json

{"connection_specification": { }
}

Destination Shares

List Destination Shares

Returns a list of Destinations shared by or with the company of the user.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'created_time,name DESC' The values are seperated by space, '{column} {ASC/DESC}' Valid columns: name, created_time
filter	string Supported filters are `owner_id`, `recipient_company_id`, `destination_id`

Responses

Response samples

200
401
403
500
default

Content type

application/json

{"destination_shares": [{"comment": "string",
"destination_id": "string",
"other_permitted_companies": ["string"
],
"recipient_email_address": "string",
"sender_company_id": "string",
"share_id": "string",
"shared_at_date_time": "2019-08-24T14:15:22Z"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Create Destination Share

Accepts a Destination Instance ID and email to share the Destination Instance with. Sharing a Destination allows the recipient to attach it on a dataset permission back to you.

Authorizations:

API_Keys

Request Body schema: application/json
required

comment	string An optional comment explaining why sharing this destination.
destination_id	string ID of the Destination Instance to be shared.
other_permitted_companies	Array of strings The ID of other companies that the sender of this share allows the recipient to send permissions to, with the shared destination associated. Currently these must be InfoSum company IDs. Using this field is currently only available where the sender and recipient of the destination share are managed service customers.
recipient_email_address	string Recipient email address to be used to find company to share Destination Instance to.

Responses

Request samples

Payload

Content type

application/json

{"comment": "string",
"destination_id": "string",
"other_permitted_companies": ["string"
],
"recipient_email_address": "string"
}

Response samples

Content type

application/json

{"share_id": "string"
}

Remove Destination Share

Stops sharing a Destination to a single company.

Authorizations:

API_Keys

path Parameters

share_id

required

string

ID of the Share that should be removed.

Responses

Response samples

Content type

application/json

{ }

Get Destination Share

Returns the shared destination for a specific share ID.

Authorizations:

API_Keys

path Parameters

share_id

required

string

The ID of the Destination Share to get details about.

Responses

Response samples

Content type

application/json

{"destination_share": {"comment": "string",
"destination_id": "string",
"other_permitted_companies": ["string"
],
"recipient_email_address": "string",
"sender_company_id": "string",
"share_id": "string",
"shared_at_date_time": "2019-08-24T14:15:22Z"
}
}

Activation Destinations

List Activation Destinations

List the destinations which can be used for the provided activation query.

Authorizations:

API_Keys

path Parameters

query_id

required

string

QueryID is the activation query ID to get valid destination targets for.

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy specifies ordering parameters. Valid parameters are: name, created_time, destination_type_id. Default: created_time DESC
filter	string Supported filters are `owner_id`, `recipient_company_id`, `destination_id`

Responses

Response samples

Content type

application/json

{"destinations": [{"created_time": "2019-08-24T14:15:22Z",
"description": "string",
"destination_type_id": "string",
"id": "string",
"name": "string",
"owner_id": "string"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Push Activation Result to Destination

Pushes results of a query to a specific destination instance.

Authorizations:

API_Keys

path Parameters

query_id required	string ID of the activation query.
destination_id required	string Destination instance ID.

Request Body schema: application/json
required

push_config

object

Configuration following the push time schema for the destination instance. Default values from the underlying instance can be found in the push schema default annotations. Defaults will not be used unless they are re-submitted as part of this config. Submit the X-INFOSUM-REDACTED keyword for x-infosum-secret properties to retain their default value from the underlying destination instance.

Responses

Request samples

Payload

Content type

application/json

{"push_config": { }
}

Response samples

Content type

application/json

{"push_id": "string"
}

Get Activation Destination Push Schema

Returns the push time schema for a specific destination instance subject to constraints (like query ID).

Authorizations:

API_Keys

path Parameters

query_id required	string ID of the Activation Query to be pushed.
destination_id required	string Destination instance ID.

Responses

Response samples

Content type

application/json

{"push_schema": { }
}

Activation Pushes

List Activation Query Pushes

Returns a list of pushes for the specified activation query.

Authorizations:

API_Keys

path Parameters

query_id

required

string

The activation query ID.

query Parameters

order_by	string OrderBy is a comma seperated value to define sorting. e.g 'destination_type,started_at_date_time DESC' The values are seperated by space, '{column} {ASC/DESC}'
filter	string Filter is a string containing logic that can specify a restricted set of Pushes to be returned. e.g. 'destination_id eq a_specific_destination_id' or 'push_successful eq true'
pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.

Responses

Response samples

Content type

application/json

{"activation_pushes": [{"completed_at_date_time": "2019-08-24T14:15:22Z",
"destination_id": "string",
"destination_type_id": "string",
"push_error": {"details": {"code": "string",
"message": "string"
},
"error": {"code": "string",
"message": "string"
},
"help_url": "string",
"request_id": "string",
"time": "2019-08-24T14:15:22Z"
},
"push_id": "string",
"push_successful": true,
"query_id": "string",
"started_at_date_time": "2019-08-24T14:15:22Z",
"status": "string"
}
],
"pagination_response": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Get Activation Query Push

Returns the details for a specific activation push subject to constraints (like query ID).

Authorizations:

API_Keys

path Parameters

query_id required	string The ID of the query that has been pushed.
push_id required	string The ID of the activation push.

Responses

Response samples

Content type

application/json

{"push_details": {"completed_at_date_time": "2019-08-24T14:15:22Z",
"destination_id": "string",
"destination_type_id": "string",
"push_config": { },
"push_error": {"details": {"code": "string",
"message": "string"
},
"error": {"code": "string",
"message": "string"
},
"help_url": "string",
"request_id": "string",
"time": "2019-08-24T14:15:22Z"
},
"push_id": "string",
"push_successful": true,
"query_id": "string",
"started_at_date_time": "2019-08-24T14:15:22Z",
"status": "string"
}
}

Cloud Vaults

Service for managing Cloud Vault data staging areas within the InfoSum platform.

InfoSum Support Portal Article

List Cloud Vaults.

List available Cloud Vaults.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'name,created_time DESC' The values are seperated by space, '{column} {ASC/DESC}'
filter	string Filter is a string containing logic that can specify a restricted set of Cloud Vaults to be returned. e.g. `name co 'Test' or name co 'Acme'`

Responses

Response samples

200
401
403
500
default

Content type

application/json

{"cloud_vaults": [{"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"id": "string",
"name": "string",
"owner_id": "string",
"region": "UNKNOWN",
"updated_date_time": "2019-08-24T14:15:22Z"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Create a new Cloud Vault.

Create a Cloud Vault to hold data and connect to a bunker.

Authorizations:

API_Keys

Request Body schema: application/json
required

description	string Description is an optional description given to this Cloud Vault.
name required	string Name is the display name given to this Cloud Vault.
region required	string Default: "UNKNOWN" Enum: "UNKNOWN" "AWS_EU_WEST_2" "AWS_US_EAST_1" "IBM_EU_CENTRAL_1" "AWS_EU_CENTRAL_1" "AWS_AP_SOUTHEAST_2" Region identifies the region that this Cloud Vault is located in. Expected format: cloudprovider + region e.g. aws-eu-west-2 Note: ibm-eu-central-1 is a deprecated value.

Responses

Request samples

Payload

Content type

application/json

{"description": "string",
"name": "string",
"region": "UNKNOWN"
}

Response samples

Content type

application/json

{"cloud_vault": {"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"id": "string",
"name": "string",
"owner_id": "string",
"region": "UNKNOWN",
"updated_date_time": "2019-08-24T14:15:22Z"
}
}

Update a Cloud Vault

Updates the provided fields in a specific Cloud Vault.

Authorizations:

API_Keys

path Parameters

cloud_vault.id

required

string

Output only. ID is the unique identifier of the Cloud Vault.

Request Body schema: application/json
required

description	string Description is an optional description given to this Cloud Vault.
name	string Name is the display name given to this Cloud Vault.

Responses

Request samples

Payload

Content type

application/json

{"description": "string",
"name": "string"
}

Response samples

Content type

application/json

{"cloud_vault": {"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"id": "string",
"name": "string",
"owner_id": "string",
"region": "UNKNOWN",
"updated_date_time": "2019-08-24T14:15:22Z"
}
}

Delete a Cloud Vault.

Deletes a specific Cloud Vault.

Authorizations:

API_Keys

path Parameters

id

required

string

query Parameters

force

boolean

Force will, if set, delete a Cloud Vault that is associated with a bunker or is currently in use. If it is not set, an error will be returned in these cases instead.

Responses

Response samples

Content type

application/json

{ }

Get a Cloud Vault.

Get the details of a specific Cloud Vault.

Authorizations:

API_Keys

path Parameters

id

required

string

Responses

Response samples

Content type

application/json

{"cloud_vault": {"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"id": "string",
"name": "string",
"owner_id": "string",
"region": "UNKNOWN",
"updated_date_time": "2019-08-24T14:15:22Z"
}
}

Files

Service for managing files within a Cloud Vault.

Delete file

Delete the specified file

Authorizations:

API_Keys

path Parameters

cloud_vault_id required	string The globally unique identifier for the Cloud Vault where the file to delete is.
name required	string.+ The path of the file to be deleted. If the path ends in '/*' then delete up to 1000 files with that path prefix. Check files_remaining in the response for the status of the prefix.

Responses

Response samples

200
401
403
500
default

Content type

application/json

{"files_remaining": true
}

List files

List available files with any metadata

Authorizations:

API_Keys

path Parameters

cloud_vault_id required	string CloudVaultId is the id of the Cloud Vault to list files from.
path required	string.+ Path is the path within the cloud vault which has been requested. This will often be blank (no path = list the entire cloudvault), but could also be the path of a single file or directory.

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma separated value to define sorting. e.g 'name,created_time DESC'. The values are separated by space, '{column} {ASC/DESC}'. These are the values that are sortable: name size created_date_time (in RFC3339 format) metadata.updated_date_time (in RFC3339 format) metadata.origin.task_id metadata.origin.task_type metadata.origin.properties
filter	string Filter is a string containing logic that can specify a restricted set of Files to be returned. e.g. `name co 'Test' or name co 'Acme'`. These are the supported fields for filtering: name size created_date_time (in RFC3339 format) metadata.origin.task_type metadata.origin.task_id metadata.updated_date_time (in RFC3339 format) metadata.resource_type metadata.origin.properties.{property_name} (dynamic filtering based on the properties)

Responses

Response samples

Content type

application/json

{"files": [{"created_date_time": "2019-08-24T14:15:22Z",
"metadata": {"origin": {"properties": {"property1": "string",
"property2": "string"
},
"task_id": "string",
"task_type": "string"
},
"resource_type": "RESOURCE_TYPE_INVALID",
"updated_date_time": "2019-08-24T14:15:22Z"
},
"name": "string",
"size": "string"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Import Connectors

Service for managing import connector configurations to connect to a remote data source.

InfoSum Support Portal Article

List Import Connector Configs

List Import Connector Configs belonging to your company subject to the provided filter.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string Default: "created_date_time DESC" Comma separated list of sorting parameters and ordering in the form of `{sort_column} {ASC/DESC}`. Supported sort fields are: `name`, `connector_type`, `created_date_time`, `updated_date_time`
filter	string Filter string as per the docs. Supported filter fields are: `id`, `name`, `description`, `connector_type`, `created_date_time`, `updated_date_time`.

Responses

Response samples

Content type

application/json

{"configs": [{"connector_type": "CONNECTOR_TYPE_INVALID",
"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"gcs_connector": {"bucket": "string",
"filenames": ["string"
],
"prefix": "string"
},
"gpg": {"do_not_expire": true,
"expiration_datetime": "2019-08-24T14:15:22Z",
"public_key": "string"
},
"id": "string",
"is_gpg": true,
"name": "string",
"owner_id": "string",
"s3_connector": {"access_keys": { },
"bucket": "string",
"cross_account": {"external_id": "string",
"principal": "string",
"session_name": "string",
"user_arn": "string"
},
"filenames": ["string"
],
"prefix": "string"
},
"sftp_connector": {"filenames": ["string"
],
"host": "string",
"host_keys": "string",
"parsed_host_keys": "string",
"port": 0,
"root_directory": "string",
"username": "string"
},
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Create Import Connector Config

Creates and validates an Import Connector Config based on the provided connector details.

Authorizations:

API_Keys

Request Body schema: application/json
required

The input to the CreateImportConnector method.

object

The details used to create a new Import Connector Config.

description	string An optional description given to this Import Connector Config.
	object Google Cloud Storage specific information.
	object Details about the expiration for the GPG key associated with the Import Connector Config.
is_gpg	boolean Enables importing of GPG encrypted files.
name	string The display name given to the Import Connector Config.
	object AWS S3 specific information.
	object SFTP server specific information.

Responses

Request samples

Payload

Content type

application/json

{"config": {"description": "string",
"gcs_connector": {"bucket": "string",
"filenames": ["string"
],
"prefix": "string",
"secret_auth": {"credentials_json": "string"
}
},
"gpg_key_expiration": {"do_not_expire": true,
"expiration_datetime": "2019-08-24T14:15:22Z"
},
"is_gpg": true,
"name": "string",
"s3_connector": {"access_keys": {"access_key_id": "string",
"secret_access_key": "string",
"session_token": "string"
},
"bucket": "string",
"cross_account": {"session_name": "string",
"user_arn": "string"
},
"filenames": ["string"
],
"prefix": "string"
},
"sftp_connector": {"filenames": ["string"
],
"host": "string",
"host_keys": "string",
"password": "string",
"port": 0,
"private_key": "string",
"private_key_passphrase": "string",
"root_directory": "string",
"username": "string"
}
}
}

Response samples

Content type

application/json

{"config_id": "string"
}

Refresh the ICC GPG Keys

Refresh ICC GPG Key pair. Users will have to re-encrypt their files with the new public GPG Key.

Authorizations:

API_Keys

path Parameters

config_id

required

string

The ID of the Import Connector Config that you wish to refresh the GPG keys for.

Request Body schema: application/json
required

object

Details about the expiration for the GPG key associated with the Import Connector Config.

do_not_expire	boolean Set the GPG key to never expire. This will override any datetime provided.
expiration_datetime	string <date-time> The UTC datetime when the GPG key will expire.

Responses

Request samples

Payload

Content type

application/json

{"gpg_key_expiration": {"do_not_expire": true,
"expiration_datetime": "2019-08-24T14:15:22Z"
}
}

Response samples

Content type

application/json

{"public_key": "string"
}

Get server host key.

Returns the first valid public sFTP server host key found. Supported key algorithms are ECDSA, RSA and ED25519.

Authorizations:

API_Keys

path Parameters

host required	string The hostname or IP address of the server to get public keys from.
port required	integer <int32> The port to acess the server on. For example 22.

Responses

Response samples

Content type

application/json

{"host_key": "string"
}

Update Import Connector Config

Updates the specified Import Connector Config, overriding any old fields with new ones provided on the request.

Authorizations:

API_Keys

path Parameters

config.id

required

string

The unique ID of the Import Connector Config.

Request Body schema: application/json
required

description	string An optional description given to this Import Connector Config.
	object Google Cloud Storage specific information.
	object Details about the expiration for the GPG key associated with the Import Connector Config.
is_gpg	boolean Enables importing of GPG encrypted files.
name	string The display name given to the Import Connector Config.
	object AWS S3 specific information.
	object SFTP server specific information.

Responses

Request samples

Payload

Content type

application/json

{"description": "string",
"gcs_connector": {"bucket": "string",
"filenames": ["string"
],
"prefix": "string",
"secret_auth": {"credentials_json": "string"
}
},
"gpg_key_expiration": {"do_not_expire": true,
"expiration_datetime": "2019-08-24T14:15:22Z"
},
"is_gpg": true,
"name": "string",
"s3_connector": {"access_keys": {"access_key_id": "string",
"secret_access_key": "string",
"session_token": "string"
},
"bucket": "string",
"cross_account": {"session_name": "string",
"user_arn": "string"
},
"filenames": ["string"
],
"prefix": "string"
},
"sftp_connector": {"filenames": ["string"
],
"host": "string",
"host_keys": "string",
"password": "string",
"port": 0,
"private_key": "string",
"private_key_passphrase": "string",
"root_directory": "string",
"username": "string"
}
}

Response samples

Content type

application/json

{"config": {"connector_type": "CONNECTOR_TYPE_INVALID",
"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"gcs_connector": {"bucket": "string",
"filenames": ["string"
],
"prefix": "string"
},
"gpg": {"do_not_expire": true,
"expiration_datetime": "2019-08-24T14:15:22Z",
"public_key": "string"
},
"id": "string",
"is_gpg": true,
"name": "string",
"owner_id": "string",
"s3_connector": {"access_keys": { },
"bucket": "string",
"cross_account": {"external_id": "string",
"principal": "string",
"session_name": "string",
"user_arn": "string"
},
"filenames": ["string"
],
"prefix": "string"
},
"sftp_connector": {"filenames": ["string"
],
"host": "string",
"host_keys": "string",
"parsed_host_keys": "string",
"port": 0,
"root_directory": "string",
"username": "string"
},
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
}

Delete Import Connector Config

Remove the specified Import Connector Config. Import Connector Configs can only be removed if they are not in use by an Importer.

Authorizations:

API_Keys

path Parameters

config_id

required

string

The ID of the Import Connector Config to be deleted.

Responses

Response samples

Content type

application/json

{ }

Get Import Connector Config

Fetches an Import Connector Config by its Config ID.

Authorizations:

API_Keys

path Parameters

config_id

required

string

The ID of the Import Connector Config to retrieve.

Responses

Response samples

Content type

application/json

{"config": {"connector_type": "CONNECTOR_TYPE_INVALID",
"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"gcs_connector": {"bucket": "string",
"filenames": ["string"
],
"prefix": "string"
},
"gpg": {"do_not_expire": true,
"expiration_datetime": "2019-08-24T14:15:22Z",
"public_key": "string"
},
"id": "string",
"is_gpg": true,
"name": "string",
"owner_id": "string",
"s3_connector": {"access_keys": { },
"bucket": "string",
"cross_account": {"external_id": "string",
"principal": "string",
"session_name": "string",
"user_arn": "string"
},
"filenames": ["string"
],
"prefix": "string"
},
"sftp_connector": {"filenames": ["string"
],
"host": "string",
"host_keys": "string",
"parsed_host_keys": "string",
"port": 0,
"root_directory": "string",
"username": "string"
},
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
}

Importers

Service for managing importers, which combine an import connector configuration with file patterns to allow for easy re-importing of data.

InfoSum Support Portal Article

List Importers

List available Imports.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string A comma seperated value to define sorting e.g 'name,created_time DESC' The values are seperated by space, '{column} {ASC/DESC}'
filter	string A string containing logic that can specify a restricted set of Importers to be returned e.g. `name co 'Test' or name co 'Acme'`

Responses

Response samples

Content type

application/json

{"imports": [{"cloud_vault_id": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"data_filter": {"filter": ["string"
]
},
"description": "string",
"icc_id": "string",
"id": "string",
"name": "string",
"owner_id": "string",
"target_folder": "string",
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Create Importer

Create a configuration for importing data.

Authorizations:

API_Keys

Request Body schema: application/json
required

The input to the CreateImport method.

object

The configuration to create a new Import with.

cloud_vault_id	string The identifier for the Cloud Vault to where the files will be imported.
	object The filtered data to import from the import connector.
description	string A description for the import.
icc_id	string The id of the import connector config used by the import.
name	string The name given to the import.
owner_id	string
target_folder	string The target folder for the import. This field can be a path containing subfolders. Valid characters are alphanumeric, '_', '-', '.', '/'. This field cannot start or end with a '/'. The 'Recordsets' folder cannot be imported into. This field will be mandatory in the near future.

Responses

Request samples

Payload

Content type

application/json

{"import": {"cloud_vault_id": "string",
"data_filter": {"filter": ["string"
]
},
"description": "string",
"icc_id": "string",
"name": "string",
"owner_id": "string",
"target_folder": "string"
}
}

Response samples

Content type

application/json

{"import_id": "string"
}

Update Importer

Updates the provided fields in a specific Import.

Authorizations:

API_Keys

path Parameters

import.id

required

string

The globally unique ID of the import.

Request Body schema: application/json
required

cloud_vault_id	string The identifier for the Cloud Vault to where the files will be imported.
	object The filtered data to import from the import connector.
description	string A description for the import.
icc_id	string The id of the import connector config used by the import.
name	string The name given to the import.
owner_id	string
target_folder	string The target folder for the import. This field can be a path containing subfolders. Valid characters are alphanumeric, '_', '-', '.', '/'. This field cannot start or end with a '/'. The 'Recordsets' folder cannot be imported into. This field will be mandatory in the near future.

Responses

Request samples

Payload

Content type

application/json

{"cloud_vault_id": "string",
"data_filter": {"filter": ["string"
]
},
"description": "string",
"icc_id": "string",
"name": "string",
"owner_id": "string",
"target_folder": "string"
}

Response samples

Content type

application/json

{"import": {"cloud_vault_id": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"data_filter": {"filter": ["string"
]
},
"description": "string",
"icc_id": "string",
"id": "string",
"name": "string",
"owner_id": "string",
"target_folder": "string",
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
}

Delete Importer

Deletes a specific Import.

Authorizations:

API_Keys

path Parameters

import_id

required

string

The ID of the Import to delete.

Responses

Response samples

Content type

application/json

{ }

Get Importer

Get the details of a specific Import.

Authorizations:

API_Keys

path Parameters

import_id

required

string

The ID of the Import to retrieve.

Responses

Response samples

Content type

application/json

{"import": {"cloud_vault_id": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"data_filter": {"filter": ["string"
]
},
"description": "string",
"icc_id": "string",
"id": "string",
"name": "string",
"owner_id": "string",
"target_folder": "string",
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
}

Import Executions

List Import Executions

List all import executions.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string A comma seperated value to define sorting e.g 'current_status DESC' The values are seperated by space, '{column} {ASC/DESC}'
filter	string A string containing logic that can specify a restricted set of Executions to be returned. e.g. `current_status eq 'WAITING'`

Responses

Response samples

Content type

application/json

{"executions": [{"cloud_vault_id": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"current_status": "NO_STATUS",
"id": "string",
"import_id": "string",
"progress": {"completion_percentage": 0.1,
"speed": "string",
"total_bytes": "string",
"transferred_bytes": "string"
},
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Gets Import Execution

Gets a specific import execution.

Authorizations:

API_Keys

path Parameters

execution_id

required

string

The ID of the Import execution task to retrieve.

Responses

Response samples

Content type

application/json

{"execution_details": {"execution": {"cloud_vault_id": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"current_status": "NO_STATUS",
"id": "string",
"import_id": "string",
"progress": {"completion_percentage": 0.1,
"speed": "string",
"total_bytes": "string",
"transferred_bytes": "string"
},
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
},
"history": [{"date_time": "2019-08-24T14:15:22Z",
"error": {"details": {"code": "string",
"message": "string"
},
"error": {"code": "string",
"message": "string"
},
"help_url": "string",
"request_id": "string",
"time": "2019-08-24T14:15:22Z"
},
"message": "string",
"status": "NO_STATUS"
}
]
}
}

Cancel Import Execution

Cancels a specific import execution.

Authorizations:

API_Keys

path Parameters

execution_id

required

string

The ID of the Import execution task to cancel.

Responses

Response samples

Content type

application/json

{ }

Execute Import

Executes an import for the specified import ID.

Authorizations:

API_Keys

path Parameters

import_id

required

string

The ID of the Import to execute.

query Parameters

target_folder

string

An optional destination folder, if provided all imported files will be copied in this folder. This will override the Importer target folder.

Responses

Response samples

Content type

application/json

{"execution_id": "string"
}

Global Schema

Service presenting the global schema keys and categories that columns can be mapped to during normalization.

InfoSum Support Portal Article

List Global Schema Categories.

List global schema categories

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'name,created_time DESC' The values are seperated by space, '{column} {ASC/DESC}' The following values are permitted: name
filter	string Filter is a string containing logic that can specify a restricted set of Cloud Vaults to be returned. e.g. `name co 'Test' or name co 'Acme'` The following values are permitted: name

Responses

Response samples

Content type

application/json

{"categories": [{"description": "string",
"id": "string",
"key_contributor": true,
"name": "string",
"properties": [{"desc": "string",
"is_column": true,
"name": "string",
"optional": true,
"possible_values": {"default_value": "string",
"possible_values": ["string"
]
},
"type": "TYPE_INVALID",
"user_input": true
}
]
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

List Global Schema Keys.

List global schema keys

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'name,created_time DESC' The values are seperated by space, '{column} {ASC/DESC}' The following values are permitted: name
filter	string Filter is a string containing logic that can specify a restricted set of Cloud Vaults to be returned. e.g. `name co 'Test' or name co 'Acme'` The following values are permitted: name
category_ids	Array of strings <int64> [ items <int64 > ] CategoryIds is the list of Category IDs that are available in the current Category assignment and specifies what keys should be returned.
type	Array of strings Items Enum: "TYPE_INVALID" "INT64" "UINT64" "DOUBLE" "TEXT" "BOOLEAN" "BYTES" "INT64RANGE" "TEXTARRAY" "INT64ARRAY" Type describes the data type assigned to a given category. INT64: INT64 is the signed 64-bit integer type. UINT64: UINT64 is the unsigned 64-bit integer type. DOUBLE: DOUBLE is the floating point type. TEXT: TEXT is the string type. BOOLEAN: BOOLEAN is the boolean type. BYTES: BYTES is the bytes type. INT64RANGE: INT64RANGE is the range of signed 64-bit integers type. TEXTARRAY: TEXTARRAY is the text array type. INT64ARRAY: INT64ARRAY is the int64 array type.

Responses

Response samples

Content type

application/json

{"keys": [{"categories": [{"category_name": "string",
"is_present": true
}
],
"id": "string",
"name": "string",
"representation_version_ids": ["string"
]
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

List Global Schema Key Categories.

List global schema key categories

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'name,created_time DESC' The values are seperated by space, '{column} {ASC/DESC}' The following values are permitted: name
filter	string Filter is a string containing logic that can specify a restricted set of Cloud Vaults to be returned. e.g. `name co 'Test' or name co 'Acme'` The following values are permitted: name

Responses

Response samples

Content type

application/json

{"categories": [{"description": "string",
"id": "string",
"key_contributor": true,
"name": "string",
"properties": [{"desc": "string",
"is_column": true,
"name": "string",
"optional": true,
"possible_values": {"default_value": "string",
"possible_values": ["string"
]
},
"type": "TYPE_INVALID",
"user_input": true
}
]
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Normalization Config

Service for managing configurations for normalizing data.

InfoSum Support Portal Article

List company Normalization Configurations.

List company normalization configurations

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'name,created_time DESC' The values are seperated by space, '{column} {ASC/DESC}' The following values are permitted: name created_date_time (RFC3339 format) updated_date_time (RFC3339 format)
filter	string Filter is a string containing logic that can specify a restricted set of Cloud Vaults to be returned. e.g. `name co 'Test' or name co 'Acme'` The following values are permitted: name created_date_time (RFC3339 format) updated_date_time (RFC3339 format)

Responses

Response samples

Content type

application/json

{"configs": [{"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"id": "string",
"name": "string",
"updated_date_time": "2019-08-24T14:15:22Z"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Creates a normalization config.

Creates a normalization config

Authorizations:

API_Keys

Request Body schema: application/json
required

description	string Description is the given description for the config.
name	string Name is the name given to the config.
	object Config is the normalization configuration defined by the user.

Responses

Request samples

Payload

Content type

application/json

{"description": "string",
"name": "string",
"normalization_config": {"categories": [{"category_id": "string",
"column_type": {"property1": "TYPE_INVALID",
"property2": "TYPE_INVALID"
},
"columns": ["string"
],
"conversion_delimiter": "string",
"convert_to_multivalue": true,
"custom_category_def": {"is_key": true,
"name": "string",
"type": {"int_set": {"values": ["string"
]
},
"integer_lists": {"values": ["string"
]
},
"integer_ranges": {"ranges": [{"lower": "string",
"upper": "string"
}
]
},
"integer_values": { },
"string_lists": {"values": ["string"
]
},
"string_set": {"validator": {"validation": "string",
"warning": "string"
},
"values": ["string"
]
},
"string_values": {"validator": {"validation": "string",
"warning": "string"
}
}
}
},
"ignore_warnings": true,
"mappings": [{"elements": [{"key": {"bool": true,
"bytes": "string",
"double": 0.1,
"int64": "string",
"int64array": {"values": [ ]
},
"int64range": {"lower": null,
"upper": null
},
"null": { },
"text": "string",
"textarray": {"values": [ ]
},
"uint64": "string"
},
"value": {"bool": true,
"bytes": "string",
"double": 0.1,
"int64": "string",
"int64array": {"values": [ ]
},
"int64range": {"lower": null,
"upper": null
},
"null": { },
"text": "string",
"textarray": {"values": [ ]
},
"uint64": "string"
}
}
],
"id": "string"
}
],
"properties": {"property1": {"bool": true,
"bytes": "string",
"double": 0.1,
"int64": "string",
"int64array": {"values": ["string"
]
},
"int64range": {"lower": "string",
"upper": "string"
},
"null": { },
"text": "string",
"textarray": {"values": ["string"
]
},
"uint64": "string"
},
"property2": {"bool": true,
"bytes": "string",
"double": 0.1,
"int64": "string",
"int64array": {"values": ["string"
]
},
"int64range": {"lower": "string",
"upper": "string"
},
"null": { },
"text": "string",
"textarray": {"values": ["string"
]
},
"uint64": "string"
}
},
"script": "string"
}
],
"identities": ["string"
],
"ignored_columns": ["string"
],
"incremental": {"full_refresh": true,
"primary_key_columns": ["string"
],
"ttl_column": "string"
},
"redaction_threshold": 0,
"rounding_threshold": 0,
"selected_key_ids": ["string"
]
}
}

Response samples

Content type

application/json

{"id": "string"
}

Delete a Normalization Config.

Delete a normalization config

Authorizations:

API_Keys

path Parameters

config_id

required

string

NormalizationConfigID is the ID of the Normalization Configuration the user wants to delete.

Responses

Response samples

Content type

application/json

{ }

Get a Normalization Configuration.

Get a normalization configuration

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the Normalization Configuration the user wants to get.

Responses

Response samples

Content type

application/json

{"config": {"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"id": "string",
"name": "string",
"normalization_config": {"categories": [{"category_id": "string",
"column_type": {"property1": "TYPE_INVALID",
"property2": "TYPE_INVALID"
},
"columns": ["string"
],
"conversion_delimiter": "string",
"convert_to_multivalue": true,
"custom_category_def": {"is_key": true,
"name": "string",
"type": {"int_set": {"values": ["string"
]
},
"integer_lists": {"values": ["string"
]
},
"integer_ranges": {"ranges": [{"lower": null,
"upper": null
}
]
},
"integer_values": { },
"string_lists": {"values": ["string"
]
},
"string_set": {"validator": {"validation": "string",
"warning": "string"
},
"values": ["string"
]
},
"string_values": {"validator": {"validation": "string",
"warning": "string"
}
}
}
},
"ignore_warnings": true,
"mappings": [{"elements": [{"key": {"bool": null,
"bytes": null,
"double": null,
"int64": null,
"int64array": null,
"int64range": null,
"null": null,
"text": null,
"textarray": null,
"uint64": null
},
"value": {"bool": null,
"bytes": null,
"double": null,
"int64": null,
"int64array": null,
"int64range": null,
"null": null,
"text": null,
"textarray": null,
"uint64": null
}
}
],
"id": "string"
}
],
"properties": {"property1": {"bool": true,
"bytes": "string",
"double": 0.1,
"int64": "string",
"int64array": {"values": ["string"
]
},
"int64range": {"lower": "string",
"upper": "string"
},
"null": { },
"text": "string",
"textarray": {"values": ["string"
]
},
"uint64": "string"
},
"property2": {"bool": true,
"bytes": "string",
"double": 0.1,
"int64": "string",
"int64array": {"values": ["string"
]
},
"int64range": {"lower": "string",
"upper": "string"
},
"null": { },
"text": "string",
"textarray": {"values": ["string"
]
},
"uint64": "string"
}
},
"script": "string"
}
],
"identities": ["string"
],
"ignored_columns": ["string"
],
"incremental": {"full_refresh": true,
"primary_key_columns": ["string"
],
"ttl_column": "string"
},
"redaction_threshold": 0,
"rounding_threshold": 0,
"selected_key_ids": ["string"
]
},
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
}

Updates a normalization config.

Updates a normalization config. This endpoint uses the POST method instead of PATCH, so the complete updated config—including all details—is required to replace the existing one.

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the config to update.

Request Body schema: application/json
required

object

Payload is the full normalization config payload to fully replace the existing config.

	object Config is the complete updated normalization config.
description	string Description is the updated normalization config description.
name	string Name is the updated normalization config name.

Responses

Request samples

Payload

Content type

application/json

{"payload": {"config": {"categories": [{"data_type": "DATATYPE_INVALID",
"functions": [{ }
],
"input_column": "string",
"output_column": "string"
}
],
"custom_keys": [{"functions": [{ }
],
"input_column": "string",
"output_column": "string"
}
],
"enrichments": [{"uk_addr_mapper": {"house_number_column": "string",
"postcode_column": "string",
"street_name_column": "string"
},
"us_addr_mapper": {"city_column": "string",
"first_addr_line_column": "string",
"house_number_column": "string",
"secondary_addr_line_column": "string",
"state_column": "string",
"third_addr_line_column": "string",
"zipcode_column": "string"
}
}
],
"export_columns": [{"functions": [{ }
],
"input_column": "string",
"output_column": "string"
}
],
"keys": [{"functions": [{ }
],
"global_schema_id": 0,
"input_column": "string",
"user_params": {"property1": "string",
"property2": "string"
}
}
]
},
"description": "string",
"name": "string"
}
}

Response samples

Content type

application/json

{"config": {"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"id": "string",
"name": "string",
"normalization_config": {"categories": [{"category_id": "string",
"column_type": {"property1": "TYPE_INVALID",
"property2": "TYPE_INVALID"
},
"columns": ["string"
],
"conversion_delimiter": "string",
"convert_to_multivalue": true,
"custom_category_def": {"is_key": true,
"name": "string",
"type": {"int_set": {"values": ["string"
]
},
"integer_lists": {"values": ["string"
]
},
"integer_ranges": {"ranges": [{"lower": null,
"upper": null
}
]
},
"integer_values": { },
"string_lists": {"values": ["string"
]
},
"string_set": {"validator": {"validation": "string",
"warning": "string"
},
"values": ["string"
]
},
"string_values": {"validator": {"validation": "string",
"warning": "string"
}
}
}
},
"ignore_warnings": true,
"mappings": [{"elements": [{"key": {"bool": null,
"bytes": null,
"double": null,
"int64": null,
"int64array": null,
"int64range": null,
"null": null,
"text": null,
"textarray": null,
"uint64": null
},
"value": {"bool": null,
"bytes": null,
"double": null,
"int64": null,
"int64array": null,
"int64range": null,
"null": null,
"text": null,
"textarray": null,
"uint64": null
}
}
],
"id": "string"
}
],
"properties": {"property1": {"bool": true,
"bytes": "string",
"double": 0.1,
"int64": "string",
"int64array": {"values": ["string"
]
},
"int64range": {"lower": "string",
"upper": "string"
},
"null": { },
"text": "string",
"textarray": {"values": ["string"
]
},
"uint64": "string"
},
"property2": {"bool": true,
"bytes": "string",
"double": 0.1,
"int64": "string",
"int64array": {"values": ["string"
]
},
"int64range": {"lower": "string",
"upper": "string"
},
"null": { },
"text": "string",
"textarray": {"values": ["string"
]
},
"uint64": "string"
}
},
"script": "string"
}
],
"identities": ["string"
],
"ignored_columns": ["string"
],
"incremental": {"full_refresh": true,
"primary_key_columns": ["string"
],
"ttl_column": "string"
},
"redaction_threshold": 0,
"rounding_threshold": 0,
"selected_key_ids": ["string"
]
},
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
}

Normalizations

Service for normalizing data using a normalization configuration ready for publishing to a dataset.

InfoSum Support Portal Article

Start a Normalization.

Start a normalization

Authorizations:

API_Keys

Request Body schema: application/json
required

NormalizeRequest is used to start a normalization.

cloud_vault_id	string CloudVaultID is the ID of the Cloud Vault to perform the normalization in. reserved 3;
input_recordset_id	string InputRecordsetID specifies the input as a recordset. reserved 5;
normalization_config_id	string NormalizationConfigID is the ID of the Normalization Config the user wants to use to normalize the input files.
output_recordset_name	string OutputRecordsetName specifies the name of the output recordset.
processing_speed	integer <int64> ProcessingSpeed is the number of shards the user wants to use for the normalization request. reserved 7;

Responses

Request samples

Payload

Content type

application/json

{"cloud_vault_id": "string",
"input_recordset_id": "string",
"normalization_config_id": "string",
"output_recordset_name": "string",
"processing_speed": 0
}

Response samples

Content type

application/json

{"execution_id": "string"
}

List company Normalizations.

List company normalizations

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is the ordering for returned transform executions.
filter	string Filter is a string containing logic that can specify a restricted set of Transforms to be returned. e.g. `history.status co "READY"`.

Responses

Response samples

Content type

application/json

{"normalizations": [{"cloud_vault_id": "string",
"config_id": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"current_status": "NO_STATUS",
"id": "string",
"input_recordset_id": "string",
"output_recordset_id": "string",
"output_recordset_name": "string",
"processing_speed": 0,
"progress": {"completion_percentage": 0.1
},
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Get a Normalization.

Get a normalization

Authorizations:

API_Keys

path Parameters

execution_id

required

string

NormalizationID is the ID of the normalization.

Responses

Response samples

Content type

application/json

{"normalization_details": {"history": [{"date_time": "2019-08-24T14:15:22Z",
"error": {"details": {"code": "string",
"message": "string"
},
"error": {"code": "string",
"message": "string"
},
"help_url": "string",
"request_id": "string",
"time": "2019-08-24T14:15:22Z"
},
"message": "string",
"status": "NO_STATUS"
}
],
"normalization": {"cloud_vault_id": "string",
"config_id": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"current_status": "NO_STATUS",
"id": "string",
"input_recordset_id": "string",
"output_recordset_id": "string",
"output_recordset_name": "string",
"processing_speed": 0,
"progress": {"completion_percentage": 0.1
},
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
}
}

Cancel a Normalize.

Cancel a normalization

Authorizations:

API_Keys

path Parameters

execution_id

required

string

NormalizationID is the ID of the normalization.

Responses

Response samples

Content type

application/json

{ }

Prepare Configs

Service for managing configurations to prepare a normalised recordset for publishing.

InfoSum Support Portal Article

List company Dataset Prepare Configurations.

List company dataset prepare configurations

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'name,created_time DESC' The values are seperated by space, '{column} {ASC/DESC}' The following values are permitted: name created_date_time (RFC3339 format) updated_date_time (RFC3339 format)
filter	string Filter is a string containing logic that can specify a restricted set of Cloud Vaults to be returned. e.g. `name co 'Test' or name co 'Acme'` The following values are permitted: name created_date_time (RFC3339 format) updated_date_time (RFC3339 format)

Responses

Response samples

Content type

application/json

{"configs": [{"created_date_time": "2019-08-24T14:15:22Z",
"dataset_publish_config_summary": {"all_categories": true,
"all_keys": true,
"category_column_count": 0,
"config_id": "string",
"export_column_count": 0,
"incremental_full_refresh": true,
"key_column_count": 0,
"purge_expired_rows_only": true,
"redaction_threshold": 0,
"rounding_threshold": 0
},
"description": "string",
"id": "string",
"name": "string",
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Creates a reusable dataset prepare config.

Creates a reusable dataset prepare config

Authorizations:

API_Keys

Request Body schema: application/json
required

	object (Config is the dataset prepare config - can be updated after creation) DatasetPublishConfig configures a prepare task.
description	string (Description is the user assigned description for the config)
name	string (Name is the user assigned name of the config)

Responses

Request samples

Payload

Content type

application/json

{"config": {"all_categories": true,
"all_keys": true,
"category_selection": ["string"
],
"export_column_selection": ["string"
],
"incremental_full_refresh": true,
"key_selection": ["string"
],
"purge_expired_rows_only": true,
"redaction_threshold": 0,
"rounding_threshold": 0
},
"description": "string",
"name": "string"
}

Response samples

Content type

application/json

{"id": "string"
}

Delete a Dataset Prepare Config.

Delete a dataset prepare config

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the Dataset Prepare Configuration the user wants to delete.

Responses

Response samples

Content type

application/json

{ }

Get a Dataset Prepare Configuration.

Get a dataset prepare configuration

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the dataset prepare configuration the user wants to get.

Responses

Response samples

Content type

application/json

{"config": {"created_date_time": "2019-08-24T14:15:22Z",
"dataset_publish_config": {"all_categories": true,
"all_keys": true,
"category_selection": ["string"
],
"export_column_selection": ["string"
],
"incremental_full_refresh": true,
"key_selection": ["string"
],
"purge_expired_rows_only": true,
"redaction_threshold": 0,
"rounding_threshold": 0
},
"description": "string",
"id": "string",
"name": "string",
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
}

Update a Dataset Prepare Configuration.

Update a dataset prepare configuration

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the dataset prepare config the user wants to update.

Request Body schema: application/json
required

	object (Config is the dataset prepare config - can be updated after creation) DatasetPublishConfig configures a prepare task.
description	string Description is a description of the dataset prepare config.
name	string Name is the name assigned to the dataset prepare config.

Responses

Request samples

Payload

Content type

application/json

{"config": {"all_categories": true,
"all_keys": true,
"category_selection": ["string"
],
"export_column_selection": ["string"
],
"incremental_full_refresh": true,
"key_selection": ["string"
],
"purge_expired_rows_only": true,
"redaction_threshold": 0,
"rounding_threshold": 0
},
"description": "string",
"name": "string"
}

Response samples

Content type

application/json

{ }

Delete categories with a filter from a Dataset Prepare Config.

Delete categories with a filter from a dataset prepare config

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the dataset prepare config the user wants to delete categories from.

query Parameters

filter

string

Filter specifies which subset of categories to delete

Responses

Response samples

Content type

application/json

{ }

List the categories for a Dataset Prepare Configuration.

List the categories for a dataset prepare configuration

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the dataset prepare config the user wants to list.

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'name,created_time DESC' The values are seperated by space, '{column} {ASC/DESC}' The following values are permitted: name created_date_time (RFC3339 format) updated_date_time (RFC3339 format)
filter	string Filter is a string containing logic that can specify a restricted set of categories to be returned. The following values are permitted: name created_date_time (RFC3339 format) updated_date_time (RFC3339 format)

Responses

Response samples

Content type

application/json

{"categories": ["string"
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Add a category to a Dataset Prepare Configuration.

Add a category to a dataset prepare configuration

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the dataset prepare config the user wants to add categories to.

Request Body schema: application/json
required

Responses

Request samples

Payload

Content type

application/json

{"categories": ["string"
]
}

Response samples

Content type

application/json

{ }

Delete export columns with a filter from a Dataset Prepare Config.

Delete export columns with a filter from a dataset prepare config

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the dataset prepare config the user wants to delete export columns from.

query Parameters

filter

string

Filter specifies which subset of export columns to delete

Responses

Response samples

Content type

application/json

{ }

List the export columns for a Dataset Prepare Configuration.

List the export columns for a dataset prepare configuration

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the dataset prepare config the user wants to list.

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'name,created_time DESC' The values are seperated by space, '{column} {ASC/DESC}' The following values are permitted: name created_date_time (RFC3339 format) updated_date_time (RFC3339 format)
filter	string Filter is a string containing logic that can specify a restricted set of export columns to be returned. The following values are permitted: name created_date_time (RFC3339 format) updated_date_time (RFC3339 format)

Responses

Response samples

Content type

application/json

{"export_columns": ["string"
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Add an export column to a Dataset Prepare Configuration.

Add an export column to a dataset prepare configuration

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the dataset prepare config the user wants to add export columns to.

Request Body schema: application/json
required

export_columns

Array of strings (ExportColumns is the list of export column names to be added to the config)

Responses

Request samples

Payload

Content type

application/json

{"export_columns": ["string"
]
}

Response samples

Content type

application/json

{ }

Delete a key from a Dataset Prepare Config.

Delete a key from a dataset prepare config

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the dataset prepare config the user wants to delete keys from.

query Parameters

filter

string

Filter specifies which subset of keys to delete

Responses

Response samples

Content type

application/json

{ }

List the keys for a Dataset Prepare Configuration.

List the keys for a dataset prepare configuration

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the dataset prepare config the user wants to list.

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting. e.g 'name,created_time DESC' The values are seperated by space, '{column} {ASC/DESC}' The following values are permitted: name created_date_time (RFC3339 format) updated_date_time (RFC3339 format)
filter	string Filter is a string containing logic that can specify a restricted set of keys to be returned. The following values are permitted: name created_date_time (RFC3339 format) updated_date_time (RFC3339 format)

Responses

Response samples

Content type

application/json

{"keys": ["string"
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Add a key to a Dataset Prepare Configuration.

Add a key to a dataset prepare configuration

Authorizations:

API_Keys

path Parameters

config_id

required

string

ConfigID is the ID of the dataset prepare config the user wants to add keys to.

Request Body schema: application/json
required

keys	Array of strings <int64> (Keys is the list of key ids to be added to the config) [ items <int64 > ]

Responses

Request samples

Payload

Content type

application/json

{"keys": ["string"
]
}

Response samples

Content type

application/json

{ }

Publishing

Service for preparing data using configurations, and publishing the prepared data to a dataset.

InfoSum Support Portal Article

Start a Prepare on normalized data.

Start a Prepare Operation

Authorizations:

API_Keys

Request Body schema: application/json
required

PrepareRequest is used to prepare a dataset.

cloud_vault_id	string CloudVaultId is the ID of cloud vault where source files are found.
	object (Publishing.DatasetPublishConfig) DatasetPublishConfig configures a prepare task.
config_id	string
dataset_id	string DatasetId is the ID of the Dataset to prepare.
recordset_id	string RecordsetId determines the set of files to normalise. All files in the cloud vault with the specific prefix will be considered.

Responses

Request samples

Payload

Content type

application/json

{"cloud_vault_id": "string",
"config_embedded": {"all_categories": true,
"all_keys": true,
"category_selection": ["string"
],
"export_column_selection": ["string"
],
"incremental_full_refresh": true,
"key_selection": ["string"
],
"purge_expired_rows_only": true,
"redaction_threshold": 0,
"rounding_threshold": 0
},
"config_id": "string",
"dataset_id": "string",
"recordset_id": "string"
}

Response samples

Content type

application/json

{"execution_id": "string"
}

List company Prepares.

List company prepares

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is the ordering for returned prepare executions.
filter	string Filter is a string containing logic that can specify a restricted set of Cloud Vaults to be returned. e.g. `name co 'Test' or name co 'Acme'`

Responses

Response samples

Content type

application/json

{"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
},
"prepares": [{"cloud_vault_id": "string",
"config_summary": {"all_categories": true,
"all_keys": true,
"category_column_count": 0,
"config_id": "string",
"export_column_count": 0,
"incremental_full_refresh": true,
"key_column_count": 0,
"purge_expired_rows_only": true,
"redaction_threshold": 0,
"rounding_threshold": 0
},
"created_date_time": "2019-08-24T14:15:22Z",
"current_status": "NO_STATUS",
"dataset_id": "string",
"expiration_date_time": "2019-08-24T14:15:22Z",
"id": "string",
"prepare_status": "NO_STATUS",
"progress": {"completion_percentage": 0.1
},
"recordset_id": "string",
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
]
}

Get details on a Prepare operation.

Authorizations:

API_Keys

path Parameters

execution_id

required

string

ExecutionID is the ID of the prepare execution to fetch.

Responses

Response samples

Content type

application/json

{"prepare_details": {"history": [{"date_time": "2019-08-24T14:15:22Z",
"error": {"details": {"code": "string",
"message": "string"
},
"error": {"code": "string",
"message": "string"
},
"help_url": "string",
"request_id": "string",
"time": "2019-08-24T14:15:22Z"
},
"message": "string",
"prepare_status": "NO_STATUS",
"status": "NO_STATUS"
}
],
"prepare": {"cloud_vault_id": "string",
"config": {"all_categories": true,
"all_keys": true,
"category_selection": ["string"
],
"export_column_selection": ["string"
],
"incremental_full_refresh": true,
"key_selection": ["string"
],
"purge_expired_rows_only": true,
"redaction_threshold": 0,
"rounding_threshold": 0
},
"config_embedded": {"all_categories": true,
"all_keys": true,
"category_selection": ["string"
],
"export_column_selection": ["string"
],
"incremental_full_refresh": true,
"key_selection": ["string"
],
"purge_expired_rows_only": true,
"redaction_threshold": 0,
"rounding_threshold": 0
},
"config_id": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"current_status": "NO_STATUS",
"dataset_id": "string",
"expiration_date_time": "2019-08-24T14:15:22Z",
"id": "string",
"prepare_status": "NO_STATUS",
"progress": {"completion_percentage": 0.1
},
"recordset_id": "string",
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
}
}

Cancel a Prepare operation.

Cancel a Prepare operation

Authorizations:

API_Keys

path Parameters

execution_id

required

string

ExecutionID is the ID of the prepare execution to cancel.

Responses

Response samples

Content type

application/json

{ }

Publishes prepared data to a dataset.

Authorizations:

API_Keys

path Parameters

execution_id

required

string

ExecutionID is the id of the prepare execution to publish to the dataset.

Responses

Response samples

Content type

application/json

{"execution_id": "string"
}

Recordsets

Service for managing recordsets created as a result of normalisation or a recordset task executed using imported files.

InfoSum Support Portal Article

List recordsets

List available recordsets with any metadata

Authorizations:

API_Keys

path Parameters

cloud_vault_id

required

string

The globally unique identifier for the Cloud Vault where the recordset to delete is.

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string
filter	string Filter is a string containing logic that can specify a restricted set of Recordsets to be returned. e.g. `name co 'Test' or name co 'Acme'`. These are the supported fields for filtering: id name record_count total_size columns_headers folder_path is_publishable origin_task_id origin_task_type origin_task_properties created_date_time (in RFC3339 format) updated_date_time (in RFC3339 format)

Responses

Response samples

Content type

application/json

{"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
},
"recordsets": [{"attribute_column_count": "string",
"average_fill_rate": 0.1,
"created_date_time": "2019-08-24T14:15:22Z",
"id": "string",
"is_publishable": true,
"key_column_count": "string",
"locks": [{"created_date_time": "2019-08-24T14:15:22Z",
"task_id": "string",
"task_type": "string"
}
],
"name": "string",
"origin": {"properties": {"property1": "string",
"property2": "string"
},
"task_id": "string",
"task_type": "string"
},
"raw_column_count": "string",
"record_count": "string",
"size": "string",
"total_size": "string",
"updated_date_time": "2019-08-24T14:15:22Z"
}
]
}

Delete recordset

Delete the specified recordset

Authorizations:

API_Keys

path Parameters

cloud_vault_id required	string The globally unique identifier for the Cloud Vault where the Recordset to delete is.
recordset_id required	string The ID of the recordset to be deleted.

Responses

Response samples

200
401
403
500
default

Content type

application/json

{ }

Get information about a specific recordset

Gets information about a recordset along with any metadata

Authorizations:

API_Keys

path Parameters

cloud_vault_id required	string The globally unique identifier for the Cloud Vault where the recordset to get is.
recordset_id required	string The ID of the recordset to get.

Responses

Response samples

Content type

application/json

{"recordset": {"attribute_column_count": "string",
"average_fill_rate": 0.1,
"columns_headers": ["string"
],
"created_date_time": "2019-08-24T14:15:22Z",
"file_count": "string",
"folder_path": "string",
"id": "string",
"is_publishable": true,
"key_column_count": "string",
"locks": [{"created_date_time": "2019-08-24T14:15:22Z",
"task_id": "string",
"task_type": "string"
}
],
"name": "string",
"normalization_details": {"global_schema_version": "string",
"incremental": {"primary_key_column_name": "string",
"ttl_column_name": "string"
}
},
"origin": {"properties": {"property1": "string",
"property2": "string"
},
"task_id": "string",
"task_type": "string"
},
"raw_column_count": "string",
"record_count": "string",
"size": "string",
"total_size": "string",
"updated_date_time": "2019-08-24T14:15:22Z"
}
}

Renames a recordset.

Renames a recordset on a specific Cloud Vault.

Authorizations:

API_Keys

path Parameters

cloud_vault_id required	string The globally unique identifier for the Cloud Vault where the recordset to rename is.
recordset_id required	string The ID of the recordset to be renamed.

Request Body schema: application/json
required

new_name

string

The new name of the recordset. Valid characters are alphanumeric, '_', '-', ' '.

Responses

Request samples

Payload

Content type

application/json

{"new_name": "string"
}

Response samples

Content type

application/json

{"recordset": {"attribute_column_count": "string",
"average_fill_rate": 0.1,
"columns_headers": ["string"
],
"created_date_time": "2019-08-24T14:15:22Z",
"file_count": "string",
"folder_path": "string",
"id": "string",
"is_publishable": true,
"key_column_count": "string",
"locks": [{"created_date_time": "2019-08-24T14:15:22Z",
"task_id": "string",
"task_type": "string"
}
],
"name": "string",
"normalization_details": {"global_schema_version": "string",
"incremental": {"primary_key_column_name": "string",
"ttl_column_name": "string"
}
},
"origin": {"properties": {"property1": "string",
"property2": "string"
},
"task_id": "string",
"task_type": "string"
},
"raw_column_count": "string",
"record_count": "string",
"size": "string",
"total_size": "string",
"updated_date_time": "2019-08-24T14:15:22Z"
}
}

List recordset columns

List available recordset columns

Authorizations:

API_Keys

path Parameters

cloud_vault_id required	string The globally unique identifier for the Cloud Vault where the recordset to get is.
recordset_id required	string The globally unique identifier for the Recordset where the recordset columns will be listed.

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma separated value to define sorting. e.g 'name DESC'. The values are separated by space, '{column} {ASC/DESC}'. These are the values that are sortable: name statistics.value_count statistics.cell_count
filter	string Filter is a string containing logic that can specify a restricted set of Recordsets to be returned. e.g. `name co 'Test' or name co 'Acme'`. These are the supported fields for filtering: name value_type statistics.value_count statistics.cell_count pii

Responses

Response samples

Content type

application/json

{"columns": [{"attribute_column": {"category_id": "string",
"category_name": "string",
"data_type": "DATA_TYPE_INVALID",
"is_custom": true,
"representation_id": "string",
"representation_name": "string"
},
"key_column": {"id": "string",
"is_custom": true,
"name": "string"
},
"name": "string",
"pii": true,
"raw_column": { },
"statistics": {"cell_count": "string",
"invalid_count": "string",
"null_count": "string",
"value_count": "string"
},
"value_type": "VALUE_TYPE_INVALID"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Recordset Configs

Service for managing configurations for creating recordsets from imported files.

InfoSum Support Portal Article

List Recordset Configs.

List all recordset configs.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting.
filter	string Filter is a string containing logic that can specify a restricted set of Configs to be returned.

Responses

Response samples

Content type

application/json

{"configs": [{"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"file_type": "AUTO",
"header_names": ["string"
],
"header_row": true,
"id": "string",
"name": "string",
"primary_delimiter": "string",
"secondary_delimiter": "string",
"updated_date_time": "2019-08-24T14:15:22Z"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Create a Recordset Config.

Create a recordset Config that can be used to create a Recordset from files on a cloud vault.

Authorizations:

API_Keys

Request Body schema: application/json
required

CreateConfigRequest contains the information required to create a recordset config.

description	string An optional description given to the config.
file_type	string (FileType enumerates the file type detection behaviour when creating a recordset) Default: "AUTO" Enum: "AUTO" "CSV" "PARQUET" Allows the user to dictate the file format used by the recordset creation task to read the input files, disregarding any file extensions.
header_names	Array of strings The names of the headers/columns. Must be provided if the input files do not have a header row, and in the same order as the columns in the input files.
header_row required	boolean Indicates whether or not the input files have a header row.
name required	string The unique name to identify the config.
primary_delimiter required	string The primary delimiter used in the input files.
secondary_delimiter	string The secondary delimiter (if any) used in the input files.

Responses

Request samples

Payload

Content type

application/json

{"description": "string",
"file_type": "AUTO",
"header_names": ["string"
],
"header_row": true,
"name": "string",
"primary_delimiter": "string",
"secondary_delimiter": "string"
}

Response samples

Content type

application/json

{"config_id": "string"
}

Update a Recordset Config.

Update the details of a specific recordset config.

Authorizations:

API_Keys

path Parameters

config.id

required

string

Output only. id is the config ID.

Request Body schema: application/json
required

description	string Description is an optional description given to this config.
file_type	string (FileType enumerates the file type detection behaviour when creating a recordset) Default: "AUTO" Enum: "AUTO" "CSV" "PARQUET" Allows the user to dictate the file format used by the recordset creation task to read the input files, disregarding any file extensions.
header_names	Array of strings HeaderNames is a list of the header/column names. To be provided if the files do not contain a header row. Must be in the same order as the columns in the files.
header_row	boolean HeaderRow indicates whether the files have a header row.
name	string Name is a unique name to help identify the config. Valid characters are alphanumeric, '_', '-', ' '.
primary_delimiter	string PrimaryDelimiter is the primary delimiter of the data.
secondary_delimiter	string SecondaryDelimiter is the secondary delimiter of the data, if any.

Responses

Request samples

Payload

Content type

application/json

{"description": "string",
"file_type": "AUTO",
"header_names": ["string"
],
"header_row": true,
"name": "string",
"primary_delimiter": "string",
"secondary_delimiter": "string"
}

Response samples

Content type

application/json

{"config": {"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"file_type": "AUTO",
"header_names": ["string"
],
"header_row": true,
"id": "string",
"name": "string",
"primary_delimiter": "string",
"secondary_delimiter": "string",
"updated_date_time": "2019-08-24T14:15:22Z"
}
}

Delete a Recordset Config.

Delete a specific recordset config.

Authorizations:

API_Keys

path Parameters

config_id

required

string

Responses

Response samples

Content type

application/json

{ }

Get a Recordset Config.

Get the details of a specific recordset config.

Authorizations:

API_Keys

path Parameters

config_id

required

string

The ID of the config to get.

Responses

Response samples

Content type

application/json

{"config": {"created_by": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"description": "string",
"file_type": "AUTO",
"header_names": ["string"
],
"header_row": true,
"id": "string",
"name": "string",
"primary_delimiter": "string",
"secondary_delimiter": "string",
"updated_date_time": "2019-08-24T14:15:22Z"
}
}

Recordset Creation

Service for creating recordsets using a stored configuration from imported files.

InfoSum Support Portal Article

Create a Recordset.

Create a recordset from files on a cloud vault. Valid characters for recordset names: alphanumeric, '_', '-', ' '.

Authorizations:

API_Keys

Request Body schema: application/json
required

CreateExecutionRequest contains the information required to create a recordset.

cloud_vault_id required	string The globally unique identifier for the Cloud Vault.
config_id	string The ID of a recordset config that can be used to execute the creation of a recordset.
	object The required information needed to execute the creation of a recordset.
filepaths	Array of strings A list of input files including the full path from the root of the CloudVault to be used to create the recordset. All files must be in the same format. Must not be set if `folder` has been specified.
recordset_name required	string The name given to the recordset that will be created.

Responses

Request samples

Payload

Content type

application/json

{"cloud_vault_id": "string",
"config_id": "string",
"execution_config": {"file_type": "AUTO",
"header_names": ["string"
],
"header_row": true,
"primary_delimiter": "string",
"secondary_delimiter": "string"
},
"filepaths": ["string"
],
"recordset_name": "string"
}

Response samples

Content type

application/json

{"execution_id": "string"
}

List Executions.

List all recordset executions.

Authorizations:

API_Keys

query Parameters

pagination.page	integer <int32> The requested page of records. Default is 1.
pagination.per_page	integer <int32> The number of records to return per page. Maximum is 500. Default is 100.
order_by	string OrderBy is a comma seperated value to define sorting.
filter	string Filter is a string containing logic that can specify a restricted set of Executions to be returned.

Responses

Response samples

Content type

application/json

{"executions": [{"cloud_vault_id": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"current_status": "NO_STATUS",
"id": "string",
"progress": {"completion_percentage": 0.1,
"lines_processed": "string",
"malformed_lines": "string",
"total_lines": "string"
},
"recordset_id": "string",
"recordset_name": "string",
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
],
"pagination": {"next_page": 0,
"results": 0,
"total_results": 0
}
}

Get a Recordset Execution.

Get the details of a specific recordset execution.

Authorizations:

API_Keys

path Parameters

execution_id

required

string

The ID of the execution to get.

Responses

Response samples

Content type

application/json

{"execution_details": {"cloud_vault_id": "string",
"config_id": "string",
"created_date_time": "2019-08-24T14:15:22Z",
"current_status": "NO_STATUS",
"execution_config": {"file_type": "AUTO",
"header_names": ["string"
],
"header_row": true,
"primary_delimiter": "string",
"secondary_delimiter": "string"
},
"filepaths": ["string"
],
"history": [{"date_time": "2019-08-24T14:15:22Z",
"error": {"details": {"code": "string",
"message": "string"
},
"error": {"code": "string",
"message": "string"
},
"help_url": "string",
"request_id": "string",
"time": "2019-08-24T14:15:22Z"
},
"message": "string",
"status": "NO_STATUS"
}
],
"id": "string",
"progress": {"completion_percentage": 0.1,
"lines_processed": "string",
"malformed_lines": "string",
"total_lines": "string"
},
"recordset_id": "string",
"recordset_name": "string",
"updated_date_time": "2019-08-24T14:15:22Z",
"user_id": "string"
}
}

Cancel an Execution

Cancels a specific recordset execution.

Authorizations:

API_Keys

path Parameters

execution_id

required

string

The ID of the execution to cancel.

Responses

Response samples

Content type

application/json

{ }

InfoSum Developer API (2.0)

Accounts

Get a company's display name.

Authorizations:

query Parameters

Responses

Response samples

Get a user's account details.

Authorizations:

Responses

Response samples

List users' display name and company name.

Authorizations:

query Parameters

Responses

Response samples

Connections

Calculate match rate

Authorizations:

Request Body schema: application/jsonrequired

Responses

Request samples

Response samples

Queries

Create a PAIR enabled activation query

Authorizations:

Request Body schema: application/jsonrequired

Responses

Request samples

Response samples

Create new private path collaboration

Authorizations:

Request Body schema: application/jsonrequired

Responses

Request samples

Response samples

Get details of an existing private path collaboration.

Authorizations:

path Parameters

Responses

Response samples

Reports

List insight reports.

Authorizations:

query Parameters

Responses

Response samples

Create a new insight report.

Authorizations:

Request Body schema: application/jsonrequired

Responses

Request samples

Response samples

Delete an insight report.

Authorizations:

path Parameters

Responses

Response samples

Get details of an insight report.

Authorizations:

path Parameters

Responses

Response samples

Get a cross category result for a completed report.

Authorizations:

path Parameters

query Parameters

Responses

Response samples

List the single category results of a completed insight report.

Authorizations:

path Parameters

query Parameters

Responses

Response samples

Get the status of an insight report.

Authorizations:

path Parameters

Responses

Response samples

Request Body schema: application/json
required

Request Body schema: application/json
required

Request Body schema: application/json
required

Request Body schema: application/json
required

Request Body schema: application/json
required

Request Body schema: application/json
required