Create Dataset
forecastservice_create_dataset | R Documentation |
Creates an Amazon Forecast dataset¶
Description¶
Creates an Amazon Forecast dataset. The information about the dataset that you provide helps Forecast understand how to consume the data for model training. This includes the following:
-
DataFrequency - How frequently your historical time-series data is collected.
-
Domain and DatasetType - Each dataset has an associated dataset domain and a type within the domain. Amazon Forecast provides a list of predefined domains and types within each domain. For each unique dataset domain and type within the domain, Amazon Forecast requires your data to include a minimum set of predefined fields.
-
Schema - A schema specifies the fields in the dataset, including the field name and data type.
After creating a dataset, you import your training data into it and add the dataset to a dataset group. You use the dataset group to create a predictor. For more information, see Importing datasets.
To get a list of all your datasets, use the list_datasets
operation.
For example Forecast datasets, see the Amazon Forecast Sample GitHub repository.
The Status
of a dataset must be ACTIVE
before you can import
training data. Use the describe_dataset
operation to get the status.
Usage¶
forecastservice_create_dataset(DatasetName, Domain, DatasetType,
DataFrequency, Schema, EncryptionConfig, Tags)
Arguments¶
DatasetName |
[required] A name for the dataset. |
Domain |
[required] The domain associated with the dataset. When you add a
dataset to a dataset group, this value and the value specified for the
The |
DatasetType |
[required] The dataset type. Valid values depend on the chosen
|
DataFrequency |
The frequency of data collection. This parameter is required for RELATED_TIME_SERIES datasets. Valid intervals are an integer followed by Y (Year), M (Month), W (Week), D (Day), H (Hour), and min (Minute). For example, "1D" indicates every day and "15min" indicates every 15 minutes. You cannot specify a value that would overlap with the next larger frequency. That means, for example, you cannot specify a frequency of 60 minutes, because that is equivalent to 1 hour. The valid values for each frequency are the following:
Thus, if you want every other week forecasts, specify "2W". Or, if you want quarterly forecasts, you specify "3M". |
Schema |
[required] The schema for the dataset. The schema attributes and
their order must match the fields in your data. The dataset
|
EncryptionConfig |
An Key Management Service (KMS) key and the Identity and Access Management (IAM) role that Amazon Forecast can assume to access the key. |
Tags |
The optional metadata that you apply to the dataset to help you categorize and organize them. Each tag consists of a key and an optional value, both of which you define. The following basic restrictions apply to tags:
|
Value¶
A list with the following syntax:
list(
DatasetArn = "string"
)
Request syntax¶
svc$create_dataset(
DatasetName = "string",
Domain = "RETAIL"|"CUSTOM"|"INVENTORY_PLANNING"|"EC2_CAPACITY"|"WORK_FORCE"|"WEB_TRAFFIC"|"METRICS",
DatasetType = "TARGET_TIME_SERIES"|"RELATED_TIME_SERIES"|"ITEM_METADATA",
DataFrequency = "string",
Schema = list(
Attributes = list(
list(
AttributeName = "string",
AttributeType = "string"|"integer"|"float"|"timestamp"|"geolocation"
)
)
),
EncryptionConfig = list(
RoleArn = "string",
KMSKeyArn = "string"
),
Tags = list(
list(
Key = "string",
Value = "string"
)
)
)