Create button and choose dataset
Shows the data source which is salesforce or CSV
We can use ETL partners to load data into wave as well.
When you choose CSV format you can select the data and it creates a json file.
What are the dimensions and the measures?
Dimension, measure and date are the three data types.
It says the dataset will get created in an hour.
Always double check your data for dimensions and measures.
Dimension in deep blue, measure in purple and date is light blue.
Before doing a create dataset run the preview of it so that you are able to verify this.
You can go to the data monitor and look at the data flow view and the job view. The data monitor is in the settings. Shows how your datasets were uploaded or not.
Uploading data is through a data scheduler and that’s why they tell that it may take time. There is a data queue.
Upload data in the csv format and make sure measures and dimensions are correctly identified. A dimension with only numeric values might be considered as a measure. Change it. The date format is sometimes not properly recognized, you can make sure that in excel its properly recognized.
When pulling data from Salesforce, we will be using a GUI builder with custom and standard objects accessible by the REST API.
On the dataset builder, you can click the object and choose the fields. Go to relationship and click join to add the related objects and attributes. It looks like a snowflake schema as you add/join more and more related objects.
They create nodes in the dataflow.
You can see the JSON Payload as well if you want. This helps you create a denormalized dataset with the relationships.
We can schedule the datasets by choosing the time/schedule time. It looks much like a simplified version of informatica workflow monitor in informatica powercenter.
After creating the dataset, if you open and look at the grouping, you will be able to see the related object fields in the grouping as well. You will be able to see it in the format relatedobject.fieldname
No comments:
Post a Comment