Create a Story
- Create an Einstein Analytics dataset and import data into it.
- Use your imported data to create a story.
If you completed the Einstein Discovery Stories module, you already know about Einstein Discovery stories. You use Einstein Discovery stories to analyze data in Einstein Analytics datasets and produce insights into that data. Einstein Discovery performs a comprehensive statistical analysis of the data using AI and machine learning. A story helps you uncover relationships between a business-relevant outcome, and the explanatory variables that are potential influencers of that outcome. An outcome, typically a key performance indicator (KPI) for your business, is sometimes referred to as the outcome variable in a story.
- Why It Happened insights are diagnostic insights that give you a deeper understanding of the complex relationships in your existing data. These insights identify factors and combinations of factors that have a significant impact on the outcome.
- What Could Happen insights provide predictive insights that predict future outcomes, and prescriptive insights that recommend ways in which to improve the predicted outcome. Predictions aren’t a guarantee of future results. But Einstein Discovery can give you a better idea of how things might turn out based on what it’s learned from the data you provided. Einstein also suggests ways in which you can improve the predicted outcome.
To get started, let's upload some data and create a story to work with.
Try Einstein Discovery with a Developer Edition Org
Before you work through this Trailhead module, sign up for a free Analytics-enabled Developer Edition org. This org is a safe environment where you can practice the skills you’re learning.
Signed up? Great! Let's jump right in!
Download the Data
Before we can create the story we use in this module, we need the data to analyze. Download the CSV file called AcquiredAccount.csv and save it to your computer.
The CSV file contains the same data that is used in the Einstein Discovery Stories module. We provide it as a shortcut so you can work through this module without having to finish the previous module first.
The CSV file has 11 columns. It contains one row of information for each of the 10,000 different companies that our auto parts manufacturing company does business with. Here is what the first few rows of the CSV file look like.
Create and Populate an Einstein Analytics Dataset
The next step is to get the data from the CSV file into an Einstein Analytics dataset.
- In your new DE Org, switch to Lightning Experience (if you have not already done so).
- From the App Launcher (), find and select Analytics Studio.
- Click Datasets.
- Click Create and then select Dataset from the dropdown.
- Choose CSV File as the source for your new data.
- In the file-selection window that opens, find and select (or drag and drop) the AcquiredAccount.csv file you downloaded, and then click Next.
- Accept the defaults and click Next.
- Accept the defaults and click Upload File. Einstein Analytics creates a dataset and imports the data from the CSV file.
Create the Story
Now, you’re ready to create a story from this dataset. Begin by telling Einstein Discovery which outcome variable to focus on. In this module, we want our story to maximize on the CLV variable. If you did the Einstein Discovery Stories module, you are familiar with this variable. Customer lifetime value (CLV) is a metric that predicts the profitability over the entire lifetime of the company’s relationship with a customer. Looking at CLV can help you find the best customers.
To create a story:
- Hover over the dataset, click the dropdown, and click Create Story. Analytics Studio launches the Story Setup wizard.
- In the first screen, for The field, select CLV as the outcome that you want Einstein to analyze.
- Accept all other defaults and click Story Type.
- In the Story Type screen, click Insights & Predictions, then click Setup Options.
- In the Setup Options screen, select Manual and click Data Options. Notice that the selected outcome variable is first in the list. The remaining columns represent explanatory variables. An explanatory variable is a variable that you explore to determine whether, and to what degree, it can influence the outcome variable for your story.
In the Correlation column, Einstein shows you the percentage by which each field is statistically correlated to the outcome. The Division field has the highest correlation. However, Account Id has the second highest correlation. That’s curious. Why? Let’s stop and think for a moment. Does an arbitrarily assigned account ID have any influence on CLV? Probably not. We know that because we know our business. In this case, statistical significance does not translate to real-world significance. Therefore, let's remove this field from our analysis to speed up the analysis and get clearer results.
- Clear the check box next to Account Id.
- Click Create Story
When it’s done analyzing your data and discovering insights, Einstein Discovery shows you the results.
Rights of ALBERT EINSTEIN are used with permission of The Hebrew University of Jerusalem. Represented exclusively by Greenlight.