Start tracking your progress
Trailhead Home
Trailhead Home

Understand Why It Happened Insights

Learning Objectives

After completing this unit, you’ll be able to:
  • Navigate to a story's Why it Happened insights and explore them.
  • Understand how combinations of factors affect the outcome.
  • Understand how unrelated factors affect the outcome.

Understand Why It Happened Insights



The instructions in this unit assume that you have successfully created an Einstein Discovery story according to the steps in "Create a Story," the first unit in this Trailhead module.

Why It Happened insights help you take a deeper look into the exact factors that led to an outcome.


The why in Why It Happened refers to a high correlation - not necessarily a causal relationship.

Use Why It Happened insights to drill deeper into the various factors that contributed to your story’s goal. These insights are based on a statistical analysis of your dataset. Einstein Discovery uses waterfall charts to help you visualize Why It Happened insights.

Your Story’s Outcome Variable and Goal

When you configured the story, you told Einstein Discovery to maximize the CLV variable in AcquiredAccount. CLV is the outcome variable in your story, and maximizing CLV is your goal. All the insights in this story show you how different variables and combinations of variables help explain variations in CLV. The top insights in the list reflect the most statistically significant variations in the outcome variable.

Select the Why it Happened Insight Type

On the Insight Navigation bar, click Why It Happened.

Why it Happened insight type on the Story navigation bar.

In Search story insights, click the down arrow and select Division - Naval.

In the Search story insights drop-down list, select Division - Naval.

Einstein Discovery refreshes the insights list.

Drivers of CLV when Division is Naval.



Don’t worry if the images here differ slightly from the screens you see in Einstein Discovery. The interface elements are usually the same, but some of the details—including the data they show—can differ slightly.

We’re looking at a waterfall chart of CLV values that can help explain why customers who are part of the Naval division are different from the average customer.
  • Global Outcome represents the mean value for CLV across all divisions (including Naval).
  • Division is Naval (Outcome) represents the mean value of CLV for the Naval division.
Wow, the Naval division in our company has a much higher CLV than the mean! Are Naval customers intrinsically better? Maybe they are basically the same as other customers but there are underlying correlations that increase CLV. Maybe it’s a little of both. Let’s find out.

Hover over the Global Outcome bar to get more information.

Global Outcome details

The Global Outcome has a mean value of 20,136 and a count of 10,000. What does this data tell us? That the average CLV across all divisions (including Naval) is $20,136, and there are 10,000 records.

Hover over the Division is Naval (Outcome) bar to see more information.

Division is Naval (Outcome) details

Division is Naval (Outcome) has a mean value of 20,488 and a count of 328. What does this data tell us? That the average CLV for the Naval division is $20,488, and there are 328 customers. The mean CLV for Naval customers is $352 higher than CLV for all customers! Now let’s find out why.

Understand the Division is Naval Insight

Hover over the Division is Naval bar to see more information.

Division is Naval details

We can learn a lot about Naval customers from this information. Let’s look at the numbers in the following order so that we can understand the building blocks first.

  • Global Frequency is 3.3%. Customers in our Naval division make up only 3.3% of customers overall. How unfortunate, because our Naval customers have a higher than average CLV. Perhaps it's time to try to acquire potential Naval customers? Or perhaps we realize that the Naval market is small and we focus on other divisions?
  • Conditional Frequency is 1 (100%). In our case, 100% of the records in the category Division is Naval are in the Naval division. Perhaps this information seems obvious.
  • Coefficient is 87. What does this tell us? That the CLV for Naval division would be $87 higher than the mean, if there were no other factors involved. This number tells you that the simple fact that division is Naval influences the CLV for the Naval division.


    Here’s one way you can use this number. If the number is high, like 1,000, then the effect of being a Naval customer, with no other factors considered, is that the CLV is $1,000 higher than average. Woo hoo! But you see that the observed outcome is much less than $1,000. This information indicates that Naval customers have the potential to be valuable, but that something else is dragging this number down.

  • Precluded Sum is 410. The impact for the average customer includes the impact of customers who are in the Naval division and the impact of those customers who are not. Einstein Discovery calculates the impact that customers who are not in the Naval division have on the CLV of customers who are in the Naval division. In our case, the impact of removing all the effects for divisions that are not Naval is to increase CLV by $410.00.
  • Impact is 495. This number summarizes all of the other previous numbers. Impact considers the effect of simply being a Naval customer and the percentage of overall customers that are Naval. Impact also adds in the impact of other customers that are not Naval. In our case, it's telling us that Naval customers would have a CLV of $495 more than average if it weren’t for other factors in the Related to and Unrelated categories. That’s a significant number! Why aren’t we realizing that potential? In the next sections, we find out.

We are done with the first-order analysis in the Division is Naval category. Now look at the next category, Related to Division is Naval.

Understanding the Unrelated Categories

We got some useful information from categories that are related to the Naval division. Now let’s look at the Unrelated category. But why do we look at information that is unrelated? Good question. The information in this section is “unrelated,” meaning that it is not specific to customers in the Naval division. This section shows us factors that have positive or negative effects on all customers. This section also accounts for how frequently a factor occurs for Naval customers, relative to customers in general. Let’s get more specific.

  • If a good thing happens more frequently for Naval customers than it does for all customers, the effect is positive.
  • If a good thing happens less frequently for Naval customers than it does for all customers, the effect is negative.
  • If a bad thing happens more frequently for Naval customers than it does for all customers, the effect is negative.
  • If a bad thing happens less frequently for Naval customer than it does for all customers, the effect is positive.

In other words, Einstein Discovery is sophisticated enough to show how bad things happening less often have a positive impact. To see more examples, let's look at our chart.

Hover over the Unrelated Small Contributors bar to display details.

Unrelated Small Contributors

You can see that all the other factors (a total of 3,803 small contributors) together account for an additional $465 in CLV.

You quickly realize the power of the Unrelated section. It gives you deeper information about why something happened. In other words, it gives you more power to credit (or give constructive feedback to) the right people.

We’re done looking at the Unrelated factors. Now let's move on to the Unexplained section.

Understanding the Unexplained Section

Looking at Unexplained phenomena sounds mysterious. Really, it's just the difference between:
  • the prediction that Einstein Discovery would make if it knew all the factors, and
  • the observed outcome of what actually happened in the dataset
Sometimes there's no bar in this category, which means that the model is making an accurate prediction. If the Unexplained bar is small, it means that Einstein Discovery is building a predictable model that identifies the factors explaining the observed outcome. Other insights that are drawn using the same model yield results that are consistent with the previous insights.

Hover over the Unexplained bar to display details.


In this case, the difference between the actual CLV (calculated from the dataset), and the predicted CLV (from Einstein Discovery’s data model), is $54.