Skip to main content

Hi everyone,

 

I have three tables

 

first table has 300 million rows, second table has 30K rows and third table has around 100k rows.

 

Now my final dataset is a left join of first table with second table and third table retaining 300 million rows of data.

 

Two approaches :

 

  1. Make these joins in prep and connect with prep. Enable incremental refresh using load date time in prep.
  2. Do all the joins in tableau itself and create extract that is 300 million rows and put a full refresh every day. My first extract took one hour.

 

Which is better among these two approaches? I want to go with prep but I am unable to explain to my supervisor why prep is better in this case. I want to know the fundamentals. This is a simple join and prep can be easily replaced but data is heavy. I do not want the tableau dashboard to show no data at any point of the time.

2 réponses
  1. 10 août 2025, 22:13

    Hi @Vaishnavi Kilambi​ 

    The bigger decision between using a full refresh or incremental is whether the historic records change. If the historic records change, then you have to use a full refresh. If the historic records don't change, then you can use an incremental, you also need a date field for the incremental refresh.

0/9000