Skip to main content
Group

DataMigration

This group will be dedicated to knowledge wrapped around migrating data into Salesforce.com from legacy systems and/or external data sets.

Hi Trailblazers,

We have a use case where customer is using third party data migration tool called 'Alteryx' for migrating Contact data to Salesforce, where they could face potential duplicates which they want to understand (first thing) then if possible they want to merge as well (based on single match etc.).

 

They are reluctant writing those same duplicate rules at 'Alteryx' side rather they want to understand if there is any mechanism to invoke Salesforce duplicate API. 

 

So I'm trying to understand the various options here and ways to do that.

 

Is it something we have to write a class with duplicate logic and expose that as an API from Salesforce ?

Is that a recommended way?

or are there any other ways to do that ?

 

Really appreciate your valuable inputs here.

 

Thank you so much.

@Architect Trailblazers  @* Salesforce Developers * @DataMigration @Salesforce Integration

5 answers
  1. Mar 4, 2024, 10:22 PM

    @Aditya Kumar Gupta     Yes, your approach is correct. You need to export that method and other logic through SOAP WSC. You may need to incorporate additional methods to store and merge records if a single match is found. If not, then store it in some sort of skinny table or flag them for further review by your business or data owner team.

    However, you need to validate the limitations and content of this approach, as the input range of this method is limited to 50 elements (records).

     

    Second Approach:

    1. Assume that Alteryx data is clean from duplicates.
    2. Perform data de-duplication outside during the first sync to reduce system overhead (using Data Analytical tool or Excel).
    3. Update SFDC records with the external ID of Alteryx's unique identifier for future synchronization.
    4. For insert: 

    a.       Implement Apex coding: Create an insert trigger to handle duplications where external IDs are missing on contacts and merge logic for the Integration User only.

    • Concern: Optimize the number (<10k) of records to avoid Apex limitations.

    b.    Implement Apex coding: Execute a nightly batch job for the Integration User, finding duplicates where external IDs are missing on contacts and merging them for a single match.

    • Concern: Operations involve create, merge, and delete, possibly hitting limitations.

    c.    Insert into a separate table and run a trigger/batch job to find matching records where external IDs are missing. Update a single-matched record and delete others. Update multiple records with multiple SFDC IDs for further review.

    • Concern: Complexity in coding and maintenance. 

    d.    Use Snowflake and compare both systems, insert/update records based on business logic.

    Concern: Requires a middle layer to avoid heavy coding and performance issues 

0/9000

Preparing for a data migration?? It's important to make sure you data model is ready to go in your Salesforce org!

Welcome to our NEW Data Modeling 101 webinar! Are you a new Salesforce Admin and struggling to create a new data model? Do you want to learn more about junction objects and how to create this powerful 'object glue' in your org? Look no further and sign up TODAY! Register here for the Data Modeling 101: https://sfdc.co/bRr7zI Every other Thursday, 9 AM PST Register here for our other webinars: http://bit.ly/GSWebinarRegister

0/9000
Have you ever needed to migrate data off of a Lotus Domino platform to Salesforce? It is a daunting task to try that by writing custom code and I don't recommend it but there is an amazing tool that can help called Notrix. #Lotus #LotusDomino #Importing Data #DataMigration #LotusNotes
0/9000