Skip to main content

Secure Agents with the Einstein Trust Layer

Learning Objectives

After completing this unit, you’ll be able to:

  • Describe how the Einstein Trust Layer secures conversational agents.
  • Explain how guardrails prevent harmful or off-brand content.
  • Identify the mechanisms used to protect personally identifiable information (PII).

The Trust Imperative in AI

As you’ve learned, turning one-way broadcasts into two-way conversations across email, SMS, and WhatsApp drives incredible engagement. However, handing the keys of customer communication over to an agent can feel daunting.

What if the agent hallucinates a fake discount? What if a customer tries to trick the agent into using inappropriate language? What happens to the sensitive data the agent accesses?

Einstein Trust Layer with its guard rails.

At Salesforce, trust is our number one value. Marketing Cloud Next mitigates these risks using the Einstein Trust Layer, a secure AI architecture built natively into the platform. It helps you to deploy conversational marketing with confidence, knowing your brand reputation and customer data are protected.

Establish Guardrails for Brand Safety

An autonomous agent representing your brand must behave professionally. The Einstein Trust Layer acts as a vigilant manager, overseeing the agent's inputs and outputs to ensure safety and accuracy. Let’s review some of the guardrails that make sure agents behave responsibly.

Custom Guardrails: Brand Guidelines, FAQs, and Tone

To ensure the agent truly feels like an extension of your brand, you have complete control over its personality and knowledge base. You can ground the agent by providing specific brand guidelines, FAQs, tone of voice instructions, and other custom parameters. Whether you want the agent to be friendly and conversational or formal and professional, your instructions dictate how the AI interacts. By explicitly defining these parameters and rooting answers in your approved FAQs, you prevent the agent from going off-script and guarantee a consistent, brand-aligned customer experience.

Toxicity and Harmful Content Detection

Before an agent sends a response back to a customer, the text passes through a toxicity scorer. If the AI generates content that is hateful, abusive, profane, or indecent, the Trust Layer blocks it. Additionally, if a customer sends an abusive message or attempts a prompt injection, such as trying to trick the AI into ignoring its instructions, the Trust Layer identifies the malicious intent and safely deflects the conversation or ends it.

Prevent Hallucinations

LLMs are creative, which is great for writing poetry but bad for factual customer service. To prevent hallucinations (the AI making things up), agents in Marketing Cloud Next rely on dynamic grounding. The agent does not guess answers based on the public internet; its answers are strictly grounded in your secure Data 360 environment. Furthermore, by strictly defining subagents and actions, you build a fence around the agent. If a customer asks about a competitor’s product, the agent knows this is out of bounds and politely steers the conversation back to your approved subagents.

Protecting Customer Data and PII

Marketers deal with highly sensitive information—names, purchase histories, and contact details. Exposing this data to external AI models is a major security risk. Here’s how conversational marketing keeps data locked down.

Data Masking

When a customer sends a message that requires the agent to process data through a large language model (LLM), the Einstein Trust Layer steps in first. It automatically detects personally identifiable information (PII) such as credit card numbers, phone numbers, or addresses. It replaces this sensitive data with anonymized placeholders (masking) before the prompt is sent to the LLM. Once the LLM returns the generated response, the Trust Layer unmasks the data so the final message sent to the customer reads naturally.

Zero Data Retention

One of the biggest fears surrounding AI is that public models can use your proprietary data to train themselves. Salesforce has strict zero-retention agreements with LLM providers. When a prompt is sent to generate a response for your agent, the LLM provider does not retain the prompt, the customer data, or the generated response. Your data is never used to train outside models.

Encryption in Transit and at Rest

Every piece of data involved in a conversational interaction is encrypted. When a customer replies via SMS or WhatsApp, the data traveling to Salesforce is encrypted in transit using industry-standard protocols (like TLS). Once stored in Data 360, the conversation logs and customer profiles are encrypted at rest.

Auditing and Monitoring

Security isn't just about automated blocks; it is also about visibility. The Einstein Trust Layer provides a comprehensive audit trail. Every interaction your agent has, including the data that was masked, the toxicity score of the response, and the exact grounding data used, is logged. This helps your security and compliance teams to monitor agent performance, audit interactions, and continually refine instructions to ensure the agent remains compliant with industry regulations.

Human in the Loop and Escalation Flows

While autonomous agents handle the bulk of inquiries, keeping a human in the loop is a critical safety net. You can configure specific escalation flows to seamlessly transfer a conversation from the AI to a live human representative. For example, if the system detects that an end user is becoming frustrated, or if a problem becomes too complex for the AI to resolve, the conversation is automatically escalated to a human rep. This ensures that your customers always receive the right level of empathy and support when they need it most.

Deploying AI doesn't mean compromising on security or the human touch. With the Einstein Trust Layer, Agentforce is built on a foundation of trust. You can deliver the personalized, two-way dialogues your customers crave. At the same time, you can maintain the rigorous security standards your business demands. This balance is achieved through strict guardrails and intelligent escalation paths. Furthermore, features like automatic PII data masking, zero data retention policies, and robust encryption ensure your data is always protected.

Wrap Up

Congratulations! You’ve completed the Conversational Marketing Fundamentals badge. You learned how to ditch the one-way broadcast model and embrace two-way conversational marketing across email, SMS, and WhatsApp. You learned about the ecosystem of flows, Data 360, and Agentforce, and saw how these agents drive engagement and productivity—from rescuing abandoned carts to automating routine scheduling. Finally, you discovered how the Einstein Trust Layer secures every interaction by protecting PII, maintaining zero data retention, and enforcing strict brand guardrails.

Now, look at your own marketing strategies: Where are you reaching a “no-reply” road block? With Agentforce and the Einstein Trust Layer, you have the tools to confidently turn those friction points into secure, personalized opportunities for growth.

Resources

在 Salesforce 帮助中分享 Trailhead 反馈

我们很想听听您使用 Trailhead 的经验——您现在可以随时从 Salesforce 帮助网站访问新的反馈表单。

了解更多 继续分享反馈