Snowflake Data Clean Rooms

Introduction: What is a Data Clean Room?

 

In this article, I will explain what a Snowflake Data Clean Room is on the Snowflake Data Cloud.
Data clean rooms on Snowflake are a set of data-related technologies that facilitate double-blind joins of data. These technologies include Data Shares, Row Access Policies, and Secure User Defined Functions. The underlying Data Sharing technology is based on Micro-Partitions, which provide features like Data Sharing and Data Cloning.

Although the original concept of data clean rooms was developed for data exchanges in advertising, I believe the concept can be applied to many other areas where “controlled” and “governed” double-blind joins of data sets can create significant value. This approach enables companies and their partners to share data at an aggregated double-blind join level, without sharing personally identifiable information (PII).
On Snowflake, sharing data through secure views and tables using their Data Share technology is already straightforward. You can share double-blind join previously agreed upon identifiers.

 

Part 1: Data Clean Room Example Use Cases

We helped Snowflake pioneer this new offering a couple of years ago with our client VideoAmp which we brought over to the Snowflake Data Cloud. Our original article back in July 2020 shows how to analyze PII and PHI Data using the earlier Data Clean Room concepts. Fast forward 2 years and now Snowflake has dramatically improved the initial version and scope that we put together. These are just a few examples; there are many other potential use cases for Snowflake Data Clean Rooms.

 

Media/Advertising:

  • Addressing the challenge of the “end of cookies” in a meaningful way, Snowflake’s Data Clean Rooms enable Advertisers to merge their first-party data and their publisher(s)’ viewership/exposure data, delivering more value for their marketing spend.
  • Collaborative Promotions. Conducting customer segment overlap analysis with a co-branding/co-marketing partner can reveal areas where customer segments and audiences are aligned.
  • Joint loyalty offerings and/or upsells can also be developed in partnership with aligned customer “interests”.

 

Healthcare and Life Sciences:

  • There are some extremely valuable use cases where we can securely share patient data and patient outcomes across government, healthcare, and life sciences to hopefully make some huge leaps forward in healthcare and life.

 

Financial Services:

  • Combining data from multiple financial institutions to identify fraud or money laundering activities without sharing sensitive customer information.

 

Retail:

  • Combining customer data from different sources to create targeted marketing campaigns and promotions.

 

Government:

  • Sharing data across different government agencies to improve public services while protecting individual privacy.

 

Part 2: Looking for more information about Data Clean Rooms?

Here are some additional resources to help you learn more about Data Clean rooms and Data Collaboration.

 

Lastly, here’s an interview I provided on my view of the opportunities around Data Clean Rooms on Snowflake. I shared some insights gained from decades of experience working in data, including thoughts about the transformational impact that cloud-based data sharing, data collaboration, data marketplaces, and data clean rooms are having on companies and industries.

What’s Next in Data Collaboration & Why Data Clean Rooms Are Exciting: Insights From Frank Bell

 

Are you interested in how you can use a Snowflake Data Clean Room for your business? Contact Us Today.

Analyzing PHI & PII with Snowflake’s Data Clean Rooms

Introduction – Data Clean Rooms:

 

Sharing data can be tough. Organizations struggle to effectively manage their data internally. This problem only increases in magnitude for data sharing across multiple organizations. To make matters worse, regulations on sensitive data (PII and PHI) further complicate the process and this type of analysis is decided to be too troublesome to engage upon.

However, that’s a huge loss of opportunity. Since the business intelligence insights gained from analyzing personal health and sales datasets can be transformative to your business direction and decisions. Luckily, there is now a way to gain access to and analyze this data while abiding by security laws.

 

Data Clean Room:

 

A data clean room is a place to perform joint analysis on sensitive data while abiding by regulations. The clean room can be set up by anyone but it will most likely be the provider. A clean room can have multiple data providers. Each provider can control:

  • Incoming Data
  • How their data can be joined with other data
  • Types of analytics that can be performed on their data
  • Outgoing data

This type of data hosting and analysis is made possible by several key Snowflake features: secure data shares, the data marketplace platform,  secure functions, and secure join capabilities.

A consumer in the data clean room can access the provider’s data through defined functions, joins, and queries the provider specifies. Data masking (hashing) can provide an extra layer of security so that no naked identifiable data is ever transferred between provider and consumer.

 

 

Case Analysis – Advertising to the NBA:

 

I am the lead marketer for Weight Loss Champions. We sell weight loss pills. We want to launch an advertising campaign targeting NBA All-Stars to use and endorse our product.

Company: Weight Loss Champions

Product: Weight Loss Pills

Marketing Hypothesis: Quarantine has forced a lot of people indoors. Gyms are closed. Many people are drinking and not exercising and have put on weight. We see this as a great opportunity for our business. We also realize that the NBA 2020 Season is set to restart in the Orlando bubble at the end of July.

Objective: Our goal is to get an NBA All-Star to use our weight loss pills. Then try to get them to endorse our product.

Strategy: We have identified several places that these athletes will frequent, such as shopping centers, and restaurants. After doing a little digging, we discovered walkways between these areas that offer ad placements. We must decide what height on these walls to place our ads.

 

Let’s take a look at how we can accomplish this. The first step is establishing our data clean room.

How to Build Your Own Data Clean Room

The approximate time to create is 8 hours. It took me 2 hours to submit everything to Snowflake and then 6 hours working with them to get the room up. With this guide, I imagine you can do it even faster.

Make a Snowflake Account

Each participant will need a Snowflake account. Existing Snowflake customers can provide a secure sub-account. You can sign up for a 30-day free trial with $400 worth of credit here.

Submit a Support Case 

To submit a support case you will need to create an account on the Snowflake community and link it to your Snowflake account. Once you create your account, follow these instructions to submit a case. Here’s what the case should look like.

 

Data Clean Rooms 2

 

After submitting your case, a Snowflake representative will contact you to go through the details and set up the exchange. This could take a couple of hours.

 

Create a Secure Share

A listing on your private data exchange comes from a secure share. When creating the secure share, you will specify what databases, schemas, and functions. A secure share can be made through the share tab, toggling to the outbound option, and then selecting create.

 

Data Clean Room 2

Read the full guide to Secure Shares here.

 

Create a New Listing

Navigate to the Data Marketplace and open your private exchange. On the left side menu, it will be under Data -> Manage. From there you can create a new Listing by clicking the button on the upper right and selecting the Secure Share from before.

 

Data Clean Room 2

 

The provider should describe their data set here and provide documentation on how the data can be analyzed. This can be an ongoing process between the provider and the consumer. Where the consumer requests a specific type of analysis and the provider then creates a secure function or share to fulfill that type of analysis.

 

Configure Roles and Access:

 

Navigate to the Data Marketplace and click on the Admin tab on the left-hand side. Click on your Private Data Exchange to configure the roles. Here you can add members and specify which accounts can be providers, consumers, and Administrators.

 

Data Clean Room 3

Analyze your Data:

 

From here, the consumer can log in to their account. They will be able to access the share and all the available schemas, tables, and functions provisioned by the provider. Here you can take a look at the share I created for a client and the functions made available. We’ll be using them later in the analysis.

 

Data Clean Room 4

To learn this and other features, feel free to check our blog for more information.