Note: The original feature called Shared Buckets is now integrated into our Data Catalog.
The data catalog represents an overview of data shared to and from the project. There are generally numerous options to share data (ranging from CSV exports to writing to a dedicated database). The data catalog allows you to share data in a very efficient, controlled and auditable way.
The Data Catalog now offers a list view instead of the previous grid view, providing better organization and visibility of shared data:
These enhancements make it easier to manage shared data and provide better governance capabilities, especially for organizations with extensive data sharing needs.
Sharing data via the data catalog is useful in numerous ways – for example:
The following terminology is used:
All changes and events in a shared bucket are propagated to its linked buckets; any tables you create in the shared bucket become immediately available in the destination projects. Data size and rows are counted only in the source project. Metadata changes, such as bucket color and description, are also propagated to linked buckets.
There are some constraints to how shared buckets can be used:
There are several options how you can share data:
To share data outside of your project, go to Catalog and click SHARE BUCKET.
Next you can choose whether you want to share an existing bucket or create a new one. Both options are technically the same, the choice depends purely on whether you prepared the data in advance (an existing bucket) or not (create a new bucket).
Choose the Share existing bucket option:
Then, select the bucket and a sharing type. You can also assign a Datashare Owner who will be responsible for the bucket. Optionally, provide a description of the bucket and select a custom color so others know what to expect.
The bucket is shared and instantly available to other projects in the organization depending on the sharing type you have chosen.
To change sharing settings, go to the bucket detail page and click the Edit Sharing button:
To extend an existing shared bucket with additional tables, you don’t need to delete the bucket and recreate it. Instead, create an alias of the table and set the shared bucket as the destination. Here is how to do so:
Go to Storage → Tables and locate the table you want to add.
Click Create Alias.
Select the shared bucket as the destination. Ensure the alias has no filter and automatic synchronization of the columns enabled.
Once created in the shared bucket, the alias will immediately appear in all linked projects.
Choose the Select tables option:
Select the tables and/or buckets you want to share:
Enter the bucket name and a sharing type. Assign a Datashare Owner, and optionally customize the bucket color and enter a description:
A new bucket will be created for you, the selected tables will be linked to it using aliases.
To change sharing settings, share the newly created bucket again. The newly selected settings will overwrite the old ones.
You can link a shared bucket into your project from the Shared with you tab in the Data Catalog:
Enter a name of the bucket as you’d like to see it in the current project and Link the bucket:
You’ll see that the bucket is available in your project:
You can see the detail of the shared bucket with a link to the source project (if you have access to it). The bucket will inherit the color and description from the source bucket.
You can edit the metadata of shared buckets directly from the Data Catalog:
These metadata changes will be propagated to all linked buckets, ensuring consistency across projects.
The new list view in the Data Catalog provides enhanced filtering and sorting capabilities:
These features help you quickly locate and manage your shared data, especially in environments with many shared buckets.
To stop sharing a bucket, go to Data Catalog and click Disable sharing: