Tables

Your project table storage is available in the Tables tab in the Storage section. All data tables are organized into buckets that can also be used to share tables between projects.

The actual data tables and the buckets are created primarily by KBC components (extractors, transformations and applications), or they are imported from CSV files. In case you want to import data to an already existing table, the imported table must have all the columns of the old, existing table, even if the old table is empty. If some columns are missing, you will receive a message like this:

Some columns are missing in the csv file. Missing columns: lat,long. Expected columns: lat,long.
Please check if the expected "," delimiter is used in the csv file.

Also note that the imported file may contain additional columns not present in the existing table. In that case, the columns from the imported table will be added to the existing table.

Aliases

Apart from actual tables, it is also possible to create aliases. They are internally implemented as database views and inherit their basic properties.

An alias does not contain any actual data; it is simply a link to some already existing data. Hence an alias cannot be written to, and its size does not count to your project quota.

In addition, if you create an alias from a table, the table cannot be deleted without the alias being deleted as well. If you attempt to do so, you will receive an error message similar to this one:

The blog-data table cannot be deleted. Please delete its aliases first: in.c-tutorial.blog-data,in.c-my-bucket.blog-data.

See an example use of an alias in our tutorial.

Screenshot - Create alias

If you select any table from any bucket in Storage, detailed information about the table will be displayed on the right side of your screen. This is what we refer to as the Table detail throughout our documentation.

Aliases cannot be chained and can be applied only between buckets with the same backend. An alias table can be filtered by a simple condition.

Screenshot - Create Simple alias

There are the following limitations:

  • Filtering is enabled only on indexed columns.
  • When an alias is created, the index on the filtered column of the source table cannot be removed.
  • Alias columns are automatically synchronized, by default, with the source table. Columns added to the source table will be added to the alias automatically. You can prevent this by disabling Synchronize columns with source table.

Copying Tables / Table Snapshots

If you want to physically copy a table, use the table snapshot feature. A copy of the table contents at the time of creating the snapshot will be made. It can be used immediately to make a physical copy of the table, or later to revert the table into its previous state.

Table Snapshots are useful when experimenting with extractors or transformations, or generally when refactoring your project: you can create a copy of your output table, experiment a little, and then compare the new output table with the original one to make sure your output remained the same. They can also be used as a workaround to renaming tables; however, this feature is not available yet.