Working with tables in branch

In the previous sections you prepared table and file manipulating configurations in production. In this section you’ll manipulate the configurations, run them and learn about how tables behave in branches.

Let’s say that you have a second look at the previously created configurations, you realized that you need top 10 values instead of top 5 and you need to convert the amounts of bitcoin to dollar amounts based on the value of bitcoin on the given day.

You don’t want to change the current transformation yet though. Perhaps it is a critical transformation in production, perhaps you want to check the results first. In either case this is where branches come in, giving you the option to test the changes safely before merging them to production.

Create a new branch

To create a new branch, click on your project’s name at the top of the screen. Then click on the green icon New displayed next to your project’s name.

Screenshot - Create Development Branch

Name your new development branch Sample branch, and click Create Development Branch to open it.

Screenshot - Name Development Branch

Your new branch will appear right below the name of your production project.

Screenshot - Created Development Branch

Change the transformation

Let’s start with the easier task - we need to change the transformation so that it returns top 10 rows instead of top 5. In your branch, navigate to Transformations. You can see the previously created Bitcoin transformation, it is however a copy living in the branch. You can verify that you are working with the branch copy by checking whether the page header has yellow accent and shows the branch name.

Screenshot - Transformation in branch

When you are in a branch, change the query limit from LIMIT 5 to LIMIT 10 in the transformation code and save it.

CREATE TABLE "top_prices" AS SELECT * FROM "bitcoin_price" ORDER BY PRICE DESC LIMIT 10;

ALTER TABLE "top_prices" DROP COLUMN "_timestamp";

Screenshot - Change to code

Feel free to check the original transformation did not change. You can do so by switching back to the project in the top menu. The production transformation still contains LIMIT 5. Now that you verified that branch transformation can be changed independently, switch again to the branch you created (Sample branch).

Navigate to the transformation and run it. Examine the Mapping section in the job detail.

Mapping in branch

Mapping in branch

Notice how in the Input section even though you did not run the extractor in this branch, the data are still loaded from the bitcoin_price table. What happens is that Keboola Connection checks whether a branch version of the table exists and if it does not, it falls back to read from production version.

Also notice that Output shows small yellow branch icon, and the name of the bucket is prefixed with a number – 1835-bitcoin. When you run a component in a branch and the component wants to write to Storage, the outputs do not overwrite production data. Instead a duplicate bucket prefixed with branch ID is automatically created. Examine the table and check that it indeed has 10 values as it should.

Note: The number in bucket name will be different for you as you have different branch ID.

This part of your task is done. Let’s get to the second part.

Download the transactions

Second objective is to produce a list of transactions showing the amount in both BTC and in USD. So you need to extract the account data first. Use the prepared CSV file with sample transactions.

In the branch navigate to your existing HTTP extractor configuration and add a new table named bitcoin_transactions and fill Path with /tutorial/branches/bitcoin_transactions.csv. Save and run the extractor.

Extractor for transactions

Extractor for transactions

Examine the job outputs. There are tables with branch icon in a bucket prefixed with a number. As we’ve already shown, this signifies that the output was stored in branch context, keeping your production data intact.

Extractor output

If you navigate to Storage Explorer in the branch you’ll see that the created buckets are shown there with an icon as well.

Branch storage

Also, when you switch back to production and navigate to the Storage Explorer you’ll see a switch to show the branch buckets.

Storage explorer in production

Storage explorer in production

Note: There is only one Storage in your project, branch buckets are created with a prefixed name, but are stored along normal buckets and are visible in production as well as in branch. You can see all buckets from all development branches in production.

Now switch back to the Sample branch. Navigate to the Bitcoin transformation and run it again. Check the input mapping and verify that the bitcoin_prices table from the branch was used as input.

Input mapping from branch data

What happened is that Keboola Connection checked whether a branch version of the bucket existed and since it did (you ran the branch version of the HTTP extractor), it was used in the transformation.

Extend the transformation

You’ll be loading the data from bitcoin_transactions table, so you need to add the table to input mapping.

Input mapping from branch data

Notice the branch icon next to the table name. You’re referring to the branch version of the table. Don’t be alarmed by the UI saying that the table does not exist – it exists only in the branch so far. The UI will be improved in future versions.

Missing table in input mapping

Now add a second code to Block 1 named Dollar values of transactions and insert the following SQL

Add a new code

CREATE TABLE "dollar_btc_transactions" AS 
    "bitcoin_transactions" AS BT
    "bitcoin_price" AS BP
        ON BP.DATE = BT.DATE;

Add a new code

For the data to make it out of the transformation you need to add the created dollar_btc_transactions table to output mapping as well.

Output mapping in branch

Changed transformation overview

Run the updated transformation and examine the results.

Changed transformation output

As you can see, this run of the transformation only accessed branch buckets. It loaded the data from HTTP extractor from branch bucket as input and stored the output in two tables in another branch bucket. You can also examine the data in the tables to see that you indeed created a list of transactions with their dollar amounts.

This means that the second part of your task is done as well. You changed the table-manipulating production configurations in a branch. You verified that the none of the changes affected neither the original project configurations nor the project production data. In the following section you’ll try running the file-manipulating configuration in branch and examine how files in branch work.