Write

Avo custom fields

2024-04-20T10:14:57+00:00

Avo is a content management system for Ruby on Rails applications that has a lot of features out of the box. It is designed to be easy to use and extendable, allowing developers to move faster and save development time.

Avo has a notion of panels to define the layout and behavior of the specific resources. For instance, it is easy to define a panel for the User resource that includes the id, email, and name fields.

class Avo::Resources::User < Avo::BaseResource
  def fields
    field :id, as: :id, link_to_record: true
    field :email, as: :text, required: true
    field :name, as: :text, required: true
  end
end

Once the resource is defined, Avo automatically generates the user interface to manage the User resource. It creates a table view to list all users, a form to create and edit users, and a detail view to show the user details.

The key role here is played by the fields method, which defines the fields that are displayed in the user interface. In many cases, the default field types provided by Avo are sufficient to cover the needs of the application. However, there are cases where custom fields are required to meet specific requirements. In this post, we will explore how to create custom fields in Avo. Out example is going to be quite simple, but it will give you a good understanding of how to create custom fields in Avo.

Copyable text field

Let’s implement an example of a copyable text field. This field type is useful when you want to display a text value that can be easily copied to the clipboard. It must looks like a default text field but with a copy button next to it. Obviously, the button should be displayed only on “Show” view since it doesn’t make sense to copy the value in the “Edit”/”Update” views.

In order to create a custom field in Avo we can use a scaffold generator that will create the necessary files for us.

$ rails generate avo:field copyable_text --field_template text
Avo pro 3.6.1
      create  app/components/avo/fields/copyable_text_field
      create  app/components/avo/fields/copyable_text_field/edit_component.html.erb
      create  app/components/avo/fields/copyable_text_field/edit_component.rb
      create  app/components/avo/fields/copyable_text_field/index_component.html.erb
      create  app/components/avo/fields/copyable_text_field/index_component.rb
      create  app/components/avo/fields/copyable_text_field/show_component.html.erb
      create  app/components/avo/fields/copyable_text_field/show_component.rb
      create  app/avo/fields/copyable_text_field.rb

Since we would like to reuse the existing text field template, we can pass the text field template as an argument to the generator. In this example, we generate a new field called copyable_text using text as a base template. The generator creates the necessary files for the new field type, including the edit, index, and show components. Since the only change is the addition of the copy button, we only need to modify the show_component.html.erb file.

<%= field_wrapper(**field_wrapper_args) do %>
  
    data-controller='clipboard'
    class='flex flex-row items-center gap-2'>

     data-clipboard-target='source'> <%= @field.value %> 

      class='hidden'
      data-action='click->clipboard#copy'
      data-clipboard-target='copy'>
      <%= helpers.svg 'heroicons/outline/document-duplicate', class: 'h-5 my-1' %>
    
     data-controller='tippy'>
      
        class='hidden'
        data-tippy-target='source'
        data-clipboard-target='done'>
        <%= helpers.svg('heroicons/outline/check', class: 'h-5 my-1') %>
      
       class='hidden' data-tippy-target='content'>Copied!


  
<% end %>

Here we display the actual value of the field and add a button that copies the value to the clipboard. The button is actually just a SVG icon from Heroicons collection that looks like a copy icon. As an addition, we use the tooltip to show the “Copied!” message when the value is copied and also change the icon to a checkmark. The tippy controller is defined by the Avo itself so we don’t need to add it manually. However, the clipboard is a custom Stimulus controller that we need to define in the JavaScript.

// app/javascript/controllers/clipboard_controller.js

import { Controller } from "@hotwired/stimulus";

const TOGGLE_TARGET_TIME = 3000;

export default class extends Controller {
  static targets = ["source", "copy", "done"];

  connect() {
    // show the copy target only if the clipboard API is supported
    if ("clipboard" in navigator) {
      this.showCopyTarget();
    }
  }

  copy(event) {
    event.preventDefault();
    navigator.clipboard.writeText(this.sourceTarget.innerText);

    this.showDoneTarget();

    setTimeout(() => this.showCopyTarget(), TOGGLE_TARGET_TIME);
  }

  showDoneTarget() {
    this.copyTarget.classList.add("hidden");
    this.doneTarget.classList.remove("hidden");
  }

  showCopyTarget() {
    this.doneTarget.classList.add("hidden");
    this.copyTarget.classList.remove("hidden");
  }
}

Here is a brief explanation of the code:

We show the copy button only if the clipboard API is supported.
When the copy button is clicked, we write source (the value of the field) to the clipboard.
We show the done target (the “Copied!” message) and hide the copy target (the copy button).
After 3 seconds, we show the copy target again.

That’s pretty much it, we just need to change the type of the field in the resource definition to use it:

def fields
  field :id, as: :id, link_to_record: true
- field :email, as: :text, required: true
+ field :email, as: :copyable_text, required: true
  field :name, as: :text, required: true
end

Now we are ready to check the results in the Avo interface:

Wrap-up

In this post, we have explored how to create custom fields in Avo. The example we have implemented is quite simple, but it demonstrates the process of creating a custom field in Avo. Avo allows you to create custom fields with a high flexibility, so the fields can be much more complex than the one we have implemented.

Follow the official guide to learn more about creating custom fields in Avo: Custom fields.

Cross account Amazon ECR images

2024-04-07T10:14:57+00:00

Sharing Amazon ECR images across accounts is a common requirement, especially in scenarios where a CI/CD pipeline resides in one account and the production environment operates in another. In this post, we’ll explore various methods to achieve this goal, highlighting the pros and cons of each approach.

Method	Description	Pros	Cons
Public ECR Repository	Making the ECR repository public allows easy access to images across accounts. However, this approach is not recommended for production workloads due to security concerns.	Easy implementation	Security risk; not suitable for production environments
Cross-account IAM Role	Creating a cross-account IAM role enables controlled access for the production account to pull images from the CI/CD account. This ensures security and fine-grained access control.	Secure access control	Requires careful IAM role setup and management
Resource-based Policy	Utilizing a resource-based policy grants the production account the necessary permissions to pull images from the CI/CD account. Offers flexibility and security through policy enforcement.	Granular control over access permissions	Policy management overhead; complexity in setup and maintenance
Pushing to Multiple ECR Repositories	Pushing the same image to multiple ECR repositories during the CI/CD pipeline ensures availability across accounts. This method simplifies access but requires synchronization and management of multiple repositories.	Simplified access management	Potential synchronization issues; increased repository management overhead
Private Image Replication	Replicating images from private repositories across regions or accounts provides redundancy and availability. This method ensures consistency and reliability of image distribution.	High reliability and consistency	Requires additional setup and configuration; potential data transfer costs

Each approach offers distinct advantages and considerations. The choice depends on factors such as security requirements, ease of implementation, and the specific needs of your infrastructure. In this post we will explore how to implement the resource based policy approach.

ECS resource-based policy

The resource-based policy approach involves creating a policy that grants the production account the necessary permissions to pull images from the CI/CD account. Let’s assume that the production account needs to pull ECR images from the CI/CD account to start the ECS task. That means the CI/CD account needs to grant the production account the necessary permissions to pull images from the ECR repository. The RepositoryPolicyText property of the AWS::ECR::Repository resource in CloudFormation allows you to define a resource-based policy for the ECR repository:

Resources:
  MyECRRepository:
    Type: AWS::ECR::Repository
    Properties:
      RepositoryName: my-ecr-repository
      RepositoryPolicyText:
        Version: "2012-10-17"
        Statement:
          - Sid: AllowPull
            Effect: Allow
            Principal:
              AWS: !Sub "arn:aws:iam::${ProductionAccountId}:role/ecs-task-role"
            Action:
              - ecr:BatchGetImage
              - ecr:GetDownloadUrlForLayer
              - ecr:BatchCheckLayerAvailability

Lambda function resource-based policy

If you need to pull the ECR to production account to deploy the Lambda function, you can use the following policy:

Resources:
  MyECRRepository:
    Type: AWS::ECR::Repository
    Properties:
      RepositoryName: my-ecr-repository
      RepositoryPolicyText:
        Version: "2012-10-17"
        Statement:
          - Sid: AllowPull
            Effect: Allow
            Principal:
              Service: lambda.amazonaws.com
            Action:
              - ecr:BatchGetImage
              - ecr:GetDownloadUrlForLayer
              - ecr:BatchCheckLayerAvailability
              - ecr:GetRepositoryPolicy
              - ecr:SetRepositoryPolicy
            Condition:
              StringEquals:
                aws:sourceArn: !Sub "arn:aws:lambda:${AWS::Region}:${AWS::AccountId}:function:my-lambda-function"

The difference between the two policies is the Principal field and the list of actions. Image-based lambdas can add permissions only if the principal calling Lambda has ecr:getRepositoryPolicy and ecr:setRepositoryPolicy permissions so they are required in the policy. The condition field is used to restrict the access to the ECR repository only for the specific Lambda function.

Root account resource-based policy

The root based policy is the most permissive policy. It allows any account service to pull the images from the ECR repository.

Resources:
  MyECRRepository:
    Type: AWS::ECR::Repository
    Properties:
      RepositoryName: my-ecr-repository
      RepositoryPolicyText:
        Version: "2012-10-17"
        Statement:
          - Sid: AllowPull
            Effect: Allow
            Principal:
              AWS: !Sub "arn:aws:iam::${ProductionAccountId}:root"
            Action:
              - ecr:BatchGetImage
              - ecr:GetDownloadUrlForLayer
              - ecr:BatchCheckLayerAvailability

However, using the root account in a resource-based policy grants the highest level of permissions within AWS. While this approach may seem convenient for allowing any account service to pull images from the ECR repository, it comes with significant drawbacks and security concerns.

Wrap up

In this post, we explored various methods to share Amazon ECR images across accounts. We discussed the pros and cons of each approach and provided examples of how to implement a resource-based policy for ECR repositories.

Resources:

Chartkick and turbo frames - elevating rails visuals

2024-02-26T10:14:57+00:00

In today’s digital landscape, delivering dynamic and interactive content is essential for engaging user experiences. Rails developers often leverage powerful tools like Chartkick to visualize data seamlessly within their applications.

Concurrently, Turbo Frames offer a streamlined way to update parts of a webpage without a full reload, enhancing responsiveness and user experience. In this blog post, we’ll explore the synergy between Chartkick charts and Turbo Frames in Rails applications, empowering developers to create rich, real-time data visualizations within a fluid user interface.

Make a dashboard

To begin implementing our dashboard with its accompanying charts, we first need to create a controller to manage these features.

$ rails g controller MyDashboardController

This command generates a new controller named MyDashboardController, which will handle the logic for rendering the dashboard and fetching data for the charts.

Inside our MyDashboardController, we define three actions:

class MyDashboardController < ApplicationController
  def index
    # Action to render the main dashboard view
  end

  def users_chart
    # Action to fetch data for the users chart
  end

  def orders_chart
    # Action to fetch data for the orders chart
  end
end

Here’s a breakdown of each action:

index: this action is responsible for rendering the main dashboard view. It serves as the entry point for users accessing the dashboard interface.
users_chart and orders_chart: these actions serve as endpoints to fetch data for the respective charts displayed on the dashboard. They will handle any necessary data processing and formatting required for visualization.

Now that we have our controller set up, we need to define routes to access these actions:

# config/routes.rb

get 'my_dashboard', to: 'my_dashboard#index'
get 'my_dashboard/users', to: 'my_dashboard#users_chart'
get 'my_dashboard/orders', to: 'my_dashboard#orders_chart'

These route definitions map specific URLs to the corresponding actions in the MyDashboardController. Users can access the dashboard and retrieve data for the charts by navigating to these URLs. The index action renders the main dashboard view, while the users_chart and orders_chart actions fetch data for the respective charts.

With our controller and routes in place, we’re now ready to build out the dashboard interface and integrate the charts seamlessly.

Add turbo frames

After implementing the index view, we can finally start visualizing our dashboard. Let’s take a look at the initial structure of our dashboard page:

   id='header' class='flex'>
     class='bg-white p-3 w-full rounded text-center uppercase text-lg font-bold'>
      My Dashboard
    
   id='charts' class='flex flex-col justify-between mt-2 gap-2'>

In this snippet, we’ve created a simple layout with two sections: a header and a container for charts. To expedite styling, we’ve utilized Tailwind CSS.

As the next step, we’ll integrate Turbo Frames to load our charts simultaneously. Turbo Frames provide a seamless way to fetch and display content without full page reloads. Let’s add Turbo Frames for each chart:

+    <% %i[users_chart orders_chart].each do |action| %>
+      <%= turbo_frame_tag action, src: url_for(action: action), loading: :lazy, class: 'w-full'  %>
+    <% end %>
  

In this code snippet, we iterate through an array containing the actions (users_chart and orders_chart) for our charts. For each action, we create a Turbo Frame with a unique identifier (action) and specify the endpoint using url_for(action: action). Additionally, we include the loading: :lazy parameter to enable lazy-loading, ensuring that the frames are fetched lazily.

Elevate the visual with a chartkick

To visualize the data with Chartkick, we first need to retrieve it from the database using ActiveRecord and then pass it to our views. Let’s walk through the implementation of the users_chart action in our controller:

class MyDashboardController < ApplicationController
+  FRAME_CHART_PARTIAL = 'my_dashboard/frame_chart'

  def users_chart
+    data = User.group_by_day(:created_at).count
+    opts = {
+      title: 'Users created',
+      colors: %w(#3b82f6)
+    }
+
+    render partial: FRAME_CHART_PARTIAL,
+           locals: { id: :users_chart, data: data, opts: opts }
  end
end

In the users_chart action, we group users by the created_at field and count the number of users created on each day. Then we defined the options for our chartkick helpers and render the partial that will use the defined variables.

Note: the group_by_day method is just a high level DSL defined by groupdate gem.

Next we can add a second action orders_chart:

class MyDashboardController < ApplicationController
+  def orders_chart
+    data = Order.group(:order_type).group_by_day(:created_at).sum(:quantity)
+    opts = {
+      title: 'Orders created',
+      colors: %w(#3b82f6 #22c55e)
+    }
+
+    render partial: FRAME_CHART_PARTIAL,
+           locals: { id: :orders_chart, data: data, opts: opts }
+  end
end

Similarly, in the orders_chart action, we group orders by both order_type and created_at, summing the quantity of orders for each day and type.

Finally, we can define the partial view that will utilize the provided data and visualize it:

<%= turbo_frame_tag(id) do %>
   class="bg-white p-3 rounded">
    <%= line_chart data, id: "#{id}-chart", **opts %>
  

<% end %>

In this partial, we use turbo_frame_tag to wrap our chart and ensure it correctly replaces the designated area in the index template. Within the frame, we utilize the line_chart helper provided by Chartkick to render the chart based on the provided data and options.

Wrap-up

In this blog post, we’ve successfully implemented dynamic charts in our Rails application’s dashboard using Chartkick and Turbo Frames. By fetching and visualizing data with ActiveRecord and Chartkick helpers, and seamlessly integrating them into the interface with Turbo Frames.

Configuring MFA delete on S3 bucket

2024-02-15T10:14:57+00:00

MFA Delete adds an additional layer of security to your S3 buckets by requiring authentication via MFA before allowing the permanent deletion of objects. This means that even if an unauthorized user gains access to your AWS credentials, they cannot delete objects from your S3 bucket without providing a valid MFA code.

Without MFA Delete enabled, a compromised set of credentials could lead to irreversible data loss or tampering. Malicious actors or inadvertent actions could result in the accidental deletion of critical files, causing financial losses, compliance violations, and reputational damage. MFA Delete mitigates these risks by introducing an extra verification step, significantly reducing the likelihood of unauthorized deletions.

Configure MFA Device in AWS Console

Before proceeding to enable MFA Delete for your S3 bucket, it’s crucial to set up a Multi-Factor Authentication (MFA) device for your IAM user account. AWS offers various MFA methods, including FIDO security keys, virtual authenticator apps, hardware TOTP tokens, and specialized options for the AWS GovCloud (US) Regions. For a comprehensive overview of MFA options, you can refer to the Multi-Factor Authentication (MFA) for IAM blog post.

In this guide, we’ll walk through setting up a virtual authenticator app, a popular and convenient choice:

Sign in to AWS Console.
Access IAM User Settings. Navigate to the Identity and Access Management (IAM) dashboard by selecting IAM from the services menu.
Select User and Enable MFA. Locate and select the IAM user account for which you want to enable MFA. Under the “Security credentials” tab, find the “Multi-factor authentication (MFA)” section and click on “Manage MFA device.”
Choose Virtual MFA Device. Select the option for a virtual MFA device.
Scan QR Code or Enter Secret Key. Using your preferred authenticator app (such as Google Authenticator or Authy), scan the QR code displayed on the screen or manually enter the provided secret key.
Verify MFA Configuration. After adding the MFA device to your authenticator app, you’ll be prompted to enter a code generated by the app to verify the setup. Enter the code to complete the configuration process.
Confirmation. Once verified, MFA will be enabled for your IAM user account, adding an extra layer of security to your AWS access.

Enable MFA delete

AWS provides flexibility in enabling MFA delete functionality either through a REST API call or via the AWS Command Line Interface (CLI). Utilizing the CLI offers a straightforward approach to configure MFA delete, as demonstrated below:

aws s3api put-bucket-versioning --bucket BUCKET_NAME \
  --versioning-configuration Status=Enabled,MFADelete=Enabled \
  --mfa "SERIAL 123456"

Here’s a breakdown of the parameters:

BUCKET_NAME - replace this with the name of the bucket for which you want to enable MFA delete. SERIAL - represents the ARN (Amazon Resource Name) of the authenticator app associated with your IAM user account, which can be found in the AWS Management Console. 123456 - this is the PIN generated by your authenticator app for the MFA operation.

For example:

aws s3api put-bucket-versioning --bucket my_unique_bucket \
  --versioning-configuration Status=Enabled,MFADelete=Disabled \
  --mfa "arn:aws:iam::1231234322:mfa/GoogleAuthenticator 352818"

In this example, my_unique_bucket is the name of the S3 bucket, arn:aws:iam::1231234322:mfa/GoogleAuthenticator represents the ARN of the authenticator app associated with the IAM user, and 352818 is the generated PIN for MFA authentication.

By executing this command, we enable versioning and MFA delete for the specified S3 bucket, ensuring an added layer of security to prevent unauthorized deletions of objects. This streamlined approach via the AWS CLI simplifies the process of configuring MFA delete, enhancing the protection of your data stored in Amazon S3.

How MySQL gap lock can lead to deadlock

2023-02-15T10:14:57+00:00

MySQL implements gap locks as a locking mechanism to control access to a table. A gap lock can be used in a SELECT statement with the FOR UPDATE or LOCK IN SHARE MODE clause, to lock the gap and prevent other transactions from inserting a new row with a duplicate key value in the gap. This can be useful for enforcing unique constraints or ensuring consistency when processing a series of related transactions.

However, a gap lock can lead to a deadlock when two or more transactions try to lock the same gap simultaneously, and each transaction is waiting for the other to release the lock. This results in a circular wait, where each transaction is waiting for the lock held by the other, and neither transaction can proceed.

Let’s find out how gap locking works and what actually triggers a deadlock.

Table structure

Suppose we have a table named products with column code, and we want the values inserted to this column to be unique. The most straightforward way to achieve this is to add a unique index to the code column. The table structure can look like this:

+------------+--------------+------+-----+---------+----------------+
| Field      | Type         | Null | Key | Default | Extra          |
+------------+--------------+------+-----+---------+----------------+
| id         | bigint(20)   | NO   | PRI | NULL    | auto_increment |
| code       | varchar(255) | NO   | UNI | NULL    |                |
+------------+--------------+------+-----+---------+----------------+

The following products table has only two columns:

id column: A bigint(20) data type column which is the primary key of the table, with auto-increment enabled.
code column: A varchar(255) data type column which is set to not allow null values and has a unique constraint. This column is used to store the code of the product and ensures that each product code is unique.

Concurrent inserts and gap lock

Gap locks can result in a situation called a gap lock wait, where one transaction waits for another transaction holding a gap lock to release it. This is needed in order to guarantee data consistency and not break the unique constraint.

Such a lock can be easily demonstrated using two concurrent transactions (running using two different connections).

Transaction 1:

START TRANSACTION; -- T1

INSERT INTO `products` (`code`) VALUES ('112');

Transaction 2:

START TRANSACTION; -- T2

INSERT INTO `products` (`code`) VALUES ('112');

If we execute T1, it will start a new transaction and insert a new row to products table. Which obviously will not be visible until the transaction is committed. However, if we start another transaction T2 using a separate connection and try to insert a product with the same code, it will be locked. MySQL understands that there is a separate transaction that can be committed or rolled back and waits for that transaction to finish.

Analyzing the deadlock

In the example above, both transactions try to insert a row with the same code value ‘112’, but the gap lock acquired by T1 prevents T2 from inserting a row with the same value. As a result, T2 has to wait for T1 to release the lock on the gap, which can lead to a potential deadlock if T1 is also waiting for a lock held by T2.

The simplest way to trigger a deadlock is to expand T1 and insert one more row which will span the gap lock on T2:

START TRANSACTION; -- T1

INSERT INTO `products` (`code`) VALUES ('112');

INSERT INTO `products` (`code`) VALUES ('111'); -- triggers deadlock

Fortunately, MySQL is smart enough to detect a deadlock and mitigate the issue. If we check the latest innodb status by running the command show engine innodb status;, we will find extra information about the latest detected deadlock:

------------------------
LATEST DETECTED DEADLOCK
------------------------
2023-02-15 18:51:25 0x170577000
*** (1) TRANSACTION:
TRANSACTION 1255170, ACTIVE 4 sec inserting
mysql tables in use 1, locked 1
LOCK WAIT 2 lock struct(s), heap size 1136, 1 row lock(s), undo log entries 1
MySQL thread id 234, OS thread handle 6180401152, query id 1768461 localhost root update
INSERT INTO `products` (`code`) VALUES ('112')
*** (1) WAITING FOR THIS LOCK TO BE GRANTED:
RECORD LOCKS space id 134040 page no 4 n bits 72 index index_products_on_code of table `products` trx id 1255170 lock mode S waiting
Record lock, heap no 2 PHYSICAL RECORD: n_fields 2; compact format; info bits 0
 0: len 3; hex 313132; asc 112;;
 1: len 8; hex 800000000000000a; asc         ;;

*** (2) TRANSACTION:
TRANSACTION 1255169, ACTIVE 11 sec inserting
mysql tables in use 1, locked 1
3 lock struct(s), heap size 1136, 2 row lock(s), undo log entries 2
MySQL thread id 233, OS thread handle 6179745792, query id 1768462 localhost root update
INSERT INTO `products` (`code`) VALUES ('111')
*** (2) HOLDS THE LOCK(S):
RECORD LOCKS space id 134040 page no 4 n bits 72 index index_products_on_code of table `products` trx id 1255169 lock_mode X locks rec but not gap
Record lock, heap no 2 PHYSICAL RECORD: n_fields 2; compact format; info bits 0
 0: len 3; hex 313132; asc 112;;
 1: len 8; hex 800000000000000a; asc         ;;

*** (2) WAITING FOR THIS LOCK TO BE GRANTED:
RECORD LOCKS space id 134040 page no 4 n bits 72 index index_products_on_code of table `products` trx id 1255169 lock_mode X locks gap before rec insert intention waiting
Record lock, heap no 2 PHYSICAL RECORD: n_fields 2; compact format; info bits 0
 0: len 3; hex 313132; asc 112;;
 1: len 8; hex 800000000000000a; asc         ;;

*** WE ROLL BACK TRANSACTION (1)

Let’s go through the report:

There are two transactions (TRANSACTION 1255170 and TRANSACTION 1255169) that are inserting a new product into the products table. Both transactions have acquired locks on the same index page for the code column using different lock modes.
Transaction 1255169 is holding an exclusive (X) lock on a row with a code value of ‘112’, while transaction 1255170 is waiting for a shared (S) lock on the same row. At the same time, transaction 1255170 is holding a shared (S) lock on the index page, while transaction 1255169 is waiting for an exclusive (X) lock on the gap before the ‘112’ row.
Because of this circular wait, neither transaction can proceed, and a deadlock is detected. To resolve the deadlock, the database automatically chooses one of the transactions to roll back (in this case, transaction 1255170). The other transaction (transaction 1255169) is allowed to proceed and complete its insert operation.

Wrap up

Databases like any software are not ideal and can’t prevent you from having issues. Deadlock is one of the issues which could be easily happened and developers should know how to understand and fix the problem.

Obviously, to prevent deadlocks, it’s important to design your database schema and queries in a way that minimizes the chances of two or more transactions conflicting with each other. In addition, you can use features like row-level locking, transaction isolation levels, and query optimization techniques to reduce the likelihood of deadlocks in your application.

AWS S3 replication across different accounts

2023-02-05T13:14:57+00:00

AWS S3 Bucket replication is an incredibly powerful and cost-effective way to ensure your data remains safe, secure, and accessible in the cloud. With S3 bucket replication, you can replicate entire buckets of data across multiple AWS accounts and even regions for added redundancy and reliability. This ensures that if one region experiences a disruption or outage, your data will still be available from another region.

Replicating an S3 bucket is also very simple to do with just a few clicks in the AWS Console or through the use of APIs. There is an official article on AWS docs explaining how to configure such a replication. However, making these changes in the infrastructure-as-code is not that straightforward. In this article, we are going to replicate s3 bucket across different accounts using CloudFormation and yaml notation.

What, where, and how to replicate?

Let’s assume we have defined two S3 buckets:

SourceBucket:
  Type: AWS::S3::Bucket
  Properties:
    BucketName: my-source-bucket

DestinationBucket:
  Type: AWS::S3::Bucket
  Properties:
    BucketName: my-destination-bucket

As the naming implies we need to replicate from SourceBucket to DestinationBucket. Since buckets are owned by different AWS accounts we can’t replicate in a usual way just defining the replication configuration on the SourceBucket. In order to replicate to a bucket in a different AWS account, the SourceBucket must be allowed to replicate objects to the destination. It can be done following these steps:

Create replication configuration on the source bucket
Create policy on the source bucket to allow replicating to the destination bucket
Create policy on the destination bucket to allow the owner of the source bucket to replicate objects

Source bucket replication configuration

At the very beginning, we need a role, which will be attached to the bucket and to the policies:

SourceBucketReplicationRole:
  Type: 'AWS::IAM::Role'
  Properties:
    AssumeRolePolicyDocument:
      Statement:
        - Action:
            - 'sts:AssumeRole'
          Effect: Allow
          Principal:
            Service:
              - s3.amazonaws.com

In the next step, we need to attach a newly created role to the SourceBucket creating bucket replication configuration. In the example below, basically, it just defines the role to be attached and the destination to replicate:

SourceBucket:
  Type: AWS::S3::Bucket
  Properties:
    BucketName: my-source-bucket
+   ReplicationConfiguration:
+     Role: !GetAtt 'SourceBucketReplicationRole.Arn'
+     Rules:
+       - Destination:
+           Bucket: !GetAtt 'DestinationBucket.Arn'
+         Status: Enable

Source bucket replication policy

The most important part is to properly define the policy and attach it to our newly created role. In the snippet below we create IAM policy with the following statements:

Allow performing s3:GetReplicationConfiguration and s3:ListBucket on SourceBucket resource. This is obviously needed to read the replication configuration and list the bucket in order to replicate it somewhere.
Allow performing s3:GetObjectVersion and s3:GetObjectVersionAcl. Replication requires versioning to be enabled.
Allow performing s3:ReplicateObject and s3:ReplicateDelete on DestinationBucket. Obviously, SourceBucket must be allowed to create and delete objects during replication on DestinationBucket.

SourceBucketReplicationPolicy:
  Type: 'AWS::IAM::Policy'
  Properties:
    PolicyName: SourceBucketReplicationPolicy
    Roles:
      - !Ref SourceBucketReplicationRole
    PolicyDocument:
      Statement:
        - Action:
            - 's3:GetReplicationConfiguration'
            - 's3:ListBucket'
          Effect: Allow
          Resource: !GetAtt 'SourceBucket.Arn'
        - Action:
            - 's3:GetObjectVersion'
            - 's3:GetObjectVersionAcl'
          Effect: Allow
          Resource: !Sub '${SourceBucket.Arn}/*'
        - Action:
            - 's3:ReplicateObject'
            - 's3:ReplicateDelete'
          Effect: Allow
          Resource: !Sub '${DestinationBucket.Arn}/*'

Destination bucket policy

The destination bucket policy looks similar to the one we defined above but operates on the source bucket replication role. So here we create IAM policy with the following statements:

Allow s3:ReplicateObject and s3:ReplicateDelete on DestinationBucket. This allows resources with SourceBucketReplicationRole role to replicate objects to the destination bucket.
Allow s:List*, s3:GetBucketVersioning and s3:PutBucketVersioning on SourceBucketReplicationRole.

DestinationBucketPolicy:
  Type: AWS::S3::BucketPolicy
  Properties:
    Bucket: !Ref 'DestinationBucket'
    PolicyDocument:
      Version: "2012-10-17"
      Statement:
        - Sid: Set permissions for objects
          Effect: Allow
          Principal:
            AWS: !Ref SourceBucketReplicationRole
          Action:
            - s3:ReplicateObject
            - s3:ReplicateDelete
          Resource: !Sub '${DestinationBucket}/*'
        - Sid: Set permissions on bucket
          Effect: Allow
          Principal:
            AWS: !Ref SourceBucketReplicationRole
          Action:
            - s3:List*
            - s3:GetBucketVersioning
            - s3:PutBucketVersioning
          Resource: !Ref SourceBucketReplicationRole

Change replica owner

In replication, the owner of the source object also owns the replica by default. When the source and destination buckets are owned by different AWS accounts and you want to change replica ownership to the AWS account that owns the destination buckets, you can add optional configuration settings to change replica ownership to the AWS account that owns the destination buckets.

A few tweaks are needed to change the replica owner using our snippets above.

At first, we need to add the owner override option to the replication configuration. Important to note, this option must be added only when the source and destination buckets are owned by different AWS accounts.

SourceBucket:
  Type: AWS::S3::Bucket
  Properties:
    BucketName: my-source-bucket
    ReplicationConfiguration:
      Role: !GetAtt 'SourceBucketReplicationRole.Arn'
      Rules:
        - Destination:
            Bucket: !GetAtt 'DestinationBucket.Arn'
+           Account: 
+           AccessControlTranslation:
+               Owner: 'Destination'
          Status: Enable

Granting Amazon S3 permission to change replica ownership. This is the IAM role that you specified in the replication configuration that allows Amazon S3 to assume and replicate objects on your behalf.

SourceBucketReplicationPolicy:
  Type: 'AWS::IAM::Policy'
  Properties:
    PolicyName: SourceBucketReplicationPolicy
    Roles:
      - !Ref SourceBucketReplicationRole
    PolicyDocument:
      Statement:
       ....
        - Action:
            - 's3:ReplicateObject'
            - 's3:ReplicateDelete'
+           - 's3:ObjectOwnerOverrideToBucketOwner'
          Effect: Allow
          Resource: !Sub '${DestinationBucket.Arn}/*'

Adding permission in the destination bucket policy to allow changing replica ownership. The owner of the destination bucket must grant the owner of the source bucket permission to change replica ownership. This allows the destination bucket owner to accept ownership of the object replicas.

DestinationBucketPolicy:
  Type: AWS::S3::BucketPolicy
  Properties:
    Bucket: !Ref 'DestinationBucket'
    PolicyDocument:
      Version: "2012-10-17"
      Statement:
        - Sid: Set permissions for objects
          Effect: Allow
          Principal:
            AWS: !Ref SourceBucketReplicationRole
          Action:
            - s3:ReplicateObject
            - s3:ReplicateDelete
+           - s3:ObjectOwnerOverrideToBucketOwner
          Resource: !Sub '${DestinationBucket}/*'
          ...

Wrap up

AWS S3 Bucket Replication across multiple AWS accounts is an incredibly powerful tool for businesses that need to ensure their data is secure and always available. The best part about using the AWS S3 bucket replication feature is how easy it makes sharing files between different accounts without having to manually transfer them over each time. However, the process of configuring such a replication using CloudFormation is not that straightforward. But still, if you understand the idea it connects the dots. Just to wrap up, it can be done in a few steps:

Create the replication configuration
Create the policy on the source bucket to write to the destination
Create the policy on the destination bucket to allow replicating from the source
Change replica owner upon replication

Splitting Rails migration into smaller pieces

2020-10-09T13:14:57+00:00

ActiveRecord migration is a great abstraction over the database schema manipulation. It looks understandable and works pretty well, however, it can take a while to migrate tables with billions of records and the developer would need to have some extra control over it.

Often the ActiveRecord migration produces several SQL queries in a single command. It can alter table multiple times which will lead to multiple long SQL queries on huge tables.

If the table is huge and such a migration takes a lot of time, it sounds natural to split such a migration into smaller pieces and run them without locking the table (if that is possible).

In this article we are going to explore how to:

log the SQL queries executed during the ActiveRecord migration
split such a migration into smaller atomic SQL queries
run these queries step by step through the rake task

Log ActiveRecord SQL queries

Let’s say we would like to migrate our very big table payments and add a reference to the user. We can pre-generate the migration:

$ rails g migration add_user_id_to_payments user:references

This will scaffold the ActiveRecord migration which looks like this:

class AddUserIdToPayments < ActiveRecord::Migration[5.2]
  def change
    add_reference :payments, :user, foreign_key: true
  end
end

But if we run the migration locally, we will see that we there is no SQL output, just the migration logs:

$ bundle exec rake db:migrate

== 20201019174359 AddUserIdToPayments: migrating ==============================
-- add_reference(:payments, :user, {:foreign_key=>true})
== 20201019174359 AddUserIdToPayments: migrated (101.1673s) =====================

To be able to see the SQL queries executed during this migration process, we can make a slight change to the migration itself and redirect the ActiveRecord log to stdout:

class AddUserIdToPayments < ActiveRecord::Migration[5.2]
+ ActiveRecord::Base.logger = Logger.new(STDOUT)

  def change
    add_reference :payments, :user, foreign_key: true
  end
end

If we rollback the migration and run it again, the SQL queries will be printed to the stdout:

$ bundle exec rake db:migrate

== 20201019174359 AddUserIdToPayments: migrating ==============================
-- add_reference(:payments, :user, {:foreign_key=>true})
D, [2020-10-19T20:44:34.434933 #75984] DEBUG -- :    ALTER TABLE `payments` ADD `user_id` bigint
D, [2020-10-19T20:44:34.502763 #75984] DEBUG -- :    CREATE  INDEX `index_payments_on_user_id`  ON `payments` (`user_id`)
D, [2020-10-19T20:44:35.199157 #75984] DEBUG -- :    ALTER TABLE `payments` ADD CONSTRAINT `fk_rails_39823123`
FOREIGN KEY (`user_id`)
  REFERENCES `users` (`id`)
== 20201019174359 AddUserIdToPayments: migrated (101.1673s) =====================

As we can notice, the migration performed three SQL queries:

Alter table to add a new column user_id
Create an index on the just added column
Alter table to add a foreign key constraint

Split migration

We know we need to add a reference to the payments table and it will take a lot of time on the production database because of the table size. However, as we can see we can split this migration into three parts, which are backward compatible and safely can be run as Online DDL without downtime.

Obviously, each query will take less time than running all of them. So our plan could be to run each SQL query manually as a non-blocking step on the live server. Often the migration step is blocking during the deployment, so we could delegate this work to the rake task:

namespace :payments do
  def execute(sql)
    ActiveRecord::Base.connection.execute(sql)
  end
  
  desc 'Add a user reference to the payments'
  task reference_user: :environment do
    ActiveRecord::Base.logger = Logger.new(STDOUT)
    
    case ENV['STEP']
    when '1'
      execute('ALTER TABLE `payments` ADD `user_id` bigint')
    when '2'
      execute('CREATE INDEX `index_payments_on_user_id` ON `payments` (`user_id`)')
    when '3'
      execute('ALTER TABLE `payments` ADD CONSTRAINT `fk_rails_39823123`')
    else
      puts 'nothing to do. Pass a STEP you would like to run: 1,2 or 3'
  end
end

Run the migration

Now we can deploy the rake task and migrate the database by hands, before the dependent code change is deployed. This will require some DevOps attention but it gives more control on how and when to perform the long-running migration:

$ STEP=1 bundle exec rake payments:reference_user
$ STEP=2 bundle exec rake payments:reference_user
$ STEP=3 bundle exec rake payments:reference_user

Also, it is not required to run all the steps at once thus the dev team can have some window for the deployment and migration.

Wrap up

In this article, we explored how to split long running migration into smaller parts and run them manually before deploying the dependent code changes. This can be useful when performing migrations on very big tables without downtime.

The alternative solution could be to:

create a new table with the needed structure
copy all the records from old table to the new one
delete the old table and rename the new one

This is well described in How to migrate large database tables without a headache. However, I find this approach much more complicated which could lead to data loss in case of simple mistakes.

Joining polymorphic associations in ActiveRecord

2020-09-30T13:14:57+00:00

Polymorphic associations in ActiveRecord allow to belong to more than one model on a single associations. The mechanism is very powerful because helps to DRY the code and make the database schema clean. Let’s have a quick example:

class Payment < ApplicationRecord
  belongs_to :subject, polymorphic: true
end

class User < ApplicationRecord
  has_many :payments, as: :subject
end

class Artist < ApplicationRecord
  has_many :payments, as: :subject
end

So the payment can belong to multiple type of models marked as subject and to distinguish them by subject_id and subject_type columns at the database level. In our simple example, we have two different entities who can create multiple payments: User and Artist.

Keep in mind, that this is not the payment between User and Artist, these are payments created by users or by artists. And such records are going to be stored in a single table payments. The first question which comes to the mind, what would happen if we try to join the polymorphic association to the Payment?

Join polymorphic association

> Payment.joins(:subject).last

ActiveRecord::EagerLoadPolymorphicError: Cannot eagerly load the polymorphic association :subject

The error says it is not able to eagerly load the polymorphic association. And that is reasonable because subject is a general name for our association and ActiveRecord doesn’t know what table to join on. If we try to construct a SQL query by hand we can end up by something like this:

SELECT payments.* FROM payments
  INNER JOIN users ON users.id = payments.subject_id

but here we join on a specific table users and payment actually can belong to artists so we miss some data we want. The solution might be to do multiple queries to join on multiple tables the payment belongs to.

Include polymorphic association

And that is what the ActiveRecord’s includes method does. It performs multiple queries to fetch the data. In the example below, if we change joins to includes the error is gone, however if we look closely to the explanation, we can see that ActiveRecord does as many extra queries to the database as the number of different types of models the polymorphic association has. In our case 2 queries: to table users and to table artists.

> Payment.includes(:subject).last #=> 
> Payment.includes(:subject).map(&:subject_id) #=> [2, 1, 2]

> Payment.includes(:subject).explain
=> EXPLAIN for: SELECT "payments".* FROM "payments"
2|0|0|SCAN TABLE payments

EXPLAIN for: SELECT "users".* FROM "users" WHERE "users"."id" = ? [["id", 2]]
2|0|0|SEARCH TABLE users USING INTEGER PRIMARY KEY (rowid=?)

EXPLAIN for: SELECT "artists".* FROM "artists" WHERE "artists"."id" IN (?, ?) [["id", 1], ["id", 2]]
2|0|0|SEARCH TABLE artists USING INTEGER PRIMARY KEY (rowid=?)

Define the association with a scope

What if our polymorphic association belongs to too many different type of models and we want to efficiently query by single association? A solution might be to define an extra association for this specific type of model:

class Payment < ApplicationRecord
  belongs_to :subject, polymorphic: true
+ belongs_to :user, -> { where(payments: { subject_type: 'User' }) }, foreign_key: 'subject_id'
end

In this example we defined a user association with an extra scope on it, so ActiveRecord can properly made the join and filter the associated records by subject_type:

> Payment.joins(:user).explain

=> EXPLAIN for: SELECT "payments".* FROM "payments"
   INNER JOIN "users" ON "users"."id" = "payments"."subject_id"
   AND "payments"."subject_type" = ? [["subject_type", "User"]]

4|0|0|SEARCH TABLE payments USING INDEX index_payments_on_subject_type_and_subject_id (subject_type=?)
11|0|0|SEARCH TABLE users USING INTEGER PRIMARY KEY (rowid=?)

Define the association through the self ref

There is another possible way to let the join work, but i find it a bit tricky:

class Payment < ApplicationRecord
  belongs_to :subject, polymorphic: true
+
+ has_one :self_ref, class_name: 'Payment', foreign_key: :id
+ has_one :user, through: :self_ref, source: :subject, source_type: 'User'
end

Here we define a self_ref association to have a relationship to self and then define the needed association to the user through self. Looks a bit hacky, right? But anyway it still works, even if it has one extra join to the self table:

> Payment.joins(:user).explain

=> EXPLAIN for: SELECT "payments".* FROM "payments"
   INNER JOIN "payments" "self_refs_payments_join" ON "self_refs_payments_join"."id" = "payments"."id"
   AND "self_refs_payments_join"."subject_type" = ?
   INNER JOIN "users" ON "users"."id" = "self_refs_payments_join"."subject_id" [["subject_type", "User"]]

4|0|0|SEARCH TABLE payments AS self_refs_payments_join USING COVERING INDEX index_payments_on_subject_type_and_subject_id (subject_type=?)
10|0|0|SEARCH TABLE payments USING INTEGER PRIMARY KEY (rowid=?)
13|0|0|SEARCH TABLE users USING INTEGER PRIMARY KEY (rowid=?)

Wrap up

In this article we discussed 3 possible solutions to deal with the polymorphic associations in ActiveRecord:

using includes
defining a new association with a scope
defining a new association with a self ref

I think the developer should take the one which fits the most of his needs or maybe even overthink if polymorphic association is required for that particular case.

Tracking PaperTrail versions while saving in batches

2020-09-16T13:14:57+00:00

Auditing model changes is a common task in modern software development. Whether it is a feature request or just some debugging purpose in mind when the complexity of the system grows it is natural to add the ability to quickly see the changes made by someone.

There are plenty of tools which allow to quickly add versions to the app and most of them have very nice DSL on top, which implies interaction with auditing to be very efficient.

However, it is often easy to solve the regular daily tasks, but it can be much harder to deal with more complicated issues.

Let’s say, we would like to efficiently track a group of model versions using PaperTrail which were created during a specific POST/PATCH request. This can be useful when the app has some heavy endpoint which doesn’t just create/update a single record in a database, but performs a batch of save operations on different kinds of models.

Tracking save requests

At first, we would like to have a mechanism to track save requests in the app. To solve that, we can just create a SaveRequest model with a few extra columns for debugging purposes.

A Rails migration could look like this:

create_table :save_requests do |t|
  t.datetime :created_at, null: false, default: -> { 'CURRENT_TIMESTAMP' }
  t.belongs_to :user, null: false
end

And the model would just be as simple as this:

class SaveRequest < ApplicationRecord
  belongs_to :user
end

Now, in the controller of our heavy endpoint we are going to create a new SaveRequest record on each create/update HTTP request:

class BatchCategoriesController < ApplicationController
  prepend_before_action :track_save_request, only: %i[update create]

  private

    def track_save_request
      @save_request ||= SaveRequest.create!(user: current_user)
    end
end

So we have an ability to track the save requests performed by the client using our endpoint. However, how can we track the changes made during this call?

Log versions created during save request

To be able to solve this, the first step will be to add a reference between SaveRequest and PaperTrail::Version models so the db schema would look like this:

A new migration just adds a new reference to versions table:

def change
  add_reference :versions, :save_request, foreign_key: true, index: true
end

and it looks obvious to add a has_many relation to the SaveRequest model now:

class SaveRequest < ApplicationRecord
  belongs_to :user
+ has_many :request_versions, class_name: 'PaperTrail::Version', foreign_key: :save_request_id
end

At this point we have a relationship between the save request and the versions, but how do we actually associate these records properly? The solution is not really straightforward and depends on the PaperTrail’s metadata feature.

PaperTrail allows passing some extra information to the versions by overriding the info_for_paper_trail method in the controller. So all the created versions in this endpoint will have that information.

That way we can attach the specific save request to the each of the created version:

class BatchCategoriesController < ApplicationController
+ attr_reader :save_request

  prepend_before_action :track_save_request, only: %i[update create]

+ # Store metadata for PaperTrail::Version
+ def info_for_paper_trail
+   { save_request_id: save_request.id }
+ end

  private

    def track_save_request
-     @save_request ||= SaveRequest.create!(user: current_user)
+     @save_request ||= PaperTrail.request(enabled: false) do
+       SaveRequest.create!(user: current_user)
+     end
    end
end

That’s actually it, let’s give it a try.

Demo time

If we create (or update) multiple categories using our BatchCategoriesController we will see that a new save request is created and there are PaperTrail versions associated with it:

> request = SaveRequest.last # =>
  # 

> request.request_versions.limit(2).map(&:changeset) # =>
  # [
  #   {
  #     "parent_category_id": [
  #       null,
  #       671
  #     ],
  #     "updated_at": [
  #       "2019-08-19 01:58:15 UTC",
  #       "2020-09-24 23:58:35 UTC"
  #     ]
  #   },
  #   {
  #     "name": [
  #       "Old Category Name",
  #       "New Category Name"
  #     ],
  #     "parent_category_id": [
  #       null,
  #       673
  #     ],
  #     "updated_at": [
  #       "2019-08-19 01:58:15 UTC",
  #       "2020-09-16 23:58:35 UTC"
  #     ]
  #   }
  # ]

Wrap Up

In this article, we described how to track paper trail versions on per save request basis using metadata to store information about the request at PaperTrail versions table. Such an approach will give an ability to quickly find the version changes made in a specific request.

There are some improvements to think about:

if the app cleans up stale versions, it would probably have to clean the orphan SaveRequest records as well
if the endpoint fails to process a save request, we would probably have to destroy the created SaveRequest record or wrap it into transaction and rollback it automatically
migrating versions table can be challenging if it is very big. Other techniques can be applied in order to speed up the process (i.e. creating a new table and copying the data)
delayed execution support can be added in different ways depending on the requirements (current save request can be passed to the job or a new record can be created to group versions created at the job level)

Tracking All Paper Trail Version From A Single Request With Correlation UUIDs

Writing Slack bot in Crystal programming language

2020-06-08T13:14:57+00:00

Crystal is a young statically typed programming language which is intended to be very fast (because of compile-time evaluation) and which has a very readable syntax similar to Ruby. Crystal is still not production ready and often introduce breaking changes during new releases. That means it can become painful to maintain a big codebase written in Crystal.

However, I personally think Crystal can be very suitable for small microservices. Just because the language is very fast and the microservice utilizes quite a small piece of code.

One of such simple examples we use in our company is a Slack bot, which helps people to find some project related information directly in Slack without bothering colleagues.

In this article you will find how to write such a bot in Crystal, deploy it and install to your slack workspace.

Bot definition

To extend some interactivity, Slack introduced slash commands. Basically it acts as a shortcut for some specific action directly in Slack. There are built-in commands and custom ones are allowed too.

Basically, if we would like to create a new slash command, we would need to have a standalone microservice available on the internet, which could handle the HTTP requests from Slack when users execute such a slash command.

So let’s give it a try. We will create /prince slash command which accepts some arguments and prints project related information to on-board our newcomers.

Start a new project

From the very beginning we would need to generate a new Crystal application:

$ crystal init app prince-slack_bot

It creates a new project skeleton for our app with a couple of important files/folders:

$ tree prince-slack_bot

prince-slack_bot
├── LICENSE
├── README.md
├── shard.yml
├── spec
│   ├── prince-slack_bot_spec.cr
│   └── spec_helper.cr
└── src
    └── prince-slack_bot.cr

2 directories, 6 files

shard.yml - this is where we will define our project specific settings, like a version of Crystal to run the app on, the version of our app, dependencies etc
src/ - a folder that holds our sources and the target to run
spec/ - tests for the sources

Let it serve

Our slack bot will have to be running as a standalone server, accept HTTP requests and respond to them. In order to do that we could use the HTTP Server available in the stdlib. However, it is quite minimalistic and lacks a couple of important features.

A more advanced alternative is Kemal, a defacto fast and effective web framework which perfectly matches our requirements (to build a microservice). We can easily add it to shard.yml as a project dependency:

dependencies:
  kemal:
    github: kemalcr/kemal

And install through the shards install command.

At this point, we are ready to create a serveable app, which could respond to HTTP requests. Let’s create src/app.cr file (which is will become a starting point for our app) and implement a server using Kemal:

require "kemal"

get "/" do
  "Prince Slack Bot is alive"
end

post "/command" do |env|
  env.response.content_type = "application/json"

  # TODO: process env.params and respond
  ({} of String => String).to_json
end

port = ENV["PORT"]?.try(&.to_i) || 3000
Kemal.run(port)

Here we defined 2 endpoints:

GET / - shows that our app is alive. Will be helpful to ensure our app is running once deployed.
POST /command - an actual endpoint to process a command from the slack app. Will accept JSON params and respond with JSON content.

We can easily try running our dummy app:

$ crystal src/app.cr
[development] Kemal is ready to lead at http://0.0.0.0:3000
2020-03-14 20:18:00 UTC 200 GET / 49.78µs

And see whether it works in a browser:

Process Slack requests

We ran a simple HTTP server which is able to accept commands through the /command endpoint. This is a time to add an ability to process them and return some results.

For slash commands, Slack uses text as a request body parameter to pass everything was typed after the command. For example, if user types /prince hello world in Slack, our server will be hitted by the HTTP request having hello world in text body param.

So we would just need to take the text param and parse it into something we can run:

module Prince::SlackBot
  def self.process(request)
    text = request.body["text"]
    parse(text).run
  end

  def self.parse(text)
    # TODO: parse command args into the runnable commands
  end
end

And at this point we can change our handler to process /command endpoint in src/app.cr file and use just defined high level command processor:

# src/app.cr

post "/command" do |env|
  env.response.content_type = "application/json"

-  # TODO: process env.params and respond
-  ({} of String => String).to_json
+  Prince::SlackBot.process(env.request).to_json
end

Define commands

We would like to define a notion of command, e. g. a slack user will have to type /prince cmd args, where cmd is a predefined command by our bot, and it accepts some arguments args.

In order to do so, we can just split our text HTTP parameter, extract command (the first word) the its arguments (the rest) and instantiate such a command:

def self.parse(text)
-  # TODO: parse command args into the processable commands
+  words = (text || "").split(' ', remove_empty: true)
+  cmd, args = words[0]?, words[1..-1]?

+  case cmd
+  when "help"
+    Command::Help.new args
+  when "status"
+    Command::Status.new args
+  when # a bag of other commands go here
+  else
+    Command::Help.new args
+  end
end

Command on other hand can be just a class, which accepts the arguments during initialization and responds to the #run method:

module Prince::SlackBot::Command
  class Help
    def initialize(@args = [] of String)
    end

    def run
      { "text" => help }
    end

    private def help
      <<-TEXT
      *Usage*: `/prince cmd args`
      *Available commands:*
       `help`   - prints this help
       `status` - prints the status of the prince app (Up/Down)
       # ...
      TEXT
    end
  end
end

Similar to requests, Slack expects JSON response with the text attribute inside. For the help command above we just send the help information in the text attribute.

Similar to the Help we can define other commands (to show the status of our app, to print links to GitHub repositories etc.)

Deploy

We need our app to be open to the world in order handle Slack requests. So we need to deploy it somewhere. The simplest way to deploy Crystal apps is using Heroku.

There is a great article which explains how to deploy Crystal apps using Crystal Heroku Buildpack. At some point we just need to push our code to heroku origin:

$ git push heroku master
Counting objects: 8, done.
Delta compression using up to 8 threads.
Compressing objects: 100% (17/17), done.
Writing objects: 100% (8/8), 0.94 KiB | 0 bytes/s, done.
Total 8 (delta 8), reused 0 (delta 0)
remote: Compressing source files... done.
remote: Building source:
remote:
remote: -----> Fetching set buildpack https://github.com/crystal-lang/heroku-buildpack-crystal.git... done
remote: -----> Crystal app detected
remote: -----> Installing Crystal (0.31.0 due to latest release at https://github.com/crystal-lang/crystal)
remote: -----> Installing Dependencies
remote: -----> Compiling src/app.cr (auto-detected from shard.yml)
remote:
remote: -----> Discovering process types
remote:        Procfile declares types     -> (none)
remote:        Default types for buildpack -> web
remote:
remote: -----> Compressing...
remote:        Done: 289.4K
remote: -----> Launching...
remote:        Released v3
remote:        https://prince-slack-bot.herokuapp.com deployed to Heroku
remote:
remote: Verifying deploy.... done.
To https://prince-slack-bot.herokuapp.com.git
 * [new branch]      master -> master

As we can see, it was successfully deployed and we can check our app availability at https://prince-slack-bot.herokuapp.com.

Configure Slack APP

Our bot is ready to handle Slack requests. However, Slack should know about it. There are a couple of steps to do here.

Create a new Slack app in the Slack workspace:

Define a Slack slash command filling a command name, request URL (the URL our bot is available at), description and some help information which will be shown to users:

Now, when the app is activated and becomes available as a slash command in our workspace, we can try typing /prince, hit enter and see the results.

Wrap up

In this article we showed how to create a Slack bot written in Crystal programming language, deployed it to Heroku and configured Slack application to interact with it.

In next articles we will talk about how to properly sign off Slack requests and write tests for our app.

Write

Avo custom fields

Copyable text field

Wrap-up

Cross account Amazon ECR images

ECS resource-based policy

Lambda function resource-based policy

Root account resource-based policy

Wrap up

Chartkick and turbo frames - elevating rails visuals

Make a dashboard

Add turbo frames

Elevate the visual with a chartkick

Wrap-up

Configuring MFA delete on S3 bucket

Configure MFA Device in AWS Console

Enable MFA delete

How MySQL gap lock can lead to deadlock

Table structure

Concurrent inserts and gap lock

Analyzing the deadlock

Wrap up

AWS S3 replication across different accounts

What, where, and how to replicate?

Source bucket replication configuration

Source bucket replication policy

Destination bucket policy

Change replica owner

Wrap up

Splitting Rails migration into smaller pieces

Log ActiveRecord SQL queries

Split migration

Run the migration

Wrap up

Joining polymorphic associations in ActiveRecord

Join polymorphic association

Include polymorphic association

Define the association with a scope

Define the association through the self ref

Wrap up

Tracking PaperTrail versions while saving in batches

Tracking save requests

Log versions created during save request

Demo time

Wrap Up

Related posts

Writing Slack bot in Crystal programming language

Bot definition

Start a new project

Let it serve

Process Slack requests

Define commands

Deploy

Configure Slack APP

Wrap up