Random Musings

Let's Encrypt Intranet

Cheng Long — Sun, 03 Sep 2017 14:33:24 GMT

Let's Encrypt (LE) has been a popular choice to get certs for public websites. Because it's free and automated. But how to get certs for private websites, which are common in company's intranet?

Problem

There's a web app in your company's intranet.
The web app has a fully qualified domain name (FQDN), e.g. foo.example.com, not an internal one like foo.internal.
It only resolves to a private IP behind VPN. Therefore, it's inaccessible without a valid VPN.
You want to add an extra layer of security by enabling HTTPS.

How to get a cert for it? And how to automate it and get it for free?

You may be wondering why not use the company's wild card cert (assuming it has one), i.e. *.example.com. Because wild card cert only supports one level of subdomain matching. If you have cert for *.example.com, it can be used for foo.example.com. But what about foo.bar.example.com? The problem remains.

Solution

One quick and dirty way is to geneate a self-signed cert. But this will make browsers give a cert warning because it's not trusted. You don't want that.

How to use LE when the site is inaccessible publicly? There's a simple and elegant solution.

LE relies on ACME protocal to verify domain ownership. There're a few types of challenges that ACME uses to verify domain ownership. The most common one is HTTP Challenge. But it's not applicable in this case because the intranet site's port 80 is inaccessible publicly. Similarly, TLS SNI Challenge can't be used because the intranet site's port 443 is inaccessible publicly either. Out-of-Band Challenge isn't really automated by definition. The only option left is DNS Challenge. DNS Challenge works by provisioning a TXT record containing a designated value for the domain. Put simply,

A designated value is generated by LE server
A TXT record containing the designated value has to be present on the domain
LE server queries the domain's TXT records
LE server verifies that the contents of one of the TXT records matches the designated value

If all of the above steps succeed, the validation is successful. Cert is generated by LE.

The advantage of this challenge is that it only requires provisioning a TXT record and doesn't require a server to be publicly accessible on either port 80 or 443, which fits the intranet site case perfectly.

Implementation

How to automate DNS Challenge and get a cert depend on your existing infra automation (you're not manually creating your infra, are you?). I use Terraform with AWS provider. Below is an example to get a cert for foo.example.com using DNS Challenge and load it to ELB. Credits to the Terraform ACME provider

# code for acme registration and private key creation is omitted

resource "acme_certificate" "certificate" {
  server_url               = "${var.acme_server_url}"
  account_key_pem          = "${tls_private_key.private_key.private_key_pem}"
  common_name              = "foo.example.com"
  must_staple = true

  dns_challenge {
    provider = "route53"
  }

  registration_url = "${acme_registration.reg.id}"
}

resource "aws_iam_server_certificate" "your_cert" {
  name_prefix       = "foo-example-cert"
  certificate_body  = "${acme_certificate.certificate.certificate_pem}"
  certificate_chain = "${acme_certificate.certificate.issuer_pem}"
  private_key       = "${acme_certificate.certificate.private_key_pem}"

  lifecycle {
    create_before_destroy = true
  }
}

resource "aws_elb" "foo" {
  # other configs for the elb are omitted

  listener {
    instance_port      = 80
    instance_protocol  = "http"
    lb_port            = 443
    lb_protocol        = "https"
    ssl_certificate_id = "${aws_iam_server_certificate.your_cert.arn}"
  }
}

Of course, this assumes that the domain foo.example.com is managed by Route 53.

Multirepo vs Monorepo

Cheng Long — Thu, 20 Jul 2017 09:32:00 GMT

Here's a conversion I keep hearing recently:

A: Let's put all our projects in one repo.
B: Why?
A: Because Google and Facebook do monorepo.

Whenever I hear this, I'm very tempted to show this picture.

Finding the origin of this picture is left as an exercise for the readers. On a more serious note, I want to write down my thoughts on multirepo vs monorepo.

What is multirepo?

One project one repository. Each project is an independent working unit. It can be a mobile app, frontend app, backend service or standalone CLI app.

Each project has full autonomy to manage its evolution and deployment. There should be little to no coupling between projects. If projects depend on each other, the coupling between projects is API contracts, nothing else.
Each project manages dependencies on its own. Common library is in a repo of itself. Projects that depend on it can use any version of that library that they deem fit. It can be argued that sharing code is also introducing coupling. And it may result in long tail of maintenance of old libraries. Anyway, Managing dependencies is hard.
Engineering teams are decoupled and can work on different projects in parallel without stepping on each other's toes.
Deployment Pipeline can be easily setup for each project.
Access control can be applied at project level.

This repo structure is how most open source projects are run. And it's also probably what most developers are familiar with. Besides, it plays nicely with microservices architecture.

What is monorepo?

One monolithic repo that contains everything. Literally, everything.

All projects (regardless they are related or not) and their dependent libraries, including 3rd party code that are not written by you nor your colleagues, live in one single repo.
There is one and only one verison of each dependency in the entire repo, which is the latest (HEAD in git terminology). Whenever a dependency needs to be updated, the update should be done for all projects depend on it and make sure that all projects still work. So the repo should always be in a consistent state. At any commit, all projects should work.
Cross-project changes is easier. Large scale refactoring is easier and can be done in one single atomic commit.
Extensive code sharing.
Everyone can see all code.

How to choose?

If you are in a two-man startup, close this page now and keep working on your monolith. The choice between multirepo and monorepo is irrelevant to you. This question is only relevant when your company is operating at scale, i.e. >100 developers.

Given the distinct characteristics of multirepo and monorepo, how to choose one over another? I think there are two main factors to consider, tooling and culture.

Tooling

In monorepo, running build is not as trivial as multirepo. You probably don't want to run tests and builds for all projects since that's just unnecessarily wasting time and computing resources. So the first thing to figure out is, given a change with one or more commits, which project(s) should build and what tests should run. And in order to figure this out, it's necessary to have acyclic directed graph (DAG) of dependencies for all projects. When a change is submitted, it's checked against the DAG of dependencies to see which projects are affected. All affected projects are possible to break, so tests are run only for these affected projects and their transitive dependents. Good news is that Google has open sourced their build tool bazel and Facebook has something similar called buck. While in multirepo, this problem doesn't exist because there is no need to figure out which project to build. Whenever a change happens to a project, that project's deployment pipeline is triggered.

Source code version control is another tooling challenge imposed by monorepo. It's well known that git is bad at scaling. So is mercurial. Quoting Linus Torvalds

Git fundamnetally never really looks at less than the whole repo. Even if you limit things a bit (ie check out just a portion, or have the history go back just a bit), git ends up still always caring about the whole thing, and carrying the knowledge around.
So git scales really badly if you force it to look at everything as one huge repository. I don't think that part is really fixable, although we can probably improve on it.

Although sparse checkout and shallow clone may alleviate the scaling problem, it's not a sustainable solution to large organizations. Some anecdote suggests that the practical limit of git is 15GB of .git directory. This is probably why Microsoft invented GVFS, Facebook chose to patch Mercurial and Google builds Piper. The point is, if your organization decides to go monorepo, think carefully about what version control to use.

Large scale refactoring in monorepo doesn't come for free. It requires dedicated tooling support.

Setting up deployment pipeline is complicated in monorepo. In multirepo, it's straightforward to have one project one pipeline. But in monorepo, one possible way is to have the first stage to figure out relevant projects and then trigger child pipelines for each relevant project. And each child pipeline may trigger other pipelines according to the DAG dependency graph. From what I see, the only off-the-shelf Continuous Delivery (CD) tool in the market that supports pipeline fan-in and fan-out is GoCD. Other CD solutions in the market have very simple pipeline modeling. They are designed for multirepo, not monorepo. For example, there isn't an elegant solution for monorepo in GitLab after one year, neither is Travis CI.

In short, for multirepo to work, open source and commercial tools in the market are most likely sufficient. But for monorepo, depending on the scale, it may require high tooling investment.

Culture

Multirepo and monorepo not only have different tooling requirments, but also varying engineering culture and philosophy. Multirepo values decoupling and engineering velocity, while monorepo favours standardization and consistency. It's all trade-offs. Whichever approach a company takes is a reflection of the company's culture. Netflix favours Freedom & Responsbility so it prefers mutlirepo. And Google values consistency and code quality so it prefers monorepo. What's important here is to pick one approach that fits your organization's engineering culture, rather than fitting your organization's engineering culture to a certain repo structure.

Conclusion

Choosing multirepo or monorepo is not trivial. There is no single absolute right or wrong answer. Companies like Amazon and Netflix are living evidence that multirepo at large scale works. On the other hand, companies like Google and Facebook are living evidence that monorepo at large scale also works. Each approach has its own set of principles and practices to follow. Each approach also has its own challenges. Deciding between the two boils down to tooling and culture. Whichever approach an organization takes should be backed up by a list of solid reasons why one is preferred over the other in that organization, not Because Google and Facebook do monorepo. That's cargo cult engineering. And You Are Not Google.

Be wary of http/client.go

Cheng Long — Sat, 25 Mar 2017 09:30:00 GMT

Recently, I found out an interesting problem in Go. The problem can be reduced to a simple client request to a HTTP server.

Suppose we have a HTTP server, which serves only one rooted path /foo/.

package main

import (
	"io"
	"log"
	"net/http"
	"net/http/httputil"
)

func handleFoo(w http.ResponseWriter, req *http.Request) {
	// request details
	dump, _ := httputil.DumpRequest(req, true)
	log.Println(string(dump))

	if auth := req.Header.Get("Authorization"); auth != "Bearer GoodToken" {
		http.Error(w, "401 Unauthorized", http.StatusUnauthorized)
		return
	}

	io.WriteString(w, "Hello World!")
}

func main() {
	http.HandleFunc("/foo/", handleFoo)
	log.Fatal(http.ListenAndServe(":12345", nil))
}

handleFoo simplily verifies that correct token is sent in the Authorization header. Otherwise, it returns 401 Unauthorized.

The client sends request with correct token to the server.

package main

import (
	"io"
	"log"
	"net/http"
	"os"
)

func main() {
	req, _ := http.NewRequest(http.MethodGet, "http://localhost:12345/foo", nil)

	req.Header.Set("Authorization", "Bearer GoodToken")

	resp, err := http.DefaultClient.Do(req)
	if err != nil {
		log.Fatal(err)
	}

	defer resp.Body.Close()
	if _, err := io.Copy(os.Stdout, resp.Body); err != nil {
		log.Fatal(err)
	}
}

What do you think the response is? 401 Unauthorized or Hello World!?

The answer is it depends. It depends on the version of Go that the client code is running. If the client code is running on Go <1.8.0, the response is 401 Unauthorized. Otherwise, it's Hello World!.

But why?

It's not trivial to see at first glance. Let me list down what happens step by step.

Client sends

     GET /foo HTTP/1.1
     Host: localhost:12345
     Authorization: Bearer GoodToken
     ...

Server receives the request. ServeMux determines that the requested path /foo match the registered rooted path /foo/. ServeMux decides to send redirect (doc).

Server responds with header

     HTTP/1.1 301 Moved Permanently
     Location: /foo/
     ...

Client receives response and follows redirect by sending another request to server.
Server receives the 2nd request and let handleFoo handle it.

Both 1.8.0 and versions <1.8.0 follow the same steps when processing the reqeust. The difference lies in Step #4.

Prior to 1.8.0, following redirect in Go will NOT copy the original headers even if it's for the same domain. This wasn't changed util this commit. What happened in Go <1.8.0 is that the Authorization header wasn't copied unpon redirect. Therefore, handleFoo returns 401 Unauthorized.

Initially, I was quite surprised by the behaviour. After reading through this issue and HTTP spec, I realized that http/client.go didn't actually do anything wrong because the HTTP spec didn't specify following redirect should copy original headers. It's just that the well known HTTP clients, e.g. curl, httpie, rest-client, etc., have established the convention.

Be wary of http/client.go.

Keeping configurations sane for multiple projects on Google Container Engine

Cheng Long — Sat, 18 Feb 2017 09:27:00 GMT

In my previous post, I present the easist and most secure way to get kubectl working for one project. But what about mutiple projects? Juggle mutiple projects on Google Container Engine (GKE) can be hard, especially when its configurations are admittedly quirky. This post describes the best practice, in my opinion, to keep configurations sane and easy to switch.

Problem

Suppose you have an awesome app that runs on GKE. You probably want to have two different environments staging and production, and the environments should be completely isolated. So you create two projects on GKE, awesome-app-staging and awesome-app-production, and provisioned resources for each. Now the question is how to effectively switch between the two projects on command line without repeating these commands over and over again.

Solution

Assuming gcloud and kubectl are installed, but not configured,

1. Create a configuration for each project

Create an empty configuration. Don't use default

gcloud config configurations create awesome-app-staging

Activate service account

gcloud auth activate-service-account --key-file /path/to/your/key.json

Set project

gcloud config set project awesome-app-staging

It's good to set DEFAULT_ZONE and DEFAULT_REGION too.

gcloud config set compute/region ${REGION}
gcloud config set compute/zone ${ZONE}

Verify that your newly-created configuration has correct values

gcloud config configurations describe awesome-app-staging

gcloud is ready.

Get kubectl ready by getting GKE credentials for the project

gcloud container clusters get-credentials ${CLUSTER} --zone ${ZONE} --project awesome-app-staging

This will insert auth data and project info in ~/.kube/config. Verify your context is correct

kubectl config current-context

It should return a string which consists of project, zone and cluster.

Repeat the above process for each project.

2. Switch projects

Once configurations are created for all projects, switching is easy.

List all contexts

kubectl config get-contexts

Switch to a context

kubectl config use-context ${CONTEXT}

See current context

kubectl config current-context

Please note that switching context in kubectl does NOT automatically switch the corresponding gcloud configuration. This means that unless you instruct gcloud and kubectl to work on the same project, they can work on completely differnt projects. Therefore, as a good practice, remember to switch gcloud configuration whenever you switch kubectl context and vice versa, unless you know what you're doing.

Activate configuration

gcloud config configurations activate ${CONFIGURATION}

Summary

This is a very simple and elegant solution to manage multiple projects on GKE. If you have better ideas, please let me know.

Happy switching!

kubectl Authentication Made Simple

Cheng Long — Mon, 30 Jan 2017 09:25:00 GMT

While working on a continuous delivery pipeline to automate deployment to Google Container Engine (GKE), I found that getting kubectl to work is very complex and convoluted, especially when it needs to be noninteractive. So I want to find out the easiest way to get kubectl working noninteractively.

Assuming that gcloud and kubectl are already installed but not necessarily setup, ONLY two commands are needed to get kubectl working noninteractively (verified with Google Cloud SDK 141.0.0 and kubectl 1.5.2)

gcloud auth activate-service-account --key-file ${PATH_TO_KEY}

gcloud container clusters get-credentials ${CLUSTER} --zone ${ZONE} --project ${PROJECT}

The first command gcloud auth activate-service-account is to authorize access to Google Cloud Platform using a service account. PATH_TO_KEY is the path to the private key of the service account. The idea is very similar to IAM in AWS. One service account is roughly equivalent to an IAM group in AWS. And the private key of the service account is like Access key ID and Secret access key. You can create a service account and generate its private key here. If you only need to deploy to GKE, Container Engine Developer is enough for the role.

The second command gcloud container clusters get-credentials fetches cluster credentials and saves it in ~/.kube/config. The environment variables in the command are self-explanatory.

You can now use kubectl to deploy to GKE. Probably this?

kubectl set image deployment/${DEPLOYMENT} ${CONTAINER_NAME}=${IMAGE}:${IMAGE_VERSION}

In summary, this solution is simple, noninteractive (great for CI/CD) and secure (fine-grained permissions defined by service account). If you have a better solution, please let me know.

Speed Up SSH

Cheng Long — Sun, 04 Dec 2016 09:23:00 GMT

Recently I worked on a project where executing remote commands is very slow to start. This is expected because an SSH connection has to be established first.

$ time ssh server exit
ssh server exit  0.04s user 0.01s system 0% cpu 7.901 total

It took nearly 8 seconds to ssh into server.

That's slow!

After a bit of googling, there is an elegant way to speed this up. Simply put this at the bottom of your ~/.ssh/config.

Host *
  Compression yes
  ControlMaster auto
  ControlPath ~/.ssh/sockets/%r@%h:%p
  ControlPersist 4h
  ServerAliveInterval 60

What this does is to try to share the master connection via a socket with multiple ssh sessions. It falls back to creating a new one if one does not already exist. The socket file is defined by ControlPath. ControlPersist 4h specifies that keep the master connection in background for four hours after the client connection is closed. ServerAliveInterval 60 means that the client will wait for 60s to send a message to the server (if no data has been received during this period) to keep the connection alive.

Create sockets directory

$ mkdir ~/.ssh/sockets

Running time ssh server exit again won't see any difference. But look into ~/.ssh/sockets, a socket file is created.

srw-------  1 user group    0 Dec  4 22:12 user@***.***.***.***:22

Try again

$ time ssh server exit
ssh server exit  0.01s user 0.00s system 40% cpu 0.027 total

Now it takes less than 0.5% of 8 seconds.

What's interesting is that, this config not only speeds up SSH connections, but also git, if you're authenticating with SSH.

Ruby and Java Stack Level

Cheng Long — Sat, 05 Nov 2016 09:17:00 GMT

While coding for an algorithmic problem, I discovered that Ruby's stack level is much shallower than Java. This caused a recursive DFS solution written in Ruby failed due to stack level too deep (SystemStackError), while the same code written in Java passed. Whether recursion or tail recursion should be used is not the point of this post. This post is to find out what the max stack level is and what limits the stack level.

Ruby

Consider the following code

def recurse(n)
  return 1 if n == 1
  1 + recurse(n-1)
end

def binary_search
  answer, a, b = 0, 0, 1_000_000_000

  while a<=b
    mid = (a+b)/2
    begin
      recurse(mid)
      answer = mid
      a = mid + 1
    rescue SystemStackError
      b = mid - 1
    end
  end

  answer
end

puts "Max Stack Level: #{binary_search}"

What do you think Max Stack Level is?

Running the code reports that the the Max Stack Level is 10080 on my MBP. This is because the stack size of the default Ruby 2.3.1 VM (MRI) is limited to 1MB.

This is very limiting for even a medium-sized dataset. To increase it, one can specify VM stack size for Ruby >= 2.0.0. To check current max stack size, open irb

irb(main):001:0> RubyVM::DEFAULT_PARAMS
{
         :thread_vm_stack_size => 1048576,
    :thread_machine_stack_size => 1048576,
          :fiber_vm_stack_size => 131072,
     :fiber_machine_stack_size => 524288
}

Let's set RUBY_THREAD_VM_STACK_SIZE to 2MB

export RUBY_THREAD_VM_STACK_SIZE=2097152

And running the same code again returns Max Stack Level: 20162, which is roughly 2 * 10080. Setting RUBY_THREAD_VM_STACK_SIZE to 3MB will increase Max Stack Level to 30243. This proves that Max Stack Level is linearly proportional to RUBY_THREAD_VM_STACK_SIZE in Ruby.

Java

public class Solution {
    public static void main(String[] args) {
        System.out.println("Max Stack Level: " + binarySearch());
    }

    private static long binarySearch() {
        long a = 0, b = Long.MAX_VALUE, answer = 0;
        while (a <= b) {
            long mid = (a + b) / 2;
            try {
                recurse(mid);
                answer = mid;
                a = mid + 1;
            } catch (StackOverflowError e) {
                b = mid - 1;
            }
        }
        return answer;
    }

    private static long recurse(long n) {
        if (n == 1) return 1;
        return 1 + recurse(n - 1);
    }
}

Running this code with

java Solution

returns Max Stack Level: 49150. This is because the thead stack size on my MBP defaults to 1MB. The defaults are platform-specific, see here.

JVM has an option ss for setting the thread stack size. Running the code with thread stack size of 2M

java -Xss2M Solution

returns Max Stack Level: 98302, which roughly doubled the stack level. Running it with 4MB gives 196606 and it matches what I thought. Therefore, in Java, Max Stack Level is linearly proportional to ss.

Conclusion

Both Ruby and Java have clearly defined max stack size, which limits the number of levels code can recurse. In Ruby, it's defined by environment variable RUBY_THREAD_VM_STACK_SIZE. While in Java, it's defined by JVM option ss. A clear observation is that when given the same max stack size and the same code, Java performs much better than Ruby. This is not a surprise because Java (on JVM) is a compiled, while Ruby MRI is interpreted.

Docker Workflow

Cheng Long — Wed, 25 May 2016 09:07:00 GMT

In my previous posts, I demoed how to orchestrate Docker with Swarm and Kubernetes. They all assume the Docker image chenglong/simple-node is already there and ready to be deployed. But how to develop that image in the first place? How to streamline and automate the process of developing it on local machine, building and testing it on Continuous Integration (CI) server, and finally deploying to production?

This post introduces one possible Docker workflow.

I recommend reading my previous post before this to understand the app architecture and Kubernetes.

Workflow

The flow is summarized as follows:

Code on local machine. Using Docker for local development is highly recommended. Find out more from my post Using Docker for Rails Development
Push commits to Github
Travis is notified and builds new a new Docker image and runs tests against the newly-built image.
If all work well, Travis pushes the new image to Docker Hub.
Travis deploys the new image to Google Container Engine (GKE). Prerequisite: you need to setup a container cluster on GKE first and get the app up and running. Find out more from my post Orchestrating Docker with Kubernetes.
Kubernetes starts rolling update.

This flow works pretty well except that some hacks are necessary for Travis to deploy to GKE. I will explain the hacks and why they are necessary in details in the subsequent section.

Please note that the tools used in this flow can be easily replaced with their respective equivalents. For example, Github can be replaced with Gitlab, Bitbucket or whatever you like. Travis can be replaced by CircleCI, Shippable, or any CI that supports Docker. Docker Hub can be replaced by your private registry. GKE can be replaced by Amazon EC2 Container Service or any Containers as a Service (CaaS).

All source code can be found in my repo.

The devil is in the detail

sudo: required
language: node_js
node_js:
  - '5'

services:
  - docker

env:
  global:
    - secure: AoSvVfpX77AtMBpXKyHH67wTKWCN9xdXoMWroU2TwewBRToebQmLUesT+6gP2rCVoQ282IpESkZkIUumj38rvCGWpL3fLU67Fb1Fa/SqKQ++OYri3NNmoOLltkHMCHHz2bl7B8/72KJQ8e+sl8KCmKBTaLp1g2/36+DZwL9KZjFOqsQg2pwv3zjOUkZte6v6Igsl7lV1pbLkg9Fq1KEvOZl8D6bHgtNuHJGYZZrHCUDHMXxXeQ8/wL0v7GpUkZAe85Ve+fkPoX7AXYumbi5SsxbwVYG64j4zdU2ydlhmMRfOrT4BIbOHUO1kmP5uSf4/xX+fGJZuGDlVT1P5RTZyvu1dA+X10a0t2pOYHAjTjJp6CNRIKuJ4izBvxCZaLAUd42FfU9BAFyUPlViaDzNZAW3DCoapfnS0xM+UY1U8Z7bs+lIXycqIE6OR4KZXfKtpoKakmh8d0eY8LpUNu81VS9Z2mxfRg0xsYhp/1E/X+LZM/YCzPpOhQCc0ZGJEtbqj1/Qebp/GRBIx9oabBH+JxldqrZApIF0MsGiaIT8Q0LJ8wNVVc7QQqhjwdtvP1Wh2pxOvnzEyAdB4AeKrYv0SMwa3OxpIDtJEP1AvXRTZWXPObSSgAhK2VPnOK9Y++N2BlSKJh9k1nlnzn0FQsHlPUV/r9Ui62tmQbzOyL69xltQ=
    - secure: KLweub+lnStvWmKY/XoDy8nXo4v5lYTHapXtewfDFrnWPL6UQV//kklUAswzGEWp4id9pOolZB4RNfucCBzB1Kb1KgSp23wz9mj+HWY+qUQRsLyIxYwsZf/o9H/MMkesG8jOF/vU35Ne3e7MdSmpNuqEqErIqc25691rBMLSDxi5YS3jcB+vGkv38xtlcDilnUrSneKuNEzuI5Q7MKr6vqIVaW3vbwEs5hEh2oPSAgVno67s9NyfUi2Dl4sIDDwzV+iXc9FNk4Ww6Vkm8uzZvIXhZDuyVnFWSJhgfNgElUna7XfqnWOyPdZQ0rh2iOtKzO9uZQiOoNq8JuT9VstFcFlhsfAtlynB5Xgp/EH9NUIZvUiueRhs0bOH9SKRUijIeTQ/OimuX0y8EZdz7TKSigZkc7iVqD0g2E2kLtv3avTCodwts54V8bX7//r8Y2FE7DFkZRGL7Khf5LPjRj7xVEb9OhEkuSYfs0oEhqyXCSMUNyov3BDFbZ/AWh0smQmKYv36U2JvMfPghNlgAE+b/Xc4F/sJMmNSDdLBmGwoJDJtVA5iB3RCfkZfEeqhMoj2C8uu8s8IASzxh2HnaaO5IQKy7kGwAzzRfazTq6JhvLPfteOKStvRYrLLBll7l2DPazLJK8ctKZ6CTFxoycU+R73IliwwpqLCAhasD9N7Q8w=
    - secure: htu5A6fjnQvJTvinumWI1u1unoDMgY5FleGM8JwVLBQl6Srr1GU/2ulii1LSbypG/JimwMzT6BIRMfNgCzAeBc6aWM8xbZUsLl7p+WFyYNxOW26mVm8Z5R+2JZXm3vZoevjHD35gWIBMReqVySIRLXanZ9SOVRC1IcX2Om1uOoQHAqwrFh4KERWnzepAXylfUtFatqmRmCRczH4m9MKF7OgbZD/7xKHHoJxXLHjmWaCQrLmucb01/e/vl3C/LwOvc+A6fwXhhMFdL5UUn9iVWLBhTYknULjOKDF2AgvahvStxGUEAcscaW4qATW32Ylamn3K4l5X+3ic+fLKQHOR+oCaVmaqcpExbxuBfLHGQkV7DiUePQsS1D5NLfAHBdbd8MJCwxQRlSIt/z/aemkx6uWEBu053c47yuuKltFhAm6B7Z52UkHC8fpFfuy03xgorjNTayghPlYDeFZRzItkibTEY1EH4Sh9yVkAIhv5hqomMQMAkxaEFgmBkK7/8+UHB+sMlBupMiejTqIgZOzIhj1uqDGkokqI2UqlNx1wuqzGJ/KnQWzsgFsnoU0Q1zIzw0i1RAiDu8qj5vSnioqw8pvZWuBDuRltE5U4fup529ZtFSNx4Ceb3ECathc6+3Em8OvtqvBOr44xNBW8XCFbi82Ys3Dd/SEHGwlcwYoT4/A=
    - secure: W1CzoMk6zUyhKhx+Q4mX/O1rh7GKFTrMNpkJ+QqN4WN56K3v49t6kotABXbFU4K8ZP84DhUkQYtm6DbEgCkxeNaT0k0gzKyG+CZo5sI6yqx15CwmR/qCkFruDQqUG/2hZNTP5ZP/7hMQwNCxATxuzu2GOcnwk1YuIJsJ5by9+RdWwKBYYhSG/BokNq6JaYxZx8NeMTa5W/CJXtu/5qFsMDQ07Syg4MgAnRHbRB9uCxpUOtJ44HDhZFOOY48qIDVpYhVpTg0XVzw5VkFNgOuczIYWw/SaWyOyoKgt8FtLhUPDsHtvueFqSJffYlyoh81Uxu+LNAtZoact5wFOwkDM+mzqRe/lzm2hcYY0nc4Dxc1ito43GedQmFnadZSozZfKAEmRV0m+1W6iE6fi+DpHXm3JCmgvolB9khLiYVTHkgaSzbfZDfdBmoXPEcaDuKnY6KTB410sGnadO6IT3Z5x8FtMQ/In/dB7XQr/G1aXve3KjUz8/oLZbS+IWEH8eQEOAR8V6w6HpvQIZE6ccwEzmLILlygne7TSAnOPQX1hiEDdUYYOtoyVEv/NUzd/OpGqNl6edYsXZB1bcY7FWy2YhmJD4mqe5h6mp4yRXYyuFbdC3OIBGpr1vnNaF9uUdayZ59pZ4kIojCQL2SKZKqOYdezyV/h0ohP93FIboi/nL7Q=
    - secure: jxY+/q6svho30zk1RMtnwHhHkYMw+NqQS4n7dOILpsI6zKwC4n2c+xjD0MoOYjVsd/SiJucXeeNR4BcS4+OglsVxja68rBWqV4F3xloGwtMZebqMhNGSV4kitgAkdDNktQid2fQAwUsoCtEfq3A6ijMdOgoehZwxXqFg5+5cRwjI58vVOfdFDGPUO6KXTDhwwXIgTi9zN8nXSNCgHUZfrIGw6jTdFvL2NGkejbX4AqutethTlXlA7lVeE/O1SUW4H8MRl59MyKUwWm/j7qkmM/TX+PUaXSwXgoR0Nzbt2DKTbY56qff1olHaUMuV5c8A5gYlThvVYbMnGRTK7JYOzmchyvRsqc1inxIwU7bi/J1DnQpCX4qF06NBU+OLbx8+qDc9E6l9XGdQ09HFVPOctlNrkfcPDgGVfRYwomU8+gb8MykgBxtUqLIpJN0pr9sNZN4L4jGuv58ThkCfoaH6YgiLKTITJ7cLAE2tNmHj5Kw+vHNvJcUltRdMOSaDI6TyAGkVpUSHaTEOvdvsj5y5wFS3TdIHnTQOWMpBiYSHY7D4trvv6KRagDjoB7DKaAohYPwataJdu3Wu+A0UIsukYPG6uVn4RPJmQHBMT8vTtf9q9eSpoRU68zSnhAL5aXtq1vqtfy4859QQDzpH7JMA87WtJqCQ4Fp8MHOg3uvb+ZM=
    - secure: FXEj1gwSi+xjpc4GFFS4ehxy6Q4wpmmPq/rlfWijOAYsNwukfkL2gCmnzjTT6ZZVHM47bNtHlTdqTM8bmBjWVeVEPKwvULq6i2neiGI7OI0vh2dsbCMIZ7bCMuRL0/WRq/A8YPsXIK76YIlMg8sSzdGdAmvZpjTbLywSl9KoSlH3S56e2Nz1kb8DhP4emeFcxdXul463JUdOBpCZa8oJLopOaX5+PypOk6XOjOyWkxN938F4w5/4MzhgghcrTRwtOQCPHyJR2zg08MWm5COKi3L2EmgPyctXGGr31YVeRTC9Gg3SJ+Mt8OfsX8f7TnUzhnfA3IszJpxw4cF1snpVmXZED2J1Aa6Gi7mSIhb1CdYNUxUKx6BVnaUGDRfmcxtGOt8Uy0A2sRl3FGhVVH6xQaaTmG3nzU0fm44cO1RwxqXfP4kXqUNj+LlCUC+3hIRrWRMfxjIWwGA0LZIqnVlou2TRLkxdM3IHx46cFqEfpD6Do/NXOCUZYRcE95FqvK6ZrobsZapkI0WkmoO4YdzCkuW9nSc8Y121OWlP9urkzaU4X7tO7FbLshKmjYNL4vLqyJ0gkN21yVJTIJWE6l/TGGi+RxwFF2jzN8oRjOJrWESmUSPa5WhNvuIOu1t0RObwRup0yjArRuUIXxuQt99c1tkfsn52fZWZih3t4bM0bxE=
    - CLOUDSDK_CORE_DISABLE_PROMPTS=1
    - COMMIT=${TRAVIS_COMMIT::8}
    - IMAGE=chenglong/simple-node
    - IMAGE_VERSION=$IMAGE:$COMMIT
    - DEPLOYMENT=frontend
    - CONTAINER_NAME=nodejs

before_script:
  - docker build -t $IMAGE:$COMMIT app
  - docker tag $IMAGE:$COMMIT $IMAGE:latest
  - docker tag $IMAGE:$COMMIT $IMAGE:travis-$TRAVIS_BUILD_NUMBER

script:
  - docker version
  - docker-compose version
  - docker-compose -f docker-compose.ci.yml run test

after_success:
  - docker login -e="$DOCKER_EMAIL" -u="$DOCKER_USERNAME" -p="$DOCKER_PASSWORD"
  - docker push $IMAGE
  - curl https://sdk.cloud.google.com | bash
  - source /home/travis/.bashrc
  - gcloud components update kubectl
  - kubectl config set-credentials default --username=${GKE_USERNAME} --password=${GKE_PASSWORD}
  - kubectl config set-cluster default --server=${GKE_SERVER} --insecure-skip-tls-verify=true
  - kubectl config set-context default --cluster=default --user=default
  - kubectl config use-context default
  - kubectl patch deployment $DEPLOYMENT -p '{"spec":{"template":{"spec":{"containers":[{"name":"'"$CONTAINER_NAME"'","image":"'"$IMAGE_VERSION"'"}]}}}}'

This is the .travis.yml that instructs Travis to

build the Docker image chenglong/simple-node with 3 tags: latest, ${TRAVIS_COMMIT::8} and travis-$TRAVIS_BUILD_NUMBER
run tests against the newly-built image chenglong/simple-node:latest. Note that the tests are run in a container named test. The tests basicially verify that the app gives expected response. See test code here.
push the image to Docker Hub if tests pass
install latest Google Cloud SDK because the one on Travis is too old
install kubectl
config kubectl to connect to the a pre-existing cluster on GKE
rolling update deployment frontend with the newly-built image chenglong/simple-node

A few key points:

chenglong/simple-node is ONLY built once. It's the same image that is tested against and deployed to GKE. See the logs for a build here.
Tests are run in a container named test, which simply starts app and redis, fires a few requests and verifies the response. To run the tests, I used docker-compose -f docker-compose.ci.yml run test. See docker-compose.ci.yml
At the time of writing, Travis CI only has docker-compose 1.4.2, which does not support compose file version 2. This requires the compose file to be in version 1, unless you want to install docker-compose 1.6+.
Travis doesn't support deploying to GKE yet, which requires some hacks to install the latest Google Cloud SDK and kubectl, so that I can do kubectl patch deployment.

Conclusion

I hope this simple demo gives you a better idea of how Docker can help in your day to day work. A few obvious advantages of this Docker workflow:

It significantly improves how we ship software. The artifact from CI is NOT anymore a tar ball, jar file, war file, nor ear file etc. It's simply a Docker image. A Docker image has not only the source code baked in, but also a complete running environment. Therefore, it removes the need to first provision server before deploying source code. Do you see Chef, Puppet or Ansible in the demo? In the age of containers, all you need is a server that runs Docker.
Since it's the same image that is tested in CI gets deployed to production (or maybe staging), it's guaranteed that it works in exactly the same way.
By containerizing the app, each component of the app is run as a container. And each container can be individually updated and deployed. Does this ring a bell? If not, read Microservices.

Orchestrating Docker with Kubernetes

Cheng Long — Sun, 15 May 2016 09:04:00 GMT

This is my second post on Docker orchestration. In the first post, I demonstrated orchestrating Docker with Swarm, Machine, Compose and Consul. This post is to demonstrate orchestrating the same app with Kubernetes and draw comparisons between them. I recommend reading that post before this.

Introduction to Kubernetes

Kubernetes is an open-source system for automating deployment, operations, and scaling of containerized applications.

It groups containers that make up an application into logical units for easy management and discovery. Kubernetes builds upon a decade and a half of experience of running production workloads at Google, combined with best-of-breed ideas and practices from the community.

Find out more here.

Kubernetes has a number of interesting concepts:

Kubernetes is definitely not a trivial tool. Luckily, the official guides does a great job at explaining things. I highly recommend going through it to learn Kubernetes.

Architecture

This is a much simplified version of the architecture.

Neither load balancer nor service discovery is in the diagram because they are both handled by Kubernetes internally. See Kubernetes' architecture here.

For this demo, I will use Google Container Engine as a hosted solution. You can run Kubernetes on various platforms, including local machine, Cloud IaaS providers, bare metals, etc. By the way, Google Cloud Platform gives $300 in the 60-day free trial. Try it out!

0. Prerequisites

This example requires a running Kubernetes cluster. If you want to use Google Container Engine, follow this. Or let gcloud init guide you.

Verify kubectl is configed and the cluster is ready

kubectl cluster-info

1. Run backend

According to the best practices, create a Service before corresponding Deployments so that the scheduler can spread the pods comprising the Service. So the redis service is created before deployment and they are defined in backend.yaml.

kubectl create -f backend.yaml

Verify the service is created

$ kubectl get services redis
NAME      CLUSTER-IP     EXTERNAL-IP   PORT(S)    AGE
redis     10.7.246.226           6379/TCP   7m

Verify the deployment is created:

$ kubectl get deployments
NAME      DESIRED   CURRENT   UP-TO-DATE   AVAILABLE   AGE
redis     1         1         1            1           53s

Verify one redis pod is created

$ kubectl get pods
NAME                     READY     STATUS    RESTARTS   AGE
redis-3180978658-4y13o   1/1       Running   0          3m

See the pod's logs

kubectl logs redis-3180978658-4y13o

2. Run frontend

Create frontend by

kubectl create -f frontend.yaml

Note that I specify type: LoadBalancer because I want this service to be accessible by the public. By default, services and pods are only accessible inside the internal Kubernetes network. Also, the Nodejs deployment will maintain 3 pods, each runnning one container based on this Docker image chenglong/simple-node:v1.

You can verify that the service and deployment for Nodejs are created correctly in the same way as backend. If everything goes well, the app should be up and running. If you do kubectl get pods, there should be 3 frontend pods and 1 backend pod.

Get the external IP

$ kubectl get service frontend
NAME       CLUSTER-IP     EXTERNAL-IP       PORT(S)   AGE
frontend   10.7.246.128   104.155.202.254   80/TCP    8m

You need to wait a while for the external IP to be available. Repeat curl a few times to verify that both load balancing and page counting work. You can also see the page in browser.

3. Self-healing

One important feature Kubernetes offers out of the box is self-healing. Put simply, Kubernetes will ensure that the specified number of replicas are running. In the event of pods or nodes fail, Kubernetes recreate new ones.

To test this feature, I delete one frontend pod and immediately list all pods.

$ kubectl delete pod frontend-2747139405-bk4ul; kubectl get pods
NAME                        READY     STATUS              RESTARTS   AGE
backend-3180978658-i1ipl    1/1       Running             0          13m
frontend-2747139405-bk4ul   1/1       Terminating         0          8m
frontend-2747139405-hjnb9   1/1       Running             0          39s
frontend-2747139405-luukn   0/1       ContainerCreating   0          3s
frontend-2747139405-mfhky   1/1       Running             0          8m

From the pods status, frontend-2747139405-bk4ul is terminating. But notice that there is a new pod frontend-2747139405-luukn automatically being created.

4. Scaling

Scaling the frontend is as simple as

$ kubectl scale deployment/frontend --replicas=6; kubectl get pods
deployment "frontend" scaled
NAME                        READY     STATUS              RESTARTS   AGE
backend-3180978658-i1ipl    1/1       Running             0          41m
frontend-2747139405-4xhlz   0/1       ContainerCreating   0          4s
frontend-2747139405-autox   0/1       ContainerCreating   0          4s
frontend-2747139405-hjnb9   1/1       Running             0          28m
frontend-2747139405-luukn   1/1       Running             0          27m
frontend-2747139405-mfhky   1/1       Running             0          36m
frontend-2747139405-r8ayi   0/1       ContainerCreating   0          4s

As seen from the above, Kubernetes immediately starts creating new replicas to match the desired state.

Scaling down is similar

kubectl scale deployment/frontend --replicas=3

Depending on the nature of the app, it's probably more useful to define Horizontal Pod Autoscaler to do autoscaling.

5. Rolling update

Suppose we need to update the frontend app to chenglong/simple-node:v2. Right now it's v1. How to roll out this release without service disruption? Kubernetes supports this natively and makes it simple.

To reduce risk, I want to do a Canary Release first. Create the canary deployment

kubectl create -f frontend-canary.yaml

Note that I set replicas: 1 so that the ratio of stable pods to canary pod is 3:1. And since the canary has labels app: nodejs and tier: frontend, it will be automatically load balanced by the frontend service.

List all pods with label track

$ kubectl get pods -L track
NAME                               READY     STATUS    RESTARTS   AGE       TRACK
backend-3180978658-23la7           1/1       Running   0          11m       
frontend-1287392616-0lfxx          1/1       Running   0          9m        stable
frontend-1287392616-95b6q          1/1       Running   0          7m        stable
frontend-1287392616-bpstl          1/1       Running   0          9m        stable
frontend-canary-1722551660-6aaon   1/1       Running   0          5m        canary

Verify the deployment works

$ kubectl get deployments frontend-canary
NAME              DESIRED   CURRENT   UP-TO-DATE   AVAILABLE   AGE
frontend-canary   1         1         1            1           3m

Hit the frontend service a few times to verify that only one pod is updated to chenglong/simple-node:v2

$ curl 
This request is served by frontend-canary-1722551660-o4qcu
You have viewed this page 20 times!
Server Time: 2016-05-15T11:30:08.649Z

$ curl 
This request is served by frontend-1287392616-wgl39. You have viewed this page 22 times!

Since the Canary Release is working fine, I want to roll out to all pods.

$ vim frontend.yaml # update simple-node:v1 to simple-node:v2
$ kubectl apply -f frontend.yaml

Kubernetes will progressively kill old pods and create new pods. It does not kill old Pods until a sufficient number of new Pods have come up, and does not create new Pods until a sufficient number of old Pods have been killed. Find out more here.

Find out details of one of the none-canary pods. Note that the image is chenglong/simple-node:v2 not v1.

$ kubectl describe pods frontend-1389432169-aszzd
Name:		frontend-1389432169-aszzd
Namespace:	default
...
Containers:
  nodejs:
    Container ID: docker://faf884e2da293f6de66e275614d...
    Image:		chenglong/simple-node:v2

Delete Canary deployment

$ kubectl delete deployments frontend-canary
deployment "frontend-canary" deleted

If this release is not ideal, we could easily roll back

$ kubectl rollout undo deployment/frontend
deployment "frontend" rolled back

Kubernetes vs Swarm

Based on the previous post and this one, it's clear to me that there're quite a few prominent differences between Kubernetes and Swarm:

Kubernetes is a more mature and powerful orchestration tool than Swarm. Swarm provides basic and essential native clustering capabilities. But Kubernetes has built-in self-healing, service discovery (etcd), load balancing, automated rollouts and rollbacks, etc. Building all these functions on Swarm is not trivial. However, this may or may not be a good thing depending on use cases. If you do need all the features that Kubernetes provides and don't intend to do any customization, Kubernetes is perfect for you. Otherwise, the complexity of Kubernetes might become a burden because it requires more efforts to adopt and support.
Different philosophies. Kubernetes has clearly taken an all-in-one approach, while Swarm is batteries included but swappable. So if I want to use Consul as the service discovery backend, I can easily do that in Swarm. But Kubernetes uses etcd by default and it's still not supported after more than one year..
Kubernetes is primarily based on Google's experience on managing containers. So it's opinionated by definition. I'm not saying being opinionated is necessarily bad. But if you do decide to use it, you probably have to live with its choices. Consul is just one example.
Command Line. Unlike Swarm, Kubernetes is not native to Docker. It has its own set of commands. See the the differences here. But in general, kubectl is quite similar to docker-cli.
Swarm performs better than Kubernetes. I think this only matters when you are running hundreds or even thousands of nodes and containers. At a small to medium scale, other factors (e.g the points above) play a more important part when deciding which one to use.

Orchestrating Docker with Swarm, Machine, Compose and Consul

Cheng Long — Fri, 15 Apr 2016 09:01:00 GMT

With multi-host networking ready for production and the announcement of Swarm 1.0, I think it's time to give Docker a serious try. This post details the steps I took to orchestrate a multi-host and multi-container app.

Architecture

This is a simple Node app that uses Redis as database, load balanced by Nginx. Each blue box is a Docker host, which runs several containers. All hosts talk to Consul for service discovery. The cluster has 5 nodes, each serves a specific purpose. We want to easily scale up and down the number of app containers.

To achieve this, we will use the following stack

All scripts and the compose file for this demo are available here.

1. Create and Run Consul

Consul is an excellent tool for service discovery. It works great with Docker. You could also use etcd.

Let's create a Docker host for Consul

docker-machine create -d virtualbox consul

This will create a new Docker host named consul on my local virtualbox. It's certainly possible to create a Docker host on a supported cloud provider, e.g. AWS, Digital Ocean, Google Compute Engine, etc. The difference is the driver. Checkout driver options and arguments.

Once consul is created, connect to it

eval $(docker-machine env consul)

Run progrium/consul in background.

docker run -d -p 8500:8500 -h consul --restart always gliderlabs/consul-server -bootstrap

Verify Consul is working

curl $(docker-machine ip consul):8500/v1/catalog/services

You can also go to Consul's web UI http://$(docker-machine ip consul):8500/ui

2. Create The Swarm

Create the Swarm master

docker-machine create \
    -d virtualbox \
    --swarm \
    --swarm-master \
    --swarm-discovery="consul://$(docker-machine ip consul):8500"\
    --engine-opt="cluster-store=consul://$(docker-machine ip consul):8500" \
    --engine-opt="cluster-advertise=eth1:2376" \
    swarm-master

This is a very long command. Let's take a detailed look.

-d virtualbox indicates the driver is virtualbox
--swarm configs the newly created machine with Swarm
--swarm-master dictates the newly created machine as the Swarm master
--swarm-discovery="consul://$(docker-machine ip consul):8500" designates Consul as the discovery service
--engine-opt="cluster-store=consul://$(docker-machine ip consul):8500" designates Consul as the distributed KV storage backend for the cluster
--engine-opt="cluster-advertise=eth1:2376" advertises the machine on the network

Create a node for load balancer

docker-machine create \
	-d virtualbox \
    --swarm \
    --swarm-discovery="consul://$(docker-machine ip consul):8500"\
    --engine-opt="cluster-store=consul://$(docker-machine ip consul):8500" \
    --engine-opt="cluster-advertise=eth1:2376" \
    --engine-label host=load-balancer \
    load-balancer

Note that we give this machine a label host with value load-balancer, which will be used for scheduling later.

Create app server 1

docker-machine create \
	-d virtualbox \
    --swarm \
    --swarm-discovery="consul://$(docker-machine ip consul):8500"\
    --engine-opt="cluster-store=consul://$(docker-machine ip consul):8500" \
    --engine-opt="cluster-advertise=eth1:2376" \
    --engine-label host=app-server \
    --virtualbox-cpu-count "2" \
    --virtualbox-memory "2048" \
    app-server-1

Note the two added parameters --virtualbox-cpu-count "2" and --virtualbox-memory "2048", which gives the app server more resources. You should adjust these values according to your needs.

Create app server 2

docker-machine create \
	-d virtualbox \
    --swarm \
    --swarm-discovery="consul://$(docker-machine ip consul):8500"\
    --engine-opt="cluster-store=consul://$(docker-machine ip consul):8500" \
    --engine-opt="cluster-advertise=eth1:2376" \
    --engine-label host=app-server \
    --virtualbox-cpu-count "2" \
    --virtualbox-memory "2048" \
    app-server-2

It's best practice to put your app servers in different availability zones or even different cloud providers to achieve high availability. Checkout the drivers reference for options.

Create database node

docker-machine create \
	-d virtualbox \
    --swarm \
    --swarm-discovery="consul://$(docker-machine ip consul):8500"\
    --engine-opt="cluster-store=consul://$(docker-machine ip consul):8500" \
    --engine-opt="cluster-advertise=eth1:2376" \
    --engine-label host=database \
    --virtualbox-disk-size "40000" \
    database

Note that we specify --virtualbox-disk-size "40000" because it's for running database. You can adjust this according to your needs.

Connect to the Swarm

eval $(docker-machine env -swarm swarm-master)

Check cluster info

docker info

You should see 5 nodes in the cluster, namely swarm-master, load-balancer, database, app-server-1 and app-server-2.

You can also run

docker run --rm swarm list consul://$(docker-machine ip consul):8500

to find out all nodes in the cluster.

3. Run registrator in each host

To automatically register and deregister services for all Docker containers in each host, we need to run gliderlabs/registrator in each node of the cluster. Registrator will also use Consul as the KV store.

Run the following script

After this, the infrastructure is ready. It's time to deploy.

4. Docker Compose

We will use the following compose file to start the app

A few important points:

chenglong/nginx-consul-template is a simple image that use NGINX and consul-template to load balance any service as instructed. In this case, it's service myapp. It requires a running Consul for service discovery. Environment variable CONSUL_URL should be set.
chenglong/simple-node is a simple Node image that uses redis as database. All it does is to display the hostname and the number of times the page is visited.
constraint:host==load-balancer, constraint:host==app-server and constraint:host==database are node filters which tell the Swarm master to schedule the corresponding containers on matching hosts. So in this case, the load-balancer container will be scheduled to run on Docker host load-balancer. All app containers will be scheduled to run on either app-server-1 or app-server-2.
There are two overlay networks frontend and backend. This is what makes multi-host networking possible, i.e. load-balancer can talk to app and app can talk to redis.

Set CONSUL_URL

export CONSUL_URL=$(docker-machine ip consul):8500

Start the app

docker-compose up -d

Verify that all containers are running

docker-compose ps

Test the app works by repeating curl $(docker-machine ip load-balancer) a few times. You should see the counter running.

5. Scaling

Now you can easily scale the number of app containers

docker-compose scale app=4

This creates 3 more app containers.

By default, Swarm use spread scheduling strategy. So all app containers will be shared evenly by app-server-1 and app-server-2.

Test load balancing works by repeating curl $(docker-machine ip load-balancer) a few times. You should see the hostname cycles the 4 containers and the counter is running.

Scaling down is easy too

docker-compose scale app=2

Summary

With Docker Swarm, Machine, Compose and Consul, it's not hard to scale and schedule Docker containers to a cluster of nodes. Although I demoed with virtualbox, you could do the same with Digital Ocean, AWS or any supported cloud provider to deploy Swarm clusters for production.

Since Swarm is native for Docker and it follows "batteries included but swappable" principle, I feel it's much more natural and easier to use than Kubernetes. I will try to do a post about Swarm vs Kubernetes in the near future.

Gmail Sending Limit

Cheng Long — Fri, 08 Apr 2016 08:58:00 GMT

Many people use Gmail as personal emails. Companies use Google Apps for Work. Everyone is happy using it. Not until you hit its limits.

I recently learnt its limits (shown below) in a hard way.

I will hightlight the most restrictive constraints from the above:

You can send max 2000 emails per day from one account, 500 for trial accounts
3000 unique recipients per day, 500 for trial accounts

From what I understand, normal Gmail accounts have the same limits as trial accounts. What are trial accounts? When you sign up Google Apps for Work, all users are in trial period for 30 days. And it's free during this period. After trial, you have to pay ~$5/user/month.

The catch is What if you want to send > 500 emails during trial period? This is the problem that I had.

According to Google,

At the end of your free trial period, your sending limits will be automatically increased when your domain is cumulatively billed for at least $30 USD (or the same amount in your currency). To expedite this process, you can manually prepay this amount as suggested here. It will take up to 48 hours to upgrade your sending limits after you submit the manual payment. Note: while you're still in your trial period, your sending limits will not be increased.

So I signed up Google Apps and setup Email with my custom domain. Made manual payment of $31. After ~550 emails, it stopped with following erorr

Daily user sending quota exceeded. s197sm16092990pfs.62 - gsmtp

I contacted online chat support. Surprisingly, I got reply immediately. It turned out that even though I had made the payment of $31, it DOES NOT automatically end my trial period. So the limit of 500 emails still applies. The solution is Google support staff sent me a new Google Apps contract via email. Unpon agreeing to its T&C, it effectively expired my trial period and changed my account to paid subscription. I was told that it may take 48 hours for the new limit of 2000 emails/day to be effective.

So far, contacting Google support seems to be the only way to upgrade from trial limits within trial period. I haven't found any setting that I can do it on my own, which is weird. If customers don't want to enjoy the free month, they should have the capability to do so.

Life is Short. Run Tests in Parallel

Cheng Long — Sun, 20 Mar 2016 08:56:00 GMT

If you are doing Rails, chances are that you have RSpec and Cucumber tests. They are great tools to give you the confidence that your software is working as expected. But as project grows, you may find that your tests are getting slower and slower, expecially the Cucumber tests. It not only slows down the speed of your local development, but also your Continuous Integration pipeline (Test is probably the first stage in your CI). We don't want to waste time waiting tests to finish in either local or CI. We want to develop and integrate faster. Here is how.

1. Install parallel_tests

parallel_tests is a great gem to run Test::Unit, RSpec, Cucumber and Spinach tests parallel on multiple CPU cores.

Update your Gemfile

group :development do
	gem "parallel_tests"
    # other gems...
end

Update config/database.yml

test:
	database: db/test<%= ENV['TEST_ENV_NUMBER'] %>.sqlite3

If you are not using sqlite3 for test db, update it accordingly.
Each CPU core will run its share of tests using its own test db. So if you have 8 CPU cores, you will see 8 sqlite3 files generated in db when running tests.

Then

bin/bundle install

2. Use Binstubs to Run Parallel Tests

If you don't know what is binstubs and why you should use it for all commands, take a look at this and this.

Generate binstubs only for parallel_tests

bin/bundle binstubs parallel_tests

It will generate 4 binstubs in bin

parallel_cucumber
parallel_rspec
parallel_spinach
parallel_test

You don't need to keep all of them. Only keep the ones that you need. In my case, I only use RSpec and Cucumber. So I deleted parallel_spinach and parallel_test. Add the rest to version control.

3. Run Tests in Parallel

Run all specs in parallel

bin/parallel_rspec spec

Run all features in parallel

bin/parallel_cucumber features

This will launch N browsers, where N is the number of your CPU cores.

Be amazed by how long it takes to run all tests now. Theoretically, the test running time can be reduced to T/N, where T is the time to run in serial and N is number of CPU cores. But that's not achievable in practice because the number of tests may not be split evenly among the processes. And even if they are split evenly, they may not take equal time to run. Besides, there are overheads to coordinate the processes and collate results. However, it's very easy and common to achieve 20% ~ 50% reduction in test running time, which can potentially save 5 ~ 10 mins in a medium size project. In a long run, it's a huge time saving.

4. Tips

Not all tests are equal. Some tests are for must-have features and some are for nice-to-have features. Some tests fail more often than others. To get feedback faster, it's a good practice to group tests into categories and run them in order, e.g.

bin/parallel_cucumber features -o '-t @smoke'
bin/parallel_cucumber features -o '-t @flaky'
bin/parallel_cucumber features -o '-t ~@smoke,~@flaky'

The above example will

Run all smoke features. More on smoke tests
Run all flaky features
Run the rest

This way, it avoids running the smoke tests and flaky tests at the very end.

Summary

It's quite easy to setup parallel tests. It not only speeds up your local development, but also CI pipeline. And the time saving is tremendous in the long run.

Life is Short. Run Tests in Parallel!

HTTP/2

Cheng Long — Mon, 14 Mar 2016 08:53:00 GMT

What is HTTP/2 and Why

HTTP/1.1 has been serving most part of the Web since 1997. As websites get more and more sophisticated and resource intensive, it starts to show its limitations, e.g. one outstanding request per TCP connection. So its next-generation emerged: HTTP/2.

HTTP/2 FAQ does a great job explaining the background and specifications. Highly recommended. Here is an executive summary, HTTP/2:

is specifically designed to improve performance
is based on SPDY
is binary, instead of textual
is fully multiplexed, instead of ordered and blocking
can therefore use one connection for parallelism
uses header compression to reduce overhead
allows servers to push responses proactively into client caches
is backward-compatible, designed to be drop-in replacement for HTTP/1.1
is supported by most broswers over TLS

HttpWatch reported good performance improvement by using HTTP/2.

How to Upgrade

Upgrading from HTTP/1.1 to HTTP/2 is quite easy. You just need to make sure that your web server supports HTTP/2 and "turn it on". I will use NGINX as an example.

Install NGINX 1.9.5+

In the case of NGINX, only 1.9.5+ supports HTTP/2.

Check your NGINX version

nginx -V

If it's lower than 1.9.5, you need to upgrade NGINX first. Otherwise, head over to Turn On HTTP/2.
At the time of writing, the latest stable release of NGINX is 1.8.1, which is lower than 1.9.5. So you need to install NGINX Mainline version. Don't worry that Mainline version is not stable. It's actually better than Stable version because it has the latest bug fixes.

On Ubuntu,

Install NGINX signing key

wget http://nginx.org/keys/nginx_signing.key
sudo apt-key add nginx_signing.key

Add the following in /etc/apt/sources.list

deb http://nginx.org/packages/mainline/ubuntu/  nginx
deb-src http://nginx.org/packages/mainline/ubuntu/  nginx

Note that should be the result of lsb_release -c | cut -f2.

Then install

sudo apt-get update
sudo apt-get install nginx

Compile NGINX from Source to Support ALPN (Optional)

Depending on the OpenSSL version that your NGINX is built with, it may not support Application Layer Protocol Negotiation (ALPN). More details here. Besides, starting from May 15th 2016, Chrome will ONLY support ALPN, which means supporting ALPN is necessary for HTTP/2 to work fully.

Find out NGINX build details

nginx -V

If it says built with OpenSSL 1.0.2f 28 Jan 2016, you don't need to compile NGINX from source. Jump to Turn On HTTP/2.

If it's built with OpenSSL lower than 1.0.2f, e.g. 1.0.1f. You need to compile NGINX from source with OpenSSL 1.0.2f. The detailed steps can be found here.

Turn on HTTP/2

In site.conf,

server {
	listen 443 ssl http2;
    ...
}

That's all you need to turn on HTTP/2!

Please note that although HTTP/2 doesn't require HTTPS, most web browsers only support HTTP/2 via TLS. So you do need to serve your site via HTTPS in order to use HTTP/2. If your site isn't using HTTPS yet, check out my post on how to use Let's Encrypt to make it HTTPS.

Reload NGINX

sudo nginx -s reload

Refresh your site and inspect the page, you should see that the assets from your site are loaded via protocal HTTP/2 (h2).

This blog is using HTTP/2. You can inspect this page to see HTTP/2 in action.

Another way is to let KeyCDN do the test.

If you prefer command line

echo | openssl s_client -alpn h2 -connect yourserver.com:443 | grep ALPN

You should see ALPN protocol: h2.

Summary

Although HTTP/2 is only out for about one year, it has very good adoption thanks to its backward-compatibility, easy of upgrading and performance benefits. I'm convinced that HTTP/2 is the future of the Web.

Wait no more, upgrade!

Reference

HTTP/2 Home Page
HTTP/2 Specifications RFC7540 and RFC7541
http2 explained

Using Docker for Rails Development

Cheng Long — Sun, 14 Feb 2016 08:49:00 GMT

Why

There are many use cases of Docker. I see people primarily using it for Continous Integration and deployment. But Docker is also good for development. The obvious advantages of using Docker for development are:

No need to install app dependencies on dev machines. App dependencies are built into Docker images. Hence, the dev machines are not messed up with crazy dependencies. The only dependency needed on dev machines is Docker, nothing else.
Have a consistent development environment for all developers. No more excuse like "It works on my machine"!
Onboard new developers quickly. No need to spend hours setting up new dev machine and configuring it. You only need docker-compose up and you can start coding.

Prerequisites

This post will show you how to setup a Ruby on Rails development environment using Docker. My dev machine has

OS X EI Capitan
Docker version 1.10.1
docker-compose version 1.6.0

Please note my Docker and docker-compose versions. If yours have lower versions, the example here will NOT work for you. In particular, I'm using docker-compose file Version 2, which are supported by Compose 1.6.0+ and require a Docker Engine of version 1.10.0+.

Architecture

The architecture of the app looks like this

We have two containers, one for app and one for db. The web container has Nginx, Passenger and the Rails app. The db container only has Postgres in it. This architecture is probably common for many Rails apps running in production. And as a good practice, you should make your development environment as close to production as possible so that the potential production issues are exposed in the development stage, not when you receive calls from unhappy customers.

Build and Run Container Web

Step 1. Create Rails Project

My dev machine only has Docker installed. It doesn't have ruby. How do I create a Rails project? Use Docker.

docker run -it --rm --user "$(id -u):$(id -g)" -v "$PWD":/apps -w /apps rails:4.2.5 rails new rails_on_docker_for_dev --skip-bundle

The above command (btw, it's one line) will pull the image rails:4.2.5 from Docker Hub and run a one-off container based on the image with the command rails new rails_on_docker_for_dev --skip-bundle. The result is that it creates a Rails 4.2.5 project called rails_on_docker_for_dev in your current directory. Take a look at the newly created project rails_on_docker_for_dev.

Step 2. Dockerize the app

Create Dockerfile in the project

FROM phusion/passenger-ruby22:0.9.18
MAINTAINER Cheng Long 

# Hack to get around the volume mount issue on Mac
# https://github.com/boot2docker/boot2docker/issues/581
# https://github.com/docker/docker/issues/7198#issuecomment-159736577
ARG LOCAL_USER_ID
RUN usermod -u ${LOCAL_USER_ID} app

ENV APP_NAME rails_on_docker_for_dev

# Enable Nginx and add config
RUN rm -f /etc/service/nginx/down
RUN rm /etc/nginx/sites-enabled/default
COPY $APP_NAME.conf /etc/nginx/sites-enabled/$APP_NAME.conf

# Create project root and change owner
ENV APP_PATH /home/app/$APP_NAME
RUN mkdir -p $APP_PATH
RUN chown -R app:app $APP_PATH

WORKDIR $APP_PATH

RUN apt-get clean && rm -rf /var/lib/apt/lists/* /tmp/* /var/tmp/*

# Use baseimage-docker's init process.
CMD ["/sbin/my_init"]

This is a standard Dockerfile. It's based on phusion/passenger-ruby22:0.9.18, which includes Nginx 1.8.0, Passenger 5.0.22 and Ruby 2.2. It basically has all the dependencies to run the container web. You probably notice that the image doesn't include the Rails app. That's because we want to mount the Rails app as a volume so that when we edit source code of the app we don't have to rebuild the image. This is much needed in development mode to shorten the feedback loop. But in production, you should include the app in the image. We will see how it's mounted in docker-compose.yml

Two lines in the Dockerfile need a bit explanation

ARG LOCAL_USER_ID
RUN usermod -u ${LOCAL_USER_ID} app

First, Passenger recommends running the app as user app because it's a good security practice. So user app needs rw access to the mounted volume.

Second, in order to let app have rw access and not to change the ownership of the files (because we want to edit the source code on local machine and the files are owned by the local user), we change app's uid to be the same as the local user. This is a trick to get around the permission issue of mounted volume on OS X.

Create rails_on_docker_for_dev.conf

server {
    listen 80;
    server_name 0.0.0.0;
    root /home/app/rails_on_docker_for_dev/public;

    passenger_enabled on;
    passenger_user app;
    passenger_ruby /usr/bin/ruby;
    passenger_app_env development;
}

Note passenger_user app;, the app is running as app not root.

Create docker-compose.yml

version: '2'
services:
  web:
    build:
      context: .
      args:
        - LOCAL_USER_ID=${LOCAL_USER_ID}
    ports:
      - "80:80"
    volumes:
      - ".:/home/app/rails_on_docker_for_dev"

Note the volumes part. We mount the project directory in the local dev machine as a volume to /home/app/rails_on_docker_for_dev inside the container so that the container has the source code and Rails auto-reloading still works.

Step 3. Build and Run

Build the image

LOCAL_USER_ID=$(id -u) docker-compose build

We need LOCAL_USER_ID=$(id -u) because docker-compose.yml expects an environment varialbe LOCAL_USER_ID so that it can change user app's uid to the local user's uid.

Run the image

docker-compose up

You will see some logs that Passenger prints out in the console. When it's ready, point your broswer to your docker machine ip. To find out your docker machine ip, docker-machine ip default, if you are using machine default.

You should see an error that Passenger complains about bundle install is not run yet. That's correct. The app doesn't have any gem yet.

First Bundle install

docker-compose run --rm --user $(id -u):$(id -g) web bundle install --path=vendor/bundle

We want to install gems in ./vendor/bundle because the gems will persist in ./vendor/bundle regardless of the lifecyle of the container. When we update the Gemfile and do bundle install again, it will only install the newly added gems, not everything again.

--user $(id -u):$(id -g) is to make sure that newly created files are owned by the local user. Without it, they will be owned by root.

--rm is to remove the container after it's done.

Remove the running container

docker rm -f $(docker ps -ql)

Run the image again

docker-compose up

If you refresh your broswer, you should see a familiar welcome page!

Step 4. Develop using Docker

Having only a welcome page is not very exciting. Let's add a Post model.

docker-compose run --rm --user $(id -u):$(id -g) web bundle exec rails g scaffold post title:string body:text

DB migrate

docker-compose run --rm --user "$(id -u):$(id -g)" web bundle exec rake db:migrate

Now you can CRUD posts. Everything works!

Build and Run Container DB

Step 5. Replace sqlite3 with pg

Replace sqlite3 with pg in Gemfile

gem 'pg', '~> 0.18.4'

bundle install

docker-compose run --rm --user $(id -u):$(id -g) web bundle install --path=vendor/bundle

Step 6. Update Web Image

Add Postgres dependency in Dockerfile

RUN apt-get update && apt-get install -qq -y libpq-dev --fix-missing --no-install-recommends

Update docker-compose.yml so that web and db are linked

version: '2'
services:
  web:
    build:
      context: .
      args:
        - LOCAL_USER_ID=${LOCAL_USER_ID}
    ports:
      - "80:80"
    volumes:
      - ".:/home/app/rails_on_docker_for_dev"
    links:
      - db

  db:
    image: postgres:9.4.5
    ports:
      - '5432:5432'

Update database.yml to use Postgres

development: &default
  adapter: postgresql
  encoding: utf8
  database: myapp_dev
  pool: 5
  username: postgres
  host: db

test:
  <<: *default
  database: myapp_test

Step 7. Rebuild images and Run

LOCAL_USER_ID=$(id -u) docker-compose build

Run

docker-compose up

You should see logs from both web and db printed in the console.

Of course, you should setup database first

docker-compose run --rm --user "$(id -u):$(id -g)" web bundle exec rake db:setup

Everything works as before!

All source code is here.

Summary

As you see, it's not that hard to use Docker for Rails development. Essentially, we just need Dockerfile anddocker-compose.yml and link the containers properly. The development workflow with Docker is pretty much the same as without Docker. But the advantages that Docker brings to development are invaluable. It abstracts away the difference between local dev machines and let every developer in your team have a consistent development environment. Isn't that awesome?

Happy Dockering!

Let's Encrypt Nginx

Cheng Long — Sun, 31 Jan 2016 08:41:00 GMT

Update [2017 Aug 5]: Certbot has been developed by EFF and others as an easy-to-use automatic client that fetches and deploys SSL/TLS certificates. I would recommend using it.

Why

Since you are here, you probably know what Let's Encrypt is and why it exists. If not, below is an executive summary (copied from here):

Anyone who has gone through the trouble of setting up a secure website knows what a hassle getting and maintaining a certificate can be. Let’s Encrypt automates away the pain and lets site operators turn on and manage HTTPS with simple commands.
No validation emails, no complicated configuration editing, no expired certificates breaking your website. And of course, because Let’s Encrypt provides certificates for free, no need to arrange payment.

At the time of writing, Nginx plugin is highly experimental, not included in letsencrypt-auto. So I didn't use it. This post documents how to upgrade your site from http to https and setup auto renewal. If your site is already using https and just want to setup auto renewal, you can skip to Auto Renew.

Prerequisites

Nginx
Debian-based distros, e.g. Ubuntu
Git
openssl

Setup Let's Encrypt

1. Install Let's Encrypt client

git clone https://github.com/letsencrypt/letsencrypt ~/letsencrypt
cd ~/letsencrypt

2. Free port 80 and 443

With the standalone plugin, Let's Encrypt need to use port 80 and 443 to issue a cert for you. So you need to make sure that port 80 and 443 are NOT in use.
Check if port 80 and 443 are used

  netstat -na | grep ':80 .*LISTEN'
  netstat -na | grep ':443 .*LISTEN'

If nothing shows up, you are good to go. Otherwise, you need to temporarily stop whatever process is using them.

3. Generate certificate

./letsencrypt-auto certonly --standalone

You will be prompted to enter email and domain. Just follow instructions.
After it's done, you should see something like:

Congratulations! Your certificate and chain have been saved at
/etc/letsencrypt/live/your.domain.com/fullchain.pem. Your cert
will expire on YYYY-MM-DD. To obtain a new version of the
certificate in the future, simply run Let's Encrypt again.

That means a cert has been issued to you. You can check it out in directory /etc/letsencrypt/live/your.domain.com. It should have cert.pem, chain.pem, fullchain.pem and privkey.pem.
Now we need to config Nginx to use it.

4. Config Nginx

In the server block for https, you should have the following lines:

  listen 443 ssl;
  server_name your.domain.com;
  ssl on;
  ssl_certificate /etc/letsencrypt/live/your.domain.com/fullchain.pem;
  ssl_certificate_key /etc/letsencrypt/live/your.domain.com/privkey.pem;
  ssl_protocols TLSv1.2;
  ssl_prefer_server_ciphers on;
  ssl_ciphers "EECDH+AESGCM:EDH+AESGCM:AES256+EECDH:AES256+EDH";
  ssl_ecdh_curve secp384r1;

4.1 Use strong DH group

For security reasons, I would strongly recommend using a strong DH group (Here is why).

To generate a strong DH group, 4096 bits

openssl dhparam -out /path/to/dhparams.pem 4096

It will take some time. After it's done, add this line in the same https server block

ssl_dhparam /path/to/dhparams.pem;

4.2 Use HTTP Strict Transport Security

I would also recommend to use HTTP Strict Transport Security. You just need to add this line in the https block

add_header Strict-Transport-Security "max-age=63072000; includeSubdomains; preload";

The complete Nginx config looks like

5. Reload Nginx

sudo nginx -s reload

6. Redirect Http to Https

This step is optional. But I think it's very useful. If you want to redirect your users who are still using http to https automatically, add the following http block

  server {
    listen 80;
    server_name your.domain.com;
    return 301 https://$host$request_uri;
  }

If everything goes right, your site should be serving https now and http (if you did Step 6). Point your broser to https://your.domain.com. And you should see a green lock on the location bar and you can see the certicate information. It looks like this

If you are a bit skeptical about the quality of your cert or your configuration, head over to SSL Labs to do a test. If you followed my recommendations to use a strong DH group and HTTP Strict Transport Security, your site should receive grade A+.

Auto Renew

By default, Let's Encrypt certs expire in 90 days. Right now, there is no built-in way to automatically renew it. But it's not hard to write a shell script and cronjob to do the job. You only need three steps

1. Create webroot config file

To renew, we need to use the webroot plugin,
create /usr/local/etc/le-renew-webroot.ini with the following content

rsa-key-size = 4096
email = youremail@example.com
domains = your.domain.com
authenticator = webroot
webroot-path = /usr/share/nginx/html

This file will be used a config file when running webroot.
Btw, you should have root directive and /.well-known defined in the https server block. They are needed by webroot for ACME chanllenge.

root /usr/share/nginx/html;
location ~ /.well-known {
	allow all;
}

Then you can run this to get a new cert

./letsencrypt-auto certonly -a webroot --agree-tos --renew-by-default --config /usr/local/etc/le-renew-webroot.ini

2. Create renewal script

sudo curl -L -o /usr/local/sbin/le-renew-webroot https://gist.githubusercontent.com/thisismitch/e1b603165523df66d5cc/raw/fbffbf358e96110d5566f13677d9bd5f4f65794c/le-renew-webroot
sudo chmod +x /usr/local/sbin/le-renew-webroot

You probably want to take a look at the script /usr/local/sbin/le-renew-webroot and customize it, especially le_path and exp_limit.

You can also run the script manually. If your cert is too new to renew, depending on your configuration in the script, you will see how many days are left. Otherwise, it will renew.

3. Cronjob

The last step is to create a cronjob to automatically run /usr/local/sbin/le-renew-webroot. So that you can sleep well and not to worry that your cert expires.

sudo crontab -e

And add in

30 2 * * 6 /usr/local/sbin/le-renew-webroot >> /var/log/le-renewal.log 2>&1

This cronjob will attempt to renew Let's Encrypt at 2:30 AM every Saturday.

Summary

As you see, setting up Let's Encrypt and making it auto renew is not that hard. It used to be much more painful due to the cost of SSL cert and the efforts to create and maintain it, which gives many people (including myself) excuses not to use https. Now I can't find any excuse not to use https.

Let's Encrypt!