6 Oct 2021 ยท Software Engineering

    What is Bazel – Tutorial, Examples, and Advantages

    12 min read
    Contents

    Bazel is an open-source build tool developed by Google to automate build processes for large-scale software. Companies such as Pinterest, Adobe, SpaceX, Nvidia, and LinkedIn use it, amongst others. In this tutorial, you’ll understand what Bazel is, how it works, and its important benefits. You’ll also learn how you can generate Bazel builds for your monorepo project.

    Why should you use Bazel?

    Bazel works along the lines of other build tools like Make, Maven, or Gradle. However, unlike other tools which are framework-specific, Bazel is tailor-fit for projects with multi-language dependencies.

    For instance, you could have a Rust or Go server, a mobile client on flutter, and a web client on Angular. In this case, if you were to manually write your own build files that cater to each language’s ecosystem, it could be a daunting task. Luckily, Bazel does all the heavy lifting for you.

    bazel tutorial, bazel build

    One of the most intriguing features of Bazel is you can easily hook it up with your project’s CI/CD. This helps you boost the productivity of your teams as you can produce more reliable builds to test your software regularly and rigorously. Consequently, you’ll also be able to ship and release more robust builds easily.

    However, there’s more to Bazel than just handling multi-language dependencies. Let’s explore some of the benefits that make it so powerful.

    What are the benefits of Bazel?

    Here are the key advantages that make Bazel an outstanding build tool:

    Reproducibility: Bazel produces pure functional builds where your output files are strictly dependent on your inputs. This gives two important characteristics to your builds. First, your builds are hermetic, which means only the inputs that you explicitly mention are readable by your build steps. Second, your builds are idempotent or in other words, reproducible. If you’re using a constant set of inputs, Bazel will produce the same build each time. Reproducible builds make debugging your build operations more convenient.

    Functional Builds in Bazel

    Integration with Docker and Kubernetes: Attributed to its versatility, Bazel complements modern platforms and systems like Docker and Kubernetes. For instance, you can have a handy Docker container for your monorepo containing a web client and a bunch of microservices. You can then use it to kickstart a testing environment that resembles your production environment. Thus you’ll be able to test your Bazel builds through your containers. Furthermore, you can enable incremental builds and manage your deployment using an orchestration engine like AWS Fargate or Kubernetes

    Deploying Bazel Build using Docker and Kubernetes

    Scalability: Bazel’s invention arose from one of Google’s internal build tools, Blaze. Within Google itself, Bazel handles builds for projects containing more than 100K source files. In other words, Bazel is perfect for large projects with enormous codebases. While it advocates the monorepo pattern, it handles microservices architecture just as easily. Dropbox used Bazel to scale their CI/CD pipelines to mitigate the number of tests executed on their commits. Uber also adopted Bazel to scale their Go monorepo. They leveraged Bazel’s hermetic builds to support incremental build generation that supports their distributed infrastructure.

    Declarative programming: Writing build configurations for projects should be as simple as possible. Bazel is built using Starlark, a high-level language derived from Python. Thus it gives developers a more convenient way to write build configurations and properties that are easily readable. Additionally, the abstraction in Bazel saves developers from dealing with complicated stuff like compilers and linkers.

    Parallelism and caching: Build tools at scale have to be high performant. Bazel speeds up your builds using a caching mechanism. It intelligently compares your subsequent builds with your previous cached builds and only builds those files which the developers updated. This ensures that Bazel only spends your CPU resources on building those pieces of your project that need to be re-built. Bazel also allows you to generate concurrent builds in a parallel fashion to save time across your distributed codebase. You can generate parallel builds on both a single machine as well as across multiple machines remotely.

    Large, and growing open-source community: Bazel’s first version came out almost 2 years ago in 2019. Since then, Google has pushed more than 100 releases in a span of two years in an attempt to constantly evolve the tool for developers. Bazel’s Github repo has 17.4k stars and has more than 2K tagged questions and more than 6K search results on Stack Overflow. These small yet growing numbers are an indication of the fact that it’s gaining traction quite rapidly amongst the community.

    Bazel terms and basics explained

    Now, let’s look at some important terminology used with Bazel.

    Workspace

    The workspace is typically the directory where Bazel constructs your project’s build files from your source code. It contains various source files in a nested hierarchical fashion. At the highest or root level of your workspace, your project might also have a dedicated WORKSPACE text file. It contains references to all the external dependencies your project requires to generate your builds. Precisely, the workspace is where your inputs are extracted and converted to outputs to generate the desired build file.

    Packages

    Packages are simply directories located below the top-level directory in a workspace. They contain your build files that may be named BUILD or BUILD.bazel , along with other related files and specified dependencies. One package may be nested in another since your source code is organized in a hierarchical fashion. Consider the following working directory:

      src/application/BUILD
      src/application/main/render.txt
      src/application/staging/BUILD


    The application is a package as it contains its own BUILD file. Similarly, staging is another package but since it falls directly under application, it can be considered as a subpackage. However, the main is not a subpackage but only a regular directory inside your application package since it doesn’t have its own BUILD file.

    Targets

    Everything inside your packages can be considered targets. It includes source files that developers of your team wrote and added to your project or generated files that Bazel constructs based on your build configurations. Other than files, targets often involve rules that govern the relationship between your input files and output files. In other words, target rules specify how Bazel is going to construct your build files, the intermediary steps it’s going to take along with every executable operation it will need to perform. Your input files for target rules could be either source files or generated files. Although, your output files are always generated files, since it’s an outcome of the build tool itself. You may specify your target rules to chain your inputs and outputs for consecutive build operations. For instance, you could use a generated file from a previous step as an input file for another step in the future.

    Labels

    The nomenclature of a target is known as a label. It’s just a way to recognize different targets falling under a package and distinguishing them from other targets in the same or different packages.
    Consider the following label:

      @repo//application:application_source

    The above label describes the target application_source inside a package application. Similarly, in the below example main_binary describes a target inside a subpackage main under the package application.

      @repo//application/main:main_binary

    Both targets have different labels and you can expect each label to uniquely identify a target.

    Dependencies

    When you build your specified targets, one or more target may depend on another for the build process. The latter is a dependency. To explain further, consider two targets Target1 and Target2. Let’s say Target1 needs Target2 at build or execution time. We can express this relationship as a Directed Acyclic Graph that demonstrates how Target1 is dependent on `Target12 in this process. Bazel calls this a dependency graph and is used to differentiate between actual and declared dependencies.

    Build Files

    Earlier, we said that under a WORKSPACE every package contains its own BUILD file. Build files contain the program that enables Bazel to generate the desired builds for your project. It’s evaluated using Starlark and contains a list of sequential statements that are executed by Bazel. The build files contain your declared rules, functions that execute these rules, and variables required in a simple syntax form.

    We know that Bazel caches previous builds for speeding up the build process. Whenever your source files change, it is the BUILD file that Bazel references for understanding the underlying changes.

    Commands

    Bazel offers a set of commands that allow you to execute certain operations. The simplest command you can run is to check if Bazel is successfully installed in your system:

      $ bazel --version
    
      bazel 4.0.0

    Other important commands are bazel build, bazel run and bazel test. The build command as the name suggests builds your output files for the specified inputs or targets.

      $ bazel build //foo
    
      INFO: Analyzed target //foo:foo (14 packages loaded, 48 targets configured).
      INFO: Found 1 target...
      Target //foo:foo up-to-date:
        bazel-bin/foo/foo
      INFO: Elapsed time: 9.905s, Critical Path: 3.25s
      INFO: Build completed successfully, 6 total actions

    The run command runs your output files and the test command is equivalent to running both the build and run commands together.

    Bazel Terminologies in a Build Workflow

    Building/compiling code with Bazel

    Now that we’re aware of common Bazel terminology, let’s understand we can build or compile our code with Bazel using the following steps:

    1. The .bazelrc file

    The first step that we do is write our build options in a dedicated .bazelrc file. These options determine the settings that are taken into account every time Bazel builds your project. Here’s how a typical .bazelrc file looks like if you’re building a JavaScript project:

    build --disk_cache=~/.cache/bazel-disk-cache
    
    build --enable_platform_specific_config
    
    build --symlink_prefix=dist/
    
    test --test_output=errors
    
    test:debug --test_output=streamed --test_strategy=exclusive --test_timeout=9999 --nocache_test_results --define=VERBOSE_LOGS=1
    
    run:debug --define=VERBOSE_LOGS=1 -- --node_options=--inspect-brk
    
    build:debug --compilation_mode=dbg
    
    build --nolegacy_external_runfiles
    
    build --incompatible_strict_action_env
    run --incompatible_strict_action_env
    
    coverage --instrument_test_targets
    
    try-import %workspace%/.bazelrc.user

    2. Add the buildifier dependency

    In order to ensure that all your build files are formatted in a similar fashion, Bazel uses a dependency called buildifier. You can install it as a development-time dependency for your project by running:

    npm install --save-dev @bazel/buildifier

    It makes sure that all your builds look the same so as to maintain consistency across various build steps in your project. This additionally aids Bazel when it’s comparing your current build with previous builds using its cached artifacts to speed up your build process.

    3. Create targets

    This is where you write all your source code that Bazel needs to build. Remember how Bazel builds output files for your specified targets? That’s basically the application-level code pertaining to your project. Once you define and declare your targets, Bazel knows which inputs to use while constructing the desired outputs.

    4. Add build rules

    When we’re through with our build options, we need to use some rules to build our project. In this step, you can either chose auto-generated rules or define your own custom rules. It is prudent to utilize auto-generated rules, and bear in mind that there’s a handful of them on Bazel’s official repository. We define these rules inside our build file.

    5. Build/compile project

    After you have specified your build rules, Bazel typically receives your build file as an input and loads it to the analyzer. Next, it produces an action graph based on your targets and dependencies, and excuses build actions on your declared inputs. Finally, it produces the build outputs and stores them in cache artifacts for the subsequent build.

    Bazel Build Process

    Let’s extend this theory in action by seeing step by step how this process allows us to build a JavaScript monorepo with Bazel.

    Building a monorepo project with Bazel

    To get started, we can use @bazel to directly setup Bazel in a new NodeJS project:

    npm init @bazel bazel_js_monorepo

    Here’s what you should get back on the terminal when the above command runs successfully:

    New Bazel Project

    You will notice that now you have buildifier in your project, along with other required files such as the package.json for managing your JavaScript dependencies, your project’s WORKSPACE, and most importantly, the BUILD.bazel file.

    New Bazel Project Workspace

    You’ll also need to install Babel and other related dependencies to transpile your JavaScript code.

    npm install @babel/core @babel/cli @babel/preset-env

    If you head over to the WORKSPACE file, you’ll see how it articulates the dependencies Bazel needs to fetch when it builds your project. You need to explicitly tell Bazel about the rules you’ll use to generate your builds. For brevity, we’ll use auto-generated rules.

    Go to the BUILD.bazel file present in the root of your project and add the following line:

    load("@npm//@babel/cli:index.bzl", "babel")

    Consequently, you will also need to configure Babel for your project. Create a new file called es5.babelrc file in the root of your project. Recall that the purpose of a .babelrc file as we saw earlier is to describe our build options. Add the following code inside it:

    {
        "sourceMaps": "inline",
        "presets": [
          [
            "@babel/preset-env",
            {
              "modules": "systemjs"
            }
          ]
        ]
      }

    The above configuration is similar to how webpack adds Babel to your regular JavaScript project.

    Before you add your build rules, you need to provide your input or source file to Bazel. Essentially, these inputs are what Bazel sees as targets to construct your builds. Let’s create a file called app.js inside the root directory. We’ll create a simple JavaScript class called User with a function that converts the name of the user in uppercase inside it.

    class User{
        constructor(name){
            this.name=name
        }
        uppercaseName(){
            this.name=this.name.toUpperCase();
            return this.name;
        }
    }
    
    const user=new User("FuzzySid")
    user.uppercaseName();

    Next, you need to declare this file as input for your Bazel build. Previously, we loaded the build rules already inside BUILD.bazel file. Let’s use these rules to declare the inputs and outputs for our build inside the BUILD.bazel file:

    babel(
        name = "compile",
        data = [
            "app.js",
            "es5.babelrc",
            "@npm//@babel/preset-env",
        ],
        outs = ["app.es5.js"],
        args = [
            "app.js",
            "--config-file",
            "./$(execpath es5.babelrc)",
            "--out-file",
            "$(execpath app.es5.js)",
        ],
    )

    Finally, you can run the build command to let Bazel do the magic for you:

    npm run build

    You should see the following output on terminal:

    Bazel Build Successful

    Inside your project, you should now have a bazel-out and a dist directory.

    Bazel Build Output directories

    Your transpiled JavaScript code should be present inside dist/bin/app.es5.js. Kudos, you’ve successfully compiled a JavaScript monorepo using Bazel!

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    Avatar
    Writen by:
    Full-stack JavaScript developer who gets his kicks from solving complex problems and crafting pixel-perfect interfaces. Loves startups and geeking over the latest tech trends.