Groups in ZeroTier Rules

An in-depth how-to on partitioning your ZeroTier network using rules.

The Problem We’re Solving

With ZeroTier the recommended way to partition nodes (that shouldn’t talk to each other) is to put them on separate ZeroTier networks¹. It would look like this:

This approach is foolproof because it doesn’t require any rules, and is absolute in locking down all traffic (since different ZeroTier networks cannot communicate with each other).

It’s possible to have one node on several networks. For example you can have a hub-and-spoke topology where one node talks to all other nodes, but all other nodes cannot talk to each other:

There are two downsides to this approach:

Joining the network has to be done at the node’s ZeroTier client, so you have to go back and forth between all your nodes and ZeroTier Central.
In the example above, every time you want to extend the network by adding a new node, you have to also create a new network. You have to join that network from both the new node and the hub of the network. That’s really annoying and inflexible.

Using ZeroTier rules it is possible to have the same kind of topology with a single ZeroTier network that has the following properties:

Instead of networks we’ll have groups.
Nodes can belong to multiple groups.
Nodes that share a group will be able to talk to each.
Nodes that share no common groups will NOT be able to talk to each other.
Management of all traffic will just be a matter of editing rules and group membership in ZeroTier Central. No editing at the clients is required.
Everything will be zero-trust²: new nodes will be isolated from the network until you grant them group membership.

Prerequisites

A ZeroTier network, w/ access on ZeroTier Central
Several client nodes joined to the network
A basic understanding of ZeroTier’s Rule Engine

Understanding Tags and the ZeroTier Central Rule Editor

In ZeroTier Central you’ll see a section called “Flow Rules.” On the left you see the rules in ZeroTier’s Domain Specific Language and on the right you see a preview of the JSON representation (which you can ignore):

Flow Rules

At the time of writing the default rules are:

1
#
2
# Allow only IPv4, IPv4 ARP, and IPv6 Ethernet frames.
3
#
4
drop
5
  not ethertype ipv4
6
  and not ethertype arp
7
  and not ethertype ipv6
8
;
9

10
# Accept anything else. This is required since default is 'drop'.
11
accept;

Erase everything and start with a blank text area³. We’ll start by defining a group tag:

1
# Create a tag for group membership
2
tag group
3
  id 1000 # All tags must have a unique id.
4
  default 0 # Default = No group membership. Zero trust.
5

6
  # These flags are what you edit. The names can be anything.
7
  flag 0 productivity
8
  flag 1 media
9
  flag 2 gaming
10
  flag 3 infrastructure
11
;

Tags are 32-bit unsigned integers associated to your node (or, according to the docs, “32-bit numeric key-value pair credentials”). Everything above helps ZeroTier Central render a GUI for editing the value. For example, with the rules above we get this in ZeroTier Central’s Tags Matrix:

Rules and Resulting Tag Matrix

And this flag editor which you can see by clicking the wrench icon next to any node in the Members section:

You can use any id you want as long as it is unique across all tags in your rules.

The id and default lines are relevant to the rules engine but flag and enum fields are actually invisible to the rules engine and only serve to help render the GUI editor for the fields. In the end, each node gets an integer value for the group tag:

What is used in the Tags Matrix

Now when you check a flag, the tag value will change:

And if you check TWO flags, the tag value will capture that:

Checked Flags

The tag value is conceptually NOT 2 + 4 = 6 but rather a bitwise OR: 2 | 4 = 6. What does that mean? Well, we take the decimal values of the flags and convert to binary:

 ┌───────┐
 │0|0|1|0│ 2 (media)
 └───────┘
 ┌───────┐
 │0|1|0|0│ 4 (productivity)
 └───────┘

Then line up the columns. Any column that has a 1 in it in either position keeps the 1. Every other column is a 0:

 ┌─┬─┬─┬─┐
 │0│0│1│0│ 2 (media)
 │0│1│0│0│ 4 (productivity)
 └─┴─┴─┴─┘
  ▼ ▼ ▼ ▼  2 | 4
 ┌─┬─┬─┬─┐
 │0│1│1│0│ 6
 └─┴─┴─┴─┘

0110 in decimal is 6, which is our tag value. This is called a bitwise OR.

It just so happens that ADDing bit flag values produces the same result as ORing them, but we need to get in the bitwise operation mindset for later.

Rule Setup for Groups

The flag lines are what you will edit. Each flag represents one group. When adding a group make sure you increment the flag value (e.g. we would use flag 4 <group-name> to add a group to the rules above). With my setup you are limited to 31 flags (ZeroTier’s rule engine limits you to 32 flags, normally⁴). So flag 30 is the highest value you should use (flags are 0-indexed):

1
flag 0 productivity
2
flag 1 media
3
flag 2 gaming
4
flag 3 infrastructure
5
flag 4 security
6
...
7
flag 30 highestFlagYouCanUse

Next we can leave the default drop rule:

1
#
2
# Allow only IPv4, IPv4 ARP, and IPv6 Ethernet frames.
3
#
4
drop
5
  not ethertype ipv4
6
  and not ethertype arp
7
  and not ethertype ipv6
8
;

Next we create the rule that does the magic:

1
# Drop any traffic between nodes that don't share at least one group
2
break
3
  tand group 0
4
;

This looks up the value for the group tag for each side of the traffic (sender and receiver) and bitwise ANDs them together. If the resulting value is 0, which will only happen when the sender and receiver share no groups, we break.

break is defined as:

Terminate evaluation of this rule set but continue evaluating capabilities.

It’s like drop but can be overridden by a capability.

tand is defined as:

Tags ANDed together equal value

An AND operation is like the OR we did above, except that we only keep 1 when both values in the column are 1. For example, if Alice is in the media and gaming groups and Bob is in the productivity group:

 ┌─┬─┬─┬─┐
 │0│1│1│0│ Alice = 6 (media | gaming)
 │0│0│0│1│ Bob = 1 (productivity)
 └─┴─┴─┴─┘
  ▼ ▼ ▼ ▼  6 & 1
 ┌─┬─┬─┬─┐
 │0│0│0│0│ 0
 └─┴─┴─┴─┘

The result of 6 & 1 = 0 would cause us to break and block the traffic. But if we add Bob to the media group:

 ┌─┬─┬─┬─┐
 │0│1│1│0│ Alice = 6 (media | gaming)
 │0│0│1│1│ Bob = 3 (media | productivity)
 └─┴─┴─┴─┘
  ▼ ▼ ▼ ▼  6 & 3
 ┌─┬─┬─┬─┐
 │0│0│1│0│ 2
 └─┴─┴─┴─┘

The result of 6 & 3 = 2 does NOT break. Instead we flow into our last rule: an unconditional accept.

1
# default to accept
2
accept;

Putting that altogether, our rules should now read:

1
# Create a tag for group membership
2
tag group
3
  id 1000 # All tags must have a unique id.
4
  default 0 # Default = No group membership. Zero trust.
5

6
  # These flags are what you edit
7
  flag 0 productivity
8
  flag 1 media
9
  flag 2 gaming
10
  flag 3 infrastructure
11
;
12

13
#
14
# Allow only IPv4, IPv4 ARP, and IPv6 Ethernet frames.
15
#
16
drop
17
  not ethertype ipv4
18
  and not ethertype arp
19
  and not ethertype ipv6
20
;
21

22
# Drop any traffic between nodes that don't share at least one group
23
break
24
  tand group 0
25
;
26

27
# default to accept
28
accept;

“All Group” Nodes

If you want a node to belong to all groups, we could just check all the group flags we’ve defined. That would work… for now. But if you add a flag you have to remember to go through all the all-group nodes and check the new flag you added. An allgroups enum solves this.

enum, unlike flag, gives you a dropdown to choose from:

enum example

Any enum value that is selected will totally overwrite any flags that are checked. The result is that we can create an enum dropdown value that means “all flags I could possibly define.” In bitwise logic speak that would be:

1
all group flags = flag 0 | flag 1 | flag 2 | flag 3 | ... | flag 30

Which is:

┌────────────────────────────────────────────┐
│00000000000000000000000000000000000000000001│
│00000000000000000000000000000000000000000010│
│00000000000000000000000000000000000000000100│
│00000000000000000000000000000000000000001000│
│00000000000000000000000000000000000000010000│
│                    ...                     │
│00001000000000000000000000000000000000000000│
│00010000000000000000000000000000000000000000│
│00100000000000000000000000000000000000000000│
│01000000000000000000000000000000000000000000│
│10000000000000000000000000000000000000000000│
└────────────────────────────────────────────┘
 ▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼▼
┌────────────────────────────────────────────┐
│11111111111111111111111111111111111111111111│
└────────────────────────────────────────────┘

That large number is 2^31 or 2,147,483,648 so we use that as our enum value. If selected, it would equivalent to selecting all possible flag values. So we add the enum:

1
# Create a tag for group membership
2
tag group
3
  id 1000 # All tags must have a unique id.
4
  default 0 # Default = No group membership. Zero trust.
5

6
  # These flags are what you edit
7
  flag 0 productivity
8
  flag 1 media
9
  flag 2 gaming
10
  flag 3 infrastructure
11

12
  # This special value means "access to all groups"
13
  enum 2147483647 allgroups # <=========================== add this
14
;

And the UI for making an all-groups node looks like this:

All Groups Enum Example

Now, even if new flags are added in the future, Alice will continue to be able to communicate with any group.

An Alternative: Capabilities

I prefer my allgroups enum for its simplicity but an alternative would be to create a superuser capability. You would add a capability like so:

1
# Create a capability called "superuser" that lets its holders override all but "drop"
2
cap superuser
3
  id 1000 # arbitrary, but must be unique
4
  accept; # allow with no match conditions means allow anything and everything
5
;

Capabilities override break rules but NOT drop rules, so you’d have to pay attention to that when crafting and placing any drop rule. Our group rule used break already so it is ready to be overridden by such a capability.

One difference between this superuser capability and the allgroups enum is that the enum still doesn’t let a node communicate with nodes that have NO group memberships at all. The enum just makes a node belong to all groups, while this capability would allow a superuser to bypass the group break rule entirely. For my arrangement I actually prefer to treat no-group nodes as completely isolated, but feel free to use the capability instead if it suits your setup better.

Final Rules

1
# Create a tag for group membership
2
tag group
3
  id 1000 # All tags must have a unique id.
4
  default 0 # Default = No group membership. Zero trust.
5

6
  # These flags are what you edit
7
  flag 0 productivity
8
  flag 1 media
9
  flag 2 gaming
10
  flag 3 infrastructure
11

12
  # This special value means "access to all groups"
13
  enum 2147483647 allgroups
14
;
15

16
#
17
# Allow only IPv4, IPv4 ARP, and IPv6 Ethernet frames.
18
#
19
drop
20
  not ethertype ipv4
21
  and not ethertype arp
22
  and not ethertype ipv6
23
;
24

25
# Drop any traffic between computers that don't share at least one group
26
break
27
  tand group 0
28
;
29

30
# default to accept
31
accept;

This is in stark contrast to Tailscale where you can only connect to one network with one account and then everything has to be managed with ACLs. Tailscale’s approach has its upsides (e.g. ACLs are way easier than ZeroTier rules) and it’s downsides (e.g. you can’t use work and personal Tailscale accounts on the same node, ACLs can’t cross organizations, etc). ↩
https://en.wikipedia.org/wiki/Zero_trust_security_model ↩
I recommend editing your rules in a version-controlled repository elsewhere and pasting it into the ZeroTier Central UI to make changes. This gives you a backup of your rules in addition to the versioning. ↩
You might be wondering: “why use 2^31 instead of 2^32 since the latter would allow us to have 32 flags instead of 31?” The answer to that is simple: ZeroNS cannot parse enum values greater than the max 32-bit integer value because it cannot understand that enum values are UNSIGNED integers: https://github.com/zerotier/zeronsd/issues/126. By dropping the enum value to 2^31 we gain ZeroNS compatibility. ↩