Peering/federating between headscale instances #486

New Issue

adam · 2025-12-29T02:18:57+01:00

adam commented

2025-12-29 02:18:57 +01:00

Originally created by @infogulch on GitHub (Apr 21, 2023).

In Multi Server Support #588 kradalby said:

There are scenarios where multiple servers would make sense and allowing them to connect would also make sense:

Multiple headscale servers, allowing two companies/owners to share nodes between them

This is the use case that I am interested in. If this is solved correctly then it could also solve part of the redundancy/partition-tolerance use case.

Design

Based on general knowledge about how Tailscale and Headscale works, I suspect this design could solve the "sharing nodes" use case and also be reasonably implementable:

ipv6-only (wait hear me out)
Headscale allocates all addresses out of a /20, about ~1 million total
The rest of the 108 bits are derived from a fingerprint of the control server's private key
Independent headscale servers can opt-in to peer/federate with each other.
- This would come down to accepting each others' private keys and domains, set by config file or something.
New ACLs so that both control servers can manage access to and from each of their respectively owned nodes

There's no race conditions because severs allocate from different IP pools. 108 bits is enough to make benign collisions virtually impossible, with a clear workaround: one control server 'just' changes its private key and prefixes (I said the solution is clear, not easy). I'm guessing that on an actively updated network, 1M nodes is probably near the useful limit for a single headscale instance anyways.

Use cases

For me, I'd like for it to be easy to communicate privately and directly between me and my friends' devices without either of us giving up agency over our headscale control server. For example, if we meet at one of our homes or at a coffee shop we could securely taildrop a file over the local wifi.
For someone like 0n1cOn3 from the original issue, they could set up two separate headscale instances, one in each locale, and peer them. This will make their system tolerant to either side being down or the internet being partitioned, though it doesn't fix the local outage aspect of crashes.

Inspiration

This design is inspired by Yggdrasil Network. In contrast to Yggdrasil, this design retains more control over which nodes peer with each other, and completely sidesteps the entire Globally Synced Node Database problem and associated collision vulnerabilities. And then there are practical matters like performance... I will stop short of proposing that this could eventually grow into a global mesh network at the same scale as yggdrasil, not because its necessarily impossible but because these simple use cases are more relevant to the project at this time.

Thoughts?

Originally created by @infogulch on GitHub (Apr 21, 2023). In [Multi Server Support #588](https://github.com/juanfont/headscale/issues/588) kradalby said: > There are scenarios where multiple servers would make sense and allowing them to connect would also make sense: > > * Multiple headscale servers, allowing two companies/owners to share nodes between them This is the use case that I am interested in. If this is solved correctly then it could *also* solve part of the redundancy/partition-tolerance use case. ### Design Based on general knowledge about how Tailscale and Headscale works, I suspect this design could solve the "sharing nodes" use case and also be reasonably implementable: 1. ipv6-only (wait hear me out) 2. Headscale allocates all addresses out of a /20, about ~1 million total 3. The rest of the 108 bits are derived from a fingerprint of the control server's private key 4. *Independent headscale servers can opt-in to peer/federate with each other.* * This would come down to accepting each others' private keys and domains, set by config file or something. 6. New ACLs so that both control servers can manage access to and from each of their respectively owned nodes There's no race conditions because severs allocate from different IP pools. 108 bits is enough to make benign collisions virtually impossible, with a clear workaround: one control server 'just' changes its private key and prefixes (I said the solution is clear, not easy). I'm guessing that on an actively updated network, 1M nodes is probably near the useful limit for a single headscale instance anyways. ### Use cases * For me, I'd like for it to be easy to communicate privately and directly between me and my friends' devices without either of us giving up agency over our headscale control server. For example, if we meet at one of our homes or at a coffee shop we could securely taildrop a file over the local wifi. * For someone like 0n1cOn3 from the original issue, they could set up two separate headscale instances, one in each locale, and peer them. This will make their system tolerant to either side being down or the internet being partitioned, though it doesn't fix the local outage aspect of crashes. ### Inspiration This design is inspired by [Yggdrasil Network](https://yggdrasil-network.github.io/). In [contrast](https://yggdrasil-network.github.io/2019/03/25/peering.html) to Yggdrasil, this design retains more control over which nodes peer with each other, and completely sidesteps the entire *Globally Synced Node Database* problem and associated [collision vulnerabilities](https://yggdrasil-network.github.io/2018/07/28/addressing.html#addressing-in-yggdrasil). And then there are practical matters like performance... I will stop short of proposing that this could eventually grow into a global mesh network at the same scale as yggdrasil, not because its necessarily impossible but because these simple use cases are more relevant to the project at this time. Thoughts?

adam added the enhancement out of scope labels 2025-12-29 02:18:57 +01:00

adam closed this issue

2025-12-29 02:18:57 +01:00

adam commented

2025-12-29 02:18:58 +01:00

@kradalby commented on GitHub (May 10, 2023):

While we appreciate the suggestion, it is out of scope for this project and not something we will work for now.

@kradalby commented on GitHub (May 10, 2023): While we appreciate the suggestion, it is out of scope for this project and not something we will work for now.

adam referenced this issue

2025-12-29 02:30:03 +01:00

[PR #486] [MERGED] Make register html template consistent with other endpoints #1434

Sign in to join this conversation.

Branches Tags

main

update_flake_lock_action

gh-pages

kradalby/3038-reg-panic

kradalby/release-v0.27.2

dependabot/go_modules/golang.org/x/crypto-0.45.0

dependabot/go_modules/github.com/opencontainers/runc-1.3.3

copilot/investigate-headscale-issue-2788

copilot/investigate-visibility-issue-2788

copilot/investigate-issue-2833

copilot/debug-issue-2846

copilot/fix-issue-2847

dependabot/go_modules/github.com/go-viper/mapstructure/v2-2.4.0

dependabot/go_modules/github.com/docker/docker-28.3.3incompatible

kradalby/cli-experiement3

doc/0.26.1

doc/0.25.1

doc/0.25.0

doc/0.24.3

doc/0.24.2

doc/0.24.1

doc/0.24.0

kradalby/build-docker-on-pr

topic/docu-versioning

topic/docker-kos

juanfont/fix-crash-node-id

juanfont/better-disclaimer

update-contributors

topic/prettier

revert-1893-add-test-stage-to-docs

add-test-stage-to-docs

remove-node-check-interval

fix-empty-prefix

fix-ephemeral-reusable

bug_report-debuginfo

autogroups

logs-to-stderr

revert-1414-topic/fix_unix_socket

rename-machine-node

port-embedded-derp-tests-v2

port-derp-tests

duplicate-word-linter

update-tailscale-1.36

warn-against-apache

ko-fi-link

more-acl-tests

fix-typo-standalone

parallel-nolint

tparallel-fix

rerouting

ssh-changelog-docs

oidc-cleanup

web-auth-flow-tests

kradalby-gh-runner

fix-proto-lint

remove-funding-links

go-1.19

enable-1.30-in-tests

0.16.x

cosmetic-changes-integration

tmp-fix-integration-docker

fix-integration-docker

configurable-update-interval

show-nodes-online

hs2021

acl-syntax-fixes

ts2021-implementation

fix-spurious-updates

unstable-integration-tests

mandatory-stun

embedded-derp

prtemplate-fix

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: starred/headscale#486