[Bug] Sometimes the advertised route is not set as primary route #1147

New Issue

adam · 2025-12-29T02:28:34+01:00

adam commented

2025-12-29 02:28:34 +01:00

Originally created by @YouSysAdmin on GitHub (Nov 15, 2025).

Is this a support request?

This is not a support request

Is there an existing issue for this?

I have searched the existing issues

Current Behavior

Sometimes for a new node with an automatically approved route, a route is not set as the primary what affect access of users to network behind this node.
This is fixed by restarting Headscale, after restart a routes are set correctly.

ID	Hostname	Approved	Available	Serving (Primary)
81	bastion	10.4.0.0/16	10.4.0.0/16

Looks like a regression for v0.27.*, I haven't seen this behavior before.

This is a fairly rare occurrence, I've get it happen a few times while testing the installation and logout the client via EC2 User-Data.
Approximately 1:10.

P.S. I haven't checked what's in the database at this moment, I'll try to check that.

Expected Behavior

ID	Hostname	Approved	Available	Serving (Primary)
81	bastion	10.4.0.0/16	10.4.0.0/16	10.4.0.0/16

Steps To Reproduce

Add a new node with an advertised route

Environment

- OS:
- Headscale version:
- Tailscale version:

Runtime environment

Headscale is behind a (reverse) proxy
Headscale runs in a container

Debug information

the same for both cases

Originally created by @YouSysAdmin on GitHub (Nov 15, 2025). ### Is this a support request? - [x] This is not a support request ### Is there an existing issue for this? - [x] I have searched the existing issues ### Current Behavior Sometimes for a new node with an automatically approved route, a route is not set as the primary what affect access of users to network behind this node. This is fixed by restarting Headscale, after restart a routes are set correctly. |ID | Hostname | Approved | Available | Serving (Primary)| |-|-|-|-|-| |81 | bastion | 10.4.0.0/16 | 10.4.0.0/16 | | Looks like a regression for v0.27.*, I haven't seen this behavior before. This is a fairly rare occurrence, I've get it happen a few times while testing the installation and logout the client via EC2 User-Data. Approximately 1:10. P.S. I haven't checked what's in the database at this moment, I'll try to check that. ### Expected Behavior |ID | Hostname | Approved | Available | Serving (Primary)| |-|-|-|-|-| |81 | bastion | 10.4.0.0/16 | 10.4.0.0/16 | 10.4.0.0/16 | ### Steps To Reproduce Add a new node with an advertised route ### Environment ```markdown - OS: - Headscale version: - Tailscale version: ``` ### Runtime environment - [ ] Headscale is behind a (reverse) proxy - [ ] Headscale runs in a container ### Debug information the same for both cases

adam added the bug regression labels 2025-12-29 02:28:34 +01:00

adam closed this issue

2025-12-29 02:28:34 +01:00

adam commented

2025-12-29 02:28:34 +01:00

@tobi-dub commented on GitHub (Nov 17, 2025):

Same behavior on my network.
When a node with already approved route restarts, the route is not served again.

Restarting headscale solves the issue. It takes a couple of seconds until the route gets served automatically.

@tobi-dub commented on GitHub (Nov 17, 2025): Same behavior on my network. When a node with already approved route restarts, the route is not served again. Restarting headscale solves the issue. It takes a couple of seconds until the route gets served automatically.

adam commented

2025-12-29 02:28:34 +01:00

@kradalby commented on GitHub (Nov 25, 2025):

@tobi-dub I've pushed some more changes to https://github.com/juanfont/headscale/pull/2890, can you try again?

@kradalby commented on GitHub (Nov 25, 2025): @tobi-dub I've pushed some more changes to https://github.com/juanfont/headscale/pull/2890, can you try again?

adam commented

2025-12-29 02:28:34 +01:00

@YouSysAdmin commented on GitHub (Nov 26, 2025):

Hi @kradalby
I built headscale from your branch and tested it

u@ip:~/headscale$ git branch
* kradalby/2888-oidc-pol

looks like the behavior persists

I got the same result - the route was applied normally once, but not the second time.

tailscale up --advertise-routes="10.4.0.0/16"  --auth-key="secret" --login-server="example.com" 
...
tailscale logout

Attempt 1:

ID	Hostname	Approved	Available	Serving (Primary)
98	bastion	10.4.0.0/16	10.4.0.0/16	10.4.0.0/16

Attempt 2:

ID	Hostname	Approved	Available	Serving (Primary)
99	bastion	10.4.0.0/16	10.4.0.0/16

I'll try and provide anonymized logs later, unfortunately I don't have time to deal with logs right now.

@YouSysAdmin commented on GitHub (Nov 26, 2025): Hi @kradalby I built headscale from your branch and tested it ``` u@ip:~/headscale$ git branch * kradalby/2888-oidc-pol ``` looks like the behavior persists I got the same result - the route was applied normally once, but not the second time. ``` tailscale up --advertise-routes="10.4.0.0/16" --auth-key="secret" --login-server="example.com" ... tailscale logout ``` Attempt 1: |ID | Hostname | Approved | Available | Serving (Primary)| |-|-|-|-|-| |98 | bastion | 10.4.0.0/16 | 10.4.0.0/16 | 10.4.0.0/16 | Attempt 2: |ID | Hostname | Approved | Available | Serving (Primary)| |-|-|-|-|-| |99 | bastion | 10.4.0.0/16 | 10.4.0.0/16 | | I'll try and provide anonymized logs later, unfortunately I don't have time to deal with logs right now.

adam commented

2025-12-29 02:28:34 +01:00

@kradalby commented on GitHub (Nov 30, 2025):

I've made a rc.1 release for 0.27.2 with fixes, would be great if you can test this and then close this (or give feedback so I can).

@kradalby commented on GitHub (Nov 30, 2025): I've made a [rc.1 release for 0.27.2](https://github.com/juanfont/headscale/releases/tag/v0.27.2-rc.1) with fixes, would be great if you can test this and then close this (or give feedback so I can).

adam commented

2025-12-29 02:28:35 +01:00

@tobi-dub commented on GitHub (Nov 30, 2025):

I've made a rc.1 release for 0.27.2 with fixes, would be great if you can test this and then close this (or give feedback so I can).

I tested the rc.1 in my setup successfully. The subnet router is switched automatically to the backup node when the primary one is unavailable.

Don't know why but it now also worked for v0.27.1. @YouSysAdmin Can you also confirm this maybe?

@tobi-dub commented on GitHub (Nov 30, 2025): > I've made a [rc.1 release for 0.27.2](https://github.com/juanfont/headscale/releases/tag/v0.27.2-rc.1) with fixes, would be great if you can test this and then close this (or give feedback so I can). I tested the rc.1 in my setup successfully. The subnet router is switched automatically to the backup node when the primary one is unavailable. Don't know why but it now also worked for v0.27.1. @YouSysAdmin Can you also confirm this maybe?

adam commented

2025-12-29 02:28:35 +01:00

@kradalby commented on GitHub (Dec 1, 2025):

Don't know why but it now also worked for v0.27.1. @YouSysAdmin Can you also confirm this maybe?

hmm, sounds like there might be something flaky somewhere. But happy to hear it works.

@kradalby commented on GitHub (Dec 1, 2025): > Don't know why but it now also worked for v0.27.1. @YouSysAdmin Can you also confirm this maybe? hmm, sounds like there might be something flaky somewhere. But happy to hear it works.

adam commented

2025-12-29 02:28:35 +01:00

@YouSysAdmin commented on GitHub (Dec 9, 2025):

Hi @kradalby
I tested v0.27.2-rc.1, new nodes work fine, but for ephemeral nodes there are problems.
(Perhaps this does not apply specifically to ephemeral ones and is also true for ordinary ones, sorry, I have not checked and will not be able to check in the near future)

Connect a new "ephemeral" client.
Check that all routes are automatically approved.
Disconnect a client (logout).
Verify that a node and route have been deleted.
Connect a new client with the same routes.

After client logout in log output error

{"level":"error","error":"generating map response for node 102: generating map response for nodeID 102: multiple errors:\n\tnode not found\n\tnode not found\n\tnode not found\n\tnode not found\n\tnode not found\n\tnode not found","worker.id":1,"node.id":0,"change":"Full","time":1765269026,"message":"failed to apply change"}

After reconnecting a client, have the same behavior - Serving (Primary) is not set until restart Headscale.
If I reboot Headscale before connecting a client, everything will work as expected.

{"level":"info","node.id":102,"node.name":"bastion","time":1765269025,"message":"Deleting ephemeral node during logout"}
{"level":"error","error":"generating map response for node 102: generating map response for nodeID 102: multiple errors:\n\tnode not found\n\tnode not found\n\tnode not found\n\tnode not found\n\tnode not found\n\tnode not found","worker.id":1,"node.id":0,"change":"Full","time":1765269026,"message":"failed to apply change"}
{"level":"error","caller":"/home/runner/work/headscale/headscale/hscontrol/poll.go:401","omitPeers":false,"stream":true,"node.id":102,"node.name":"bastion","error":"node not found: 102","time":1765269034,"message":"Failed to disconnect node bastion"}
{"level":"info","caller":"/home/runner/work/headscale/headscale/hscontrol/poll.go:383","omitPeers":false,"stream":true,"node.id":102,"node.name":"bastion","time":1765269034,"message":"node has disconnected, mapSession: 0xc0003a4f00, chan: 0xc0003f4770"}
{"level":"error","error":"generating map response for node 102: generating map response for nodeID 102: multiple errors:\n\tnode not found\n\tnode not found","worker.id":1,"node.id":24,"change":"NodeNewOrUpdate","time":1765269049,"message":"failed to apply change"}

@YouSysAdmin commented on GitHub (Dec 9, 2025): Hi @kradalby I tested v0.27.2-rc.1, new nodes work fine, but for `ephemeral` nodes there are problems. _(Perhaps this does not apply specifically to ephemeral ones and is also true for ordinary ones, sorry, I have not checked and will not be able to check in the near future)_ 1. Connect a new "ephemeral" client. 2. Check that all routes are automatically approved. 3. Disconnect a client (logout). 4. Verify that a node and route have been deleted. 5. Connect a new client with the same routes. After client logout in log output error ```json {"level":"error","error":"generating map response for node 102: generating map response for nodeID 102: multiple errors:\n\tnode not found\n\tnode not found\n\tnode not found\n\tnode not found\n\tnode not found\n\tnode not found","worker.id":1,"node.id":0,"change":"Full","time":1765269026,"message":"failed to apply change"} ``` After reconnecting a client, have the same behavior - `Serving (Primary)` is not set until restart Headscale. If I reboot Headscale before connecting a client, everything will work as expected. ```json {"level":"info","node.id":102,"node.name":"bastion","time":1765269025,"message":"Deleting ephemeral node during logout"} {"level":"error","error":"generating map response for node 102: generating map response for nodeID 102: multiple errors:\n\tnode not found\n\tnode not found\n\tnode not found\n\tnode not found\n\tnode not found\n\tnode not found","worker.id":1,"node.id":0,"change":"Full","time":1765269026,"message":"failed to apply change"} {"level":"error","caller":"/home/runner/work/headscale/headscale/hscontrol/poll.go:401","omitPeers":false,"stream":true,"node.id":102,"node.name":"bastion","error":"node not found: 102","time":1765269034,"message":"Failed to disconnect node bastion"} {"level":"info","caller":"/home/runner/work/headscale/headscale/hscontrol/poll.go:383","omitPeers":false,"stream":true,"node.id":102,"node.name":"bastion","time":1765269034,"message":"node has disconnected, mapSession: 0xc0003a4f00, chan: 0xc0003f4770"} {"level":"error","error":"generating map response for node 102: generating map response for nodeID 102: multiple errors:\n\tnode not found\n\tnode not found","worker.id":1,"node.id":24,"change":"NodeNewOrUpdate","time":1765269049,"message":"failed to apply change"} ```

adam referenced this issue

2025-12-29 03:18:52 +01:00

[PR #1147] [MERGED] Migrate DB: rename table is plural #1916

Sign in to join this conversation.

Branches Tags

main

update_flake_lock_action

gh-pages

kradalby/3038-reg-panic

kradalby/release-v0.27.2

dependabot/go_modules/golang.org/x/crypto-0.45.0

dependabot/go_modules/github.com/opencontainers/runc-1.3.3

copilot/investigate-headscale-issue-2788

copilot/investigate-visibility-issue-2788

copilot/investigate-issue-2833

copilot/debug-issue-2846

copilot/fix-issue-2847

dependabot/go_modules/github.com/go-viper/mapstructure/v2-2.4.0

dependabot/go_modules/github.com/docker/docker-28.3.3incompatible

kradalby/cli-experiement3

doc/0.26.1

doc/0.25.1

doc/0.25.0

doc/0.24.3

doc/0.24.2

doc/0.24.1

doc/0.24.0

kradalby/build-docker-on-pr

topic/docu-versioning

topic/docker-kos

juanfont/fix-crash-node-id

juanfont/better-disclaimer

update-contributors

topic/prettier

revert-1893-add-test-stage-to-docs

add-test-stage-to-docs

remove-node-check-interval

fix-empty-prefix

fix-ephemeral-reusable

bug_report-debuginfo

autogroups

logs-to-stderr

revert-1414-topic/fix_unix_socket

rename-machine-node

port-embedded-derp-tests-v2

port-derp-tests

duplicate-word-linter

update-tailscale-1.36

warn-against-apache

ko-fi-link

more-acl-tests

fix-typo-standalone

parallel-nolint

tparallel-fix

rerouting

ssh-changelog-docs

oidc-cleanup

web-auth-flow-tests

kradalby-gh-runner

fix-proto-lint

remove-funding-links

go-1.19

enable-1.30-in-tests

0.16.x

cosmetic-changes-integration

tmp-fix-integration-docker

fix-integration-docker

configurable-update-interval

show-nodes-online

hs2021

acl-syntax-fixes

ts2021-implementation

fix-spurious-updates

unstable-integration-tests

mandatory-stun

embedded-derp

prtemplate-fix

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: starred/headscale#1147