When already-expired node is set to "Never Expire" (expiry is NULL), it does not go back to logged-in status. #681

New Issue

adam · 2025-12-29T02:21:59+01:00

adam commented

2025-12-29 02:21:59 +01:00

Originally created by @benmehlman on GitHub (Mar 28, 2024).

Bug description

Using Tailscale's control server: when a node expires, it remains connected to the control server, although it no longer passes tailnet traffic. The node can be restored to operation by selecting "Disable key expiry" in the Tailscale admin UI.
It will start passing traffic again, without having to re-authenticate or take any other action on the node machine itself.

This does not work on headscale.

Environment

OS: Debian 12.4
Headscale version: v0.23.0-alpha5
Tailscale version: 1.62.0

Headscale is behind a (reverse) proxy
Yes, nginx.. I have to in order to host the ui.. but.. I've been in the discord and nobody really knows much about expiration behavior, I seem to be the only person active there who is really interested in expiration behavior right now.

The reverse proxy seems to be working fine as everything else related to node-server communication is working perfectly and it's been very stable.

Headscale runs in a container

To Reproduce

These steps assume OIDC is in use...

In config.yaml, set oidc expiry to a short time so that expiration can be easily observed (eg. "5m"), and restart the service,
Run "tailscale up" on the node with the appropriate parameters to connect to the headscale instance.
Complete OIDC login.
Observe that the node is connected to the tailnet as normal.

On the headscale server:

Wait for the node to expire.
Observe that headscale nodes list indicates that the node is connected but expired, as expected.
Test the node connectivity to confirm that it has stopped passing traffic as expected.
Set the node to "Disable key expiry" by using sqlite3 to execute: UPDATE node SET expiry = NULL WHERE id = the_node_id;
Observe that headscale nodes list indicates that the node is "online" and expired is "no".
Observe that, even after some time is allowed for polling (if necessary), the node does not resume passing traffic, and tailscale status on the node remains "Logged out".

Logs and attachments

netmap_recover_after_expiry.json

Originally created by @benmehlman on GitHub (Mar 28, 2024).  ## Bug description Using Tailscale's control server: when a node expires, it remains connected to the control server, although it no longer passes tailnet traffic. The node can be restored to operation by selecting "Disable key expiry" in the Tailscale admin UI. It will start passing traffic again, without having to re-authenticate or take any other action on the node machine itself. This does not work on headscale. ## Environment  - OS: Debian 12.4 - Headscale version: v0.23.0-alpha5 - Tailscale version: 1.62.0  - [X] Headscale is behind a (reverse) proxy Yes, nginx.. I have to in order to host the ui.. but.. I've been in the discord and nobody really knows much about expiration behavior, I seem to be the only person active there who is really interested in expiration behavior right now. The reverse proxy seems to be working fine as everything else related to node-server communication is working perfectly and it's been very stable. - [ ] Headscale runs in a container ## To Reproduce  These steps assume OIDC is in use... 1. In config.yaml, set oidc expiry to a short time so that expiration can be easily observed (eg. "5m"), and restart the service, 2. Run "tailscale up" on the node with the appropriate parameters to connect to the headscale instance. 3. Complete OIDC login. 4. Observe that the node is connected to the tailnet as normal. On the headscale server: 5. Wait for the node to expire. 6. Observe that `headscale nodes list` indicates that the node is connected but expired, as expected. 7. Test the node connectivity to confirm that it has stopped passing traffic as expected. 8. Set the node to "Disable key expiry" by using `sqlite3` to execute: `UPDATE node SET expiry = NULL WHERE id = the_node_id;` 9. Observe that `headscale nodes list` indicates that the node is "online" and expired is "no". 10. Observe that, even after some time is allowed for polling (if necessary), the node does not resume passing traffic, and `tailscale status` on the node remains "Logged out". ## Logs and attachments  [netmap_recover_after_expiry.json](https://github.com/juanfont/headscale/files/14792940/netmap_recover_after_expiry.json)

adam added the stale bug labels 2025-12-29 02:21:59 +01:00

adam closed this issue

2025-12-29 02:21:59 +01:00

adam commented

2025-12-29 02:22:00 +01:00

@kradalby commented on GitHub (May 1, 2024):

So this will not really work since changing the database will not trigger any of the mechanisms that update the clients. I would think that if you change the database and restarts headscale it might work.

I think essentially what we need is a new command set-expiry or something which sets a new expiry and the nodes are appropriately updated.

I'm going to remove this from 0.23.0, it is important, but it is not a regression at should be tackled after.

@kradalby commented on GitHub (May 1, 2024): So this will not really work since changing the database will not trigger any of the mechanisms that update the clients. I would think that if you change the database and restarts headscale it _might_ work. I think essentially what we need is a new command `set-expiry` or something which sets a new expiry and the nodes are appropriately updated. I'm going to remove this from 0.23.0, it is important, but it is not a regression at should be tackled after.

adam commented

2025-12-29 02:22:00 +01:00

@benmehlman commented on GitHub (May 7, 2024):

I did try restarting headscale, it didn't cause the node to come back online.. so, there is some other detail that is not quite right.

May I suggest that rather than a separate api for set-expiry, rather implement PATCH so that as new columns are added in the future it would be easy to add them to the API without adding more endpoints?

Also I suggest adding a separate boolean column for "never_expire". This removes the ambiguity when a node which has never authenticated has an expiry = null.

@benmehlman commented on GitHub (May 7, 2024): I did try restarting headscale, it didn't cause the node to come back online.. so, there is some other detail that is not quite right. May I suggest that rather than a separate api for set-expiry, rather implement PATCH so that as new columns are added in the future it would be easy to add them to the API without adding more endpoints? Also I suggest adding a separate boolean column for "never_expire". This removes the ambiguity when a node which has never authenticated has an expiry = null.

adam commented

2025-12-29 02:22:00 +01:00

@github-actions[bot] commented on GitHub (Aug 6, 2024):

This issue is stale because it has been open for 90 days with no activity.

@github-actions[bot] commented on GitHub (Aug 6, 2024): This issue is stale because it has been open for 90 days with no activity.

adam commented

2025-12-29 02:22:01 +01:00

@github-actions[bot] commented on GitHub (Aug 13, 2024):

This issue was closed because it has been inactive for 14 days since being marked as stale.

@github-actions[bot] commented on GitHub (Aug 13, 2024): This issue was closed because it has been inactive for 14 days since being marked as stale.

adam commented

2025-12-29 02:22:01 +01:00

@HarukaMa commented on GitHub (Aug 13, 2024):

It's only been 7 days, and I think this is still a valid issue (I might run into this in a couple months to be exact).

@HarukaMa commented on GitHub (Aug 13, 2024): It's only been 7 days, and I think this is still a valid issue (I might run into this in a couple months to be exact).

adam referenced this issue

2025-12-29 02:30:37 +01:00

[PR #684] [MERGED] Fix API router #1569

Sign in to join this conversation.

Branches Tags

main

gh-pages

update_flake_lock_action

kradalby/3038-reg-panic

kradalby/release-v0.27.2

dependabot/go_modules/golang.org/x/crypto-0.45.0

dependabot/go_modules/github.com/opencontainers/runc-1.3.3

copilot/investigate-headscale-issue-2788

copilot/investigate-visibility-issue-2788

copilot/investigate-issue-2833

copilot/debug-issue-2846

copilot/fix-issue-2847

dependabot/go_modules/github.com/go-viper/mapstructure/v2-2.4.0

dependabot/go_modules/github.com/docker/docker-28.3.3incompatible

kradalby/cli-experiement3

doc/0.26.1

doc/0.25.1

doc/0.25.0

doc/0.24.3

doc/0.24.2

doc/0.24.1

doc/0.24.0

kradalby/build-docker-on-pr

topic/docu-versioning

topic/docker-kos

juanfont/fix-crash-node-id

juanfont/better-disclaimer

update-contributors

topic/prettier

revert-1893-add-test-stage-to-docs

add-test-stage-to-docs

remove-node-check-interval

fix-empty-prefix

fix-ephemeral-reusable

bug_report-debuginfo

autogroups

logs-to-stderr

revert-1414-topic/fix_unix_socket

rename-machine-node

port-embedded-derp-tests-v2

port-derp-tests

duplicate-word-linter

update-tailscale-1.36

warn-against-apache

ko-fi-link

more-acl-tests

fix-typo-standalone

parallel-nolint

tparallel-fix

rerouting

ssh-changelog-docs

oidc-cleanup

web-auth-flow-tests

kradalby-gh-runner

fix-proto-lint

remove-funding-links

go-1.19

enable-1.30-in-tests

0.16.x

cosmetic-changes-integration

tmp-fix-integration-docker

fix-integration-docker

configurable-update-interval

show-nodes-online

hs2021

acl-syntax-fixes

ts2021-implementation

fix-spurious-updates

unstable-integration-tests

mandatory-stun

embedded-derp

prtemplate-fix

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: starred/headscale#681