[Bug] Headscale embedded Derper Server speed slow #1070

New Issue

adam · 2025-12-29T02:28:06+01:00

adam commented

2025-12-29 02:28:06 +01:00

Originally created by @kocy33 on GitHub (Jul 22, 2025).

Is this a support request?

This is not a support request

Is there an existing issue for this?

I have searched the existing issues

Current Behavior

Hello,

I am running Headscale with the embedded Derp Server on a VPS with docker compose.
The iperf3 results from the VPS shows fast speeds and with monitoring htop i can only see 10% utilization.
But I can only get approx 1mb/s - 2mb/s throughput.

I have also tried public derp servers, but this results in much worse latency and speed (700kb/s)
I run through 5g - 464xlat and local upload speed is 100mbit (so approx 12mb/s).
Is that expected speed for headscale?
Or did I misconfigure something?

My idea was to maybe run a wireguard tunnel from 5gwan home > vps.
(Because I can't open ports, since the 5G WAN sits behind CGNAT.)
Would that be useful?

My infrastructure looks like this right now.
Client announcing ipv4 routes > pfsense > 5G WAN > VPS control with embedded derp.
(IPV6 is disabled everywhere)

I also tested upload directly with iperf3 from client to vps, and that is no issue.

I tried to access with multiple outside clients. All with slow speed.

Is the embedded derp bad performance? Is it adviced to run a seperate docker container for derp server?
If so, could somone advice one?

The latency of the embedded derp server seems to be good, since i can run 6 camera streams in realtime.
(On public derp servers that is not possible.)

iperf3 from VPS result:

[ 5] 5.00-6.00 sec 119 MBytes 997 Mbits/sec

[ 7] 5.00-6.00 sec 132 MBytes 1.11 Gbits/sec

[ 9] 5.00-6.00 sec 101 MBytes 846 Mbits/sec

[ 11] 5.00-6.00 sec 124 MBytes 1.04 Gbits/sec

[SUM] 5.00-6.00 sec 476 MBytes 4.00 Gbits/sec

Expected Behavior

Little Overhead but fast speed.

Steps To Reproduce

Use embedded Derp Server.
See bad speed.

Environment

- OS: debian / Windows 
- Headscale version: 0.26.1
- Tailscale version:

Runtime environment

Headscale is behind a (reverse) proxy
Headscale runs in a container

Debug information

Relay derp used because no direct connection possible through 5gwan

Originally created by @kocy33 on GitHub (Jul 22, 2025). ### Is this a support request? - [x] This is not a support request ### Is there an existing issue for this? - [x] I have searched the existing issues ### Current Behavior Hello, I am running Headscale with the embedded Derp Server on a VPS with docker compose. The iperf3 results from the VPS shows fast speeds and with monitoring htop i can only see 10% utilization. But I can only get approx 1mb/s - 2mb/s throughput. I have also tried public derp servers, but this results in much worse latency and speed (700kb/s) I run through 5g - 464xlat and local upload speed is 100mbit (so approx 12mb/s). Is that expected speed for headscale? Or did I misconfigure something? My idea was to maybe run a wireguard tunnel from 5gwan home > vps. (Because I can't open ports, since the 5G WAN sits behind CGNAT.) Would that be useful? My infrastructure looks like this right now. Client announcing ipv4 routes > pfsense > 5G WAN > VPS control with embedded derp. (IPV6 is disabled everywhere) I also tested upload directly with iperf3 from client to vps, and that is no issue. I tried to access with multiple outside clients. All with slow speed. Is the embedded derp bad performance? Is it adviced to run a seperate docker container for derp server? If so, could somone advice one? The latency of the embedded derp server seems to be good, since i can run 6 camera streams in realtime. (On public derp servers that is not possible.) iperf3 from VPS result: [ 5] 5.00-6.00 sec 119 MBytes 997 Mbits/sec [ 7] 5.00-6.00 sec 132 MBytes 1.11 Gbits/sec [ 9] 5.00-6.00 sec 101 MBytes 846 Mbits/sec [ 11] 5.00-6.00 sec 124 MBytes 1.04 Gbits/sec [SUM] 5.00-6.00 sec 476 MBytes 4.00 Gbits/sec ### Expected Behavior Little Overhead but fast speed. ### Steps To Reproduce Use embedded Derp Server. See bad speed. ### Environment ```markdown - OS: debian / Windows - Headscale version: 0.26.1 - Tailscale version: ``` ### Runtime environment - [ ] Headscale is behind a (reverse) proxy - [ ] Headscale runs in a container ### Debug information Relay derp used because no direct connection possible through 5gwan

adam added the bug label 2025-12-29 02:28:06 +01:00

adam closed this issue

2025-12-29 02:28:06 +01:00

adam commented

2025-12-29 02:28:07 +01:00

@kradalby commented on GitHub (Jul 22, 2025):

Is the embedded derp bad performance? Is it adviced to run a seperate docker container for derp server?
If so, could somone advice one?

I have no particular expectations about the speed, it shares part of the web server with Headscale, if your after performance I would try running the separate derper binary from Tailscale.

I've never speed tested the embedded DERP, nor have we done anything to ensure its fast. I would view it as _purely for convenience.

@kradalby commented on GitHub (Jul 22, 2025): > Is the embedded derp bad performance? Is it adviced to run a seperate docker container for derp server? > If so, could somone advice one? I have no particular expectations about the speed, it shares part of the web server with Headscale, if your after performance I would try running the separate `derper` binary from Tailscale. I've never speed tested the embedded DERP, nor have we done anything to ensure its fast. I would view it as _purely for convenience.

adam commented

2025-12-29 02:28:07 +01:00

@kocy33 commented on GitHub (Jul 23, 2025):

Aye!
I am pretty happy with headscale!
Getting it up and running was pretty drop in smooth. :)
Since this was the first time setting it up, I just wasn't sure what speeds to expect.

Since I have written this I made some more tests.
I setup the sending side with open ports and then tried the other side with a relayed side and a side with open ports.
The direct connected had full speed and one relayed side was getting approx half of it.
It made sense.

You reckon running a seperate derper server would gain more speed in general compared to the embedded?
Would running a tailscale client on the VPS that also runs the controlserver / embedded derper be beneficial?

Ill close it as a bug because its expected behavior i guess.

@kocy33 commented on GitHub (Jul 23, 2025): Aye! I am pretty happy with headscale! Getting it up and running was pretty drop in smooth. :) Since this was the first time setting it up, I just wasn't sure what speeds to expect. Since I have written this I made some more tests. I setup the sending side with open ports and then tried the other side with a relayed side and a side with open ports. The direct connected had full speed and one relayed side was getting approx half of it. It made sense. You reckon running a seperate derper server would gain more speed in general compared to the embedded? Would running a tailscale client on the VPS that also runs the controlserver / embedded derper be beneficial? Ill close it as a bug because its expected behavior i guess.

adam commented

2025-12-29 02:28:07 +01:00

@kradalby commented on GitHub (Jul 23, 2025):

The main reason to have tailscale and derp on the same is that it supports a flag to validate so only nodes from your server can use the derp I believe, but I have never used it.

Otherwise I would expect better performance with a dedicated derp

@kradalby commented on GitHub (Jul 23, 2025): The main reason to have tailscale and derp on the same is that it supports a flag to validate so only nodes from your server can use the derp I believe, but I have never used it. Otherwise I would expect better performance with a dedicated derp

adam referenced this issue

2025-12-29 02:32:00 +01:00

[PR #1070] [MERGED] Update remote-cli.md #1857

Sign in to join this conversation.

Branches Tags

main

gh-pages

update_flake_lock_action

kradalby/3038-reg-panic

kradalby/release-v0.27.2

dependabot/go_modules/golang.org/x/crypto-0.45.0

dependabot/go_modules/github.com/opencontainers/runc-1.3.3

copilot/investigate-headscale-issue-2788

copilot/investigate-visibility-issue-2788

copilot/investigate-issue-2833

copilot/debug-issue-2846

copilot/fix-issue-2847

dependabot/go_modules/github.com/go-viper/mapstructure/v2-2.4.0

dependabot/go_modules/github.com/docker/docker-28.3.3incompatible

kradalby/cli-experiement3

doc/0.26.1

doc/0.25.1

doc/0.25.0

doc/0.24.3

doc/0.24.2

doc/0.24.1

doc/0.24.0

kradalby/build-docker-on-pr

topic/docu-versioning

topic/docker-kos

juanfont/fix-crash-node-id

juanfont/better-disclaimer

update-contributors

topic/prettier

revert-1893-add-test-stage-to-docs

add-test-stage-to-docs

remove-node-check-interval

fix-empty-prefix

fix-ephemeral-reusable

bug_report-debuginfo

autogroups

logs-to-stderr

revert-1414-topic/fix_unix_socket

rename-machine-node

port-embedded-derp-tests-v2

port-derp-tests

duplicate-word-linter

update-tailscale-1.36

warn-against-apache

ko-fi-link

more-acl-tests

fix-typo-standalone

parallel-nolint

tparallel-fix

rerouting

ssh-changelog-docs

oidc-cleanup

web-auth-flow-tests

kradalby-gh-runner

fix-proto-lint

remove-funding-links

go-1.19

enable-1.30-in-tests

0.16.x

cosmetic-changes-integration

tmp-fix-integration-docker

fix-integration-docker

configurable-update-interval

show-nodes-online

hs2021

acl-syntax-fixes

ts2021-implementation

fix-spurious-updates

unstable-integration-tests

mandatory-stun

embedded-derp

prtemplate-fix

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: starred/headscale#1070