Add benchmarks that systematically test node counts from 100 to 50,000 to identify scaling limits and validate performance under load.