# Performance Data
# Observe the Delay Added by the Gateway from the Response Header
As shown in the figure below, the response header x-kong-proxy-latency
in the red box is the processing delay added by the gateway, which is 1ms in this case. If 0 is displayed, it is less than 1ms.
The response header x-kong-upstream-latency
is the delay from sending the request to the back-end service to receiving the complete response from the backend, which is 6ms here.
# Pressure Test Data
- Number of instances: 1
- CPU: 2 cores
- Memory: 256 MB
Pressure Test Method | Number of Concurrent Clients | Requests Per Second | Average Delay | CPU Usage | Memory Usage |
---|---|---|---|---|---|
Short Connection | 10 | 4393 | 2.2ms | 100% | 106 MB |
Short Connection | 30 | 4699 | 6.3ms | 100% | 106 MB |
Long Connection | 10 | 5928 | 1.6ms | 100% | 106 MB |
Long Connection | 30 | 5800 | 5ms | 100% | 106 MB |
The request processing capacity of a gateway node with 1 instance and 2 cores of CPU is about 5,000 QPS. According to the default configuration of the platform, 2 instances will be started to achieve the processing capacity of 10,000 QPS. If still unable to meet the business needs, carry out horizontal scaling.