# Performance Data

# Observe the Delay Added by the Gateway from the Response Header

As shown in the figure below, the response header x-kong-proxy-latency in the red box is the processing delay added by the gateway, which is 1ms in this case. If 0 is displayed, it is less than 1ms.

The response header x-kong-upstream-latency is the delay from sending the request to the back-end service to receiving the complete response from the backend, which is 6ms here.

# Pressure Test Data

Number of instances: 1
CPU: 2 cores
Memory: 256 MB

Pressure Test Method	Number of Concurrent Clients	Requests Per Second	Average Delay	CPU Usage	Memory Usage
Short Connection	10	4393	2.2ms	100%	106 MB
Short Connection	30	4699	6.3ms	100%	106 MB
Long Connection	10	5928	1.6ms	100%	106 MB
Long Connection	30	5800	5ms	100%	106 MB

The request processing capacity of a gateway node with 1 instance and 2 cores of CPU is about 5,000 QPS. According to the default configuration of the platform, 2 instances will be started to achieve the processing capacity of 10,000 QPS. If still unable to meet the business needs, carry out horizontal scaling.

← API Authentication Instructions for Special Status Code →