Contents
Traffic and Capacity Terminology
This section defines and describes the relationships between these capacity and traffic measurement metrics in a GVP system:
Capacity Metrics and Formulas
- Call Arrivals Per Second (CAPS)
- Port Density (PD), also known as Peak Ports
Performance Metrics and Formulas
- Call Duration (CD) and Peak Capacity (PC)
- Call Setup Latency (CSL)
- Caller Perceived Latency (CPL), also known as Response Time Latency
- Call Passrate (PR)
System capacity is defined as a function of the maximum number of ports (PD) or maximum call-arrival rate (CAPS) at which GVP can maximize its use of hardware resources while maintaining all of the criteria within a predefined threshold.
This section also provides the formulas used to calculate capacity and performance.
Capacity Metrics and Formulas
Two units of measure are used for capacity planning: Call Arrivals Per Second and Port Density.
Call Arrivals Per Second (CAPS)
CAPS measures traffic within the system. For example, 10 CAPS means that GVP is receiving 10 calls every second, which is considered busy traffic. CAPS is similar to Busy Hour Call Attempts (BHCA) or Centum Call Seconds (CCS), which is the legacy engineering term for telephony traffic.
Use the following formula to calculate CAPS in terms of CCS:
CAPS = CCS/36
CAPS measures can be applied to components which handle messages or data associated with a call. For example, the reporting server will have a CAPS value based on the number of call records written to it, which will often relate one-to-one with a completed call.
Throughout this chapter, including the tables, the capacity of a function and/or component is defined by its Peak CAPS (the maximum number of calls per second that the system can handle for that function without suffering from latency). See the Capacity Criteria section below for more information.
Port Density (PD) and Average Port Density
PD is the maximum number of simultaneous calls that can be served by GVP at any given time. In the tables of this document, PD is called “Peak Ports” because it also specifies the number of ports that are required to handle the call traffic.
Use the following formula to calculate Port Density:
PD = CAPS x Avg(CD)
...where Avg = Average.
Average Port Density (APD) is the average number of simultaneous calls that are being served by GVP at any given time. The formula to calculate APD is:
APD = CAPS x Average(Call Duration)
Due to PD being a random variable, it can be higher than its average value APD. Use the following formula to calculate Port Density:
PD = APD + 3*SQRT(APD)
...where SQRT(x) is the square root of x.
Performance Metrics and Formulas
Four units of measure are used to assess performance—Call Duration, Call Setup Latency, Caller Perceived Latency, and Call Passrate.
Call Duration (CD) and Peak Capacity
CD is the length of time that a call stays in the GVP system. Use CAPS and CD to calculate the port density required for handling such traffic.
Instead of measuring individual Caller Perceived Latencies within an application under test, you can use data collected by GVP to measure the increase in the total call duration to determine system loading.
For a single call, the assumption is that the normal call duration (where the hang-up is performed by the application) is CD1. When the load increases on the system, the call duration is expected to increase due to an increase of latencies within the application. Assume that for x simultaneous calls in the system, the measured average call duration is Avg(CDx) and the measured 95th percentile call duration is 95%tile(CDx). The capacity measurement goal is
Avg(CDx) / CD1 <= 110%
95%tile(CDx) / CD1 <= 120%
When the 110% (and 120%) threshold is reached, the call volume x is considered to be the Peak Capacity (PC) for this criterion.
Call Setup Latency (CSL)
CSL is the delay between the initial SIP INVITE message and the first audible RTP packet sent from GVP. One example is the dialogue in a typical call flow shown in the figure Typical SIP Call Flow #1.
Figure: Typical SIP Call Flow #1
CSL consists of the following requests and responses:
- User SIP INVITE request received > SIP 200 OK response sent.
- SIP 200 OK response sent > User SIP ACK request sent.
- User SIP ACK request sent > First audible media response sent.
Figure: Typical SIP Call Flow #2
CSL is the delay between when the initial SIP INVITE (top line from the typical call flow shown in the figure Typical SIP Call Flow #2) is received to the time that the first audible packet (bottom line from the diagram above) is sent out by GVP.
Other than call setup latency, SIP INFO response latency (from SIP INFO with MSML embedded to 200 OK response from MCP for SIP INFO request) is also an important factor. It should be measured and reported separately, although this duration is already part of call setup latency.
Caller Perceived Latency (CPL)
CPL, also known as Response Time Latency (RTL) is the time between the last user input (speech or DTMF) and the next prompt. In the Figure Caller Perceived Latency (below), the time between T1 and T2 is the period of CPL.
Use the following formula to calculate CPL:
CPL = T2 - T1
CPL is impacted by the following factors:
- Recognition engines.
- End of speech or a DTMF time out.
- Application pages and prompts.
- Grammars caching and fetching mechanisms.
- The size of application pages.
- Call traffic, including call arrival rate and call duration.
- Speech density—during a call, the percentage of time that speech or DTMF detection is on and the caller can provide input.
- The size of speech recognition grammars and the how often they are used in an application.
- Back end operation—the length of time required to obtain information from the business layer (such as, database queries and CRM findings) and return the results to the caller.
Call Passrate (CP)
CP is the number of calls that finished the predefined call flow successfully during a performance load test.
Use the following formula to calculate the CP:
(Assume the number of calls is 1000 and 5 calls did not finish the predefined call flow successfully.)
(1000-5)/1000 = 99.5%
The capacity measurement formulas are:
ASR/TTS-dependent application |
Passrate >= 99.95% |
Error rate <= 0.05% |
DTMF-only application |
Passrate >= 99.99% |
Error rate <= 0.01% |