It is always good to test the performance of distributed applications to ensure it meets the target, and today we’ll learn what is a model and how to build a model to achieve this.
What is a model
A model
is a representation of a system that could be used to track the key characteristics and analyze the behaviour of a system.
Such performance targets could be:
- Response time
- Throughput (work per unit of time)
- Resource Utilisation (think of a cloud provider)
- Energy Consumption
- Relates to CPU frequency, application profile
- Minimize cost (when using cloud resources)
- Minimize number of failures (fault tolerance)
- Trade Computation with Communication
There are two approches two build a model:
- Analytical Model
- Simulation
Analytical Model
Here are some examples for analytical model:
-
Minimum Possible HTTP Transaction Time
$$t = RTT + t_{request} + t_{siteprocessing} + t_{reply}$$ In this equation, we have t for time and RTT for round-trip-time, $t_{request}$ is request size divided by bandwidth, $t_{siteprocessing}$ is the time for processing and $t_{reply}$ is reply size / bandwidth. -
Parallel Computation
$$t_p=t_s*(1-x+x/p)$$ The above equation is called theAmdahl’s Law
, which calculates the total running time $t_p$. It defines a program running in time $t_s$ with only 1 CPU, try run this program with p CPUs, theinherently sequential part
is $1-x$, and the rest of them are the parts could be run parallel. The idea of this law is, the whole system is retricted to the inherently sequential part when we are doing optimization. By considering the derivation of this equation, we could calculatespeedup
by the following equation: $$Speedup = 1/(1-x)$$ For example, if we have a distributed application with 10% inherently sequential part and the rest 90% could be run in parallel, then we have a speedup of $1/(1-0.9) = 10$
Querying Theory
The Querying theory is a much more complicated problem considering the service counters to which customers arrive and join a waiting line (queue), few parameters that could influence the results are: arrvial rate, number of servers, processing time, etc.
The liitle's law
is a useful result for it. Look at the equation $$Avg\_no\_of\_customers\_in = arrival\_rate × service\_time$$
In a steady state
, the above equation holds, which means in the queue, there won’t be stacked requests.
Simulation
Simulation is useful after a large number of samples tested, we simulate what samples and check the result.
We could use Monte-Carlo
to implement simulation which contains two steps:
- Generate random input
- Simulate what follows and check the result
Discrite Event Simulation
In a discrite event simulation, we randomly generate events with some notion of time, and record all results with each events. It is common to use this kind of simulation in such computer games.