Advanced Distributed System - Performance Modelling

Performance Modelling

Posted by UNKNOWN SPACE on Tuesday, December 12, 2023

It is always good to test the performance of distributed applications to ensure it meets the target, and today we’ll learn what is a model and how to build a model to achieve this.

What is a model

A model is a representation of a system that could be used to track the key characteristics and analyze the behaviour of a system. Such performance targets could be:

  1. Response time
  2. Throughput (work per unit of time)
  3. Resource Utilisation (think of a cloud provider)
  4. Energy Consumption
  5. Relates to CPU frequency, application profile
  6. Minimize cost (when using cloud resources)
  7. Minimize number of failures (fault tolerance)
  8. Trade Computation with Communication

There are two approches two build a model:

  1. Analytical Model
  2. Simulation

Analytical Model

Here are some examples for analytical model:

  1. Minimum Possible HTTP Transaction Time
    $$t = RTT + t_{request} + t_{siteprocessing} + t_{reply}$$ In this equation, we have t for time and RTT for round-trip-time, $t_{request}$ is request size divided by bandwidth, $t_{siteprocessing}$ is the time for processing and $t_{reply}$ is reply size / bandwidth.

  2. Parallel Computation
    $$t_p=t_s*(1-x+x/p)$$ The above equation is called the Amdahl’s Law, which calculates the total running time $t_p$. It defines a program running in time $t_s$ with only 1 CPU, try run this program with p CPUs, the inherently sequential part is $1-x$, and the rest of them are the parts could be run parallel. The idea of this law is, the whole system is retricted to the inherently sequential part when we are doing optimization. By considering the derivation of this equation, we could calculate speedup by the following equation: $$Speedup = 1/(1-x)$$ For example, if we have a distributed application with 10% inherently sequential part and the rest 90% could be run in parallel, then we have a speedup of $1/(1-0.9) = 10$

Querying Theory

The Querying theory is a much more complicated problem considering the service counters to which customers arrive and join a waiting line (queue), few parameters that could influence the results are: arrvial rate, number of servers, processing time, etc.

The liitle's law is a useful result for it. Look at the equation $$Avg\_no\_of\_customers\_in = arrival\_rate × service\_time$$ In a steady state, the above equation holds, which means in the queue, there won’t be stacked requests.

Simulation

Simulation is useful after a large number of samples tested, we simulate what samples and check the result.

We could use Monte-Carlo to implement simulation which contains two steps:

  1. Generate random input
  2. Simulate what follows and check the result

Discrite Event Simulation

In a discrite event simulation, we randomly generate events with some notion of time, and record all results with each events. It is common to use this kind of simulation in such computer games.