6.21.3.1.2.6. Boot and delete VM with disabling management network on one of controllers

This report is generated on results collected by execution of the following Rally scenario:

---
  NovaServers.boot_and_delete_server:
    -
      args:
        flavor:
            name: "m1.micro"
        image:
            name: "(^cirros.*uec$|TestVM)"
        force_delete: false
      runner:
        type: "constant_for_duration"
        duration: 600
        concurrency: 4
      context:
        users:
          tenants: 1
          users_per_tenant: 1
      hooks:
        -
          name: fault_injection
          args:
            action: disconnect management network on one node with nova-scheduler service
          trigger:
            name: event
            args:
              unit: iteration
              at: [50]

6.21.3.1.2.6.1. Summary

In this scenario we disable management network interface on one of controllers (in Fuel architecture controller runs DB, MQ, API services, scheduler). This emulates the case with networking outage (network port failure on machine or switch).

The outage causes all services to become unreachable from outside. Moreover, the cluster remains broken even 10 minutes after the fault.

Service downtime, s

MTTR, s

Absolute performance degradation, s

Relative performance degradation, ratio

358.0 ±2.7

149.0 ±2.1

24 ±17

5.7 ±3.4

Metrics:
  • Service downtime is the time interval between the first and the last errors.

  • MTTR is the mean time to recover service performance after the fault.

  • Absolute performance degradation is an absolute difference between the mean of operation duration during recovery period and the baseline’s.

  • Relative performance degradation is the ratio between the mean of operation duration during recovery period and the baseline’s.

6.21.3.1.2.6.2. Details

This section contains individual data for particular scenario runs.

6.21.3.1.2.6.2.1. Run #1

../../../../../../_images/plot_118.svg

6.21.3.1.2.6.2.1.1. Baseline

Baseline samples are collected before the start of fault injection. They are used to estimate service performance degradation after the fault.

Samples

Median, s

Mean, s

Std dev

95% percentile, s

36

5.5

5.2

0.6

6

6.21.3.1.2.6.2.1.2. Service downtime

The tested service is not available during the following time period(s).

#

Downtime, s

1

126.32 ±0.82

2

231.7 ±6.5

6.21.3.1.2.6.2.1.3. Service performance degradation

The tested service has measurable performance degradation during the following time period(s).

#

Time to recover, s

Absolute degradation, s

Relative degradation

1

149.0 ±4.6

24 ±17

5.7 ±3.4