Added latency injection functionality #2230

gareth-johnston · 2024-10-13T23:27:13Z

Sometimes you may wish to configure machines with varying latencies between them. For example, you may need to simulate multiple DCs, or a cluster with a latency profile specific to a use case.

This commit adds the ability to easily configure latencies between different nodes and clients created in your testing environment, eliminating the need to do this manually per machine.

There are two methods to leverage this functionality. In both cases you must edit the generated inventory.yaml and add a group: groupId property to relevant hosts.

i.e.

loadgenerators:
  hosts:
    3.121.207.133:
      ansible_ssh_private_key_file: key
      ansible_user: ec2-user
      private_ip: 10.0.55.38
      group: group1

nodes:
  hosts:
    3.122.199.101:
      ansible_ssh_private_key_file: key
      ansible_user: ec2-user
      private_ip: 10.0.44.25
      group: group2

Option 1: For a simple configuration when you just want a steady latency between two or more nodes, use command

inventory inject_latencies --latency <l> --interface <ni>

add --rtt if you wish for the latency to be the total round trip time between groups.

Option 2: For a more nuanced set up with group -> group having different latencies, create a yaml file defining the relationships as follows

latency_profiles:
  relationships:
    - source: group1
      target: group2
      latency: 5
    - source: group2
      target: group1
      latency: 2
  default_latency: 0

In this example, we have two hosts, each belonging to a different group (group1 and group2). Latencies will only be applied between different groups.
If a host is not assigned a group, no direction of latency will be applied to that host.

Note that the latency is applied to the outbound traffic from a source group to the target group.

Add this file to the working directory of your project, then run

inventory inject_latencies --interface <ni> --profiles my_profile.yaml

This will apply the defined latencies between groups. Note that latency values are defaulted to ms so there is no need to specify the unit when defining the profile.yaml or running the command.

Adds documentation to readme for more information and usage.

k-jamroz · 2024-10-14T13:56:50Z

src/apply_latencies.py

+        print(stderr.read().decode())
+
+    # apply the filter command for each IP individually
+    for ip in target_ips:


this should be easier and cleaner with ansible (similar to inventory tune). why not use it?

Didn't consider it, wasn't really aware of the option if I am honest : )

Not much of a functional difference, definitely cleaner but they both achieve the same result and at the moment I think in the current form this commit adds value by reducing setup time/ human error enough that it is worth it to go ahead and migrate it later. If you think it wouldn't take too much time to change it to use ansible I am more than happy to look into it during a cool down (along with your other comment)

using Ansible will make this more consistent with rest of simulator, more robust and easier to maintain. Ansible has some learning curve, but IMO it is worth learning.
with ansible you will have some challenges with translating custom yaml to node-specific configs/scripts, but Ansible has some extensive templating and programming capabilities.

dependency installation should be very easy with ansible (we already have playbooks that do it).
the configuration commands are a simple loop over some config: should be rather easy in ansible

Sounds good. Let's leave this here for now, I will look into ansible and translate it across, thank you foir the valuable input 👍

k-jamroz · 2024-10-14T14:00:46Z

README.md

+Once your hosts are grouped, you can inject latencies by running the following command:
+
+```bash
+inventory inject_latencies --latency 10 --interface eth0 --rtt


there are 2 possible approaches to defining the latencies: globally (independent on test) and as part of test setup (in tests.yaml / test case definition). Both have pros and cons. I am curious why did you choose this approach?

Mainly because it was the simplest approach, in most cases if we are testing a use case with latencies applied I feel like they are going to be fairly static arrangements. If you want to test with different latencies you can just run another test suite with machines having different latencies applied.

That said, I thought about the benefit of having it per test case definition, it is a superior implementation if we can do both - but don't have the time to allocate to this as it isn't scheduled work. It is definitely a possible improvement though.

simplicity of implementation and limited time allocated are perfectly good explanation :)

I was wondering which way is more convenient in practice. Definition in test case makes it easily repeatable, but will we actually use it, or will that become something we only copy-paste over and over again?

Perhaps we can extend it in the future if we such value? In practice I see myself using this per test class, not case. But fore example it might be more useful in the future with the automated performance tests we that work commences.

JamesHazelcast

Great work on this @gareth-johnston, this is really useful to have built into Simulator. One minor question, and I agree with @k-jamroz regarding Ansible usage, but I'll leave him to check over that when it's ready so you can have my approval in advance ✅

JamesHazelcast · 2024-10-15T15:35:22Z

README.md

+In this example, we have two groups (group1 and group2) with different latencies between them. 
+Communication from group1 to group2 will have a 5ms latency, while communication from group2 to group1 will have a 2ms latency.
+
+The default_latency serves as a fallback value in case no specific latency is defined between two groups.


Is default_latency required, or can it be optionally omitted?

k-jamroz · 2024-10-15T17:20:14Z

README.md

+Once your hosts are grouped, you can inject latencies by running the following command:
+
+```bash
+inventory inject_latencies --latency 10 --interface eth0 --rtt


how can I disable the latency injection when no longer needed without recreating the environment? inventory inject_latencies --latency 0 or something more complicated? Maybe it is worth adding dedicated command or option?

Yes, you can just do as you say. Same to overwrite existing form. But I agree, a command would be useful to just wipe the state.

gareth-johnston and others added 4 commits October 9, 2024 11:13

Make reference selection deterministic

fe74879

introduce latency configuration

d22a9c4

update readme

5bc22b8

Merge branch 'master' into CORE-165/cp-boundaries-tests

6c9364e

gareth-johnston requested review from k-jamroz, JamesHazelcast and gbarnett-hz October 14, 2024 08:14

k-jamroz reviewed Oct 14, 2024

View reviewed changes

JamesHazelcast approved these changes Oct 15, 2024

View reviewed changes

k-jamroz reviewed Oct 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added latency injection functionality #2230

Added latency injection functionality #2230

gareth-johnston commented Oct 13, 2024 •

edited

Loading

k-jamroz Oct 14, 2024

gareth-johnston Oct 14, 2024

gareth-johnston Oct 14, 2024

k-jamroz Oct 14, 2024

k-jamroz Oct 14, 2024

gareth-johnston Oct 14, 2024

k-jamroz Oct 14, 2024

gareth-johnston Oct 14, 2024 •

edited

Loading

k-jamroz Oct 14, 2024

gareth-johnston Oct 14, 2024

JamesHazelcast left a comment

JamesHazelcast Oct 15, 2024

k-jamroz Oct 15, 2024

gareth-johnston Oct 15, 2024

Added latency injection functionality #2230

Are you sure you want to change the base?

Added latency injection functionality #2230

Conversation

gareth-johnston commented Oct 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gareth-johnston Oct 14, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JamesHazelcast left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gareth-johnston commented Oct 13, 2024 •

edited

Loading

gareth-johnston Oct 14, 2024 •

edited

Loading