Add script to parse and obtain stats from cluster CSVs #2032

Selutario · 2021-10-13T12:47:12Z

Related issue
Closes #1938

Description

This PR adds a new tool inside wazuh_testing package that can be used to load data from CSVs and calculate some stats from it.

There are two similar tools:

ClusterCSVTasksParser
ClusterCSVResourcesParser

The objective of both tools is to parse the CSVs of all the nodes in a cluster environment, calculate statistics for each of them (mean and regression coefficient), select and return the highest values: ClusterCSVTasksParser is used to obtain this information from the cluster tasks (agent-info sync, Integrity check and Integrity Sync), while ClusterCSVResourcesParser can be used to obtain information of the resource-usage of the wazuh-clusterd process (RAM, CPU, File descriptors by default).

Configuration options

artifacts_path (ClusterCSVTasksParser, ClusterCSVResourcesParser): Specifies the path where the CSVs of the cluster nodes are located. It must follow this format:

.
├── master
│   ├── *
│   │   ├── *
│   │   │   ├── wazuh-clusterd.csv
│   │   ├── *
│   │   │   ├── agent-info_sync.csv
│   │   │   ├── integrity_check.csv
│   │   │   └── integrity_sync.csv
├── worker_x
│   ├── *
│   │   ├── *
│   │   │   ├── wazuh-clusterd.csv
│   │   ├── *
│   │   │   ├── agent-info_sync.csv
│   │   │   ├── integrity_check.csv
│   │   │   └── integrity_sync.csv
└── ...

columns (ClusterCSVResourcesParser): Columns of the CSV on which statistics should be extracted.

Output example

TASKS

{
    "setup_phase": {
        "integrity_check": {
            "time_spent(s)": {
                "workers": {
                    "mean": ("worker_4", 2.882842105263158),
                    "max": ("worker_10", 12.238),
                },
                "master": {
                    "mean": ("master", 1.7251713780918725),
                    "max": ("master", 10.025),
                },
            }
        },
        "integrity_sync": {
            "time_spent(s)": {
                "workers": {
                    "mean": ("worker_3", 0.8801250000000002),
                    "max": ("worker_7", 6.87),
                },
                "master": {
                    "mean": ("master", 3.7857918149466196),
                    "max": ("master", 23.085),
                },
            }
        },
        "agent-info_sync": {
            "time_spent(s)": {
                "workers": {
                    "mean": ("worker_4", 2.0509978833333333),
                    "max": ("worker_3", 9.55),
                },
                "master": {
                    "mean": ("master", 0.4447335640138408),
                    "max": ("master", 3.676),
                },
            }
        },
    },
    "stable_phase": {
        "integrity_check": {
            "time_spent(s)": {
                "workers": {
                    "mean": ("worker_8", 1.6634466019417473),
                    "max": ("worker_8", 3.108),
                },
                "master": {
                    "mean": ("master", 0.726184287099903),
                    "max": ("master", 2.026),
                },
            }
        },
        "agent-info_sync": {
            "time_spent(s)": {
                "workers": {
                    "mean": ("worker_8", 0.7170288461538461),
                    "max": ("worker_8", 1.881),
                },
                "master": {
                    "mean": ("master", 0.2620988372093024),
                    "max": ("master", 1.107),
                },
            }
        },
    },
}

RESOURCES

{
    "setup_phase": {
        "wazuh-clusterd": {
            "USS(KB)": {
                "workers": {
                    "mean": ("worker_9", 75079.43661971831),
                    "max": ("worker_10", 149044.0),
                    "reg_cof": ("worker_9", 489.7276436479312),
                },
                "master": {
                    "mean": ("master", 163284.44755244756),
                    "max": ("master", 347204.0),
                    "reg_cof": ("master", 1759.8759644111753),
                },
            },
            "CPU(%)": {
                "workers": {
                    "mean": ("worker_8", 8.525352112676055),
                    "max": ("worker_10", 38.3),
                    "reg_cof": ("worker_5", 0.04679342234032074),
                },
                "master": {
                    "mean": ("master", 52.511888111888105),
                    "max": ("master", 85.1),
                    "reg_cof": ("master", 0.28662464296267104),
                },
            },
            "FD": {
                "workers": {
                    "mean": ("worker_6", 14.195804195804195),
                    "max": ("worker_5", 16),
                    "reg_cof": ("worker_3", 0.0024418070192718387),
                },
                "master": {
                    "mean": ("master", 42.61538461538461),
                    "max": ("master", 86),
                    "reg_cof": ("master", -0.03370924849798087),
                },
            },
        }
    },
    "stable_phase": {
        "wazuh-clusterd": {
            "USS(KB)": {
                "workers": {
                    "mean": ("worker_9", 106824.18433179724),
                    "max": ("worker_7", 172632.0),
                    "reg_cof": ("worker_1", 9.019405572231445),
                },
                "master": {
                    "mean": ("master", 218785.16129032258),
                    "max": ("master", 281268.0),
                    "reg_cof": ("master", -70.75089840612243),
                },
            },
            "CPU(%)": {
                "workers": {
                    "mean": ("worker_8", 10.96589861751152),
                    "max": ("worker_2", 25.8),
                    "reg_cof": ("worker_4", 0.0013146089056121396),
                },
                "master": {
                    "mean": ("master", 41.93041474654378),
                    "max": ("master", 61.3),
                    "reg_cof": ("master", -0.002096750705806691),
                },
            },
            "FD": {
                "workers": {
                    "mean": ("worker_5", 14.055299539170507),
                    "max": ("worker_4", 15),
                    "reg_cof": ("worker_6", 0.0001952869169673985),
                },
                "master": {
                    "mean": ("master", 24.341013824884794),
                    "max": ("master", 29),
                    "reg_cof": ("master", -0.014530691432141685),
                },
            },
        }
    },
}

deps/wazuh_testing/wazuh_testing/tools/performance/csv_parser.py

AdriiiPRodri

LGTM!

Rebits

LGTM

Selutario self-assigned this Oct 13, 2021

Selutario added 6 commits October 26, 2021 12:57

Add script to parse and obtain stats from cluster CSVs

bfa4462

Convert defaultdicts to dicts. Minor changes.

4e97633

Update docstring

3623008

Add max field to stats calculation

06b65e9

Add new ClusterEnvInfo class

2de0be6

Use master logs to define when setup phase starts for every node

7275730

Selutario force-pushed the feature/1938-cluster-stas-script branch from 2d9cdae to 7275730 Compare October 26, 2021 10:57

AdriiiPRodri suggested changes Oct 27, 2021

View reviewed changes

deps/wazuh_testing/wazuh_testing/tools/performance/csv_parser.py Outdated Show resolved Hide resolved

Use ternary operators

c7ba26a

AdriiiPRodri approved these changes Oct 27, 2021

View reviewed changes

davidjiglesias approved these changes Oct 27, 2021

View reviewed changes

Rebits approved these changes Oct 28, 2021

View reviewed changes

Selutario mentioned this pull request Nov 2, 2021

Add cluster performance test #2130

Merged

snaow merged commit d140cb6 into master Nov 30, 2021

snaow deleted the feature/1938-cluster-stas-script branch November 30, 2021 22:45

Selutario added team/framework test/performance and removed team/framework test/performance labels Jan 24, 2022

Selutario mentioned this pull request Jan 24, 2022

Update cluster CSV parser tool and test thresholds #2468

Closed

2 tasks

snaow mentioned this pull request Jan 27, 2022

QA Release - Rev 430031 #2500

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add script to parse and obtain stats from cluster CSVs #2032

Add script to parse and obtain stats from cluster CSVs #2032

Selutario commented Oct 13, 2021 •

edited

Loading

AdriiiPRodri left a comment

Rebits left a comment

Add script to parse and obtain stats from cluster CSVs #2032

Add script to parse and obtain stats from cluster CSVs #2032

Conversation

Selutario commented Oct 13, 2021 • edited Loading

Description

Configuration options

Output example

AdriiiPRodri left a comment

Choose a reason for hiding this comment

Rebits left a comment

Choose a reason for hiding this comment

Selutario commented Oct 13, 2021 •

edited

Loading