Memory-heavy workloads may be scaled too high #1030

sharnoff · 2024-08-09T00:32:34Z

Problem description / Motivation

Currently, the vm-monitor:

Reserves ~75% of memory for LFC
Asks for scale-up when postgres exceeds the remainder (without looking at what is actually being used by the LFC)

This works ok as a naive solution for most OLTP workloads, but happens to mean that certain memory-heavy workloads can be scaled higher than they need (note: excluding cache usage by LFC, so allocations are elsewhere — like in pgvector index build).

Meanwhile the autoscaler-agent triggers upscaling based on memory usage if postgres' memory usage exceeds 75% of memory... so it's basically always handled by the vm-monitor first in practice.

This came up in this thread: https://neondb.slack.com/archives/C03TN5G758R/p1723127762991289

Feature idea(s) / DoD

We should be more careful about how we treat memory usage as a scaling signal, so that memory-heavy workloads are no longer scaled up beyond what's necessary, while also making sure that we don't harm performance for workloads that are memory-heavy and also rely on LFC being in the OS page cache.

Implementation ideas

See https://www.notion.so/neondatabase/0f75b15d47ad479094861302a99114af

Tasks

Give feedback

In short: In addition to scaling when there's a lot of memory used by postgres, we should also scale up to make sure that enough of the LFC is able to fit into the page cache alongside it. To answer "how much is enough of the LFC", we take the minimum of 5-minute LFC working set size (from window size) and the cached memory (from the 'Cached' field of /proc/meminfo, via vector metrics). Part of #1030. Must be deployed before the vm-monitor changes in order to make sure we don't have worse performance for workloads that are both memory-heavy and rely on LFC being in the VM's page cache.

In short: Currently we reserve 75% of memory to the LFC, meaning that if we scale up to keep postgres using less than 25% of the compute's memory. This means that for certain memory-heavy workloads, we end up scaling much higher than is actually needed — in the worst case, up to 4x, although in practice it tends not to be quite so bad. Part of neondatabase/autoscaling#1030. Must be deployed after the autoscaler-agent changes in order to make sure we don't have worse performance for workloads that are both memory-heavy and rely on LFC being in the VM's page cache.

In short: In addition to scaling when there's a lot of memory used by postgres, we should also scale up to make sure that enough of the LFC is able to fit into the page cache alongside it. To answer "how much is enough of the LFC", we take the minimum of the estimated working set size and the cached memory (from the 'Cached' field of /proc/meminfo, via vector metrics). Part of #1030. Must be deployed before the vm-monitor changes in order to make sure we don't have worse performance for workloads that are both memory-heavy and rely on LFC being in the VM's page cache.

In short: Currently we reserve 75% of memory to the LFC, meaning that if we scale up to keep postgres using less than 25% of the compute's memory. This means that for certain memory-heavy workloads, we end up scaling much higher than is actually needed — in the worst case, up to 4x, although in practice it tends not to be quite so bad. Part of neondatabase/autoscaling#1030.

sharnoff · 2024-10-08T00:02:45Z

Now that neondatabase/neon#8668 has been merged, this will be fixed with the next compute release containing it.

In short: Currently we reserve 75% of memory to the LFC, meaning that if we scale up to keep postgres using less than 25% of the compute's memory. This means that for certain memory-heavy workloads, we end up scaling much higher than is actually needed — in the worst case, up to 4x, although in practice it tends not to be quite so bad. Part of neondatabase/autoscaling#1030.

sharnoff added c/autoscaling/autoscaler-agent Component: autoscaling: autoscaler-agent c/autoscaling/vm-monitor Component: autoscaling: vm-monitor t/bug Issue Type: Bug labels Aug 9, 2024

sharnoff self-assigned this Aug 9, 2024

This was referenced Aug 9, 2024

agent/core: Use cached LFC for memory scaling signal #1031

Merged

vm-monitor: Ignore LFC in postgres cgroup memory threshold neondatabase/neon#8668

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory-heavy workloads may be scaled too high #1030

Memory-heavy workloads may be scaled too high #1030

sharnoff commented Aug 9, 2024 •

edited

Loading

Tasks

sharnoff commented Oct 8, 2024

Memory-heavy workloads may be scaled too high #1030

Memory-heavy workloads may be scaled too high #1030

Comments

sharnoff commented Aug 9, 2024 • edited Loading

Problem description / Motivation

Feature idea(s) / DoD

Implementation ideas

Tasks

sharnoff commented Oct 8, 2024

sharnoff commented Aug 9, 2024 •

edited

Loading