additional changes to LR2 branch #1989

vroldanbet · 2024-07-19T12:56:57Z

Based on #1905 with extra optimizations I found that have improved performance

There are several optimizations here, mostly around reducing allocations and GC pressure.

changes to `default_query_exec_mode`

This is a long-standing issue with SpiceDB and prepared statements, and the changes here should address support for PG proxies. As part of the profiling and performance testing I did locally, I observed compelling reasons to move to exec mode. We still need to observe the impact in a real cluster since the protocol is now likely chattier (it's "text" based mode instead of "binary") but remain confident the overhead in the protocol is amortized by the removal of a full roundtrip and the computation of a custom query plan.

Moving default_query_exec_mode=exec showed better LR2 throughput and also reduced allocations. I took the changes from 45c300d (hi @bradengroom 👋) and adjusted them so the default mode is now exec which disables prepared plans, which have shown to be counterproductive with SpiceDB.

SpiceDB has been ignoring generic plans for a long time now. The hypothesis is that since SpiceDB queries have so much argument variability the generic plans become suboptimal leading to slow queries. Consequently, we switched to force_custom_plan, meaning a new query plan is created for each query. This wasn't the most efficient choice but it was the available workaround at the time as it could be triggered via URI parameters, and then it was made the default behavior in the code.

From the PG docs:

Prepared statements (either explicitly prepared or implicitly generated, for example by PL/pgSQL) can be executed using custom or generic plans. Custom plans are made afresh for each execution using its specific set of parameter values, while generic plans do not rely on the parameter values and can be re-used across executions. Thus, use of a generic plan saves planning time, but if the ideal plan depends strongly on the parameter values then a generic plan may be inefficient. The choice between these options is normally made automatically, but it can be overridden with plan_cache_mode. The allowed values are auto (the default), force_custom_plan and force_generic_plan. This setting is considered when a cached plan is to be executed, not when it is prepared. For more information see PREPARE.

exec query exec mode does not use the binary protocol, so I had to add support for text support for some of the arguments, specifically xid8 and the map[string]any used for the context column which is mapped to JSONB. Without query plans and with the text mode, there is heavy reliance by pgx over reflection and custom interfaces. I made xid8 implement a pgx interface that bypasses reflection, and map[string]any didn't need a decoder since a JSONB decoder for maps already exists, I just had to register the mapping.

I'm debating if this should be the new default or if we should be more conservative about it. The current configuration is very inefficient and I can't think of reasons this wouldn't be universally beneficial.

Reducing allocations when generating queries

An important reduction in allocations (10% reduction!) which in turn also has a dramatic impact in throughput happens in FilterToResourceIDs and FilterWithSubjectsSelectors. Both concatenated strings to create schema filterers used by LR code. This was extremely inefficient, and by switching to a strings.Builder an important reduction in allocations on a LR-based workload is observed

I also reduced the allocations in other parts of the code that showed up in the profiles and weren't as impactful but seemed like low-hanging fruit (~ 1% reduction).

changes to chunk size

We also observed improvements for certain workloads with wide relations when the dispatch chunk size was changed. Right now I just hardcoded an increase to experiment with it, but the idea is to turn it into a flag. This is still TODO.
The changes to chunk size helps a ton, so I'll turn it into a configurable flag instead of the hardcoded value of 100

…ources2 This implementation should be much faster for intersections, exclusions and caveats due to early tree shearing and check hints

… LR2

…dispatched resources when checking those already received from another dispatch Adds some parallelism back into LR2

Also adds additional testing to ensure check hints are used in LR2

uses string.Builder and strings.Repeat (which also uses string.Builder) to reduce CPU and allocations showed up in generating 10% of allocations and 5% of CPU on a LookupResources-based workload. This reduces 66.25% allocations and 92% CPU

github-actions · 2024-07-19T12:57:34Z

CLA Assistant Lite bot All contributors have signed the CLA ✍️ ✅

bradengroom · 2024-07-19T14:40:58Z

I have read the CLA Document and I hereby sign the CLA

bradengroom · 2024-07-19T14:41:19Z

recheck

we've observed improvements on LR workloads when increasing the dispatch chunk size

josephschorr and others added 10 commits July 19, 2024 10:12

Implement a new, experimental variant of LookupResources as LookupRes…

9f4f9e4

…ources2 This implementation should be much faster for intersections, exclusions and caveats due to early tree shearing and check hints

Add additional steelthread tests for LookupResources and fix issue in…

e3acb60

… LR2

Switch experimental LookupResources2 to request additional chunks of …

3f5b5b5

…dispatched resources when checking those already received from another dispatch Adds some parallelism back into LR2

Re-engineer how check hints are handled to use protos for performance

a6fb421

Also adds additional testing to ensure check hints are used in LR2

experiment with chunk size 500

ab2dacd

optimize FilterToResourceIDs

281e50e

uses string.Builder and strings.Repeat (which also uses string.Builder) to reduce CPU and allocations showed up in generating 10% of allocations and 5% of CPU on a LookupResources-based workload. This reduces 66.25% allocations and 92% CPU

optimization: reduce allocations in FilterWithSubjectsSelectors

08e972b

optimization: reduce allocations in syncONRSet

3ac693f

fixup: schema filterer

244cb44

optimization: avoid proto allocation

d575119

github-actions bot added area/CLI Affects the command line area/api v1 Affects the v1 API area/datastore Affects the storage system area/tooling Affects the dev or user toolchain (e.g. tests, ci, build tools) area/dispatch Affects dispatching of requests labels Jul 19, 2024

bradengroom and others added 2 commits July 19, 2024 15:21

Disable prepared statements via pgx config

131bce9

fixup

cf2417a

vroldanbet force-pushed the lr2-2 branch from cf14214 to cf2417a Compare July 19, 2024 14:21

authzedbot added a commit to authzed/cla that referenced this pull request Jul 19, 2024

@bradengroom has signed the CLA in authzed/spicedb#1989

4d79d05

vroldanbet self-assigned this Jul 19, 2024

This was referenced Jul 23, 2024

Maybe a bug or not! Spicedb connect to Pgbouncer report error "prepared statement name is already in use (SQLSTATE 08P01)" #1992

Closed

Implement a new, experimental variant of LookupResources as LookupResources2 #1905

Merged

josephschorr and others added 2 commits July 24, 2024 17:01

add support for configurable dispatch chunk size

3705adb

we've observed improvements on LR workloads when increasing the dispatch chunk size

adjust documentation

6a75f5f

vroldanbet closed this Jul 24, 2024

github-actions bot locked and limited conversation to collaborators Jul 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

additional changes to LR2 branch #1989

additional changes to LR2 branch #1989

vroldanbet commented Jul 19, 2024 •

edited

Loading

github-actions bot commented Jul 19, 2024 •

edited

Loading

bradengroom commented Jul 19, 2024

bradengroom commented Jul 19, 2024

additional changes to LR2 branch #1989

additional changes to LR2 branch #1989

Conversation

vroldanbet commented Jul 19, 2024 • edited Loading

changes to default_query_exec_mode

Reducing allocations when generating queries

changes to chunk size

github-actions bot commented Jul 19, 2024 • edited Loading

bradengroom commented Jul 19, 2024

bradengroom commented Jul 19, 2024

vroldanbet commented Jul 19, 2024 •

edited

Loading

changes to `default_query_exec_mode`

github-actions bot commented Jul 19, 2024 •

edited

Loading