Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize ACS pipeline for runtime and memory #5832

Draft
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

dralley
Copy link
Contributor

@dralley dralley commented Sep 24, 2024

So far, I can tell that it at least reduced runtimes by about 75%, but the memory use improvement is minimal.

Lift some queries that were performed per-batch outside of the loop,
resolving an N+1 where N=number of batches. Also use .iterator()

[noissue]
Instead of performing a gigantic AND-ed OR clause query, break up the
list of remote artifacts by checksum type and perform one IN query per
type of checksum, which ought to be easily indexable.

[noissue]
Lift the ACS domain check out of the stage entirely.

[noissue]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant