Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any way we can filter by the total number of versions? #1771

Open
davosMW opened this issue Jul 12, 2024 · 1 comment
Open

Any way we can filter by the total number of versions? #1771

davosMW opened this issue Jul 12, 2024 · 1 comment

Comments

@davosMW
Copy link

davosMW commented Jul 12, 2024

Hi, I'm trying to reduce the mirror size because now the deletion takes too long.
(approx 1 day with xargs + rsync)

I've found that there are 440k packages that only have 2 versions, which are quite obviously not used by anyone else but the authors themselves, and want to find a way to filter them.

Is there any way we can do this?

image

@davosMW davosMW closed this as completed Jul 12, 2024
@davosMW davosMW reopened this Jul 12, 2024
@cooperlees
Copy link
Contributor

Hi,

I don't see a filter plugin that can do filtering based on the number of versions a project has. It should be rather easy to calculate off the metadata and add into bandersnatch. I would accept a PR for this. Please add unittests showing it working tho.

I would imagine it to be similar to size_project_metadata. Maybe call it versions_min_metadata and for bonus points support a versions_max_metadata (although I don't see a use case except maybe to filter out packages that get released daily).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants