Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

boto3 paginator key listing runs very slow #66

Open
metazool opened this issue Jan 29, 2025 · 0 comments
Open

boto3 paginator key listing runs very slow #66

metazool opened this issue Jan 29, 2025 · 0 comments

Comments

@metazool
Copy link
Collaborator

metazool commented Jan 29, 2025

We originally had s3fs interface to s3 object storage in this project.

It was replaced with the lower-level boto3 in this PR - for consistency with other projects / developer familiarity. It meant a one-line call to s3.ls() was replaced with an involved key listing method borrowed from SO

Now (with 10x more data but still only thousands of objects) it's returning very slowly, with s3.ls() still taking seconds - you can see an observable lag in each call to the paginator.

Is it worth digging into why (e.g. the possibility of request throttling at the JASMIN end, because it feels like this behaviour has changed?) or take the simplest course just to return to s3fs? I don't have expertise in s3

@metazool metazool changed the title s3 paginator key listing runs very slow boto3 paginator key listing runs very slow Jan 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant