Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

S3Client::GetObject is not thread safe #3350

Open
1 task
mikejiang opened this issue Mar 19, 2025 · 2 comments
Open
1 task

S3Client::GetObject is not thread safe #3350

mikejiang opened this issue Mar 19, 2025 · 2 comments
Labels
bug This issue is a bug. closing-soon This issue will automatically close in 4 days unless further comments are made. dependencies This issue is a problem in a dependency.

Comments

@mikejiang
Copy link

Describe the bug

below is part of threat sanitizer alert ,

    #0 aws_cpu_has_feature <null> (libexternal_Saws_Slibaws-c-common.so+0x2023b)
    #1 aws_checksums_crc64nvme <null> (libexternal_Saws_Slibaws-checksums.so+0x14a6)
    #2 aws_checksums_crc64nvme_ex <null> (libexternal_Saws_Slibaws-checksums.so+0x1564)
    #3 Aws::Crt::Checksum::ComputeCRC64NVME(aws_byte_cursor, unsigned long) <null> (libexternal_Saws_Slibaws-crt-cpp.so+0x65691)
    #4 Aws::Utils::Crypto::CRCChecksum<unsigned long, &Aws::Crt::Checksum::ComputeCRC64NVME, &(Aws::Utils::Array<unsigned char> Aws::Utils::Crypto::ConvertToBuffer<unsigned long>(unsigned long))>::Update(unsigned char*, unsigned long) <null> (libexternal_Saws_Slibaws-sdk-core.so+0x31c3cc)
    #5 Aws::Utils::Crypto::CRC64::Update(unsigned char*, unsigned long) <null> (libexternal_Saws_Slibaws-sdk-core.so+0x306663)
    #6 WriteData(char*, unsigned long, unsigned long, void*) <null> (libexternal_Saws_Slibaws-sdk-core.so+0x27be9b)
    #7 Curl_client_write <null> (libexternal_Scurl_Slibcurl.so+0x6b47b)
    #8 Curl_readwrite <null> (libexternal_Scurl_Slibcurl.so+0x8721a)
    #9 multi_runsingle <null> (libexternal_Scurl_Slibcurl.so+0x5f675)
    #10 curl_multi_perform <null> (libexternal_Scurl_Slibcurl.so+0x61b19)
    #11 curl_easy_perform <null> (libexternal_Scurl_Slibcurl.so+0x3328a)
    #12 Aws::Http::CurlHttpClient::MakeRequest(std::shared_ptr<Aws::Http::HttpRequest> const&, Aws::Utils::RateLimits::RateLimiterInterface*, Aws::Utils::RateLimits::RateLimiterInterface*) const <null> (libexternal_Saws_Slibaws-sdk-core.so+0x283d08)
    #13 std::_Function_handler<std::shared_ptr<Aws::Http::HttpResponse> (), Aws::Client::AWSClient::AttemptOneRequest(std::shared_ptr<Aws::Http::HttpRequest> const&, Aws::AmazonWebServiceRequest const&, char const*, char const*, char const*) const::{lambda()#3}>::_M_invoke(std::_Any_data const&) <null> (libexternal_Saws_Slibaws-sdk-core.so+0x156688)
    #14 std::shared_ptr<Aws::Http::HttpResponse> smithy::components::tracing::TracingUtils::MakeCallWithTiming<std::shared_ptr<Aws::Http::HttpResponse> >(std::function<std::shared_ptr<Aws::Http::HttpResponse> ()>, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, smithy::components::tracing::Meter const&, std::map<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::less<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > >, std::allocator<std::pair<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > > > >&&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) <null> (libexternal_Saws_Slibaws-sdk-core.so+0x17c886)
    #15 Aws::Client::AWSClient::AttemptOneRequest(std::shared_ptr<Aws::Http::HttpRequest> const&, Aws::AmazonWebServiceRequest const&, char const*, char const*, char const*) const <null> (libexternal_Saws_Slibaws-sdk-core.so+0x1615d1)
    #16 Aws::Client::AWSClient::AttemptExhaustively(Aws::Http::URI const&, Aws::AmazonWebServiceRequest const&, Aws::Http::HttpMethod, char const*, char const*, char const*) const <null> (libexternal_Saws_Slibaws-sdk-core.so+0x1644b3)
    #17 Aws::Client::AWSClient::MakeRequestWithUnparsedResponse(Aws::Http::URI const&, Aws::AmazonWebServiceRequest const&, Aws::Http::HttpMethod, char const*, char const*, char const*) const <null> (libexternal_Saws_Slibaws-sdk-core.so+0x1695d5)
    #18 Aws::Client::AWSClient::MakeRequestWithUnparsedResponse(Aws::AmazonWebServiceRequest const&, Aws::Endpoint::AWSEndpoint const&, Aws::Http::HttpMethod, char const*, char const*, char const*) const <null> (libexternal_Saws_Slibaws-sdk-core.so+0x16a1a9)
    #19 Aws::S3::S3Client::GetObject(Aws::S3::Model::GetObjectRequest const&) const::{lambda()#1}::opera r()() const <null> (libexternal_Saws_Slibaws.so+0x8256f3)
    #20 std::_Function_handler<Aws::Utils::Outcome<Aws::S3::Model::GetObjectResult, Aws::S3::S3Error> (), Aws::S3::S3Client::GetObject(Aws::S3::Model::GetObjectRequest const&) const::{lambda()#1}>::_M_invoke(std::_Any_data const&) <null> (libexternal_Saws_Slibaws.so+0x825c8c)
    #21 Aws::S3::S3Client::GetObject(Aws::S3::Model::GetObjectRequest const&) const <null> (libexternal_Saws_Slibaws.so+0x7793b0)
    #22 Aws::Transfer::TransferManager::DoSinglePartDownload(std::shared_ptr<Aws::Transfer::TransferHandle> const&) <null> (libexternal_Saws_Slibaws.so+0xbc0fc3)
    #23 Aws::Transfer::TransferManager::DoDownload(std::shared_ptr<Aws::Transfer::TransferHandle> const&) <null> (libexternal_Saws_Slibaws.so+0xbc2d5e)
    #24 std::_Function_handler<void (), std::_Bind<Aws::Transfer::TransferManager::DownloadFile(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, std::function<std::iostream* ()>, Aws::Transfer::DownloadConfiguration const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, std::shared_ptr<Aws::Client::AsyncCallerContext const> const&)::{lambda()#1} ()> >::_M_invoke(std::_Any_data const&) <null> (libexternal_Saws_Slibaws.so+0xbc4ec8)
    #25 Aws::Utils::Threading::ThreadTask::MainTaskRunner() <null> (libexternal_Saws_Slibaws-sdk-core.so+0x3413c5)
    #26 std::thread::_State_impl<std::thread::_Invoker<std::tuple<std::_Bi

Aws::S3::S3Client::GetObject call will hit Aws::Crt::Checksum::ComputeCRC64NVME , which mutates some static function pointers and variables, causing racing.
I didn't find any reference in terms of how to properly initialize these checkssum related global state.
had to do the hack by computing it ahead of time in my code

Aws::Utils::Crypto::CRC32 crc32;
  Aws::Utils::Crypto::CRC32C crc32c;
  Aws::Utils::Crypto::CRC64 crc64;

  Aws::String dummy = "dummy";
  crc32.Calculate(dummy);
  crc32c.Calculate(dummy);
  crc64.Calculate(dummy);

so that these statics get set before kicks off the concurrent download logic.

what is the kosher way of doing this?

Regression Issue

  • Select this option if this issue appears to be a regression.

Expected Behavior

these gets set during Aws::InitAPI

Current Behavior

they are not set until getObject() is invoked

Reproduction Steps

 Aws::InitAPI();
 Aws::Client::ClientConfiguration conf;
  conf.region = aws_region;
  Aws::S3::S3Client client(conf);

auto executor = ozette::Executor::Create(3);
  executor->ParallelFor(0, 3, [&](int64_t i) {
  Aws::Http::URI uri(s3_uri.c_str());
  Aws::S3::Model::GetObjectRequest get_request;
  get_request.SetBucket(uri.GetAuthority());
  get_request.SetKey(uri.GetPath());
  auto get_object_outcome = s3_client.GetObject(get_request);

  });

Possible Solution

No response

Additional Information/Context

No response

AWS CPP SDK version used

1.11.504

Compiler and Version used

gcc9

Operating System and version

ubuntu

@mikejiang mikejiang added bug This issue is a bug. needs-triage This issue or PR still needs to be triaged. labels Mar 19, 2025
@SergeyRyabinin
Copy link
Contributor

Hi!

Thank you for the report! We have reviewed it, and CRT library is going to fix it sometime.
In the meanwhile, we can confirm this is somewhat a false-positive report: this is a side-effect of lazy selection of performance optimized method for the checksum computation.
This init happens on the very first call, so if there are multiple concurrent first calls to the checksum computation - it results in the reported race condition.
However, there must be no real memory/data corruption, or any race condition side-effects, as the result of concurrent optimized checksum method selection will be the same on all invocations.

I submitted an internal tracking ticket on the CRT library for this report.

Best regards,
Sergey

@SergeyRyabinin SergeyRyabinin added dependencies This issue is a problem in a dependency. and removed needs-triage This issue or PR still needs to be triaged. labels Mar 19, 2025
@DmitriyMusatkin
Copy link
Contributor

should be fixed in 1.11.536

@SergeyRyabinin SergeyRyabinin added the closing-soon This issue will automatically close in 4 days unless further comments are made. label Apr 1, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug This issue is a bug. closing-soon This issue will automatically close in 4 days unless further comments are made. dependencies This issue is a problem in a dependency.
Projects
None yet
Development

No branches or pull requests

3 participants