Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Verbatim PFB manifest to failure due to missing DUOS registration #6871

Open
nadove-ucsc opened this issue Jan 31, 2025 · 0 comments
Open

Verbatim PFB manifest to failure due to missing DUOS registration #6871

nadove-ucsc opened this issue Jan 31, 2025 · 0 comments
Assignees
Labels
+ [priority] High bug [type] A defect preventing use of the system as specified manifests [subject] Generation and contents of manifests orange [process] Done by the Azul team

Comments

@nadove-ucsc
Copy link
Contributor

We started observing these errors after #6847 was merged to anvildev:

CloudWatch Logs Insights
region: us-east-1
log-group-names: /aws/lambda/azul-service-anvildev-manifest
start-time: -86400s
end-time: 0s
query-string:

fields @timestamp, @message
| filter @message like /ERROR|timed out/
| sort @timestamp asc
| limit 1

@timestamp @message
2025-01-31 19:30:12.821 [ERROR] TypeError: object of type 'NoneType' has no len() Traceback (most recent call last):   File "/var/task/azul/chalice.py", line 189, in patched_event_source_handler     return old_handler(self_, event, context)   File "/var/task/chalice/app.py", line 1761, in call     return self.handler(event_obj)   File "/var/task/chalice/app.py", line 1707, in call     return self._original_func(event.to_dict(), event.context)   File "/var/task/app.py", line 1526, in generate_manifest     return app.manifest_controller.get_manifest(event)   File "/var/task/azul/service/manifest_controller.py", line 97, in get_manifest     result = self.service.get_manifest(format=manifest_key.format,   File "/var/task/azul/service/manifest_service.py", line 636, in get_manifest     return self._generate_manifest(generator, manifest_key, partition)   File "/var/task/azul/service/manifest_service.py", line 645, in _generate_manifest     partition = generator.write(manifest_key, partition)   File "/var/task/azul/service/manifest_service.py", line 1341, in write     file_path, base_name = self.create_file()   File "/var/task/azul/service/manifest_service.py", line 2158, in create_file     avro_pfb.write_pfb_entities(pfb_entities(), pfb_schema, path)   File "/var/task/azul/service/avro_pfb.py", line 90, in write_pfb_entities     fastavro.writer(fh, parsed_schema, entities, validator=True)   File "fastavro/_write.pyx", line 795, in fastavro.write.writer   File "/var/task/azul/service/manifest_service.py", line 2154, in pfb_entities     yield avro_pfb.PFBEntity.for_replica(id, dict(replica)).to_json(())   File "/var/task/azul/service/avro_pfb.py", line 198, in for_replica     return cls(id=id, name=name, object=object)   File "", line 7, in init     self.attrs_post_init()   File "/var/task/azul/service/avro_pfb.py", line 174, in attrs_post_init     reject(len(self.id) > 254, 'Terra requires IDs be no longer than 254 chars', )

The cause is this replica, present on anvildev (but not on anvilbox due to #6609):

GET azul_v2_anvildev_anvil_replica/_search
{
  "query": {
    "term": {
      "replica_type.keyword": {
        "value": "duos_dataset_registration"
      }
    }
  }
}

...

{
    "_index" : "azul_v2_anvildev_anvil_replica",
    "_type" : "_doc",
    "_id" : "anvil_dataset_677dd55c-3fa3-4b07-8c98-985d94d7577e_6523fde7b93a5c3f90f87f197dded8dedd1fa21d",
    "_score" : 9.701045,
    "_source" : {
      "entity_id" : "677dd55c-3fa3-4b07-8c98-985d94d7577e",
      "contents" : {
        "duos_id" : null,
        "description" : null,
        "dataset_id" : "385290c3-dff5-fb6d-2501-fa0ba3ad1c35",
        "version" : "2022-06-01T00:00:00.000000Z"
      },
      "replica_type" : "duos_dataset_registration",
      "hub_ids" : [
        "677dd55c-3fa3-4b07-8c98-985d94d7577e"
      ]
    }
  },
@nadove-ucsc nadove-ucsc added the orange [process] Done by the Azul team label Jan 31, 2025
@nadove-ucsc nadove-ucsc self-assigned this Jan 31, 2025
@nadove-ucsc nadove-ucsc added bug [type] A defect preventing use of the system as specified manifests [subject] Generation and contents of manifests + [priority] High labels Jan 31, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
+ [priority] High bug [type] A defect preventing use of the system as specified manifests [subject] Generation and contents of manifests orange [process] Done by the Azul team
Projects
None yet
Development

No branches or pull requests

1 participant