feat(agentic idp): First version of agentic IDP using strands #48

kazmer97 · 2025-09-04T16:51:08Z

Summary

• Add agentic extraction module using Strands agents for structured data output without JSON parsing errors • Enhance
extraction service with dynamic Pydantic model generation from configuration attributes • Maintain full backward
compatibility - traditional extraction remains default behavior

Changes Made

• New: lib/idp_common_pkg/idp_common/extraction/agentic_idp.py - Core agentic functionality with tool-based
extraction
• Modified: lib/idp_common_pkg/idp_common/extraction/service.py - Added dynamic model generation and agentic
integration
• Updated: lib/idp_common_pkg/idp_common/extraction/README.md - Comprehensive documentation with usage examples

Benefits

• Eliminates JSON parsing errors through validated Pydantic models
• Improves extraction accuracy with self-correcting agent tools
• Provides dual extraction paths (traditional vs agentic) based on configuration
• Zero breaking changes - existing workflows continue unchanged

This enhancement significantly improves extraction reliability while preserving all existing functionality and
configuration patterns.

lib/idp_common_pkg/idp_common/extraction/agentic_idp.py

template.yaml

rstrahan · 2025-10-01T20:48:43Z

patterns/pattern-2/template.yaml

              - system_prompt
              - task_prompt
            properties:
+              agentic:


How do prompts work.. Does the agent still use the existing System prompt and Task prompt? (I had imagined the agent would need different prompt variants, but if not, that's great!)

How Prompts Work with Agentic Extraction
The agentic extraction uses the existing System and Task prompts, but in a different way than traditional extraction:

Traditional Extraction
System prompt: Sent directly to Bedrock as system message
Task prompt: Sent as user message
Model responds with JSON text that needs parsing
Agentic Extraction
System prompt → Passed via custom_instruction parameter and appended to agentic system prompt
Task prompt → Sent as user message (content blocks with text/images)
Uses Strands agent with tools for structured output
No JSON parsing needed - returns validated Pydantic model
Key Difference
The agentic system prompt (in agentic_idp.py) provides extraction guidelines and tool usage instructions. Your existing system/task prompts are incorporated as custom instructions to guide the extraction without changing the core agent behavior.

publish.py

notebooks/examples/step3_extraction_using_agent_idp.ipynb

kazmer97 · 2025-10-02T11:39:03Z

How Prompts Work with Agentic Extraction

The agentic extraction uses the existing System and Task prompts, but in a different way than traditional extraction:

Traditional Extraction

System prompt: Sent directly to Bedrock as system message
Task prompt: Sent as user message
Model responds with JSON text that needs parsing

Agentic Extraction

System prompt → Passed via custom_instruction parameter and appended to agentic system prompt
Task prompt → Sent as user message (content blocks with text/images)
Uses Strands agent with tools for structured output
No JSON parsing needed - returns validated Pydantic model

Key Difference

The agentic system prompt (in agentic_idp.py) provides extraction guidelines and tool usage instructions. Your existing system/task prompts are incorporated as custom instructions to guide the extraction without changing the core agent behavior.

Result: Same prompts work for both methods, just applied differently under the hood.

lib/idp_common_pkg/tests/conftest.py

rstrahan · 2025-10-07T19:33:59Z

patterns/pattern-2/template.yaml

+                    description: This introduces a second agent to review the first agents work. Only use with highly complex workflows as it increases token usage.
+                    order: 1
+                    default: false
+                  enable_caching:


We should be able to automate this.. already in idp_common bedrock client we have a list of models that support caching, so use that to auto detect and cache if model supports it.

config_library/pattern-2/lending-package-sample/config.yaml

rstrahan · 2025-10-07T20:58:15Z

docs/extraction.md


 Configure extraction behavior through several components:

+### Agentic Extraction (Recommended for Production)


Is it too soon to say this.. Better to say (Preview) or (Experimental) for now, IMHO, till we have more testing and ironed out bugs..

rstrahan · 2025-10-07T21:01:51Z

docs/extraction.md

+
+#### When to Enable Agentic Extraction
+
+Enable agentic extraction when you need:


Can we say something about this enabling future extensions involving MCP servers to integrate validation or enrichment into the extraction process... (eg validate names, addresses, etc from external services)

rstrahan · 2025-10-07T21:03:35Z

docs/extraction.md

    When a field is not present, indicate this explicitly rather than guessing.
-    
+
  task_prompt: |


Can you explain how the system_prompt and task_prompts apply to the agentic extraction.. Does the agent use the same prompt as 'traditional' method?

rstrahan · 2025-10-07T21:05:44Z

lib/idp_common_pkg/idp_common/extraction/agentic_idp.py

See slack comment on throttle catching..

it failed with
Execution Failed
Error: ValueError: Agent invocation failed: An error occurred (ThrottlingException) when calling the ConverseStream operation (reached max retries: 4): Too many tokens, please wait before trying again.
We need to keep retrying... for much much longer.. @kaznb can you check
idp_common bedrock client for exponential backoff/retry logic
state machine for retries at lambda invocation level..
We should never fail due to throttling.. at least not for several hours..
Note - I seem to get this error every time with Sonnet 4.5 in my isengard account when processing just 1 lending_package.pdf (Isengard has really low quota), but, whatever the customer quota is, we still need to defend against throttling when volumes are large and demand exceeds quota

feat(agentic idp): First version of agentic IDP using strands

kazmer97 changed the title ~~feat(agentic idp): First version of agentic IDP using strands~~ Draft: feat(agentic idp): First version of agentic IDP using strands Sep 4, 2025

kazmer97 marked this pull request as draft September 4, 2025 17:31

kazmer97 force-pushed the feat/agentic-idp branch 4 times, most recently from cd18bbb to c3345ea Compare September 12, 2025 13:21

kazmer97 marked this pull request as ready for review September 18, 2025 09:09

kazmer97 force-pushed the feat/agentic-idp branch from c3345ea to e2580e0 Compare September 18, 2025 09:09

kazmer97 changed the title ~~Draft: feat(agentic idp): First version of agentic IDP using strands~~ feat(agentic idp): First version of agentic IDP using strands Sep 18, 2025

kazmer97 force-pushed the feat/agentic-idp branch 2 times, most recently from 239141a to 99565e2 Compare September 22, 2025 12:13

kazmer97 commented Sep 24, 2025

View reviewed changes

lib/idp_common_pkg/idp_common/extraction/agentic_idp.py Outdated Show resolved Hide resolved

kazmer97 force-pushed the feat/agentic-idp branch from 99565e2 to 6e9e5db Compare September 26, 2025 15:59

rstrahan requested changes Oct 1, 2025

View reviewed changes

kazmer97 force-pushed the feat/agentic-idp branch 2 times, most recently from eabe44b to 4b23c91 Compare October 2, 2025 11:21

kazmer97 force-pushed the feat/agentic-idp branch 4 times, most recently from 9b9c42b to db43335 Compare October 7, 2025 16:47

rstrahan requested changes Oct 7, 2025

View reviewed changes

lib/idp_common_pkg/tests/conftest.py Outdated Show resolved Hide resolved

kazmer97 force-pushed the feat/agentic-idp branch 2 times, most recently from d2373c5 to 83fd4a8 Compare October 7, 2025 20:58

rstrahan requested changes Oct 7, 2025

View reviewed changes

kazmer97 force-pushed the feat/agentic-idp branch 2 times, most recently from 1de7a71 to b4c1f69 Compare October 8, 2025 14:29

kazmer97 added 2 commits October 8, 2025 16:13

feat(agentic idp): First version of agentic IDP using strands

8d25cc7

feat(agentic idp): First version of agentic IDP using strands

Dockerised deployment of Pattern 2

a108144

agentic idp test case

8a23073

kazmer97 force-pushed the feat/agentic-idp branch from b4c1f69 to 8a23073 Compare October 8, 2025 15:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(agentic idp): First version of agentic IDP using strands #48

feat(agentic idp): First version of agentic IDP using strands #48

Uh oh!

kazmer97 commented Sep 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

rstrahan Oct 1, 2025

Uh oh!

kazmer97 Oct 7, 2025

Uh oh!

Uh oh!

Uh oh!

kazmer97 commented Oct 2, 2025

Uh oh!

Uh oh!

rstrahan Oct 7, 2025

Uh oh!

Uh oh!

rstrahan Oct 7, 2025

Uh oh!

rstrahan Oct 7, 2025

Uh oh!

rstrahan Oct 7, 2025

Uh oh!

rstrahan Oct 7, 2025

Uh oh!

Uh oh!


		Configure extraction behavior through several components:

		### Agentic Extraction (Recommended for Production)


		#### When to Enable Agentic Extraction

		Enable agentic extraction when you need:

		When a field is not present, indicate this explicitly rather than guessing.


		task_prompt: \|

feat(agentic idp): First version of agentic IDP using strands #48

Are you sure you want to change the base?

feat(agentic idp): First version of agentic IDP using strands #48

Uh oh!

Conversation

kazmer97 commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes Made

Benefits

Uh oh!

Uh oh!

Uh oh!

rstrahan Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

kazmer97 Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kazmer97 commented Oct 2, 2025

How Prompts Work with Agentic Extraction

Traditional Extraction

Agentic Extraction

Key Difference

Uh oh!

Uh oh!

rstrahan Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rstrahan Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

rstrahan Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

rstrahan Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

rstrahan Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kazmer97 commented Sep 4, 2025 •

edited

Loading