Skip to content

Latest commit

 

History

History
35 lines (23 loc) · 1001 Bytes

document-to-json.md

File metadata and controls

35 lines (23 loc) · 1001 Bytes

document-to-json

This agent converts an unstructured blob of text (like a pdf document) into a JSON structured string.

Example

Example as a step in a pipeline

- name: "Convert to structured data"
  type: "document-to-json"
  intput: "input-topic"
  output: "output-topic"
  configuration:
    text-field: text
    copy-properties: true

With the configuration above and an input of "Hello there", the output is {"text": "Hello there"}.

Topics

Input

  • Unstructured only text (blob of text) ?
  • Implicit topic ?

Output

  • Structured text ?
  • Implicit topic ?

Configuration

Checkout the full configuration properties in the API Reference page.