Skip to content

Document ids in Spark Datasets #845

Open
@bwf93

Description

@bwf93

What kind an issue is this?

  • Bug report. If you’ve found a bug, please provide a code snippet or test to reproduce it below.
    The easier it is to track down the bug, the faster it is solved.
  • [ X] Feature Request. Start by telling us what problem you’re trying to solve.
    Often a solution already exists! Don’t send pull requests to implement new features without
    first getting our support. Sometimes we leave features out on purpose to keep the project small.

Feature description

Currently if using elasticsearch-hadoop with spark to read data you get a Tuple with the document id and the document itself. However if you read the data into a Dataset you do not get access to the document id. It would be the document id is included in the dataset that is backed by elasticsearch.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions