Skip to content

Text Summarization: 'int' object has no attribute 'isnumeric' #1355

@onefanwu

Description

@onefanwu

Search before asking

  • I have searched the EvaDB issues and found no similar bug report.

Bug

evadb=#SELECT TextSummarizer(article) FROM cnn_news_test;
@status: ResponseStatus.FAIL
@batch: 
 None
@error: 'int' object has no attribute 'isnumeric'

When I run the queries in the text_summarization benchmark, I get the above error.

The queries used are as follows:

DROP TABLE IF EXISTS cnn_news_test;

CREATE TABLE IF NOT EXISTS cnn_news_test(
        id TEXT(128),
        article TEXT(4096),
        highlights TEXT(1024)
    );

DROP FUNCTION IF EXISTS TextSummarizer;

CREATE FUNCTION IF NOT EXISTS TextSummarizer
      TYPE HuggingFace
      TASK 'summarization'
      MODEL 'benchmark/models/distilbart-cnn-12-6'
      MIN_LENGTH 5
      MAX_LENGTH 100;


DROP TABLE IF EXISTS cnn_news_summary;

LOAD CSV 'benchmark/datasets/text/cnn_dailymail/test.csv'
INTO cnn_news_test;

CREATE TABLE IF NOT EXISTS cnn_news_summary AS
SELECT TextSummarizer(article) FROM cnn_news_test;

The error may be due to the following section in hf_abstract_function.py:

        for entry in function_obj.metadata:
            if entry.value.isnumeric():
                pipeline_args[entry.key] = int(entry.value)
            else:
                pipeline_args[entry.key] = entry.value

Environment

  • EvaDB v0.3.8

Are you willing to submit a PR?

  • Yes I'd like to help by submitting a PR!

Metadata

Metadata

Assignees

Labels

Bug 🐞EVA is not working as expected

Type

No type

Projects

Status

In Progress

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions