Skip to content

Proposal: Support saving the model with PMML format #2064

Open
@weiguoz

Description

@weiguoz

Currently, We train a model, which is used for prediction later. This works fine under SQLFlow.
But, when we trained a model by SQLFlow, then use the model for prediction in other platforms/tools, we will encounter compatibility problems. Meanwhile, it's not easy to make such other platforms support SQLFlow format.

So, I propose SQLFlow support exporting different types of model formats, e.g. PMML which is widely used in the industry. Generally, it will be more flexible in a user-defined way to specify the model format.

Proposals:

  1. Specify the model format by a suffix of the model name in the SQL, like:
SELECT *
TO TRAIN DNNClassifier
COLUMN col1, col2, col3, col4
INTO my_model.pmml   --- my_mode.{user-defined model format}
  1. Specify the format in the SQLFlow server level via an environment variable.
    This method lacks flexibility but has no addition to SQL expressions.

I like the first proposal because of flexibility.
How do you think? @Yancey1989 @typhoonzero @tonyyang-svail @wangkuiyi @llxxxll @shendiaomo @lhw362950217

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions