Open
Description
Currently, We train a model, which is used for prediction later. This works fine under SQLFlow.
But, when we trained a model by SQLFlow, then use the model for prediction in other platforms/tools, we will encounter compatibility problems. Meanwhile, it's not easy to make such other platforms
support SQLFlow format.
So, I propose SQLFlow support exporting different types of model formats, e.g. PMML which is widely used in the industry. Generally, it will be more flexible in a user-defined way to specify the model format.
Proposals:
- Specify the model format by a suffix of the model name in the SQL, like:
SELECT *
TO TRAIN DNNClassifier
COLUMN col1, col2, col3, col4
INTO my_model.pmml --- my_mode.{user-defined model format}
- Specify the format in the SQLFlow server level via an environment variable.
This method lacks flexibility but has no addition to SQL expressions.
I like the first proposal because of flexibility.
How do you think? @Yancey1989 @typhoonzero @tonyyang-svail @wangkuiyi @llxxxll @shendiaomo @lhw362950217