Submit User model to InferX platform

After deploy InferX platform with https://github.com/inferx-net/inferx/wiki/InferX-platform-0.0.1-deployment, we can submit user model to the platform.

Create API Key

In the InferX dashbord, click "Apikey" and input username and password created in Keycloak. If deployment is based on Keycloak database image, the default username is "testuser1" and password is "test".
In the Apikey page, input apikey name under textbox under "Add Apikey" and then click button "Add Apikey name".
Then the new API Key will generated for the user

Set API Key

Open local file path of https://github.com/inferx-net/inferx/blob/main/config/models.txt
Edit IFERX_APIKEY with the generated apikey such as "97146956-2fd9-4b7e-8308-835d81dff85e"
Open a bash terminal and copy and paste

export KEYCLOAK_URL="http://localhost:1260/authn"
export INFX_GATEWAY_URL="http://localhost:4000"
export IFERX_APIKEY="d485cf31-c313-426f-a3a8-116a7fa97d44"

Go to curent repo config folder

cd inferx/config

Create tenant: Copy and paste

/opt/inferx/bin/ixctl create public.json

Create namespace: Copy and paste

/opt/inferx/bin/ixctl create TinyLlama_namespace.json

Download model image from hugginface

In the example, TinyLlama/TinyLlama-1.1B-Chat-v1.0 is downloaded to "/home/brad/cache" folder, any there is one folder named /home/brad/cache/hub/models--TinyLlama--TinyLlama-1.1B-Chat-v1.0

Update TinyLlama-1.1B-Chat-v1.0.json

Please update the "mounts" ==> "hostpath" to your models folder. In the example, the folder is "/home/brad/cache".

Submit the model

/opt/inferx/bin/ixctl create TinyLlama-1.1B-Chat-v1.0.json

Snapshotting

When submitting is done, the InferX will start snapshoting. You can find a new pod created at Pods (http://[inferxhost]:81/listpod).

Snapshot ready

After snapshot is done, a new model in will appear in Models page (http://[inferxhost]:81/listfunc)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Submit User model to InferX platform

Create API Key

Set API Key

Go to curent repo config folder

Create tenant: Copy and paste

Create namespace: Copy and paste

Download model image from hugginface

Update TinyLlama-1.1B-Chat-v1.0.json

Submit the model

Snapshotting

Snapshot ready

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally