You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, the helm chart is using upstream image huggingface/downloader:0.17.3 to download models. This image is buggy and quite old, and according to HF community feedback, they don't plan to maintain it any more.
We need to create our own image to download the model, and leverage the latest HF_HUB_ENABLE_HF_TRANSFER for fast model download.
The text was updated successfully, but these errors were encountered:
Btw. Downloader Job that could be run before OPEA applications to download all specified models to given hostPath or PVC would be useful though, as it would:
makes it sure that app startup does not fail to disk being full (models filling disk would have happened earlier)
allow apps' PVCs to use readOnlyMany mode (AFAIK supported by all cloud provides, unlike the writeMany mode)
(token) secret and additional privileges would be needed only for single Job
better support offline (no internet / private cloud) use-cases
Priority
Undecided
OS type
Ubuntu
Hardware type
Xeon-GNR
Running nodes
Single Node
Description
Currently, the helm chart is using upstream image
huggingface/downloader:0.17.3
to download models. This image is buggy and quite old, and according to HF community feedback, they don't plan to maintain it any more.We need to create our own image to download the model, and leverage the latest HF_HUB_ENABLE_HF_TRANSFER for fast model download.
The text was updated successfully, but these errors were encountered: