No support for multi-GPU #106

parviste-fortum · 2023-10-13T13:42:05Z

It seems that it's not possible to run models using multiple gpus, e.g. by passing device_map="auto" to pipelines.

Is there any way to work around this limitation?

The text was updated successfully, but these errors were encountered:

philschmid · 2023-10-14T08:58:58Z

What model are you trying to run? We recommend for LLMs to use the LLM container

parviste-fortum · 2023-10-14T13:24:32Z

Yes, I moved over to the TGI container. I had started with the generic container since it's what is used by https://registry.terraform.io/modules/philschmid/sagemaker-huggingface/aws/latest. The lack of multi-gpu support was just quite surprising, especially since it doesn't really seem to be any specific reason for not having it (I suppose no one has just implemented it).

MoritzLaurer · 2024-07-12T10:30:26Z

I think this blog post should respond to your question @parviste-fortum : https://www.philschmid.de/sagemaker-multi-replica

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No support for multi-GPU #106

No support for multi-GPU #106

parviste-fortum commented Oct 13, 2023

philschmid commented Oct 14, 2023

parviste-fortum commented Oct 14, 2023

MoritzLaurer commented Jul 12, 2024 •

edited

Loading

No support for multi-GPU #106

No support for multi-GPU #106

Comments

parviste-fortum commented Oct 13, 2023

philschmid commented Oct 14, 2023

parviste-fortum commented Oct 14, 2023

MoritzLaurer commented Jul 12, 2024 • edited Loading

MoritzLaurer commented Jul 12, 2024 •

edited

Loading