-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
azure-ai-inference Sample Issue (load_client method failed with (404) "Resource not found") #39310
Comments
@jerryshia Not all endpoints support the /info route that the get_model_info() method uses. Which endpoint are you using? Free tier GitHub models endpoint? A model deployed via AI Foundry? Serverless API? Managed endpoint? Can you share the endpoint URL here? or IM me on Teams so we can share more info, including SDK logs as needed. |
Hi @dargilco, not the OP here but having similar issues. I guessed that we might not have support to run More specifically, I have different Meta Llama models deployed to an 'Azure AI Services' endpoint, and I specify the model name using the arg I've double checked for silly mistakes I can think of, e.g. incorrect endpoints (I have tried both with and without the The docs also indicate that my API key should be 32 characters, but I can confirm that the API key provided by the Azure AI model inference endpoint is longer than this. One additional note: If I send the request with the MODEL_NAME = 'Llama-3.3-70B-Instruct'
client = ChatCompletionsClient(
endpoint = ENDPOINT,
credential = AzureKeyCredential(API_KEY)
)
response = client.complete(
messages=[
SystemMessage(content="You are a helpful assistant."),
UserMessage(content="Explain Riemann's conjecture in 1 paragraph"),
],
model = "Meta-Llama-3.1-405B-Instruct"
) Which returns:
Appreciate that this might just be a daft user error, but I can't think of anything else to check and can't find any other docs or discussions about this system. Hopefully this is also helpful to OP, since it seems like our issues are relevant? Happy to reach out on teams (I don't know your address?) but thought I'd post this anyway in the interest of sharing with others. |
Additional info: I can also confirm the following versions for these azure packages are installed in my venv:
|
@mtcogco please open a new GitHub issue for your issue, and I will try to help you there. It's unrelated to what @jerryshia wrote. |
I reached out directly to @jerryshia over IM, waiting for a response. |
@jerryshia closing since I have not heard back from you. Please reactivate if necessary. |
Section link1, link2, link3, link4,link5:
Reason:

ResourceNotFoundError: (404) Resource not found
@rohit-ganguly , @lmazuel , @achandmsft , @mayurid , @dargilco for notification.
The text was updated successfully, but these errors were encountered: