replace hf_xxxxxxxxxxxx by process.env.HF_TOKEN in examples #1764

Wauplin · 2025-06-04T09:14:28Z

Follow-up PR after huggingface/huggingface.js#1514

HuggingFaceDocBuilderDev · 2025-06-04T09:15:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@gary149

Solve #1361. Long awaited feature for @gary149. I did not go for the cleanest solution but it works well and should be robust/flexible enough if we need to fix something in the future. ## EDIT: breaking change => access token should be passed as `opts.accessToken` now in `snippets.getInferenceSnippets` ## TODO once merged: - [ ] adapt in moon-landing for snippets on model page huggingface-internal/moon-landing#13964 - [ ] adapt in doc-builder for `<inferencesnippet>` html tag (used in hub-docs) huggingface/doc-builder#570 - [ ] hardcoded examples in hub-docs huggingface/hub-docs#1764 ## Some examples: ### JS client ```js import { InferenceClient } from "@huggingface/inference"; const client = new InferenceClient(process.env.HF_TOKEN); const chatCompletion = await client.chatCompletion({ provider: "hf-inference", model: "meta-llama/Llama-3.1-8B-Instruct", messages: [ { role: "user", content: "What is the capital of France?", }, ], }); console.log(chatCompletion.choices[0].message); ``` ### Python client ```py import os from huggingface_hub import InferenceClient client = InferenceClient( provider="hf-inference", api_key=os.environ["HF_TOKEN"], ) completion = client.chat.completions.create( model="meta-llama/Llama-3.1-8B-Instruct", messages=[ { "role": "user", "content": "What is the capital of France?" } ], ) print(completion.choices[0].message) ``` ### openai client ```py import os from openai import OpenAI client = OpenAI( base_url="https://router.huggingface.co/hf-inference/models/meta-llama/Llama-3.1-8B-Instruct/v1", api_key=os.environ["HF_TOKEN"], ) completion = client.chat.completions.create( model="meta-llama/Llama-3.1-8B-Instruct", messages=[ { "role": "user", "content": "What is the capital of France?" } ], ) print(completion.choices[0].message) ``` ### curl ```sh curl https://router.huggingface.co/hf-inference/models/meta-llama/Llama-3.1-8B-Instruct/v1/chat/completions \ -H "Authorization: Bearer $HF_TOKEN" \ -H 'Content-Type: application/json' \ -d '{ "messages": [ { "role": "user", "content": "What is the capital of France?" } ], "model": "meta-llama/Llama-3.1-8B-Instruct", "stream": false }' ``` ### check out PR diff for more examples --------- Co-authored-by: Simon Brandeis <[email protected]>

replace hf_xxxxxxxxxxxx by process.env.HF_TOKEN in examples

5138c70

Wauplin requested review from julien-c and gary149 June 4, 2025 09:14

Wauplin mentioned this pull request Jun 4, 2025

[InferenceSnippet] Take token from env variable if not set huggingface/huggingface.js#1514

Merged

3 tasks

julien-c approved these changes Jun 4, 2025

View reviewed changes

Wauplin merged commit f400c7e into main Jun 4, 2025
2 checks passed

Wauplin deleted the update-snippets-examples branch June 4, 2025 12:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

replace hf_xxxxxxxxxxxx by process.env.HF_TOKEN in examples #1764

replace hf_xxxxxxxxxxxx by process.env.HF_TOKEN in examples #1764

Uh oh!

Wauplin commented Jun 4, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jun 4, 2025

Uh oh!

Uh oh!

Uh oh!

replace hf_xxxxxxxxxxxx by process.env.HF_TOKEN in examples #1764

replace hf_xxxxxxxxxxxx by process.env.HF_TOKEN in examples #1764

Uh oh!

Conversation

Wauplin commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jun 4, 2025

Uh oh!

Uh oh!

Uh oh!

Wauplin commented Jun 4, 2025 •

edited

Loading