draft: add hyperbolic support #1191

Kaihuang724 · 2025-02-06T17:21:58Z

Added Hyperbolic as an inference provider

julien-c · 2025-02-07T23:40:15Z

Thanks @Kaihuang724! i've rebased on top of main as we changed things a bit in the past few days. Let me know if my commits make sense.

As you pointed out the Hyperbolic, 2 out of 4 tests currently don't pass.

I haven't checked why yet, but here's how you can run just the Hyperbolic tests locally (from inside packages/inference):

pnpm test -- -t "Hyperbolic"

When they do pass you can record the VCR tapes (so the CI doesn't run actual requests):

VCR_MODE=record pnpm test -- -t "Hyperbolic"

Will help early next week if you're stuck

HuggingFaceDocBuilderDev · 2025-02-07T23:40:28Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

julien-c · 2025-02-07T23:42:24Z

packages/inference/test/HfInference.spec.ts

+				"meta-llama/Llama-3.2-3B-Instruct": "meta-llama/Llama-3.2-3B-Instruct",
+				"meta-llama/Llama-3.3-70B-Instruct": "meta-llama/Llama-3.3-70B-Instruct",
+				"stabilityai/stable-diffusion-2": "stabilityai/stable-diffusion-2",
+				"meta-llama/Llama-3.1-405B-BASE-FP8": "meta-llama/Llama-3.1-405B-BASE-FP8",


Note that the keys should be HF model ids (it's not the case for the last one at least)

SBrandeis

Thank you @Kaihuang724 !

Would you mind updating the VCR tapes (=pre-cached API responses for online testing), please?

We need them for the CI tests.

You can do so by running the following command:

VCR_MODE=cache pnpm run test

julien-c · 2025-02-10T18:21:26Z

@Kaihuang724 let us know if any help is needed!

Kaihuang724 · 2025-02-11T04:07:10Z

Thank you @Kaihuang724 !

Would you mind updating the VCR tapes (=pre-cached API responses for online testing), please?

We need them for the CI tests.

You can do so by running the following command:
VCR_MODE=cache pnpm run test

I went ahead and updated the VCR tapes, but I still can't get the tests to run successfully. I'm not sure why, it seems like it's trying to call our API with this model: mistralai/Mixtral-8x7B-Instruct-v0.1 even though I'm passing this model name meta-llama/Llama-3.1-405B": "meta-llama/Llama-3.1-405B for the textGeneration test. Any ideas why?

SBrandeis · 2025-02-11T13:27:03Z

it seems like it's trying to call our API with this model: mistralai/Mixtral-8x7B-Instruct-v0.1 even though I'm passing this model name meta-llama/Llama-3.1-405B": "meta-llama/Llama-3.1-405B for the textGeneration test. Any ideas why?

It seems that your chat completion API default to the mistral model when no model is provided in the body - is that correct?

Note that we only add the model argument in the body when the task is chatCompletion or chatCompletionStream:
https://github.com/Kaihuang724/huggingface.js/blob/cb1ff636a5170c815f966aea9b0954845b8da77a/packages/inference/src/lib/makeRequestOptions.ts#L146-L150

Make sure the URL and body outputed by makeRequestOptions matches what you expect on your side.
https://github.com/Kaihuang724/huggingface.js/blob/cb1ff636a5170c815f966aea9b0954845b8da77a/packages/inference/src/tasks/custom/request.ts#L18

If not you will have to implement an adapter in the associated task method, eg here for text-to-image:
https://github.com/Kaihuang724/huggingface.js/blob/cb1ff636a5170c815f966aea9b0954845b8da77a/packages/inference/src/tasks/cv/textToImage.ts#L23-L40

…into kai/hyperbolic-integration

connorch · 2025-02-12T22:23:15Z

Thank you @SBrandeis ! This PR should be ready for review now.

SBrandeis · 2025-02-13T10:21:00Z

Hi @connorch - thank you for your changes!
I pushed a few updates:

b2fbceb : Add a type for Hyperbolic's text-to-image generations
6bdd200 : Remove some uneeded code in the baseUrl computation

But most importantly de26ffa :

huggingface.js/packages/inference/test/HfInference.spec.ts

Lines 1180 to 1257 in de26ffa

    
           describe.concurrent( 
        
           	"Hyperbolic", 
        
           	() => { 
        
           		HARDCODED_MODEL_ID_MAPPING.hyperbolic = { 
        
           			"meta-llama/Llama-3.2-3B-Instruct": "meta-llama/Llama-3.2-3B-Instruct", 
        
           			"meta-llama/Llama-3.3-70B-Instruct": "meta-llama/Llama-3.3-70B-Instruct", 
        
           			"stabilityai/stable-diffusion-2": "stabilityai/stable-diffusion-2", 
        
           			"meta-llama/Llama-3.1-405B": "meta-llama/Meta-Llama-3.1-405B-Instruct", 
        
           		}; 
        
           		it("chatCompletion - hyperbolic", async () => { 
        
           			const res = await chatCompletion({ 
        
           				accessToken: env.HF_HYPERBOLIC_KEY, 
        
           				model: "meta-llama/Llama-3.2-3B-Instruct", 
        
           				provider: "hyperbolic", 
        
           				messages: [{ role: "user", content: "Complete this sentence with words, one plus one is equal " }], 
        
           				temperature: 0.1, 
        
           			}); 
        
           			expect(res).toBeDefined(); 
        
           			expect(res.choices).toBeDefined(); 
        
           			expect(res.choices?.length).toBeGreaterThan(0); 
        
           			if (res.choices && res.choices.length > 0) { 
        
           				const completion = res.choices[0].message?.content; 
        
           				expect(completion).toBeDefined(); 
        
           				expect(typeof completion).toBe("string"); 
        
           				expect(completion).toContain("two"); 
        
           			} 
        
           		}); 
        
           		it("chatCompletion stream", async () => { 
        
           			const stream = chatCompletionStream({ 
        
           				accessToken: env.HF_HYPERBOLIC_KEY, 
        
           				model: "meta-llama/Llama-3.3-70B-Instruct", 
        
           				provider: "hyperbolic", 
        
           				messages: [{ role: "user", content: "Complete the equation 1 + 1 = , just the answer" }], 
        
           			}) as AsyncGenerator<ChatCompletionStreamOutput>; 
        
           			let out = ""; 
        
           			for await (const chunk of stream) { 
        
           				if (chunk.choices && chunk.choices.length > 0) { 
        
           					out += chunk.choices[0].delta.content; 
        
           				} 
        
           			} 
        
           			expect(out).toContain("2"); 
        
           		}); 
        
           		it("textToImage", async () => { 
        
           			const res = await textToImage({ 
        
           				accessToken: env.HF_HYPERBOLIC_KEY, 
        
           				model: "stabilityai/stable-diffusion-2", 
        
           				provider: "hyperbolic", 
        
           				inputs: "award winning high resolution photo of a giant tortoise", 
        
           				parameters: { 
        
           					height: 128, 
        
           					width: 128, 
        
           				}, 
        
           			} satisfies TextToImageArgs); 
        
           			expect(res).toBeInstanceOf(Blob); 
        
           		}); 
        
           		it("textGeneration", async () => { 
        
           			const res = await textGeneration({ 
        
           				accessToken: env.HF_HYPERBOLIC_KEY, 
        
           				model: "meta-llama/Llama-3.1-405B", 
        
           				provider: "hyperbolic", 
        
           				inputs: "Paris is", 
        
           				parameters: { 
        
           					temperature: 0, 
        
           					top_p: 0.01, 
        
           					max_new_tokens: 10, 
        
           				} 
        
           			}); 
        
           			expect(res).toMatchObject({ generated_text: "...the capital and most populous city of France," }); 
        
           		}); 
        
           	}, 
        
           	TIMEOUT 
        
           );

I updated the tests to match our types and expected APIs, which revealed there is the need to implement adapters to transform inputs and outputs for the text-generation and text-to-image tasks to match what you expect on Hyperbolic side.

Namely:

For text-generation, convert the TextGenerationInput to the expected payload shape on Hyperbolic (which seems to be similar to ChatCompletionInput?)
For text-to-image, you seem to expect model_name in the body to determine which model to run inference with. You can implement the transformation here

julien-c

Thanks for pushing this over the finish line, @SBrandeis <3

connorch · 2025-02-14T17:00:58Z

Thank you @SBrandeis ! I appreciate you cleaning things up and getting this merged 🙌

Kaihuang724 requested review from SBrandeis, gary149, Wauplin, julien-c, pcuenca, ngxson, hanouticelina and coyotte508 as code owners February 6, 2025 17:21

draft: add hyperbolic support

33771e6

julien-c force-pushed the kai/hyperbolic-integration branch from c507b1e to 33771e6 Compare February 7, 2025 23:30

julien-c added 2 commits February 8, 2025 00:31

fix

31cf25c

Make compile

179c84d

julien-c reviewed Feb 7, 2025

View reviewed changes

SBrandeis self-assigned this Feb 10, 2025

SBrandeis reviewed Feb 10, 2025

View reviewed changes

Kaihuang724 added 2 commits February 10, 2025 20:02

draft: edited hyperbolic model names

c2a9c79

fix: updated VCR tapes

cb1ff63

connorch added 8 commits February 12, 2025 12:27

fix texttoImage

95d33d8

text generation working

2ce622d

add tapes

037c39d

decrease image gen size

5a24052

update tapes

3e5eb4e

Merge branch 'main' of https://github.com/huggingface/huggingface.js …

e683717

…into kai/hyperbolic-integration

update tapes

90c37ba

update tapes

eead488

SBrandeis added 3 commits February 13, 2025 11:08

textToImage: types

b2fbceb

rm duplicate URL

6bdd200

tests: use task methods for better typing + match hf.js API

de26ffa

SBrandeis added 5 commits February 13, 2025 11:21

update model mapping for text-to-image

3dbb6ff

lint

cba9580

Merge branch 'main' into kai/hyperbolic-integration

9ef486b

fix text-generation

2a8d153

fix text-to-image

7cf2f49

SBrandeis requested a review from julien-c February 14, 2025 11:03

SBrandeis approved these changes Feb 14, 2025

View reviewed changes

julien-c approved these changes Feb 14, 2025

View reviewed changes

julien-c merged commit 3e78986 into huggingface:main Feb 14, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

draft: add hyperbolic support #1191

draft: add hyperbolic support #1191

Uh oh!

Kaihuang724 commented Feb 6, 2025

Uh oh!

julien-c commented Feb 7, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Feb 7, 2025

Uh oh!

julien-c Feb 7, 2025

Uh oh!

SBrandeis left a comment •

edited

Loading

Uh oh!

julien-c commented Feb 10, 2025

Uh oh!

Kaihuang724 commented Feb 11, 2025

Uh oh!

SBrandeis commented Feb 11, 2025

Uh oh!

connorch commented Feb 12, 2025

Uh oh!

SBrandeis commented Feb 13, 2025

Uh oh!

julien-c left a comment

Uh oh!

Uh oh!

connorch commented Feb 14, 2025

Uh oh!

Uh oh!

draft: add hyperbolic support #1191

draft: add hyperbolic support #1191

Uh oh!

Conversation

Kaihuang724 commented Feb 6, 2025

Uh oh!

julien-c commented Feb 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Feb 7, 2025

Uh oh!

julien-c Feb 7, 2025

Choose a reason for hiding this comment

Uh oh!

SBrandeis left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

julien-c commented Feb 10, 2025

Uh oh!

Kaihuang724 commented Feb 11, 2025

Uh oh!

SBrandeis commented Feb 11, 2025

Uh oh!

connorch commented Feb 12, 2025

Uh oh!

SBrandeis commented Feb 13, 2025

Uh oh!

julien-c left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

connorch commented Feb 14, 2025

Uh oh!

Uh oh!

julien-c commented Feb 7, 2025 •

edited

Loading

SBrandeis left a comment •

edited

Loading