You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Qualcomm AI Engine Direct - Reland GA Static QWEN2.5 0.5B (#12582)
### Summary
Previous PR were merged unintentionally:
#12054
On top of previous PR, also had a new commit addressing code review.
### Test plan
[PLEASE REMOVE] How did you test this PR? Please write down any manual
commands you used and note down tests that you have written if
applicable.
Copy file name to clipboardExpand all lines: examples/qualcomm/oss_scripts/llama/README.md
+2-1Lines changed: 2 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,10 +1,11 @@
1
1
# Summary
2
2
3
3
## Overview
4
-
This file provides you the instructions to run LLAMA model with different parameters via Qualcomm HTP backend. We currently support the following models:
4
+
This file provides you the instructions to run LLM Decoder model with different parameters via Qualcomm HTP backend. We currently support the following models:
5
5
1. LLAMA2 Stories 110M
6
6
2. LLAMA3.2 1B
7
7
3. LLAMA3.2 3B
8
+
4. QWEN2.5 0.5B
8
9
9
10
We offer the following modes to execute the model:
0 commit comments