How can I use T-MAC with llama.cpp in Android system,,any guidence? #12
-
Hi developers,I am using llama.cpp in my Android Project,and I want to know how can I use T-MAC with llama.cpp. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
We are also working on simplifying the deployment on Android and hopefully we can release instructions ASAP. |
Beta Was this translation helpful? Give feedback.
First, follow the normal workflow to install T-MAC (OSX/Ubuntu recommended).
By specifying
-u
forrun_pipeline.py
, you can use our prebuilt kernels for ARM. The performance of the kernels may not be optimal, but should be very close.If you want to tune and codegen kernels for your own device, you need to follow android rpc to setup TVM RPC and codegen doc to generate kernels.
For STEP.0, STEP.1, STEP.2 and STEP.3 of
run_pipeline.py
, you don't need to do any modifications.For STEP.4 and STEP.5, you should follow llama.cpp Android section to cross-compile for Android.
Finally, you can run STEP.6 in Android termux or adb shell.
We are also working on simplifying the deployment …