Skip to content

How can I use T-MAC with llama.cpp in Android system,,any guidence? #12

Answered by kaleid-liner
FranzKafkaYu asked this question in Q&A
Discussion options

You must be logged in to vote
  1. First, follow the normal workflow to install T-MAC (OSX/Ubuntu recommended).

  2. By specifying -u for run_pipeline.py, you can use our prebuilt kernels for ARM. The performance of the kernels may not be optimal, but should be very close.

    If you want to tune and codegen kernels for your own device, you need to follow android rpc to setup TVM RPC and codegen doc to generate kernels.

  3. For STEP.0, STEP.1, STEP.2 and STEP.3 of run_pipeline.py, you don't need to do any modifications.

  4. For STEP.4 and STEP.5, you should follow llama.cpp Android section to cross-compile for Android.

  5. Finally, you can run STEP.6 in Android termux or adb shell.

We are also working on simplifying the deployment …

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by kaleid-liner
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants