Skip to content

Commit

Permalink
[Doc] Update Llama-2 Android Support (mlc-ai#880)
Browse files Browse the repository at this point in the history
  • Loading branch information
Hzfengsy authored Sep 7, 2023
1 parent 20f125f commit f3ea48a
Showing 1 changed file with 8 additions and 1 deletion.
9 changes: 8 additions & 1 deletion docs/compilation/compile_models.rst
Original file line number Diff line number Diff line change
Expand Up @@ -414,6 +414,13 @@ This section lists compile commands for more models that you can try out.
python3 -m mlc_llm.build --model Llama-2-7b-chat-hf --target iphone --max-seq-len 768 --quantization q3f16_1
.. tab:: Android

.. code:: shell
python3 -m mlc_llm.build --model Llama-2-7b-chat-hf --target android --max-seq-len 768 --quantization q4f16_1
.. tab:: Vicuna-v1-7B

Please check this page on :doc:`how to get the Vicuna model weights </compilation/get-vicuna-weight>`.
Expand Down Expand Up @@ -470,7 +477,7 @@ This section lists compile commands for more models that you can try out.

.. code:: shell
python3 -m mlc_llm.build --model vicuna-v1-7b --target android --max-seq-len 768 --quantization q4f16_0
python3 -m mlc_llm.build --model vicuna-v1-7b --target android --max-seq-len 768 --quantization q4f16_1
.. tab:: RedPajama-v1-3B

Expand Down

0 comments on commit f3ea48a

Please sign in to comment.