LLMFarm-MiniCPM

Based on LLMFarm, we run MiniCPM on ios devices. Note that the models run on ios are quantized to 4-bit and may lose some performance. The original models can be found here.

Deploy MiniCPM on IOS

The first method is to directly download our converted model，You can skip the subsequent conversion steps.

The second method is to download the original model from the huggingface and follow the steps below to convert and quantify it.

convert model

downloading model
git clone https://github.com/OpenBMB/llama.cpp.git
cd llama.cpp && make -j8
python3 convert.py ${hf_model_dir} --vocab-type hfft --outtype f32
./quantize ${hf_model_dir}/ggml-model-f32.gguf ${output_dir}/minicpm-q4_1.gguf q4_1

compile

git clone https://github.com/OpenBMB/LLMFarm-MiniCPM.git
cd LLMFarm-MiniCPM && git submodule update --init --recursive
Open this project with Xcode
Setting Siging & Capabilities
Select a device My MaC or your iphone
run

chat

add chat
select a model
Set template: CPM
Start chat

Name		Name	Last commit message	Last commit date
Latest commit History 206 Commits
.vscode		.vscode
FineTune_Test		FineTune_Test
LLMFarm.xcodeproj		LLMFarm.xcodeproj
LLMFarm		LLMFarm
ModelTest		ModelTest
dist		dist
llmfarm_core.swift @ 0e06ef5		llmfarm_core.swift @ 0e06ef5
.gitignore		.gitignore
.gitmodules		.gitmodules
Info.plist		Info.plist
LICENSE		LICENSE
README.md		README.md
README_cn.md		README_cn.md
build_and_run_model_test.sh		build_and_run_model_test.sh
donate.md		donate.md
inference_options.md		inference_options.md
lora.md		lora.md
models.md		models.md
models_devices.md		models_devices.md

License

OpenBMB/LLMFarm-MiniCPM

Folders and files

Latest commit

History

Repository files navigation

LLMFarm-MiniCPM

Deploy MiniCPM on IOS

convert model

compile

chat

About

Resources

License

Stars

Watchers

Forks

Languages