• Stars
    star
    773
  • Rank 56,428 (Top 2 %)
  • Language
    Swift
  • License
    MIT License
  • Created 11 months ago
  • Updated about 1 month ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

llama and other large language models on iOS and MacOS offline using GGML library.

LLMFarm

If you are experiencing the error extras.otherInstrOffset != 0 && "Kind::arm64_adrp_ldr missing extra info" see the solution here.

Icon Icon

Install Stable                          Install Latest

Icon Icon

LLMFarm is an iOS and MacOS app to work with large language models (LLM). It allows you to load different LLMs with certain parameters.With LLMFarm, you can test the performance of different LLMs on iOS and macOS and find the most suitable model for your project.
Based on ggml and llama.cpp by Georgi Gerganov.

Also used sources from:

Features

  • MacOS (13+)
  • iOS (16+)
  • Various inferences
  • Various sampling methods
  • Metal (dont work on intel Mac)
  • Model setting templates
  • LoRA adapters support (read more)
  • LoRA finetune support
  • LoRA export as model support
  • Restore context state (now only chat history)

Inferences

Multimodal

Note: For Falcon, Alpaca, GPT4All, Chinese LLaMA / Alpaca and Chinese LLaMA-2 / Alpaca-2, Vigogne (French), Vicuna, Koala, OpenBuddy (Multilingual), Pygmalion/Metharme, WizardLM, Baichuan 1 & 2 + derivations, Aquila 1 & 2, Mistral AI v0.1, Refact, Persimmon 8B, MPT, Bloom select llama inferece in model settings.

Sampling methods

Getting Started

You can find answers to some questions in the FAQ section.

Inference options

When creating a chat, a JSON file is generated in which you can specify additional inference options. The chat files are located in the "chats" directory. You can see all inference options here.

Models

You can download some of the supported models here.

Development

llmfarm_core has been moved to a separate repository. To build llmfarm, you need to clone this repository recursively:

git clone --recurse-submodules https://github.com/guinmoon/LLMFarm

P.S.

The number of open source models is constantly growing. One of the advantages of using such models is the possibility of preserving their original content without censorship. However, a disadvantage may be the irrelevance of the information they contain. It is also possible to get answers to questions from different industries, for example, there are models that specialize in medical terms or programming. In addition, you can use these models to create stories, songs, music, play quests, and so on...