• Stars
    star
    173
  • Rank 218,876 (Top 5 %)
  • Language
    C++
  • License
    MIT License
  • Created 12 months ago
  • Updated 11 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

LLaVA server (llama.cpp).

LLaVA C++ Server

Bart Trzynadlowski, 2023

Simple API server for llama.cpp implementation of LLaVA.

Usage

Download one of ggml-model-*.gguf and mmproj-model-f16.gguf from here. Then, simply invoke:

bin/llava-server -m ggml-model-q5_k.gguf --mmproj mmproj-model-f16.gguf

This will start a server on localhost:8080. You can change the hostname and port with --host and --port, respectively, and enable HTTP logging with --log-http. You should be able to interact with the server at localhost:8080 in a web browser.

API

The LLaVA endpoint is at /llava. The request body takes the following parameters:

Name Type Required Description
user_prompt string yes The prompt (e.g., "what is this?")
image_file file yes Image data in binary form.
system_prompt string no System prompt.

Build Instructions

The llama.cpp and cpp-httplib repositories are included as gitmodules. After cloning, make sure to first run:

git submodule init
git submodule update

Then to build, simply run:

make

So far, this has only been tested on macOS, but should work anywhere else llama.cpp builds.

More Repositories

1

ChatARKit

Using ChatGPT to create AR experiences with natural language.
C
421
star
2

Supermodel

Official repository of the Sega Model 3 arcade emulator.
C
250
star
3

FasterRCNN

Clean and readable implementations of Faster R-CNN in PyTorch and TensorFlow 2 with Keras.
Python
130
star
4

robot-arm

Imitation learning with iPhone based teleoperation of a low-cost robot arm.
Python
40
star
5

hololens

HoloLens projects.
C#
15
star
6

PixArt

PAJ7025R2 for 6dof tracking: code (Arduino, Windows) and PCB (KiCad).
C++
11
star
7

model3

A simple test program and board support package for Sega Model 3 Step 1.0 boards. Works on actual hardware.
C
10
star
8

Quest-Portal-Example

Simple example of stereo portal rendering in VR with Unity on Meta Quest.
C#
10
star
9

Laughprop

Stable Diffusion 2.1 party game. Laugh propagation... get it? :D
JavaScript
6
star
10

MRLaserTag

Mixed reality VR laser tag using Oculus Quest 2 and OAK-D depth cameras. First prize winner for North America region in OpenCV AI Competition 2021.
C++
4
star
11

esp32s3-wifi-speed-test

Seeed XIAO ESP32S3 Sense WiFi speed test.
Python
2
star
12

SharedMemory-Win32-Example

Example of inter-process communication on Windows using shared memory (memory-mapped files and events).
C++
1
star
13

Quest-XRNode-Test

C#
1
star
14

pathfinding

Path finding/graph search experiments.
C#
1
star
15

realsense

Intel RealSense SR300 experiments.
C++
1
star
16

VGG16

Training VGG-16 on ImageNet with TensorFlow and Keras, replicating the results of the paper by Simonyan and Zisserman.
Python
1
star
17

posterize-go

Example of image posterization using k-means clustering in C++ with Go integration.
C++
1
star
18

multimodal-cloud-tests

Testing deployment of multimodal AI models on cloud providers.
Python
1
star
19

simple_ik_visualizer

Simple visualizer tool for IK solver results.
Python
1
star
20

urdf_to_dh

URDF to Denavit-Hartenberg parameters.
Python
1
star