Hopiu/llm

mirror of https://github.com/Hopiu/llm.git synced 2026-03-20 14:40:25 +00:00

Initial CLI support and plugin hook for embeddings, refs #185

* Embeddings plugin hook + OpenAI implementation
* llm.get_embedding_model(name) function
* llm embed command, for returning embeddings or saving them to SQLite
* Tests using an EmbedDemo embedding model
* llm embed-models list and emeb-models default commands
* llm embed-db path and llm embed-db collections commands

2023-08-27 22:24:10 -07:00

804 B

Raw Blame History

(embeddings-binary)=

Binary embedding formats

The default output format of the llm embed command is a JSON array of floating point numbers.

LLM stores embeddings in a more space-efficient format: little-endian binary sequences of 32-bit floating point numbers, each represented using 4 bytes.

The following Python functions can be used to convert between the two formats:

import struct

def encode(values):
    return struct.pack("<" + "f" * len(values), *values)

def decode(binary):
    return struct.unpack("<" + "f" * (len(binary) // 4), binary)

When using llm embed directly, the default output format is JSON.

Use --format blob for the binary output, --format hex for that binary output as hexadecimal and --format base64 for that binary output encoded using base64.

804 B Raw Blame History

Binary embedding formats

804 B

Raw Blame History