llm/docs/embeddings/binary.md

(embeddings-binary)=
# Binary embedding formats

The default output format of the `llm embed` command is a JSON array of floating point numbers.

LLM stores embeddings in a more space-efficient format: little-endian binary sequences of 32-bit floating point numbers, each represented using 4 bytes.

The following Python functions can be used to convert between the two formats:

```python
import struct

def encode(values):
    return struct.pack("<" + "f" * len(values), *values)

def decode(binary):
    return struct.unpack("<" + "f" * (len(binary) // 4), binary)
```
When using `llm embed` directly, the default output format is JSON.

Use `--format blob` for the binary output, `--format hex` for that binary output as hexadecimal and `--format base64` for that binary output encoded using base64.
Initial CLI support and plugin hook for embeddings, refs #185 * Embeddings plugin hook + OpenAI implementation * llm.get_embedding_model(name) function * llm embed command, for returning embeddings or saving them to SQLite * Tests using an EmbedDemo embedding model * llm embed-models list and emeb-models default commands * llm embed-db path and llm embed-db collections commands 2023-08-28 05:24:10 +00:00			`(embeddings-binary)=`
			`# Binary embedding formats`

			The default output format of the `llm embed` command is a JSON array of floating point numbers.

			`LLM stores embeddings in a more space-efficient format: little-endian binary sequences of 32-bit floating point numbers, each represented using 4 bytes.`

			`The following Python functions can be used to convert between the two formats:`

			```python
			`import struct`

			`def encode(values):`
			`return struct.pack("<" + "f" * len(values), *values)`

			`def decode(binary):`
			`return struct.unpack("<" + "f" * (len(binary) // 4), binary)`
			```
			When using `llm embed` directly, the default output format is JSON.

			Use `--format blob` for the binary output, `--format hex` for that binary output as hexadecimal and `--format base64` for that binary output encoded using base64.