llm/docs/embeddings/storage.md

(embeddings-storage)=
# Embedding storage format

The default output format of the `llm embed` command is a JSON array of floating point numbers.

LLM stores embeddings in space-efficient format: a little-endian binary sequences of 32-bit floating point numbers, each represented using 4 bytes.

These are stored in a `BLOB` column in a SQLite database.

The following Python functions can be used to convert between this format and an array of floating point numbers:

```python
import struct

def encode(values):
    return struct.pack("<" + "f" * len(values), *values)

def decode(binary):
    return struct.unpack("<" + "f" * (len(binary) // 4), binary)
```

These functions are available as `llm.encode()` and `llm.decode()`.

If you are using [NumPy](https://numpy.org/) you can decode one of these binary values like this:

```python
import numpy as np

numpy_array = np.frombuffer(value, "<f4")
```
The `<f4` format string here ensures NumPy will treat the data as a little-endian sequence of 32-bit floats.
Renamed binary.md to storage.md and documented --binary embeddings, refs #264 2023-09-12 18:15:17 +00:00			`(embeddings-storage)=`
Various documentation copy improvements, refs #264 2023-09-12 18:04:45 +00:00			`# Embedding storage format`
Initial CLI support and plugin hook for embeddings, refs #185 * Embeddings plugin hook + OpenAI implementation * llm.get_embedding_model(name) function * llm embed command, for returning embeddings or saving them to SQLite * Tests using an EmbedDemo embedding model * llm embed-models list and emeb-models default commands * llm embed-db path and llm embed-db collections commands 2023-08-28 05:24:10 +00:00
			The default output format of the `llm embed` command is a JSON array of floating point numbers.

Various documentation copy improvements, refs #264 2023-09-12 18:04:45 +00:00			`LLM stores embeddings in space-efficient format: a little-endian binary sequences of 32-bit floating point numbers, each represented using 4 bytes.`
Initial CLI support and plugin hook for embeddings, refs #185 * Embeddings plugin hook + OpenAI implementation * llm.get_embedding_model(name) function * llm embed command, for returning embeddings or saving them to SQLite * Tests using an EmbedDemo embedding model * llm embed-models list and emeb-models default commands * llm embed-db path and llm embed-db collections commands 2023-08-28 05:24:10 +00:00
Various documentation copy improvements, refs #264 2023-09-12 18:04:45 +00:00			These are stored in a `BLOB` column in a SQLite database.

			`The following Python functions can be used to convert between this format and an array of floating point numbers:`
Initial CLI support and plugin hook for embeddings, refs #185 * Embeddings plugin hook + OpenAI implementation * llm.get_embedding_model(name) function * llm embed command, for returning embeddings or saving them to SQLite * Tests using an EmbedDemo embedding model * llm embed-models list and emeb-models default commands * llm embed-db path and llm embed-db collections commands 2023-08-28 05:24:10 +00:00
			```python
			`import struct`

			`def encode(values):`
			`return struct.pack("<" + "f" * len(values), *values)`

			`def decode(binary):`
			`return struct.unpack("<" + "f" * (len(binary) // 4), binary)`
			```
Documentation for building binary embedding plugins, refs #264 2023-09-12 18:32:12 +00:00
			These functions are available as `llm.encode()` and `llm.decode()`.
NumPy decoding docs, plus extra tests for llm.encode/decode !stable-docs Refs https://discord.com/channels/823971286308356157/1128504153841336370/1151975583237034056 2023-09-14 21:01:27 +00:00
			`If you are using [NumPy](https://numpy.org/) you can decode one of these binary values like this:`

			```python
			`import numpy as np`

			`numpy_array = np.frombuffer(value, "<f4")`
			```
			The `<f4` format string here ensures NumPy will treat the data as a little-endian sequence of 32-bit floats.