Moved some docs into schemas.md, refs #788

2026-04-29 17:34:45 +00:00 · 2025-02-27 11:20:50 -08:00 · 2025-02-27 11:20:50 -08:00 · 74baf33a56
commit 74baf33a56
parent a1ea85ecbd
3 changed files with 58 additions and 82 deletions
--- a/docs/logging.md
+++ b/docs/logging.md
@ -198,88 +198,10 @@ You can also use [Datasette](https://datasette.io/) to browse your logs like thi
 datasette "$(llm logs path)"
 ```

-(logging-schemas)=
+### Browsing data collected using schemas

-## JSON objects created using schemas
+The `--schema X` option can be used to view responses that used the specified schema. This can be combined with `--data` and `--data-array` and `--data-key` to extract just the returned JSON data - consult the {ref}`schemas documentation <schemas-logs>` for details.

-You can use {ref}`usage-schemas` to collect structured JSON data from text and images that you feed into LLM.
-
-The JSON produced by these is logged in the database. You can use special options to extract just those JSON objects in a useful format.
-
-The `--schema X` filter option can be used to filter just for responses that were created using the specified schema. You can pass the full schema JSON, a path to the schema on disk or the schema ID.
-
-The `--data` option causes just the JSON data collected by that schema to be outputted, as newline-delimited JSON.
-
-If you instead want a JSON array of objects (with starting and ending square braces) you can use `--data-array` instead.
-
-Consider this schema file, called `dogs.schema.json`:
-
-```json
-{
-  "$schema": "http://json-schema.org/draft-07/schema#",
-  "type": "object",
-  "properties": {
-    "dogs": {
-      "type": "array",
-      "items": {
-        "type": "object",
-        "properties": {
-          "name": {
-            "type": "string",
-            "minLength": 1
-          },
-          "ten_word_bio": {
-            "type": "string",
-            "minLength": 1
-          }
-        },
-        "required": ["name", "ten_word_bio"],
-        "additionalProperties": false
-      }
-    }
-  },
-  "required": ["dogs"],
-  "additionalProperties": false
-}
-```
-You can use this several times to invent several cool dogs:
-```bash
-llm --schema dogs.schema.json 'invent 3 cool dogs'
-llm --schema dogs.schema.json 'invent 2 cool dogs'
-```
-Having logged the cool dogs, you can see just the data that was returned by those prompts like this:
-```bash
-llm logs --schema dogs.schema.json --data
-```
-Output:
-```
-{"dogs": [{"name": "Robo", "ten_word_bio": "A cybernetic dog with laser eyes and super intelligence."}, {"name": "Flamepaw", "ten_word_bio": "Fire-resistant dog with a talent for agility and tricks."}]}
-{"dogs": [{"name": "Bolt", "ten_word_bio": "Lightning-fast border collie, loves frisbee and outdoor adventures."}, {"name": "Luna", "ten_word_bio": "Mystical husky with mesmerizing blue eyes, enjoys snow and play."}, {"name": "Ziggy", "ten_word_bio": "Quirky pug who loves belly rubs and quirky outfits."}]}
-```
-Note that the dogs are nested in that `"dogs"` key. To access the list of items from that key use `--data-key dogs`:
-```bash
-llm logs --schema dogs.schema.json --data-key dogs
-```
-Output:
-```
-{"name": "Bolt", "ten_word_bio": "Lightning-fast border collie, loves frisbee and outdoor adventures."}
-{"name": "Luna", "ten_word_bio": "Mystical husky with mesmerizing blue eyes, enjoys snow and play."}
-{"name": "Ziggy", "ten_word_bio": "Quirky pug who loves belly rubs and quirky outfits."}
-{"name": "Robo", "ten_word_bio": "A cybernetic dog with laser eyes and super intelligence."}
-{"name": "Flamepaw", "ten_word_bio": "Fire-resistant dog with a talent for agility and tricks."}
-```
-Finally, to output a JSON array instead of newline-delimited JSON use `--data-array`:
-```bash
-llm logs --schema dogs.schema.json --data-key dogs --data-array
-```
-Output:
-```json
-[{"name": "Bolt", "ten_word_bio": "Lightning-fast border collie, loves frisbee and outdoor adventures."},
- {"name": "Luna", "ten_word_bio": "Mystical husky with mesmerizing blue eyes, enjoys snow and play."},
- {"name": "Ziggy", "ten_word_bio": "Quirky pug who loves belly rubs and quirky outfits."},
- {"name": "Robo", "ten_word_bio": "A cybernetic dog with laser eyes and super intelligence."},
- {"name": "Flamepaw", "ten_word_bio": "Fire-resistant dog with a talent for agility and tricks."}]
-```
 (logging-sql-schema)=

 ## SQL schema
--- a/docs/schemas.md
+++ b/docs/schemas.md
@ -118,7 +118,7 @@ To return multiple items matching your schema, use the `--schema-multi` option.
 ```bash
 llm --schema-multi 'name,description,fave_toy' 'invent 3 dogs'
 ```
-The Python utility function `llm.schema_dsl(schema)` can be used to convert this syntax into the equivalent JSON schema dictionary when working with schemas {ref}`in the Python API <python-api-schemas`.
+The Python utility function `llm.schema_dsl(schema)` can be used to convert this syntax into the equivalent JSON schema dictionary when working with schemas {ref}`in the Python API <python-api-schemas>`.

 You can experiment with the syntax using the `llm schemas dsl` command, which converts the input into a JSON schema:
 ```bash
@ -141,4 +141,58 @@ Output:
    "age"
  ]
 }
+```
+
+(schemas-logs)=
+
+## Browsing logged JSON objects created using schemas
+
+By default, all JSON produced using schemas is logged to {ref}`a SQLite database <logging>`. You can use special options to the `llm logs` command to extract just those JSON objects in a useful format.
+
+The `llm logs --schema X` filter option can be used to filter just for responses that were created using the specified schema. You can pass the full schema JSON, a path to the schema on disk or the schema ID.
+
+The `--data` option causes just the JSON data collected by that schema to be outputted, as newline-delimited JSON.
+
+If you instead want a JSON array of objects (with starting and ending square braces) you can use `--data-array` instead.
+
+Let's invent some dogs:
+
+```bash
+llm --schema-multi 'name, ten_word_bio' 'invent 3 cool dogs'
+llm --schema-multi 'name, ten_word_bio' 'invent 2 cool dogs'
+```
+Having logged these cool dogs, you can see just the data that was returned by those prompts like this:
+```bash
+llm logs --schema-multi 'name, ten_word_bio' --data
+```
+We need to use `--schema-multi` here because we used that when we first created these records. The `--schema` option is also supported, and can be passed a filename or JSON schema or schema ID as well.
+
+Output:
+```
+{"items": [{"name": "Robo", "ten_word_bio": "A cybernetic dog with laser eyes and super intelligence."}, {"name": "Flamepaw", "ten_word_bio": "Fire-resistant dog with a talent for agility and tricks."}]}
+{"items": [{"name": "Bolt", "ten_word_bio": "Lightning-fast border collie, loves frisbee and outdoor adventures."}, {"name": "Luna", "ten_word_bio": "Mystical husky with mesmerizing blue eyes, enjoys snow and play."}, {"name": "Ziggy", "ten_word_bio": "Quirky pug who loves belly rubs and quirky outfits."}]}
+```
+Note that the dogs are nested in that `"items"` key. To access the list of items from that key use `--data-key items`:
+```bash
+llm logs --schema-multi 'name, ten_word_bio' --data-key items
+```
+Output:
+```
+{"name": "Bolt", "ten_word_bio": "Lightning-fast border collie, loves frisbee and outdoor adventures."}
+{"name": "Luna", "ten_word_bio": "Mystical husky with mesmerizing blue eyes, enjoys snow and play."}
+{"name": "Ziggy", "ten_word_bio": "Quirky pug who loves belly rubs and quirky outfits."}
+{"name": "Robo", "ten_word_bio": "A cybernetic dog with laser eyes and super intelligence."}
+{"name": "Flamepaw", "ten_word_bio": "Fire-resistant dog with a talent for agility and tricks."}
+```
+Finally, to output a JSON array instead of newline-delimited JSON use `--data-array`:
+```bash
+llm logs --schema-multi 'name, ten_word_bio' --data-key items --data-array
+```
+Output:
+```json
+[{"name": "Bolt", "ten_word_bio": "Lightning-fast border collie, loves frisbee and outdoor adventures."},
+ {"name": "Luna", "ten_word_bio": "Mystical husky with mesmerizing blue eyes, enjoys snow and play."},
+ {"name": "Ziggy", "ten_word_bio": "Quirky pug who loves belly rubs and quirky outfits."},
+ {"name": "Robo", "ten_word_bio": "A cybernetic dog with laser eyes and super intelligence."},
+ {"name": "Flamepaw", "ten_word_bio": "Fire-resistant dog with a talent for agility and tricks."}]
 ```
--- a/docs/usage.md
+++ b/docs/usage.md
@ -182,7 +182,7 @@ llm --schema a75b7b3f00e065247e6e364304338aa5 'five dogs'

 Be warned that different models may support different dialects of the JSON schema specification.

-See {ref}`logging-schemas` for tips on using the `llm logs --schema X` command to access JSON objects you have previously logged using this option.
+See {ref}`schemas-logs` for tips on using the `llm logs --schema X` command to access JSON objects you have previously logged using this option.

 (usage-conversation)=
 ### Continuing a conversation