llm/docs/openai-models.md

(openai-models)=

# OpenAI models

LLM ships with a default plugin for talking to OpenAI's API. OpenAI offer both language models and embedding models, and LLM can access both types.

(openai-models-configuration)=

## Configuration

All OpenAI models are accessed using an API key. You can obtain one from [the API keys page](https://platform.openai.com/api-keys) on their site.

Once you have created a key, configure LLM to use it by running:

```bash
llm keys set openai
```
Then paste in the API key.

(openai-models-language)=

## OpenAI language models

Run `llm models` for a full list of available models. The OpenAI models supported by LLM are:

<!-- [[[cog
from click.testing import CliRunner
from llm.cli import cli
result = CliRunner().invoke(cli, ["models", "list"])
models = [line for line in result.output.split("\n") if line.startswith("OpenAI ")]
cog.out("```\n{}\n```".format("\n".join(models)))
]]] -->
```
OpenAI Chat: gpt-4o (aliases: 4o)
OpenAI Chat: chatgpt-4o-latest (aliases: chatgpt-4o)
OpenAI Chat: gpt-4o-mini (aliases: 4o-mini)
OpenAI Chat: gpt-4o-audio-preview
OpenAI Chat: gpt-4o-audio-preview-2024-12-17
OpenAI Chat: gpt-4o-audio-preview-2024-10-01
OpenAI Chat: gpt-4o-mini-audio-preview
OpenAI Chat: gpt-4o-mini-audio-preview-2024-12-17
OpenAI Chat: gpt-4.1 (aliases: 4.1)
OpenAI Chat: gpt-4.1-mini (aliases: 4.1-mini)
OpenAI Chat: gpt-4.1-nano (aliases: 4.1-nano)
OpenAI Chat: gpt-3.5-turbo (aliases: 3.5, chatgpt)
OpenAI Chat: gpt-3.5-turbo-16k (aliases: chatgpt-16k, 3.5-16k)
OpenAI Chat: gpt-4 (aliases: 4, gpt4)
OpenAI Chat: gpt-4-32k (aliases: 4-32k)
OpenAI Chat: gpt-4-1106-preview
OpenAI Chat: gpt-4-0125-preview
OpenAI Chat: gpt-4-turbo-2024-04-09
OpenAI Chat: gpt-4-turbo (aliases: gpt-4-turbo-preview, 4-turbo, 4t)
OpenAI Chat: gpt-4.5-preview-2025-02-27
OpenAI Chat: gpt-4.5-preview (aliases: gpt-4.5)
OpenAI Chat: o1
OpenAI Chat: o1-2024-12-17
OpenAI Chat: o1-preview
OpenAI Chat: o1-mini
OpenAI Chat: o3-mini
OpenAI Chat: o3
OpenAI Chat: o4-mini
OpenAI Chat: gpt-5
OpenAI Chat: gpt-5-mini
OpenAI Chat: gpt-5-nano
OpenAI Chat: gpt-5-2025-08-07
OpenAI Chat: gpt-5-mini-2025-08-07
OpenAI Chat: gpt-5-nano-2025-08-07
OpenAI Completion: gpt-3.5-turbo-instruct (aliases: 3.5-instruct, chatgpt-instruct)
```
<!-- [[[end]]] -->

See [the OpenAI models documentation](https://platform.openai.com/docs/models) for details of each of these.

`gpt-4o-mini` (aliased to `4o-mini`) is the least expensive model, and is the default for if you don't specify a model at all. Consult [OpenAI's model documentation](https://platform.openai.com/docs/models) for details of the other models.

[o1-pro](https://platform.openai.com/docs/models/o1-pro) is not available  through the Chat Completions API used by LLM's default OpenAI plugin. You can install the new [llm-openai-plugin](https://github.com/simonw/llm-openai-plugin) plugin to access that model.

## Model features

The following features work with OpenAI models:

- {ref}`System prompts <usage-system-prompts>` can be used to provide instructions that have a higher weight than the prompt itself.
- {ref}`Attachments <usage-attachments>`. Many OpenAI models support image inputs - check which ones using `llm models --options`. Any model that accepts images can also accept PDFs.
- {ref}`Schemas <usage-schemas>` can be used to influence the JSON structure of the model output.
- {ref}`Model options <usage-model-options>` can be used to set parameters like `temperature`. Use `llm models --options` for a full list of supported options.

(openai-models-embedding)=

## OpenAI embedding models

Run `llm embed-models` for a list of {ref}`embedding models <embeddings>`. The following OpenAI embedding models are supported by LLM:

```
ada-002 (aliases: ada, oai)
3-small
3-large
3-small-512
3-large-256
3-large-1024
```

The `3-small` model is currently the most inexpensive. `3-large` costs more but is more capable - see [New embedding models and API updates](https://openai.com/blog/new-embedding-models-and-api-updates) on the OpenAI blog for details and benchmarks.

An important characteristic of any embedding model is the size of the vector it returns. Smaller vectors cost less to store and query, but may be less accurate.

OpenAI `3-small` and `3-large` vectors can be safely truncated to lower dimensions without losing too much accuracy. The `-int` models provided by LLM are pre-configured to do this, so `3-large-256` is the `3-large` model truncated to 256 dimensions.

The vector size of the supported OpenAI embedding models are as follows:

| Model | Size |
| --- | --- |
| ada-002 | 1536 |
| 3-small | 1536 |
| 3-large | 3072 |
| 3-small-512 | 512 |
| 3-large-256 | 256 |
| 3-large-1024 | 1024 |

(openai-completion-models)=

## OpenAI completion models

The `gpt-3.5-turbo-instruct` model is a little different - it is a completion model rather than a chat model, described in [the OpenAI completions documentation](https://platform.openai.com/docs/api-reference/completions/create).

Completion models can be called with the `-o logprobs 3` option (not supported by chat models) which will cause LLM to store 3 log probabilities for each returned token in the SQLite database. Consult [this issue](https://github.com/simonw/llm/issues/284#issuecomment-1724772704) for details on how to read these values.

(openai-extra-models)=

## Adding more OpenAI models

OpenAI occasionally release new models with new names. LLM aims to ship new releases to support these, but you can also configure them directly, by adding them to a `extra-openai-models.yaml` configuration file.

Run this command to find the directory in which this file should be created:

```bash
dirname "$(llm logs path)"
```
On my Mac laptop I get this:
```
~/Library/Application Support/io.datasette.llm
```
Create a file in that directory called `extra-openai-models.yaml`.

Let's say OpenAI have just released the `gpt-3.5-turbo-0613` model and you want to use it, despite LLM not yet shipping support. You could configure that by adding this to the file:

```yaml
- model_id: gpt-3.5-turbo-0613
  model_name: gpt-3.5-turbo-0613
  aliases: ["0613"]
```
The `model_id` is the identifier that will be recorded in the LLM logs. You can use this to specify the model, or you can optionally include a list of aliases for that model. The `model_name` is the actual model identifier that will be passed to the API, which must match exactly what the API expects.

If the model is a completion model (such as `gpt-3.5-turbo-instruct`) add `completion: true` to the configuration.

If the model supports structured extraction using json_schema, add `supports_schema: true` to the configuration.

For reasoning models like `o1` or `o3-mini` add `reasoning: true`.

With this configuration in place, the following command should run a prompt against the new model:

```bash
llm -m 0613 'What is the capital of France?'
```
Run `llm models` to confirm that the new model is now available:
```bash
llm models
```
Example output:
```
OpenAI Chat: gpt-3.5-turbo (aliases: 3.5, chatgpt)
OpenAI Chat: gpt-3.5-turbo-16k (aliases: chatgpt-16k, 3.5-16k)
OpenAI Chat: gpt-4 (aliases: 4, gpt4)
OpenAI Chat: gpt-4-32k (aliases: 4-32k)
OpenAI Chat: gpt-3.5-turbo-0613 (aliases: 0613)
```
Running `llm logs -n 1` should confirm that the prompt and response has been correctly logged to the database.
Documentation section about OpenAI models Closes #398, closes #396, closes #394 2024-01-26 00:21:41 +00:00			`(openai-models)=`

			`# OpenAI models`

			`LLM ships with a default plugin for talking to OpenAI's API. OpenAI offer both language models and embedding models, and LLM can access both types.`

			`(openai-models-configuration)=`

			`## Configuration`

			`All OpenAI models are accessed using an API key. You can obtain one from [the API keys page](https://platform.openai.com/api-keys) on their site.`

			`Once you have created a key, configure LLM to use it by running:`

			```bash
			`llm keys set openai`
			```
			`Then paste in the API key.`

			`(openai-models-language)=`

			`## OpenAI language models`

			Run `llm models` for a full list of available models. The OpenAI models supported by LLM are:

gpt-4o model, refs #490 2024-05-13 19:49:45 +00:00			`<!-- [[[cog`
			`from click.testing import CliRunner`
			`from llm.cli import cli`
			`result = CliRunner().invoke(cli, ["models", "list"])`
			`models = [line for line in result.output.split("\n") if line.startswith("OpenAI ")]`
Fix for broken markdown on openai-models page Refs #558 !stable-docs 2024-08-26 01:03:46 +00:00			cog.out("```\n{}\n```".format("\n".join(models)))
gpt-4o model, refs #490 2024-05-13 19:49:45 +00:00			`]]] -->`
Documentation section about OpenAI models Closes #398, closes #396, closes #394 2024-01-26 00:21:41 +00:00			```
Show attachment types in llm models --options, closes #612 2024-11-06 06:49:26 +00:00			`OpenAI Chat: gpt-4o (aliases: 4o)`
Updated docs with new chatgpt-4o-latest model, refs #752 2025-02-16 01:46:07 +00:00			`OpenAI Chat: chatgpt-4o-latest (aliases: chatgpt-4o)`
Show attachment types in llm models --options, closes #612 2024-11-06 06:49:26 +00:00			`OpenAI Chat: gpt-4o-mini (aliases: 4o-mini)`
			`OpenAI Chat: gpt-4o-audio-preview`
New OpenAI audio models, closes #677 2024-12-17 19:14:42 +00:00			`OpenAI Chat: gpt-4o-audio-preview-2024-12-17`
			`OpenAI Chat: gpt-4o-audio-preview-2024-10-01`
gpt-4o-mini-audio-preview, closes #677 2024-12-18 04:28:57 +00:00			`OpenAI Chat: gpt-4o-mini-audio-preview`
			`OpenAI Chat: gpt-4o-mini-audio-preview-2024-12-17`
Add GPT-4.1 model family to default OpenAI plugin (#965) * openai: add gpt-4.1 models * Refactor and run cog --------- Co-authored-by: Simon Willison <swillison@gmail.com> 2025-05-04 17:27:12 +00:00			`OpenAI Chat: gpt-4.1 (aliases: 4.1)`
			`OpenAI Chat: gpt-4.1-mini (aliases: 4.1-mini)`
			`OpenAI Chat: gpt-4.1-nano (aliases: 4.1-nano)`
Documentation section about OpenAI models Closes #398, closes #396, closes #394 2024-01-26 00:21:41 +00:00			`OpenAI Chat: gpt-3.5-turbo (aliases: 3.5, chatgpt)`
gpt-4o model, refs #490 2024-05-13 19:49:45 +00:00			`OpenAI Chat: gpt-3.5-turbo-16k (aliases: chatgpt-16k, 3.5-16k)`
Documentation section about OpenAI models Closes #398, closes #396, closes #394 2024-01-26 00:21:41 +00:00			`OpenAI Chat: gpt-4 (aliases: 4, gpt4)`
			`OpenAI Chat: gpt-4-32k (aliases: 4-32k)`
			`OpenAI Chat: gpt-4-1106-preview`
			`OpenAI Chat: gpt-4-0125-preview`
gpt-4-turbo model ID, closes #493 2024-05-13 20:37:23 +00:00			`OpenAI Chat: gpt-4-turbo-2024-04-09`
			`OpenAI Chat: gpt-4-turbo (aliases: gpt-4-turbo-preview, 4-turbo, 4t)`
Ran cog, refs #795 2025-02-27 22:50:02 +00:00			`OpenAI Chat: gpt-4.5-preview-2025-02-27`
Assign gpt-4.5 default alias, refs #795 2025-02-27 22:51:09 +00:00			`OpenAI Chat: gpt-4.5-preview (aliases: gpt-4.5)`
o1 Closes #676 2024-12-22 22:06:47 +00:00			`OpenAI Chat: o1`
			`OpenAI Chat: o1-2024-12-17`
o1-preview and o1-mini, refs #570 (#573) 2024-09-12 23:08:04 +00:00			`OpenAI Chat: o1-preview`
			`OpenAI Chat: o1-mini`
o3-mini and reasoning_effort option, refs #728 2025-01-31 20:14:02 +00:00			`OpenAI Chat: o3-mini`
llm/default_plugins: add o3 model (#945) * llm/default_plugins: add o3 model This is the newest model released by OpenAI and is available through the API. * Ran cog --------- Co-authored-by: Simon Willison <swillison@gmail.com> 2025-05-04 23:01:55 +00:00			`OpenAI Chat: o3`
o4-mini, closes #976 2025-05-04 23:04:28 +00:00			`OpenAI Chat: o4-mini`
GPT-5 model IDs, refs #1229 2025-08-07 18:41:38 +00:00			`OpenAI Chat: gpt-5`
			`OpenAI Chat: gpt-5-mini`
			`OpenAI Chat: gpt-5-nano`
			`OpenAI Chat: gpt-5-2025-08-07`
			`OpenAI Chat: gpt-5-mini-2025-08-07`
			`OpenAI Chat: gpt-5-nano-2025-08-07`
Fix for broken markdown on openai-models page Refs #558 !stable-docs 2024-08-26 01:03:46 +00:00			`OpenAI Completion: gpt-3.5-turbo-instruct (aliases: 3.5-instruct, chatgpt-instruct)`
			```
gpt-4o model, refs #490 2024-05-13 19:49:45 +00:00			`<!-- [[[end]]] -->`
Documentation section about OpenAI models Closes #398, closes #396, closes #394 2024-01-26 00:21:41 +00:00
			`See [the OpenAI models documentation](https://platform.openai.com/docs/models) for details of each of these.`

Improved OpenAI model docs Refs #839, closes #840 2025-03-22 01:31:20 +00:00			`gpt-4o-mini` (aliased to `4o-mini`) is the least expensive model, and is the default for if you don't specify a model at all. Consult [OpenAI's model documentation](https://platform.openai.com/docs/models) for details of the other models.
Documentation section about OpenAI models Closes #398, closes #396, closes #394 2024-01-26 00:21:41 +00:00
Improved OpenAI model docs Refs #839, closes #840 2025-03-22 01:31:20 +00:00			`[o1-pro](https://platform.openai.com/docs/models/o1-pro) is not available through the Chat Completions API used by LLM's default OpenAI plugin. You can install the new [llm-openai-plugin](https://github.com/simonw/llm-openai-plugin) plugin to access that model.`
Documentation section about OpenAI models Closes #398, closes #396, closes #394 2024-01-26 00:21:41 +00:00
Extra OpenAI docs including mention of PDFs, closes #834 2025-03-26 02:28:10 +00:00			`## Model features`

			`The following features work with OpenAI models:`

			- {ref}`System prompts <usage-system-prompts>` can be used to provide instructions that have a higher weight than the prompt itself.
			- {ref}`Attachments <usage-attachments>`. Many OpenAI models support image inputs - check which ones using `llm models --options`. Any model that accepts images can also accept PDFs.
			- {ref}`Schemas <usage-schemas>` can be used to influence the JSON structure of the model output.
			- {ref}`Model options <usage-model-options>` can be used to set parameters like `temperature`. Use `llm models --options` for a full list of supported options.

Documentation section about OpenAI models Closes #398, closes #396, closes #394 2024-01-26 00:21:41 +00:00			`(openai-models-embedding)=`

			`## OpenAI embedding models`

			Run `llm embed-models` for a list of {ref}`embedding models <embeddings>`. The following OpenAI embedding models are supported by LLM:

			```
			`ada-002 (aliases: ada, oai)`
			`3-small`
			`3-large`
			`3-small-512`
			`3-large-256`
			`3-large-1024`
			```

			The `3-small` model is currently the most inexpensive. `3-large` costs more but is more capable - see [New embedding models and API updates](https://openai.com/blog/new-embedding-models-and-api-updates) on the OpenAI blog for details and benchmarks.

			`An important characteristic of any embedding model is the size of the vector it returns. Smaller vectors cost less to store and query, but may be less accurate.`

			OpenAI `3-small` and `3-large` vectors can be safely truncated to lower dimensions without losing too much accuracy. The `-int` models provided by LLM are pre-configured to do this, so `3-large-256` is the `3-large` model truncated to 256 dimensions.

			`The vector size of the supported OpenAI embedding models are as follows:`

			`\| Model \| Size \|`
			`\| --- \| --- \|`
			`\| ada-002 \| 1536 \|`
			`\| 3-small \| 1536 \|`
			`\| 3-large \| 3072 \|`
			`\| 3-small-512 \| 512 \|`
			`\| 3-large-256 \| 256 \|`
			`\| 3-large-1024 \| 1024 \|`

Improved OpenAI model docs Refs #839, closes #840 2025-03-22 01:31:20 +00:00			`(openai-completion-models)=`

			`## OpenAI completion models`

			The `gpt-3.5-turbo-instruct` model is a little different - it is a completion model rather than a chat model, described in [the OpenAI completions documentation](https://platform.openai.com/docs/api-reference/completions/create).

			Completion models can be called with the `-o logprobs 3` option (not supported by chat models) which will cause LLM to store 3 log probabilities for each returned token in the SQLite database. Consult [this issue](https://github.com/simonw/llm/issues/284#issuecomment-1724772704) for details on how to read these values.

Documentation section about OpenAI models Closes #398, closes #396, closes #394 2024-01-26 00:21:41 +00:00			`(openai-extra-models)=`

			`## Adding more OpenAI models`

			OpenAI occasionally release new models with new names. LLM aims to ship new releases to support these, but you can also configure them directly, by adding them to a `extra-openai-models.yaml` configuration file.

			`Run this command to find the directory in which this file should be created:`

			```bash
			`dirname "$(llm logs path)"`
			```
			`On my Mac laptop I get this:`
			```
			`~/Library/Application Support/io.datasette.llm`
			```
			Create a file in that directory called `extra-openai-models.yaml`.

			Let's say OpenAI have just released the `gpt-3.5-turbo-0613` model and you want to use it, despite LLM not yet shipping support. You could configure that by adding this to the file:

			```yaml
			`- model_id: gpt-3.5-turbo-0613`
Improved docs for extra-openai-models.yaml (#957) - Mention mandatory model_name field - Document support_schema option 2025-05-04 17:30:37 +00:00			`model_name: gpt-3.5-turbo-0613`
Documentation section about OpenAI models Closes #398, closes #396, closes #394 2024-01-26 00:21:41 +00:00			`aliases: ["0613"]`
			```
Add model_name parameter to OpenAI extra models documentation (#950) Explain difference between `model_id` and `model_name`. Refs #925. 2025-05-04 21:56:28 +00:00			The `model_id` is the identifier that will be recorded in the LLM logs. You can use this to specify the model, or you can optionally include a list of aliases for that model. The `model_name` is the actual model identifier that will be passed to the API, which must match exactly what the API expects.
Documentation section about OpenAI models Closes #398, closes #396, closes #394 2024-01-26 00:21:41 +00:00
			If the model is a completion model (such as `gpt-3.5-turbo-instruct`) add `completion: true` to the configuration.

Improved docs for extra-openai-models.yaml (#957) - Mention mandatory model_name field - Document support_schema option 2025-05-04 17:30:37 +00:00			If the model supports structured extraction using json_schema, add `supports_schema: true` to the configuration.

Allow "reasoning" for extra-openai-models.yaml (#766) * Allow "reasoning" for extra-openai-models.yaml Currently you get an error when trying to use `-o reasoning_effort high` with a model that has been defined in `extra-openai-models.yaml`. This allows a `reasoning` field. * Mention reasoning: true in other OpenAI models docs --------- Co-authored-by: Simon Willison <swillison@gmail.com> 2025-02-27 05:50:14 +00:00			For reasoning models like `o1` or `o3-mini` add `reasoning: true`.

Documentation section about OpenAI models Closes #398, closes #396, closes #394 2024-01-26 00:21:41 +00:00			`With this configuration in place, the following command should run a prompt against the new model:`

			```bash
			`llm -m 0613 'What is the capital of France?'`
			```
			Run `llm models` to confirm that the new model is now available:
			```bash
			`llm models`
			```
			`Example output:`
			```
			`OpenAI Chat: gpt-3.5-turbo (aliases: 3.5, chatgpt)`
			`OpenAI Chat: gpt-3.5-turbo-16k (aliases: chatgpt-16k, 3.5-16k)`
			`OpenAI Chat: gpt-4 (aliases: 4, gpt4)`
			`OpenAI Chat: gpt-4-32k (aliases: 4-32k)`
			`OpenAI Chat: gpt-3.5-turbo-0613 (aliases: 0613)`
			```
			Running `llm logs -n 1` should confirm that the prompt and response has been correctly logged to the database.