llm/docs/plugins/directory.md

(plugin-directory)=
# Plugin directory

The following plugins are available for LLM. Here's {ref}`how to install them <installing-plugins>`.

## Local models

These plugins all help you run LLMs directly on your own computer:

- **[llm-llama-cpp](https://github.com/simonw/llm-llama-cpp)** uses [llama.cpp](https://github.com/ggerganov/llama.cpp) to run models published in the GGUF format.
- **[llm-mlc](https://github.com/simonw/llm-mlc)** can run local models released by the [MLC project](https://mlc.ai/mlc-llm/), including models that can take advantage of the GPU on Apple Silicon M1/M2 devices.
- **[llm-gpt4all](https://github.com/simonw/llm-gpt4all)** adds support for various models released by the [GPT4All](https://gpt4all.io/) project that are optimized to run locally on your own machine. These models include versions of Vicuna, Orca, Falcon and MPT - here's [a full list of models](https://observablehq.com/@simonw/gpt4all-models).
- **[llm-mpt30b](https://github.com/simonw/llm-mpt30b)** adds support for the [MPT-30B](https://huggingface.co/mosaicml/mpt-30b) local model.
- **[llm-ollama](https://github.com/taketwo/llm-ollama)** adds support for local models run using [Ollama](https://ollama.ai/).
- **[llm-llamafile](https://github.com/simonw/llm-llamafile)** adds support for local models that are running locally using [llamafile](https://github.com/Mozilla-Ocho/llamafile).

## Remote APIs

These plugins can be used to interact with remotely hosted models via their API:

- **[llm-mistral](https://github.com/simonw/llm-mistral)** adds support for [Mistral AI](https://mistral.ai/)'s language and embedding models.
- **[llm-gemini](https://github.com/simonw/llm-gemini)** adds support for Google's [Gemini](https://ai.google.dev/docs) models.
- **[llm-claude](https://github.com/tomviner/llm-claude)** by Tom Viner adds support for Claude 2.1 and Claude Instant 2.1 by Anthropic.
- **[llm-claude-3](https://github.com/simonw/llm-claude-3)** supports Anthropic's [Claude 3 family](https://www.anthropic.com/news/claude-3-family) of models.
- **[llm-command-r](https://github.com/simonw/llm-command-r)** supports Cohere's Command R and [Command R Plus](https://txt.cohere.com/command-r-plus-microsoft-azure/) API models.
- **[llm-reka](https://github.com/simonw/llm-reka)** supports the [Reka](https://www.reka.ai/) family of models via their API.
- **[llm-perplexity](https://github.com/hex/llm-perplexity)** by Alexandru Geana supports the [Perplexity Labs](https://docs.perplexity.ai/) API models, including `sonar-medium-online` which can search for things online and `llama-3-70b-instruct`.
- **[llm-groq](https://github.com/angerman/llm-groq)** by Moritz Angermann provides access to fast models hosted by [Groq](https://console.groq.com/docs/models).
- **[llm-anyscale-endpoints](https://github.com/simonw/llm-anyscale-endpoints)** supports models hosted on the [Anyscale Endpoints](https://app.endpoints.anyscale.com/) platform, including Llama 2 70B.
- **[llm-replicate](https://github.com/simonw/llm-replicate)** adds support for remote models hosted on [Replicate](https://replicate.com/), including Llama 2 from Meta AI.
- **[llm-fireworks](https://github.com/simonw/llm-fireworks)** supports models hosted by [Fireworks AI](https://fireworks.ai/).
- **[llm-palm](https://github.com/simonw/llm-palm)** adds support for Google's [PaLM 2 model](https://developers.generativeai.google/).
- **[llm-openrouter](https://github.com/simonw/llm-openrouter)** provides access to models hosted on [OpenRouter](https://openrouter.ai/).
- **[llm-cohere](https://github.com/Accudio/llm-cohere)** by Alistair Shepherd provides `cohere-generate` and `cohere-summarize` API models, powered by [Cohere](https://cohere.com/).
- **[llm-bedrock-anthropic](https://github.com/sblakey/llm-bedrock-anthropic)** by Sean Blakey adds support for Claude and Claude Instant by Anthropic via Amazon Bedrock.
- **[llm-bedrock-meta](https://github.com/flabat/llm-bedrock-meta)** by Fabian Labat adds support for Llama 2 and Llama 3 by Meta via Amazon Bedrock.
- **[llm-together](https://github.com/wearedevx/llm-together)** adds support for the [Together AI](https://www.together.ai/) extensive family of hosted openly licensed models.

If an API model host provides an OpenAI-compatible API you can also [configure LLM to talk to it](https://llm.datasette.io/en/stable/other-models.html#openai-compatible-models) without needing an extra plugin.

## Embedding models

{ref}`Embedding models <embeddings>` are models that can be used to generate and store embedding vectors for text.

- **[llm-sentence-transformers](https://github.com/simonw/llm-sentence-transformers)** adds support for embeddings using the [sentence-transformers](https://www.sbert.net/) library, which provides access to [a wide range](https://www.sbert.net/docs/pretrained_models.html) of embedding models.
- **[llm-clip](https://github.com/simonw/llm-clip)** provides the [CLIP](https://openai.com/research/clip) model, which can be used to embed images and text in the same vector space, enabling text search against images. See [Build an image search engine with llm-clip](https://simonwillison.net/2023/Sep/12/llm-clip-and-chat/) for more on this plugin.
- **[llm-embed-jina](https://github.com/simonw/llm-embed-jina)** provides Jina AI's [8K text embedding models](https://jina.ai/news/jina-ai-launches-worlds-first-open-source-8k-text-embedding-rivaling-openai/).
- **[llm-embed-onnx](https://github.com/simonw/llm-embed-onnx)** provides seven embedding models that can be executed using the ONNX model framework.

## Extra commands

- **[llm-cmd](https://github.com/simonw/llm-cmd)** accepts a prompt for a shell command, runs that prompt and populates the result in your shell so you can review it, edit it and then hit `<enter>` to execute or `ctrl+c` to cancel.
- **[llm-python](https://github.com/simonw/llm-python)** adds a `llm python` command for running a Python interpreter in the same virtual environment as LLM. This is useful for debugging, and also provides a convenient way to interact with the LLM {ref}`python-api` if you installed LLM using Homebrew or `pipx`.
- **[llm-cluster](https://github.com/simonw/llm-cluster)** adds a `llm cluster` command for calculating clusters for a collection of embeddings. Calculated clusters can then be passed to a Large Language Model to generate a summary description.

## Just for fun

- **[llm-markov](https://github.com/simonw/llm-markov)** adds a simple model that generates output using a [Markov chain](https://en.wikipedia.org/wiki/Markov_chain). This example is used in the tutorial [Writing a plugin to support a new model](https://llm.datasette.io/en/latest/plugins/tutorial-model-plugin.html).
Move plugin directory into LLM repo, refs #173 2023-08-21 05:17:13 +00:00			`(plugin-directory)=`
			`# Plugin directory`

			The following plugins are available for LLM. Here's {ref}`how to install them <installing-plugins>`.

			`## Local models`

			`These plugins all help you run LLMs directly on your own computer:`

llm-mistral and llm-gemini !stable-docs 2023-12-15 05:36:45 +00:00			`- [llm-llama-cpp](https://github.com/simonw/llm-llama-cpp) uses [llama.cpp](https://github.com/ggerganov/llama.cpp) to run models published in the GGUF format.`
Move plugin directory into LLM repo, refs #173 2023-08-21 05:17:13 +00:00			`- [llm-mlc](https://github.com/simonw/llm-mlc) can run local models released by the [MLC project](https://mlc.ai/mlc-llm/), including models that can take advantage of the GPU on Apple Silicon M1/M2 devices.`
			`- [llm-gpt4all](https://github.com/simonw/llm-gpt4all) adds support for various models released by the [GPT4All](https://gpt4all.io/) project that are optimized to run locally on your own machine. These models include versions of Vicuna, Orca, Falcon and MPT - here's [a full list of models](https://observablehq.com/@simonw/gpt4all-models).`
			`- [llm-mpt30b](https://github.com/simonw/llm-mpt30b) adds support for the [MPT-30B](https://huggingface.co/mosaicml/mpt-30b) local model.`
Add ollama to plugin directory (#395) * Add ollama to plugin directory !stable-docs 2024-01-25 21:57:40 +00:00			`- [llm-ollama](https://github.com/taketwo/llm-ollama) adds support for local models run using [Ollama](https://ollama.ai/).`
List llm-llamafile in plugins directory, closes #470 2024-05-13 19:55:22 +00:00			`- [llm-llamafile](https://github.com/simonw/llm-llamafile) adds support for local models that are running locally using [llamafile](https://github.com/Mozilla-Ocho/llamafile).`
Move plugin directory into LLM repo, refs #173 2023-08-21 05:17:13 +00:00
			`## Remote APIs`

			`These plugins can be used to interact with remotely hosted models via their API:`

llm-mistral and llm-gemini !stable-docs 2023-12-15 05:36:45 +00:00			`- [llm-mistral](https://github.com/simonw/llm-mistral) adds support for [Mistral AI](https://mistral.ai/)'s language and embedding models.`
Fix typo and broken link for Gemini in directory Refs #389 !stable-docs 2024-01-14 01:20:23 +00:00			`- [llm-gemini](https://github.com/simonw/llm-gemini) adds support for Google's [Gemini](https://ai.google.dev/docs) models.`
llm-claude-3 !stable-docs 2024-03-04 18:48:57 +00:00			`- [llm-claude](https://github.com/tomviner/llm-claude) by Tom Viner adds support for Claude 2.1 and Claude Instant 2.1 by Anthropic.`
			`- [llm-claude-3](https://github.com/simonw/llm-claude-3) supports Anthropic's [Claude 3 family](https://www.anthropic.com/news/claude-3-family) of models.`
llm-command-r !stable-docs Refs https://github.com/simonw/llm-command-r/issues/1 2024-04-04 14:41:03 +00:00			`- [llm-command-r](https://github.com/simonw/llm-command-r) supports Cohere's Command R and [Command R Plus](https://txt.cohere.com/command-r-plus-microsoft-azure/) API models.`
llm-reka in plugin directory !stable-docs 2024-04-18 02:38:41 +00:00			`- [llm-reka](https://github.com/simonw/llm-reka) supports the [Reka](https://www.reka.ai/) family of models via their API.`
llm-perplexity Refs https://github.com/hex/llm-perplexity/issues/2 !stable-docs 2024-04-21 23:18:37 +00:00			- [llm-perplexity](https://github.com/hex/llm-perplexity) by Alexandru Geana supports the [Perplexity Labs](https://docs.perplexity.ai/) API models, including `sonar-medium-online` which can search for things online and `llama-3-70b-instruct`.
llm-groq !stable-docs 2024-04-22 03:33:23 +00:00			`- [llm-groq](https://github.com/angerman/llm-groq) by Moritz Angermann provides access to fast models hosted by [Groq](https://console.groq.com/docs/models).`
llm-anyscale-endpoints !stable-docs 2023-08-23 20:46:48 +00:00			`- [llm-anyscale-endpoints](https://github.com/simonw/llm-anyscale-endpoints) supports models hosted on the [Anyscale Endpoints](https://app.endpoints.anyscale.com/) platform, including Llama 2 70B.`
llm-mistral and llm-gemini !stable-docs 2023-12-15 05:36:45 +00:00			`- [llm-replicate](https://github.com/simonw/llm-replicate) adds support for remote models hosted on [Replicate](https://replicate.com/), including Llama 2 from Meta AI.`
llm-fireworks Refs https://github.com/simonw/llm-fireworks/issues/1 !stable-docs 2024-04-19 00:20:09 +00:00			`- [llm-fireworks](https://github.com/simonw/llm-fireworks) supports models hosted by [Fireworks AI](https://fireworks.ai/).`
llm-mistral and llm-gemini !stable-docs 2023-12-15 05:36:45 +00:00			`- [llm-palm](https://github.com/simonw/llm-palm) adds support for Google's [PaLM 2 model](https://developers.generativeai.google/).`
			`- [llm-openrouter](https://github.com/simonw/llm-openrouter) provides access to models hosted on [OpenRouter](https://openrouter.ai/).`
llm-cohere in plugin directory !stable-docs 2023-09-13 23:40:34 +00:00			- [llm-cohere](https://github.com/Accudio/llm-cohere) by Alistair Shepherd provides `cohere-generate` and `cohere-summarize` API models, powered by [Cohere](https://cohere.com/).
Added llm-together !stable-docs 2024-01-27 18:40:40 +00:00			`- [llm-bedrock-anthropic](https://github.com/sblakey/llm-bedrock-anthropic) by Sean Blakey adds support for Claude and Claude Instant by Anthropic via Amazon Bedrock.`
Update directory.md (#486) * Update directory.md Added support for Bedrock Llama 3 2024-05-13 20:01:33 +00:00			`- [llm-bedrock-meta](https://github.com/flabat/llm-bedrock-meta) by Fabian Labat adds support for Llama 2 and Llama 3 by Meta via Amazon Bedrock.`
Added llm-together !stable-docs 2024-01-27 18:40:40 +00:00			`- [llm-together](https://github.com/wearedevx/llm-together) adds support for the [Together AI](https://www.together.ai/) extensive family of hosted openly licensed models.`
Move plugin directory into LLM repo, refs #173 2023-08-21 05:17:13 +00:00
			`If an API model host provides an OpenAI-compatible API you can also [configure LLM to talk to it](https://llm.datasette.io/en/stable/other-models.html#openai-compatible-models) without needing an extra plugin.`

Embedding models in plugin directory, refs #207 2023-09-04 02:39:10 +00:00			`## Embedding models`

			{ref}`Embedding models <embeddings>` are models that can be used to generate and store embedding vectors for text.

			`- [llm-sentence-transformers](https://github.com/simonw/llm-sentence-transformers) adds support for embeddings using the [sentence-transformers](https://www.sbert.net/) library, which provides access to [a wide range](https://www.sbert.net/docs/pretrained_models.html) of embedding models.`
llm-clip !stable-docs 2023-09-13 23:43:46 +00:00			`- [llm-clip](https://github.com/simonw/llm-clip) provides the [CLIP](https://openai.com/research/clip) model, which can be used to embed images and text in the same vector space, enabling text search against images. See [Build an image search engine with llm-clip](https://simonwillison.net/2023/Sep/12/llm-clip-and-chat/) for more on this plugin.`
llm-embed-jina in plugins directory !stable-docs Refs: - https://github.com/simonw/llm-embed-jina/issues/1 2023-10-26 01:29:30 +00:00			`- [llm-embed-jina](https://github.com/simonw/llm-embed-jina) provides Jina AI's [8K text embedding models](https://jina.ai/news/jina-ai-launches-worlds-first-open-source-8k-text-embedding-rivaling-openai/).`
llm-embed-onnx in plugin directory !stable-docs Refs https://github.com/simonw/llm-embed-onnx/issues/1 2024-01-28 22:27:13 +00:00			`- [llm-embed-onnx](https://github.com/simonw/llm-embed-onnx) provides seven embedding models that can be executed using the ONNX model framework.`
Embedding models in plugin directory, refs #207 2023-09-04 02:39:10 +00:00
llm-cluster !stable-docs Refs https://github.com/simonw/llm-cluster/issues/1 2023-09-04 16:36:56 +00:00			`## Extra commands`

llm-cmd !stable-docs Refs https://github.com/simonw/llm-cmd/issues/1 2024-03-26 15:58:48 +00:00			- [llm-cmd](https://github.com/simonw/llm-cmd) accepts a prompt for a shell command, runs that prompt and populates the result in your shell so you can review it, edit it and then hit `<enter>` to execute or `ctrl+c` to cancel.
llm-python in plugin directory Refs https://github.com/simonw/llm-python/issues/1 !stable-docs 2023-10-27 05:42:04 +00:00			- [llm-python](https://github.com/simonw/llm-python) adds a `llm python` command for running a Python interpreter in the same virtual environment as LLM. This is useful for debugging, and also provides a convenient way to interact with the LLM {ref}`python-api` if you installed LLM using Homebrew or `pipx`.
llm-cluster !stable-docs Refs https://github.com/simonw/llm-cluster/issues/1 2023-09-04 16:36:56 +00:00			- [llm-cluster](https://github.com/simonw/llm-cluster) adds a `llm cluster` command for calculating clusters for a collection of embeddings. Calculated clusters can then be passed to a Large Language Model to generate a summary description.

Move plugin directory into LLM repo, refs #173 2023-08-21 05:17:13 +00:00			`## Just for fun`

			`- [llm-markov](https://github.com/simonw/llm-markov) adds a simple model that generates output using a [Markov chain](https://en.wikipedia.org/wiki/Markov_chain). This example is used in the tutorial [Writing a plugin to support a new model](https://llm.datasette.io/en/latest/plugins/tutorial-model-plugin.html).`