Hopiu/llm

mirror of https://github.com/Hopiu/llm.git synced 2026-05-08 13:54:44 +00:00

Access large language models from the command-line

Find a file

Simon Willison ba75c674cb llm.get_async_model(), llm.AsyncModel base class and OpenAI async models (#613 ) - https://github.com/simonw/llm/issues/507#issuecomment-2458639308 * register_model is now async aware Refs https://github.com/simonw/llm/issues/507#issuecomment-2458658134 * Refactor Chat and AsyncChat to use _Shared base class Refs https://github.com/simonw/llm/issues/507#issuecomment-2458692338 * fixed function name * Fix for infinite loop * Applied Black * Ran cog * Applied Black * Add Response.from_row() classmethod back again It does not matter that this is a blocking call, since it is a classmethod * Made mypy happy with llm/models.py * mypy fixes for openai_models.py I am unhappy with this, had to duplicate some code. * First test for AsyncModel * Still have not quite got this working * Fix for not loading plugins during tests, refs #626 * audio/wav not audio/wave, refs #603 * Black and mypy and ruff all happy * Refactor to avoid generics * Removed obsolete response() method * Support text = await async_mock_model.prompt("hello") * Initial docs for llm.get_async_model() and await model.prompt() Refs #507 * Initial async model plugin creation docs * duration_ms ANY to pass test * llm models --async option Refs https://github.com/simonw/llm/pull/613#issuecomment-2474724406 * Removed obsolete TypeVars * Expanded register_models() docs for async * await model.prompt() now returns AsyncResponse Refs https://github.com/simonw/llm/pull/613#issuecomment-2475157822 --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>		2024-11-13 17:51:00 -08:00
.github	Run cog -r in PRs, use that to update logging.md with new tables (#616 )	2024-11-06 06:56:19 -08:00
docs	llm.get_async_model(), llm.AsyncModel base class and OpenAI async models (#613 )	2024-11-13 17:51:00 -08:00
llm	llm.get_async_model(), llm.AsyncModel base class and OpenAI async models (#613 )	2024-11-13 17:51:00 -08:00
tests	llm.get_async_model(), llm.AsyncModel base class and OpenAI async models (#613 )	2024-11-13 17:51:00 -08:00
.gitignore	Set min/max constraints to float arguments	2023-07-26 10:59:09 -07:00
.readthedocs.yaml	.readthedocs.yaml	2023-07-24 08:53:48 -07:00
Justfile	Update to ruff check .	2024-07-18 12:06:41 -07:00
LICENSE	Initial prototype, refs #1	2023-04-01 14:28:24 -07:00
MANIFEST.in	Don't include tests/ in the package	2023-07-01 11:45:00 -07:00
mypy.ini	Initial CLI support and plugin hook for embeddings, refs #185	2023-08-27 22:24:10 -07:00
pytest.ini	llm.get_async_model(), llm.AsyncModel base class and OpenAI async models (#613 )	2024-11-13 17:51:00 -08:00
README.md	Update README.md (#621 )	2024-11-12 19:07:28 -08:00
ruff.toml	Lint using Ruff, refs #78	2023-07-02 12:41:40 -07:00
setup.py	llm.get_async_model(), llm.AsyncModel base class and OpenAI async models (#613 )	2024-11-13 17:51:00 -08:00

README.md

LLM

A CLI utility and Python library for interacting with Large Language Models, both via remote APIs and models that can be installed and run on your own machine.

Run prompts from the command-line, store the results in SQLite, generate embeddings and more.

Consult the LLM plugins directory for plugins that provide access to remote and local models.

Full documentation: llm.datasette.io

Background on this project:

Installation

Install this tool using pip:

pip install llm

Or using Homebrew:

brew install llm

Detailed installation instructions.

Getting started

If you have an OpenAI API key you can get started using the OpenAI models right away.

As an alternative to OpenAI, you can install plugins to access models by other providers, including models that can be installed and run on your own device.

Save your OpenAI API key like this:

llm keys set openai

This will prompt you for your key like so:

Enter key: <paste here>

Now that you've saved a key you can run a prompt like this:

llm "Five cute names for a pet penguin"

1. Waddles
2. Pebbles
3. Bubbles
4. Flappy
5. Chilly

Read the usage instructions for more.

Installing a model that runs on your own machine

LLM plugins can add support for alternative models, including models that run on your own machine.

To download and run Mistral 7B Instruct locally, you can install the llm-gpt4all plugin:

llm install llm-gpt4all

Then run this command to see which models it makes available:

llm models

gpt4all: all-MiniLM-L6-v2-f16 - SBert, 43.76MB download, needs 1GB RAM
gpt4all: orca-mini-3b-gguf2-q4_0 - Mini Orca (Small), 1.84GB download, needs 4GB RAM
gpt4all: mistral-7b-instruct-v0 - Mistral Instruct, 3.83GB download, needs 8GB RAM
...

Each model file will be downloaded once the first time you use it. Try Mistral out like this:

llm -m mistral-7b-instruct-v0 'difference between a pelican and a walrus'

You can also start a chat session with the model using the llm chat command:

llm chat -m mistral-7b-instruct-v0

Chatting with mistral-7b-instruct-v0
Type 'exit' or 'quit' to exit
Type '!multi' to enter multiple lines, then '!end' to finish
>

Using a system prompt

You can use the -s/--system option to set a system prompt, providing instructions for processing other input to the tool.

To describe how the code in a file works, try this:

cat mycode.py | llm -s "Explain this code"

Help

For help, run:

llm --help

You can also use:

python -m llm --help