Commit graph

312 commits

Author SHA1 Message Date
Simon Willison
ba75c674cb
llm.get_async_model(), llm.AsyncModel base class and OpenAI async models (#613)
- https://github.com/simonw/llm/issues/507#issuecomment-2458639308

* register_model is now async aware

Refs https://github.com/simonw/llm/issues/507#issuecomment-2458658134

* Refactor Chat and AsyncChat to use _Shared base class

Refs https://github.com/simonw/llm/issues/507#issuecomment-2458692338

* fixed function name

* Fix for infinite loop

* Applied Black

* Ran cog

* Applied Black

* Add Response.from_row() classmethod back again

It does not matter that this is a blocking call, since it is a classmethod

* Made mypy happy with llm/models.py

* mypy fixes for openai_models.py

I am unhappy with this, had to duplicate some code.

* First test for AsyncModel

* Still have not quite got this working

* Fix for not loading plugins during tests, refs #626

* audio/wav not audio/wave, refs #603

* Black and mypy and ruff all happy

* Refactor to avoid generics

* Removed obsolete response() method

* Support text = await async_mock_model.prompt("hello")

* Initial docs for llm.get_async_model() and await model.prompt()

Refs #507

* Initial async model plugin creation docs

* duration_ms ANY to pass test

* llm models --async option

Refs https://github.com/simonw/llm/pull/613#issuecomment-2474724406

* Removed obsolete TypeVars

* Expanded register_models() docs for async

* await model.prompt() now returns AsyncResponse

Refs https://github.com/simonw/llm/pull/613#issuecomment-2475157822

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-11-13 17:51:00 -08:00
Hiepler
5a984d0c87
docs: add llm-grok (#629)
Adds`llm-grok` xAI API (https://github.com/Hiepler/llm-grok) to the plugin directory.

!stable-docs
2024-11-13 17:21:04 -08:00
Simon Willison
7520671176 audio/wav not audio/wave, refs #603 2024-11-12 21:43:07 -08:00
Travis Northcutt
c0cb1697bc
Update default model information (#622)
The default model is now 4o-mini; this change updates the usage page of the docs to reflect that
2024-11-12 19:06:16 -08:00
Simon Willison
dff53a9cae Better --help for llm keys get, refs #623 2024-11-11 09:53:24 -08:00
Simon Willison
561784df6e llm keys get command, refs #623 2024-11-11 09:47:13 -08:00
Simon Willison
febbc04fb6
Run cog -r in PRs, use that to update logging.md with new tables (#616)
* Create cog.yml
* Document attachments and prompt_attachments table schemas

Closes #615

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-11-06 06:56:19 -08:00
Simon Willison
98d2c19876 Promote alternative model providers in llm --help 2024-11-06 06:38:53 -08:00
Simon Willison
245e025270 Ran cog, refs #612 2024-11-05 23:45:17 -08:00
Chris Mungall
3b2e5263a3
Allow passing of can_stream in openai_models.py (#600)
* Allow passing of can_stream in openai_models.py

Fixes #599 

* Only set can_stream: false if it is false

Refs https://github.com/simonw/llm/pull/600#issuecomment-2458825866

* Docs for can_stream: false

---------

Co-authored-by: Simon Willison <swillison@gmail.com>
2024-11-05 23:04:13 -08:00
Simon Willison
12df1a3b2a Show attachment types in llm models --options, closes #612 2024-11-05 22:49:26 -08:00
Simon Willison
0cc4072bcd Support attachments without prompts, closes #611 2024-11-05 21:27:18 -08:00
Simon Willison
41cb5c3387 Ran cog, refs #608 2024-11-05 21:13:36 -08:00
Simon Willison
fe1e09706f
llm-lambda-labs
!stable-docs
2024-11-04 10:26:02 -08:00
Simon Willison
a44ba49c21 Release 0.17
Refs #587, #590, #591
2024-10-28 19:36:12 -07:00
Simon Willison
ba1ccb3a4a Release 0.17a0
Refs #587, #590
2024-10-28 15:46:52 -07:00
Simon Willison
1f822d820b Update docs with cog 2024-10-28 15:41:34 -07:00
Simon Willison
f0ed54abf1 Docs for CLI attachments, refs #587 2024-10-28 15:41:34 -07:00
Simon Willison
570a3eccae Python attachment documentation, plus fixed a mimetype detection bug
Refs #587
2024-10-28 15:41:34 -07:00
Simon Willison
1126393ba1 Docs for writing models that accept attachments, refs #587 2024-10-28 15:41:34 -07:00
Simon Willison
7e6031e382
llm-gguf, llm-jq
!stable-docs
2024-10-26 22:44:06 -07:00
Simon Willison
d654c95212 Release notes for 0.16 2024-09-12 16:20:12 -07:00
Simon Willison
bfcfd2c91b
o1-preview and o1-mini, refs #570 (#573) 2024-09-12 16:08:04 -07:00
Kian-Meng Ang
50520c7c1c
Fix typos (#567)
Found via `codespell -H -L wit,thre`

!stable-docs
2024-09-08 08:44:43 -07:00
Simon Willison
7d6ece2a31 Fix for broken markdown on openai-models page
Refs #558 !stable-docs
2024-08-25 18:03:46 -07:00
Simon Willison
6deed8f976 get_model() improvement, get_default_model() / set_default_wodel() now documented
Refs #553
2024-08-18 17:37:31 -07:00
Simon Willison
d075336c69 Release 0.15
Refs #515, #525, #536, #537
2024-07-18 12:31:14 -07:00
Simon Willison
562fefb374 Use 3-small in docs instead of ada-002
Spotted while working on #537
2024-07-18 12:23:49 -07:00
Simon Willison
fcba89d73b Update docs to reflect new gpt-4o-mini default, refs #536 2024-07-18 12:16:03 -07:00
Simon Donohue
50454c1957
Update outdated reference to gpt-4-turbo (#525)
Looks like this alias was overlooked in 8171c9a. This commit makes it
match with the usage of gpt-4o in the associated example.
2024-07-18 12:10:40 -07:00
Simon Willison
2881576dd0 Re-ran cog, refs #536 2024-07-18 12:00:35 -07:00
Simon Willison
96db13f537
Link to new video
!stable-docs
2024-06-17 10:18:24 -07:00
Simon Willison
68df9721de
github repo static badge
!stable-docs
2024-05-13 18:41:07 -07:00
Simon Willison
45245413bd
GitHub stars badge
!stable-docs
2024-05-13 15:09:56 -07:00
Simon Willison
9a3236db61 gpt-4-turbo model ID, closes #493 2024-05-13 13:37:23 -07:00
Simon Willison
ab1cc4fd1f Release 0.14
Refs #404, #431, #470, #490, #491
2024-05-13 13:26:48 -07:00
Fabian Labat
6cdc29c8d6
Update directory.md (#486)
* Update directory.md

Added support for Bedrock Llama 3
2024-05-13 13:01:33 -07:00
Simon Willison
3cc588f247 List llm-llamafile in plugins directory, closes #470 2024-05-13 12:55:22 -07:00
Simon Willison
8171c9a6bf Update help for GPT-4o, closes #490 2024-05-13 12:53:31 -07:00
Simon Willison
73bbbec372 gpt-4o model, refs #490 2024-05-13 12:49:45 -07:00
Simon Willison
04915e95f8
llm-groq
!stable-docs
2024-04-21 20:33:23 -07:00
Simon Willison
2a9b6113f5
llm-perplexity
Refs https://github.com/hex/llm-perplexity/issues/2

!stable-docs
2024-04-21 16:18:37 -07:00
Simon Willison
99a2836638
llm-fireworks
Refs https://github.com/simonw/llm-fireworks/issues/1

!stable-docs
2024-04-18 17:20:09 -07:00
Simon Willison
9ad9ac68dc
llm-reka in plugin directory
!stable-docs
2024-04-17 19:38:41 -07:00
Simon Willison
12e027d3e4
llm-command-r
!stable-docs

Refs https://github.com/simonw/llm-command-r/issues/1
2024-04-04 07:41:03 -07:00
Simon Willison
008efae86a
llm-cmd
!stable-docs

Refs https://github.com/simonw/llm-cmd/issues/1
2024-03-26 08:58:48 -07:00
Simon Willison
fb63c92cd2 llm logs -r/--response option, closes #431 2024-03-04 13:29:07 -08:00
Simon Willison
5852aa1574
llm-claude-3
!stable-docs
2024-03-04 10:48:57 -08:00
Simon Willison
de6af1ca2f
llm-embed-onnx in plugin directory
!stable-docs

Refs https://github.com/simonw/llm-embed-onnx/issues/1
2024-01-28 14:27:13 -08:00
Simon Willison
5293b9dd72
Added llm-together
!stable-docs
2024-01-27 10:40:40 -08:00