Simon Willison
c52cfee881
llm.get_models() and llm.get_async_models(), closes #640
2024-11-20 20:09:06 -08:00
Simon Willison
8a7b0c4f5d
response.usage() and await aresponse.usage(), closes #644
2024-11-19 21:25:37 -08:00
Simon Willison
cfb10f4afd
Log input tokens, output tokens and token details ( #642 )
...
* Store input_tokens, output_tokens, token_details on Response, closes #610
* llm prompt -u/--usage option
* llm logs -u/--usage option
* Docs on tracking token usage in plugins
* OpenAI default plugin logs usage
2024-11-19 20:21:59 -08:00
Simon Willison
4a059d722b
Log --async responses to DB, closes #641
...
Refs #507
2024-11-19 18:11:52 -08:00
Simon Willison
157b29ddeb
Test for basic async conversation, refs #632
2024-11-14 14:28:17 -08:00
Simon Willison
ba75c674cb
llm.get_async_model(), llm.AsyncModel base class and OpenAI async models ( #613 )
...
- https://github.com/simonw/llm/issues/507#issuecomment-2458639308
* register_model is now async aware
Refs https://github.com/simonw/llm/issues/507#issuecomment-2458658134
* Refactor Chat and AsyncChat to use _Shared base class
Refs https://github.com/simonw/llm/issues/507#issuecomment-2458692338
* fixed function name
* Fix for infinite loop
* Applied Black
* Ran cog
* Applied Black
* Add Response.from_row() classmethod back again
It does not matter that this is a blocking call, since it is a classmethod
* Made mypy happy with llm/models.py
* mypy fixes for openai_models.py
I am unhappy with this, had to duplicate some code.
* First test for AsyncModel
* Still have not quite got this working
* Fix for not loading plugins during tests, refs #626
* audio/wav not audio/wave, refs #603
* Black and mypy and ruff all happy
* Refactor to avoid generics
* Removed obsolete response() method
* Support text = await async_mock_model.prompt("hello")
* Initial docs for llm.get_async_model() and await model.prompt()
Refs #507
* Initial async model plugin creation docs
* duration_ms ANY to pass test
* llm models --async option
Refs https://github.com/simonw/llm/pull/613#issuecomment-2474724406
* Removed obsolete TypeVars
* Expanded register_models() docs for async
* await model.prompt() now returns AsyncResponse
Refs https://github.com/simonw/llm/pull/613#issuecomment-2475157822
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-11-13 17:51:00 -08:00
Simon Willison
7520671176
audio/wav not audio/wave, refs #603
2024-11-12 21:43:07 -08:00
Simon Willison
561784df6e
llm keys get command, refs #623
2024-11-11 09:47:13 -08:00
Simon Willison
5d1d723d4b
Special case treat audio/wave as audio/wav, closes #603
2024-11-07 17:13:54 -08:00
Simon Willison
12df1a3b2a
Show attachment types in llm models --options, closes #612
2024-11-05 22:49:26 -08:00
Simon Willison
1a60fa1667
Test to exercise gpt-4o-audio-preview, closes #608
2024-11-05 21:50:00 -08:00
Simon Willison
39d61d433a
Automated tests for attachments, refs #587
2024-10-28 19:21:11 -07:00
Simon Willison
758ff9ac17
Upgrade to pytest-httpx>=0.33.0
2024-10-28 15:41:34 -07:00
Simon Willison
db1d77f486
assert_all_responses_were_requested=False
2024-10-28 15:41:34 -07:00
Simon Willison
286cf9fcd9
attachments= keyword argument, tests pass again - refs #587
2024-10-28 15:41:34 -07:00
Andrew Wason
a466ddf3cd
Fix broken tests. Drop python 3.8. ( #580 )
...
* Fix broken tests. Drop python 3.8.
* Test on Python 3.13 too
---------
Co-authored-by: Simon Willison <swillison@gmail.com>
2024-10-27 11:26:47 -07:00
Simon Willison
6deed8f976
get_model() improvement, get_default_model() / set_default_wodel() now documented
...
Refs #553
2024-08-18 17:37:31 -07:00
Simon Willison
a83421607a
Switch default model to gpt-4o-mini (from gpt-3.5-turbo), refs #536
2024-07-18 11:57:19 -07:00
Simon Willison
964f4d9934
Fix for llm logs -q plus -m bug, closes #515
2024-06-16 14:35:38 -07:00
Simon Willison
fb63c92cd2
llm logs -r/--response option, closes #431
2024-03-04 13:29:07 -08:00
Simon Willison
8021e12aaa
Windows readline fix, plus run CI against macOS and Windows
...
* Run CI on Windows and macOS as well as Ubuntu, refs #407
* Use pyreadline3 on win32
* Back to fail-fast since we have a bigger matrix now
* Mark some tests as xfail on windows
2024-01-26 16:24:58 -08:00
Simon Willison
9119b03a07
Chmod 600 keys.json on creation, refs #351
2024-01-26 13:18:13 -08:00
Simon Willison
214fcaaf86
Upgrade to run against OpenAI >= 1.0
...
* strategy: fail-fast: false - to help see all errors
* Apply latest Black
Refs #325
2024-01-25 22:00:44 -08:00
Simon Willison
469644c2af
Test using test-llm-load-plugins.sh
...
* Run test-llm-load-plugins.sh, closes #378
2024-01-25 17:44:34 -08:00
Simon Willison
8b78ac6099
Fix for bug where embed did not use default model, closes #317
2023-10-31 21:19:59 -07:00
e. alvarez
839b4d7161
Fix issues: #274 , #280 ( #282 )
...
* Fix issue with reading directories in `iterate_files()` (#280 )
* Add directory checking logic in `iterate_files()` (#274 )
* Added tests for #282 , #274 , #280
---------
Co-authored-by: Simon Willison <swillison@gmail.com>
2023-09-18 23:14:30 -07:00
Simon Willison
b4ec54ef19
NotImplementedError for system prompts with OpenAI completion models, refs #284
...
Signed-off-by: Simon Willison <swillison@gmail.com>
2023-09-18 22:51:22 -07:00
Simon Willison
f76b2120e4
Revert "Handle system prompts for completion models, refs #284 "
...
This reverts commit 4eed871cf1 .
Decesion made in #288
2023-09-18 22:44:38 -07:00
Simon Willison
4eed871cf1
Handle system prompts for completion models, refs #284
2023-09-18 22:36:38 -07:00
Simon Willison
fcff36c6bc
completion: true to register completion models, refs #284
2023-09-18 22:17:26 -07:00
Simon Willison
4fea46113f
logprobs support for OpenAI completion models, refs #284
2023-09-18 22:04:28 -07:00
Simon Willison
2b504279d9
Test for OpenAI chat streaming, closes #287
2023-09-18 21:27:36 -07:00
Simon Willison
4d18da4e11
Bump default gpt-3.5-turbo-instruct max tokens to 256, refs #284
2023-09-18 20:29:39 -07:00
Simon Willison
4d46ebaa32
OpenAI completion models including gpt-3.5-turbo-instruct, refs #284
2023-09-18 18:34:32 -07:00
Simon Willison
356fcb72f6
NumPy decoding docs, plus extra tests for llm.encode/decode
...
!stable-docs
Refs https://discord.com/channels/823971286308356157/1128504153841336370/1151975583237034056
2023-09-14 14:01:47 -07:00
Simon Willison
33dee4762e
llm embed-multi --batch-size option, closes #273
2023-09-13 16:33:27 -07:00
Simon Willison
b9478e6a17
batch_size= argument to embed_multi(), refs #273
2023-09-13 16:24:04 -07:00
Simon Willison
6c43948325
llm.user_dir() creates directory if needed, closes #275
...
Would have fixed this bug too:
- https://github.com/simonw/llm-sentence-transformers/issues/9
2023-09-13 15:58:18 -07:00
Simon Willison
603da35e37
Fixed flaky order tests, refs #271
2023-09-12 11:37:36 -07:00
Simon Willison
9e529bb36a
Wrap !multi in single quotes, for consistency with exit/quit
2023-09-12 10:45:02 -07:00
Simon Willison
f54f2c659d
response.__str__ method, closes #268
2023-09-12 10:36:29 -07:00
Simon Willison
9c33d30843
llm chat !multi support, closes #267
2023-09-12 09:31:20 -07:00
Simon Willison
22a59f795e
llm collections defaults to llm collections list, close #265
2023-09-11 23:08:11 -07:00
Simon Willison
591ad6f571
Revert "Reuse embeddings for hashed content, --store now works on second run - closes #224 "
...
This reverts commit 267e2ea999 .
It's broken, see:
https://github.com/simonw/llm/issues/224#issuecomment-1715014393
2023-09-11 22:56:52 -07:00
Simon Willison
267e2ea999
Reuse embeddings for hashed content, --store now works on second run - closes #224
2023-09-11 22:44:22 -07:00
Simon Willison
52cec1304b
Binary embeddings ( #254 )
...
* Binary embeddings support, refs #253
* Write binary content to content_blob, with tests - refs #253
* supports_text and supports_binary embedding validation, refs #253
2023-09-11 18:58:44 -07:00
Simon Willison
5ba34dbe36
llm embed-db is now llm collections, refs #229
2023-09-10 14:24:27 -07:00
Alexis Métaireau
df32d7685d
Updated error message for invalid or missing embedding model ( #257 )
...
* Updated error message for missing embedding model
---------
Co-authored-by: Simon Willison <swillison@gmail.com>
2023-09-10 11:56:29 -07:00
Simon Willison
2246e8f4fd
Make tests robust against extra plugins, closes #258
2023-09-10 11:21:04 -07:00
Simon Willison
ae7f4f6de7
llm chat -o/--option - refs #244
2023-09-10 11:14:28 -07:00