Simon Willison
656d8fa3c4
--xl/--extract-last flag for prompt and log list commands ( #718 )
...
Closes #717
2025-01-24 10:52:46 -08:00
web-sst
6f7ea406bf
Register full embedding model names ( #654 )
...
Provide backward compatible aliases.
This makes available the same model names that ttok uses.
2025-01-22 20:14:03 -08:00
Csaba Henk
88a8cfd9e4
llm logs -x/--extract option ( #693 )
...
* llm logs -x/--extract option
* Update docs/help.md for llm logs -x
* Added test for llm logs -x/--extract, refs #693
* llm logs -xr behaves same as llm logs -x
* -x/--extract in llm logging docs
---------
Co-authored-by: Simon Willison <swillison@gmail.com>
2025-01-10 15:53:04 -08:00
Simon Willison
b452effa09
llm models -q/--query option, closes #700
2025-01-09 11:37:33 -08:00
Simon Willison
000e984def
--extract support for templates, closes #681
2024-12-19 07:16:48 -08:00
Simon Willison
67d4a99645
llm prompt -x/--extract option, closes #681
2024-12-19 06:40:05 -08:00
Simon Willison
571f4b2a4d
Fix for UTC warnings
...
Closes #672
2024-12-12 14:57:23 -08:00
Simon Willison
b6be09aa28
Fix get_models() and get_async_models() duplicates bug
...
Closes #667 , refs #640
2024-12-05 13:44:07 -08:00
Simon Willison
f9af563df5
response.on_done() mechanism, closes #653
2024-12-01 15:47:23 -08:00
Simon Willison
c52cfee881
llm.get_models() and llm.get_async_models(), closes #640
2024-11-20 20:09:06 -08:00
Simon Willison
8a7b0c4f5d
response.usage() and await aresponse.usage(), closes #644
2024-11-19 21:25:37 -08:00
Simon Willison
cfb10f4afd
Log input tokens, output tokens and token details ( #642 )
...
* Store input_tokens, output_tokens, token_details on Response, closes #610
* llm prompt -u/--usage option
* llm logs -u/--usage option
* Docs on tracking token usage in plugins
* OpenAI default plugin logs usage
2024-11-19 20:21:59 -08:00
Simon Willison
4a059d722b
Log --async responses to DB, closes #641
...
Refs #507
2024-11-19 18:11:52 -08:00
Simon Willison
157b29ddeb
Test for basic async conversation, refs #632
2024-11-14 14:28:17 -08:00
Simon Willison
ba75c674cb
llm.get_async_model(), llm.AsyncModel base class and OpenAI async models ( #613 )
...
- https://github.com/simonw/llm/issues/507#issuecomment-2458639308
* register_model is now async aware
Refs https://github.com/simonw/llm/issues/507#issuecomment-2458658134
* Refactor Chat and AsyncChat to use _Shared base class
Refs https://github.com/simonw/llm/issues/507#issuecomment-2458692338
* fixed function name
* Fix for infinite loop
* Applied Black
* Ran cog
* Applied Black
* Add Response.from_row() classmethod back again
It does not matter that this is a blocking call, since it is a classmethod
* Made mypy happy with llm/models.py
* mypy fixes for openai_models.py
I am unhappy with this, had to duplicate some code.
* First test for AsyncModel
* Still have not quite got this working
* Fix for not loading plugins during tests, refs #626
* audio/wav not audio/wave, refs #603
* Black and mypy and ruff all happy
* Refactor to avoid generics
* Removed obsolete response() method
* Support text = await async_mock_model.prompt("hello")
* Initial docs for llm.get_async_model() and await model.prompt()
Refs #507
* Initial async model plugin creation docs
* duration_ms ANY to pass test
* llm models --async option
Refs https://github.com/simonw/llm/pull/613#issuecomment-2474724406
* Removed obsolete TypeVars
* Expanded register_models() docs for async
* await model.prompt() now returns AsyncResponse
Refs https://github.com/simonw/llm/pull/613#issuecomment-2475157822
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-11-13 17:51:00 -08:00
Simon Willison
7520671176
audio/wav not audio/wave, refs #603
2024-11-12 21:43:07 -08:00
Simon Willison
561784df6e
llm keys get command, refs #623
2024-11-11 09:47:13 -08:00
Simon Willison
5d1d723d4b
Special case treat audio/wave as audio/wav, closes #603
2024-11-07 17:13:54 -08:00
Simon Willison
12df1a3b2a
Show attachment types in llm models --options, closes #612
2024-11-05 22:49:26 -08:00
Simon Willison
1a60fa1667
Test to exercise gpt-4o-audio-preview, closes #608
2024-11-05 21:50:00 -08:00
Simon Willison
39d61d433a
Automated tests for attachments, refs #587
2024-10-28 19:21:11 -07:00
Simon Willison
758ff9ac17
Upgrade to pytest-httpx>=0.33.0
2024-10-28 15:41:34 -07:00
Simon Willison
db1d77f486
assert_all_responses_were_requested=False
2024-10-28 15:41:34 -07:00
Simon Willison
286cf9fcd9
attachments= keyword argument, tests pass again - refs #587
2024-10-28 15:41:34 -07:00
Andrew Wason
a466ddf3cd
Fix broken tests. Drop python 3.8. ( #580 )
...
* Fix broken tests. Drop python 3.8.
* Test on Python 3.13 too
---------
Co-authored-by: Simon Willison <swillison@gmail.com>
2024-10-27 11:26:47 -07:00
Simon Willison
6deed8f976
get_model() improvement, get_default_model() / set_default_wodel() now documented
...
Refs #553
2024-08-18 17:37:31 -07:00
Simon Willison
a83421607a
Switch default model to gpt-4o-mini (from gpt-3.5-turbo), refs #536
2024-07-18 11:57:19 -07:00
Simon Willison
964f4d9934
Fix for llm logs -q plus -m bug, closes #515
2024-06-16 14:35:38 -07:00
Simon Willison
fb63c92cd2
llm logs -r/--response option, closes #431
2024-03-04 13:29:07 -08:00
Simon Willison
8021e12aaa
Windows readline fix, plus run CI against macOS and Windows
...
* Run CI on Windows and macOS as well as Ubuntu, refs #407
* Use pyreadline3 on win32
* Back to fail-fast since we have a bigger matrix now
* Mark some tests as xfail on windows
2024-01-26 16:24:58 -08:00
Simon Willison
9119b03a07
Chmod 600 keys.json on creation, refs #351
2024-01-26 13:18:13 -08:00
Simon Willison
214fcaaf86
Upgrade to run against OpenAI >= 1.0
...
* strategy: fail-fast: false - to help see all errors
* Apply latest Black
Refs #325
2024-01-25 22:00:44 -08:00
Simon Willison
469644c2af
Test using test-llm-load-plugins.sh
...
* Run test-llm-load-plugins.sh, closes #378
2024-01-25 17:44:34 -08:00
Simon Willison
8b78ac6099
Fix for bug where embed did not use default model, closes #317
2023-10-31 21:19:59 -07:00
e. alvarez
839b4d7161
Fix issues: #274 , #280 ( #282 )
...
* Fix issue with reading directories in `iterate_files()` (#280 )
* Add directory checking logic in `iterate_files()` (#274 )
* Added tests for #282 , #274 , #280
---------
Co-authored-by: Simon Willison <swillison@gmail.com>
2023-09-18 23:14:30 -07:00
Simon Willison
b4ec54ef19
NotImplementedError for system prompts with OpenAI completion models, refs #284
...
Signed-off-by: Simon Willison <swillison@gmail.com>
2023-09-18 22:51:22 -07:00
Simon Willison
f76b2120e4
Revert "Handle system prompts for completion models, refs #284 "
...
This reverts commit 4eed871cf1 .
Decesion made in #288
2023-09-18 22:44:38 -07:00
Simon Willison
4eed871cf1
Handle system prompts for completion models, refs #284
2023-09-18 22:36:38 -07:00
Simon Willison
fcff36c6bc
completion: true to register completion models, refs #284
2023-09-18 22:17:26 -07:00
Simon Willison
4fea46113f
logprobs support for OpenAI completion models, refs #284
2023-09-18 22:04:28 -07:00
Simon Willison
2b504279d9
Test for OpenAI chat streaming, closes #287
2023-09-18 21:27:36 -07:00
Simon Willison
4d18da4e11
Bump default gpt-3.5-turbo-instruct max tokens to 256, refs #284
2023-09-18 20:29:39 -07:00
Simon Willison
4d46ebaa32
OpenAI completion models including gpt-3.5-turbo-instruct, refs #284
2023-09-18 18:34:32 -07:00
Simon Willison
356fcb72f6
NumPy decoding docs, plus extra tests for llm.encode/decode
...
!stable-docs
Refs https://discord.com/channels/823971286308356157/1128504153841336370/1151975583237034056
2023-09-14 14:01:47 -07:00
Simon Willison
33dee4762e
llm embed-multi --batch-size option, closes #273
2023-09-13 16:33:27 -07:00
Simon Willison
b9478e6a17
batch_size= argument to embed_multi(), refs #273
2023-09-13 16:24:04 -07:00
Simon Willison
6c43948325
llm.user_dir() creates directory if needed, closes #275
...
Would have fixed this bug too:
- https://github.com/simonw/llm-sentence-transformers/issues/9
2023-09-13 15:58:18 -07:00
Simon Willison
603da35e37
Fixed flaky order tests, refs #271
2023-09-12 11:37:36 -07:00
Simon Willison
9e529bb36a
Wrap !multi in single quotes, for consistency with exit/quit
2023-09-12 10:45:02 -07:00
Simon Willison
f54f2c659d
response.__str__ method, closes #268
2023-09-12 10:36:29 -07:00