gptel

Author	SHA1	Message	Date
Karthik Chikmagalur	66a63e6c82	gptel-ollama: switch to chat API * gptel-ollama.el (gptel-curl--parse-stream, gptel--parse-response, gptel--request-data, gptel--parse-buffer, gptel--ollama-context, gptel--ollama-token-count, gptel-make-ollama): Switch to Ollama's chat API from the completions API. This makes interacting with Ollama fully stateless, like all the other APIs, and should help significantly with issues like #249 and #279. Support non-streaming responses from Ollama in the process. Remove `gptel--ollama-context` as it is no longer needed. Add a `gptel--ollama-token-count` for tracking token costs. A UI affordance for this is not implemented yet, but is planned.	2024-04-22 12:39:21 -07:00
Karthik Chikmagalur	b31c9be5e0	gptel-ollama: Adjust Ollama stream parser for libjansson * gptel-ollama.el (gptel-curl--parse-stream): libjansson and json.el behave differently w.r.t moving point when there is a parsing error. Fix by explicitly handling point when there is an error. (#255)	2024-03-16 20:17:52 -07:00
Karthik Chikmagalur	f58ad9435c	gptel: Use libjansson support if available Using the libjansson JSON parser gives us a modest boost in speed. It's not as significant a speedup as it is for LSP clients since our jSON payloads are smaller and less frequent -- but we might as well use it. * gptel.el (gptel--json-read, gptel--json-encode, gptel--url-get-response, gptel--parse-response): Define macros to use the libjansson-supported `json-parse-buffer` and `json-serialize`. Replace use of `json-encode` and `json-read` appropriately. * gptel-openai.el: (gptel-curl--parse-stream) : Use `gptel--json-read` instead of `json-read`. * gptel-ollama.el (gptel-curl--parse-stream): Use `gptel--json-read` instead of `json-read`. * gptel-gemini.el (gptel-curl--parse-stream): Use `gptel--json-read` instead of `json-read`. * gptel-curl.el (gptel-curl--get-args, gptel-curl--get-response, gptel-curl--log-response, gptel-curl--stream-cleanup, gptel-curl--parse-response): Use `gptel--json-read` and `gptel--json-encode` in place of the json.el versions. * gptel-anthropic.el (gptel-curl--parse-stream): Use `gptel--json-read` instead of `json-read`. * test/gptel-org-test.el: Use `gptel--json-read`.	2024-03-14 20:28:21 -07:00
r0man	43f625ecb9	gptel-openai: curl-args slot in gptel-backend (#221 ) gptel-openai.el (gptel-backend, gptel-make-openai, gptel-make-azure): Add a curl-args slot to the backend struct for additional Curl arguments. Usage example: This can be used to set the `--cert` and `--key` options in a custom backend that uses mutal TLS to communicate with an OpenAI proxy/gateway. gptel-curl.el (gptel-curl--get-args): Add backend-specific curl-args when creating HTTP requests. gptel-gemini.el (gptel-make-gemini): Add a curl-args slot to the constructor. gptel-kagi.el (gptel-make-kagi): Ditto. gptel-ollama.el (gptel-make-ollama): Ditto.	2024-02-20 15:21:46 -08:00
Karthik Chikmagalur	af5444a2ea	gptel: docstrings for multi-LLM support, bump version * gptel.el (gptel, gptel--insert-response, gptel-temperature, gptel-pre-response-hook, gptel-response-filter-functions, gptel-stream, gptel--convert-org, gptel-pre-response-hook, gptel-max-tokens, gptel-mode, gptel-request, gptel--insert-response, gptel-set-topic, gptel--convert-org): Replace mentions of "ChatGPT" with "LLM" or equivalent. Update package description and remove commented out obsolete code. Bump version to 0.6.5. * gptel-transient.el (gptel--crowdsourced-prompts-url, gptel--crowdsourced-prompts, gptel-menu, gptel-system-prompt, gptel-rewrite-menu, gptel--infix-max-tokens, gptel--suffix-system-message): Ditto. * gptel-ollama.el (gptel--request-data): Ditto * gptel-kagi.el (gptel--request-data): Ditto * gptel-curl.el (gptel-curl--stream-insert-response, gptel-curl--stream-cleanup, gptel-curl--stream-filter): Ditto	2024-02-03 14:29:12 -08:00
Karthik Chikmagalur	d0c685e501	gptel: checkdoc linting and indentation rules * gptel.el (gptel-use-header-line, gptel--parse-buffer): Docstrings. * gptel-transient.el (gptel--transient-read-variable): Docstring. * gptel-openai.el (gptel-make-openai, gptel-make-azure): Add indent declaration. * gptel-ollama.el (gptel-make-ollama): Add indent declaration. * gptel-kagi.el (gptel-make-kagi): Add indent declaration. * gptel-gemini.el (gptel-make-gemini): Add indent declaration.	2024-01-19 14:19:22 -08:00
Karthik Chikmagalur	bea31e33e2	gptel-ollama: Use default host in gptel-make-ollama	2024-01-12 22:27:20 -08:00
Karthik Chikmagalur	38095eaed5	gptel: Fix prompt collection bug + linting * gptel.el: Update package description. * gptel-gemini.el(gptel--request-data, gptel--parse-buffer): Add model temperature to request correctly. * gptel-ollama.el(gptel--parse-buffer): Ensure that newlines are trimmed correctly even when `gptel-prompt-prefix-string` and `gptel-response-prefix-string` are absent. Fix formatting and linter warnings. * gptel-openai.el(gptel--parse-buffer): Ditto.	2023-12-20 15:40:56 -08:00
daedsidog	644e341244	Add multiline prefixes & AI response prefixes (#142 ) gptel: Add customizable prompt/response prefixes gptel.el (gptel-prompt-prefix-alist, gptel-response-prefix-alist, gptel-prompt-prefix-string, gptel-response-prefix-string, gptel--url-get-response): Add customizable response prefixes (per major-mode) in `gptel-response-prefix-alist`. Rename `gptel-prompt-string` -> `gptel-prompt-prefix-string` The function `gptel-response-prefix-string` returns the prefix string for the response in the current major-mode. gptel-openai.el, gptel-ollama.el (gptel--parse-buffer): Remove the prompt and response prefixes when creating prompt strings to send to the LLM API. gptel-curl.el (gptel-curl--stream-cleanup, gptel-curl--stream-insert-response): Insert the response prefix for the current major-mode before inserting the LLM API response.	2023-12-15 09:30:16 -08:00
Karthik Chikmagalur	66d2bafad6	gptel-ollama: Fix buffer parsing * gptel-ollama.el (gptel--parse-buffer): The prompt construction for Ollama fails when starting from (point-min). Fix by checking if a valid text-property match object is found in the parsing.	2023-11-07 22:37:59 -08:00
Karthik Chikmagalur	cee5893d79	gptel: Appease the byte compiler.	2023-11-07 20:36:37 -08:00
Karthik Chikmagalur	c97778d5a8	gptel: address byte-compile and checkdoc warnings * gptel.el, gptel-transient.el, gptel-openai.el, gptel-ollama.el	2023-11-07 20:36:37 -08:00
Karthik Chikmagalur	1434bbac7b	gptel-ollama, gptel-openai: Add example of backend creation README: Fix error with Ollama backend instructions	2023-10-29 00:31:56 -07:00
Karthik Chikmagalur	6419e8f021	gptel: Add multi-llm support README.org: Update README with new information and a multi-llm demo. gptel.el (gptel-host, gptel--known-backends, gptel--api-key, gptel--create-prompt, gptel--request-data, gptel--parse-buffer, gptel-request, gptel--parse-response, gptel--openai, gptel--debug, gptel--restore-state, gptel, gptel-backend): Integrate multiple LLMs through the introcution of gptel-backends. Each backend is composed of two pieces: 1. An instance of a cl-struct, containing connection, authentication and model information. See the cl-struct `gptel-backend` for details. A separate cl-struct type is defined for each supported backend (OpenAI, Azure, GPT4All and Ollama) that inherits from the generic gptel-backend type. 2. cl-generic implementations of specific tasks, like gathering up and formatting context (previous user queries and LLM responses), parsing responses or responses streams etc. The four tasks currently specialized this way are carried out by `gptel--parse-buffer` and `gptel--request-data` (for constructing the query) and `gptel--parse-response` and `gptel-curl--parse-stream` (for parsing the response). See their implementations for details. Some effort has been made to limit the number of times dispatching is done when reading streaming responses. When a backend is created, it is registered in the collection `gptel--known-backends` and can be accessed by name later, such as from the transient menu. Only one of these backends is active at any time in a buffer, stored in the buffer-local variable `gptel-backend`. Most messaging, authentication etc accounts for the active backend, although there might be some leftovers. When using `gptel-request` or `gptel-send`, the active backend can be changed or let-bound. - Obsolete `gptel-host` - Fix the rear-sticky property when restoring sessions from files. - Document some variables (not user options), like `gptel--debug` gptel-openai.el (gptel-backend, gptel-make-openai, gptel-make-azure, gptel-make-gpt4all): This file (currently always loaded) sets up the generic backend struct and includes constructors for creating OpenAI, GPT4All and Azure backends. They all use the same API so a single set of defgeneric implemenations suffices for all of them. gptel-ollama.el (gptel-make-ollama): This file includes the cl-struct, constructor and requisite defgeneric implementations for Ollama support. gptel-transient.el (gptel-menu, gptel-provider-variable, gptel--infix-provider, gptel-suffix-send): - Provide access to all available LLM backends and models from `gptel-menu`. - Adjust keybindings in gptel-menu: setting the model and query parameters is now bound to two char keybinds, while redirecting input and output is bound to single keys.	2023-10-28 23:57:47 -07:00

14 commits