Model Options Support #288
Replies: 2 comments
-
I have some concerns regarding Options-related logic. Currently at startup time, a (default) Options is created, and at runtime, a (runtime) Options is created, then some logic will build (native) Options from these two Options using different merge methods.
I believe that writing logic to merge is error prone especially when copying values from different Options types (e.g. ChatOptionsImpl/PortableFunctionCallingOptions/AzureOpenAiChatOptions) to (native) Options (i.e. ChatCompletionsOptions). Also it is impossible to clear a default value at runtime (i.e. use a null value at runtime instead of a non-null default value). Can we reconsider how to support portable options (without merging) to avoid these issues? Thanks. |
Beta Was this translation helpful? Give feedback.
-
Hi @tzolov . I was looking for a support to dynamically change API KEY at runtime through options. I did not find any relevant information related to this. Can you help me know if this is supported? Right now if I add spring.ai.openai.api-key=YOUR_API_KEY in application.properties, its working, but if I want to set the API key at runtime, then I dont see any method options through OpenAIChatOptions . |
Beta Was this translation helpful? Give feedback.
-
With the new
ModelOptions
mechanism you can set any model-specific configurations on startup or at runtime.For example:
seed
,model
,temperature
,logitBias
,topP
… in case OpenAIChatClient ornum-thread
,top-k
,num-gpu
... for the OllamaChatClient.Use the auto-configuration
options
properties to set the default (e.g. start-up) options common for all model requests.Example for OpenAiChatClient:
or use the dedicated options builders, to create the options programmatically.
NOTE: The runtime options override the default one.
For more information, check the updated openai-chat , and openai-embeddings documentations.
Gradually we will enable the options support to all model clients.
The Azure OpenAI Chat/Embeddings Options PR is underway: #286 and the OllamaChatClient has a partial support.
Following diagram illustrates the data (and processing) flow for all model types:
Beta Was this translation helpful? Give feedback.
All reactions