Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Separate OpenAI compatible server support for "local server" to keep using OpenAI in parallel #3214

Closed
1 task done
jens-f opened this issue Jul 28, 2024 · 4 comments
Labels
category: providers Local & remote inference providers move to Cortex needs info Not enough info, more logs/data required type: feature request A new feature

Comments

@jens-f
Copy link

jens-f commented Jul 28, 2024

  • I have searched the existing issues

Is your feature request related to a problem? Please describe it

I'm switching between using OpenAI and a local open AI compatible endpoints a lot. Since swapping out the base url whenever I switch is tedious, I was thinking of using one of the other endpoints such as https://jan.ai/docs/remote-models/openrouter. But According to the manual at https://jan.ai/docs/remote-models/generic-openai we should use the OpenAI server.

Describe the solution

Would it be possible to add another Server to the configurations panel for an OpenAI compatible endpoint(s)? Ideally allowing us to give it a name (so it's easier to tell in the chat what model is used). But even a generic name would be sufficient right now. This way the user can tell in the chats if they are actually talking to OpenAI or a local model.

Note: This might be related to #2840 but felt different enough to open another request

Teachability, documentation, adoption, migration strategy

No response

What is the motivation / use case for changing the behavior?

Reduce manual steps needed to switch urls frequently. Increase usability since the chats will actually tell the user that the conversation is not with a real OpenAI model

@Van-QA
Copy link
Contributor

Van-QA commented Aug 21, 2024

Hi there, if you're looking for alternative solutions for local servers with local models, you might want to check out Cortex (https://github.com/janhq/cortex) for headless AI operations.
I appreciate you taking the time to report this. Let us know if you have any other questions! 😁

@jens-f
Copy link
Author

jens-f commented Aug 21, 2024

@Van-QA thanks for looking into this. This feature request is not about how to run local models - in our setup the OpenAI compatible serving is the standardized way to access models. The feature request was really just about having the ability to extend the Jan UI to allow us to run another OpenAI endpoint with a custom URL, so we don't have to manually swap out the url of the "official" OpenAI model in Jan all the time.

@Van-QA Van-QA reopened this Aug 21, 2024
@freelerobot freelerobot added the needs info Not enough info, more logs/data required label Sep 5, 2024
@freelerobot freelerobot moved this from Completed to Icebox in Jan & Cortex Sep 5, 2024
@freelerobot
Copy link
Contributor

freelerobot commented Oct 14, 2024

Related #3773
#3505

@imtuyethan
Copy link
Contributor

imtuyethan commented Dec 10, 2024

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: providers Local & remote inference providers move to Cortex needs info Not enough info, more logs/data required type: feature request A new feature
Projects
Archived in project
Development

No branches or pull requests

4 participants