discussions: Remote API Extension #3505

dan-menlo · 2024-08-30T07:40:29Z

freelerobot · 2024-09-05T11:16:02Z

Dupe of #3374

louis-jan · 2024-09-27T07:04:59Z

Separation of Concerns

How models list work?

Remote extensions should work with autopopulating models, aka /models list.
We could not build hundreds model.json files manually.
The current extension framework is actually designed to handle this, it's just an implementation issue from extensions, which can be improved.
There was a hacky UI implementation where we pre-populated models, then disabled all of them until the API key was set. That should be a part of the extension, not the Jan app.
Extension builder still ships default available models. We don't close the door, we improve the example.

// Before
override async onLoad(): Promise<void> {
  super.onLoad()
  // Register Settings (API Key, Endpoints)
  this.registerSettings(SETTINGS)
	
  // Pre-populate models - persist model.json files
  // MODELS are model.json files that come with the extension.
  this.registerModels(MODELS)
}

// After
override async onLoad(): Promise<void> {
  super.onLoad()
  // Register Settings (API Key, Endpoints)
  this.registerSettings(SETTINGS)
	
  // Fetch models from provider models endpoint - just a simple fetch
  // Default to `/models`
  get('/models')
    .then((models) => {
        // Model builder will construct model template (aka preset)
	// This operation builds Model DTOs that works with the app.
	this.registerModels(this.modelBuilder.build(models))
    })
}

Remote Provider Extension

Draw.io

https://drive.google.com/file/d/1pl9WjCzKl519keva85aHqUhx2u0onVf4/view?usp=sharing

Supported parameters?

Each provider works with different parameters, but they all share the same basic function with the current ones defined.
We've already supported transformPayload and transformResponse to adapt to these cases.
So users still see parameters consistent from model to model, but the magic happens behind the scenes, where the transformations are simplified under the hood.

/**
* transformPayload Example
* Tranform the payload before sending it to the inference endpoint.
* The new preview models such as o1-mini and o1-preview replaced max_tokens by max_completion_tokens parameter.
* Others do not.
*/
transformPayload = (payload: OpenAIPayloadType): OpenAIPayloadType => {
  // Transform the payload for preview models
  if (this.previewModels.includes(payload.model)) {
    const { max_tokens, ...params } = payload
    return { ...params, max_completion_tokens: max_tokens }
  }
  // Pass through for officialw models
  return payload
}

Decoration?
- We've currently hard-coded many provider metadata from Jan, which could cause issues with future installed extensions.
- The decoration should be done from the Extension Manifest (package.json).
- https://code.visualstudio.com/api/references/extension-manifest
```
{
  "name": "openai-extension",
  "displayName": "OpenAI Extension Provider",
  "icon": "https://openai.com/logo.png"
}
```
Just remove the hacky parts from Jan.

Model Dropdown: It checks if the engine is nitro or others, filtering for local versus cloud sections. New local engines will be treated as remote engines (e.g. cortex.cpp). -> Filter by Extension type (class name or type, e.g. LocalOAIEngine vs RemoteOAIEngine).
All models from the cloud provider are disabled by default if no API key is set. What if I use a self-hosted endpoint without API key restrictions? Models available or not should be determined from the extensions, when there are no credentials to meet the requirements, it will result in an empty section, indicating no available models. When users input the API-Key from extension settings page, it will fetch model list automatically and cache. Users can also refresh the models list from there (should not fetch so many times, we are building a local-first application)
Application settings can be a bit confusing, with Model Providers and Core Extensions listed separately. Where do other extensions fit in?

Extension settings do not have a community or "others" section

Extensions installation is a straightforward process that requires minimal effort.

There is no official way to install extensions from a GitHub repository URL. Users typically don't know how to package and install software from sources.
There should be a shortcut from the settings page that allows users to input the URL, pop up the extension repository details, and then install from there.

It would be helpful to provide a list of community extensions, allowing users to easily find the right extension for their specific use case without having to search.

dan-menlo · 2024-09-29T07:57:27Z

Idea from @norrybul: janhq/models#23 (comment)

louis-jan · 2024-09-29T08:17:45Z

Idea from @norrybul: janhq/models#23 (comment)

Hi @dan-homebrew, that's what we initially thought we should do, but there are a couple of problems, so we've pushed back the Custom OAI Extension:

Limitations of UI support in extensions.
Model pre-population would establish a 1-1 mapping between model.json and extension settings. Once the Model Cache is complete, the extension no longer relies on model.json.
Why not use existing extensions? E.g. OpenAI, OpenRouter...
It's a good example of community extension?

freelerobot · 2024-10-14T01:23:54Z

Using `/models` to auto-populate available remote models.

✅ This is the a great idea

Let's account for:

A remote inference provider has a differently named /models endpoint, e.g. /v1/models /v2/models (silly example, but path should be flexible and easy to config)
A remote inference provider doesn't have a models-list endpoint. Dumb idea: can we ask the extension-builder to just provide a hardcoded JSON that would otherwise be returned by the endpoint? e.g. we expect to read a local models.json file.
I actually dont think the edge case where, a user wants to add additional models when there is already a /models endpoint, is that common - do we need to handle this?

Right Panel: Inference Parameters UI Extensions

Inference parameters will vary across APIs.

How might we auto update the Inference UI in the right panel?
Is this something that can be automatic?
Related issue: planning: Jan UI Extensions Framework #2718

Extensions DevEx

@louis-jan whats the extensions devex? Can you provide a full example for adding openai?
Flows:

User registers a new remote provider endpoint
User configures models list (or just uses default)
User configures settings parameters (or just uses default)

Extensions Hub

I like that we're thinking about how to showcase and list available community extensions.
I think we should take it out of the scope of this particular epic.
I've created a separate epic for us to think through this here: #3788
For the scope of this epic, let's assume we will package extensions into monorepo

louis-jan · 2024-10-14T02:35:24Z

Hey @0xSage, an extension-builder can offer a JSON, then call registerModels from the extension. This JSON can either be a local file or the result of a remote fetch. We support both. So it's not restricted to the /models path, but rather the fetch action.

We don't offer an API for users to register a new remote provider endpoint via extension code, as it's merely a utility or a small example they can copy over from our provided examples. The extension framework is designed to supply APIs that facilitate the interaction of extensions with the application.

I already added the example of:

Extension builder can register models from their model sources, which can be either a JSON file or the result of a fetch operation.
Extension builder can transform setting parameters, eventually the same setting interfaces but transform the payload.

#3505 (comment)

// Register models with JSON file
override async onLoad(): Promise<void> {
  super.onLoad()
  // Register Settings (API Key, Endpoints)
  this.registerSettings(SETTINGS)
	
  // Pre-populate models - persist model.json files
  // MODELS are model.json files that come with the extension.
  this.registerModels(MODELS)
}

// Register models with provider models list endpoint
override async onLoad(): Promise<void> {
  super.onLoad()
  // Register Settings (API Key, Endpoints)
  this.registerSettings(SETTINGS)
	
  // Fetch models from provider models endpoint - just a simple fetch
  // Default to `/models`
  get('/models')
    .then((models) => {
        // Model builder will construct model template (aka preset)
	// This operation builds Model DTOs that works with the app.
        // They can transform model parameters right here for supported settings
	this.registerModels(this.modelBuilder.build(models))
    })
}

/**
* transformPayload Example
* Tranform the payload before sending it to the inference endpoint.
* The new preview models such as o1-mini and o1-preview replaced max_tokens by max_completion_tokens parameter.
* Others do not.
*/
transformPayload = (payload: OpenAIPayloadType): OpenAIPayloadType => {
  // Transform the payload for preview models
  if (this.previewModels.includes(payload.model)) {
    const { max_tokens, ...params } = payload
    return { ...params, max_completion_tokens: max_tokens }
  }
  // Pass through for officialw models
  return payload
}

dan-menlo added this to Jan & Cortex Aug 30, 2024

dan-menlo converted this from a draft issue Aug 30, 2024

imtuyethan added the type: epic A major feature or initiative label Aug 30, 2024

freelerobot mentioned this issue Sep 5, 2024

feat: add older/more OpenAI models #3392

Closed

1 task

freelerobot added the P1: important Important feature / fix label Sep 5, 2024

freelerobot assigned louis-jan Sep 5, 2024

freelerobot moved this from Planning to Need Investigation in Jan & Cortex Sep 5, 2024

dan-menlo changed the title ~~epic: Remote API Extension that is modular and with easily updateable model lists~~ epic: Remote API Extension Revamp Sep 9, 2024

dan-menlo self-assigned this Sep 10, 2024

dan-menlo moved this from Need Investigation to Planning in Jan & Cortex Sep 10, 2024

dan-menlo removed the status in Jan & Cortex Sep 10, 2024

dan-menlo moved this to Planning in Jan & Cortex Sep 10, 2024

dan-menlo unassigned dan-menlo and louis-jan Sep 10, 2024

dan-menlo assigned louis-jan Sep 17, 2024

imtuyethan added the category: providers Local & remote inference providers label Sep 18, 2024

dan-menlo changed the title ~~epic: Remote API Extension Revamp~~ architecture: Remote API Extension Revamp Sep 27, 2024

dan-menlo mentioned this issue Sep 29, 2024

feat: Remote APIs can fetch Model List #3374

Closed

6 tasks

dan-menlo mentioned this issue Sep 29, 2024

feat: Support Qwen 2.5 janhq/models#23

Closed

2 tasks

dan-menlo moved this from Scheduled to Investigating in Jan & Cortex Sep 29, 2024

This was referenced Oct 13, 2024

feat: Provider Extension - OpenRouter #3452

Closed

bug: GPT-4 doesn't support vision & file search in Jan #3520

Closed

dan-menlo changed the title ~~architecture: Remote API Extension Revamp~~ architecture: Remote API Extension Oct 13, 2024

freelerobot mentioned this issue Oct 14, 2024

planning: Jan and Cortex's Extension Framework #3773

Open

dan-menlo mentioned this issue Oct 14, 2024

roadmap: Jan has revamped Remote Engines (e.g. OpenAI, Anthropic etc) #3786

Open

25 tasks

This was referenced Oct 14, 2024

bug: Unsupported Parameter Errors for model o1-mini & o1-preview #3771

Closed

feat: Separate OpenAI compatible server support for "local server" to keep using OpenAI in parallel #3214

Closed

dan-menlo changed the title ~~architecture: Remote API Extension~~ discussions: Remote API Extension Oct 17, 2024

dan-menlo moved this from Scheduled to Completed in Jan & Cortex Oct 17, 2024

dan-menlo closed this as completed by moving to Completed in Jan & Cortex Oct 17, 2024

github-project-automation bot moved this from Completed to Review + QA in Jan & Cortex Oct 17, 2024

freelerobot mentioned this issue Oct 17, 2024

epic: Provider Refactor + Extensions #3824

Closed

14 tasks

This was referenced Oct 17, 2024

bug: Unsupported Parameter Errors for model o1-mini & o1-preview #3745

Closed

feat: I can't send images with Phi-3.5-vision #3448

Open

imtuyethan moved this from Review + QA to Completed in Jan & Cortex Oct 23, 2024

imtuyethan mentioned this issue Nov 7, 2024

idea: Allow 8192 tokens on Claude 3.5 #3973

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

discussions: Remote API Extension #3505

discussions: Remote API Extension #3505

dan-menlo commented Aug 30, 2024 •

edited by freelerobot

Loading

freelerobot commented Sep 5, 2024

louis-jan commented Sep 27, 2024 •

edited

Loading

dan-menlo commented Sep 29, 2024

louis-jan commented Sep 29, 2024

freelerobot commented Oct 14, 2024 •

edited

Loading

louis-jan commented Oct 14, 2024 •

edited

Loading

discussions: Remote API Extension #3505

discussions: Remote API Extension #3505

Comments

dan-menlo commented Aug 30, 2024 • edited by freelerobot Loading

Goal

Tasklist

Out-of-scope

Tasklist

Remote API Extensions

Existing Issues

freelerobot commented Sep 5, 2024

louis-jan commented Sep 27, 2024 • edited Loading

Separation of Concerns

dan-menlo commented Sep 29, 2024

louis-jan commented Sep 29, 2024

freelerobot commented Oct 14, 2024 • edited Loading

Using /models to auto-populate available remote models.

Right Panel: Inference Parameters UI Extensions

Extensions DevEx

Extensions Hub

louis-jan commented Oct 14, 2024 • edited Loading

dan-menlo commented Aug 30, 2024 •

edited by freelerobot

Loading

louis-jan commented Sep 27, 2024 •

edited

Loading

freelerobot commented Oct 14, 2024 •

edited

Loading

Using `/models` to auto-populate available remote models.

louis-jan commented Oct 14, 2024 •

edited

Loading