Enable preparing nested OutputSchemas for serialization #1357

dbogunowicz · 2023-10-26T08:40:33Z

Currently, the prep_outputs_for_serialization function assumes that pipeline_outputs cannot be arbitrarily nested (allows up to one level of nesting). This is why, for more complex BaseModels returned by the python (e.g. TextGenerationOutput), the function will not properly convert nested numpy arrays to lists for serialization.

My diff makes sure that any pipeline_output: BaseModel that contains an arbitrary number and nesting depth of any fields that are either a BaseModel, list of numpy.ndarray is supported.

As a result, we can now serialize logits of the LLM output, that previously was not properly converted from numpy array to a list.

import requests

model_path = "hf:mgoin/TinyStories-1M-deepsparse"
prompt =  ["name one former president of the USA"]
server_address = "http://0.0.0.0:5543/v2/models/gen/infer"

payload = {"prompt": prompt, "output_scores": True, "include_prompt_logits": True}
response = requests.post(server_address, json=payload)
response = response.json()

num_tokens_prompt_and_generated = len(response["generations"][0]["score"])

payload = {"prompt": prompt, "output_scores": True}
response = requests.post(server_address, json=payload)
response = response.json()

num_tokens_generated = len(response["generations"][0]["score"])
num_tokens_prompt = num_tokens_prompt_and_generated - num_tokens_generated
print(f"Number of prompt tokens: {num_tokens_prompt}")
print(f"Number of generated tokens: {num_tokens_generated}")

returns

Number of prompt tokens: 6
Number of generated tokens: 3

src/deepsparse/server/helpers.py

* initial commit to unblock derek * ready for review * add unit test

dbogunowicz added 2 commits October 26, 2023 08:40

initial commit to unblock derek

213dde4

ready for review

a9b628f

dbogunowicz changed the title ~~[WiP] Properly unserialize nested OutputSchemas~~ Enable preparing nested OutputSchemas for serialization Oct 26, 2023

dbogunowicz requested a review from bfineran October 26, 2023 09:29

dbogunowicz assigned rahul-tuli Oct 26, 2023

dbogunowicz requested review from dsikka and rahul-tuli October 26, 2023 09:30

dbogunowicz assigned dbogunowicz and unassigned rahul-tuli Oct 27, 2023

bfineran previously approved these changes Oct 30, 2023

View reviewed changes

src/deepsparse/server/helpers.py Show resolved Hide resolved

dbogunowicz and others added 2 commits October 31, 2023 08:39

Merge branch 'main' into feature/damian/nested_unserialisation

8a0c099

add unit test

78dc97f

dbogunowicz dismissed bfineran’s stale review via 78dc97f October 31, 2023 09:08

dbogunowicz requested a review from bfineran October 31, 2023 09:08

Satrat approved these changes Oct 31, 2023

View reviewed changes

bfineran approved these changes Oct 31, 2023

View reviewed changes

dbogunowicz merged commit 8dbb4ad into main Nov 1, 2023
13 checks passed

dbogunowicz deleted the feature/damian/nested_unserialisation branch November 1, 2023 13:36

dbogunowicz added a commit that referenced this pull request Nov 2, 2023

Enable preparing nested OutputSchemas for serialization (#1357)

3e4fa09

* initial commit to unblock derek * ready for review * add unit test

bfineran pushed a commit that referenced this pull request Nov 2, 2023

Enable preparing nested OutputSchemas for serialization (#1357)

ae291dd

* initial commit to unblock derek * ready for review * add unit test

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable preparing nested OutputSchemas for serialization #1357

Enable preparing nested OutputSchemas for serialization #1357

dbogunowicz commented Oct 26, 2023 •

edited

Loading

Enable preparing nested OutputSchemas for serialization #1357

Enable preparing nested OutputSchemas for serialization #1357

Conversation

dbogunowicz commented Oct 26, 2023 • edited Loading

dbogunowicz commented Oct 26, 2023 •

edited

Loading