Bring back Python backend based PyTorch backend #6518

kthui · 2023-11-04T01:19:23Z

Related PRs:

Minor changes to L0_model_config due to adding "runtime" field into model configuration.
Add L0_pytorch_python_runtime test for the Python backend based PyTorch backend.

qa/L0_pytorch_python_runtime/infer.py

qa/L0_model_config/autofill_noplatform_success/custom/no_backend.identity/expected

qa/L0_pytorch_python_runtime/unit_test.py

rmccorm4 · 2023-11-28T00:11:16Z

qa/L0_pytorch_python_runtime/unit_test.py

+import torch
+
+sys.modules["triton_python_backend_utils"] = unittest.mock.MagicMock()
+from py_runtime import _gather_torch_tensors, _scatter_torch_tensors


Can you add a comment that this is importing from python based implementation of pytorch backend? It wasn't clear to me at first and had to look around to figure that out. Can also mention the mock is just to satisfy import for python model implementation, but is only using the helpers and not the model itself.

Sure.
Add comments on unit test imports

qa/L0_pytorch_python_runtime/test.sh

… jacky-python-based-pytorch

Tabrizian · 2024-01-02T16:26:09Z

qa/L0_model_config/autofill_noplatform_success/onnx/cpu_instance/expected

+backend: "onnxruntime"
+runtime: ""


should auto-complete fill in this field with correct value as well (i.e., this would be /opt/tritonserver/backends/onnxruntime/libtriton_onnxruntime.so)?

That was my original thought, but the issue is how does auto-complete determine if the model uses C++ or Python runtime? A simple solution is to have backend specific auto-complete implementation. For example, if it the model uses PyTorch backend and a *.pt model file is provided without any *.py file, then use C++. If a *.py file is provided, then use Python. See this commit. From previous discussions with @nnshah1 and @rmccorm4, we want to stop adding more backend specific logic into auto-complete and opt for a generic implementation, which Python runtime falls into the category of custom backend on auto-complete. Backends can implement either C++ or Python runtime (or both), we do not want to auto-complete into Python runtime on a C++ only backend, which requires checking the backend installation directory. The current structure limits auto-complete to only see model directory, but not backend directory, because the latter is resolved only when the model is loaded, which happens after auto-complete. A easier approach is to resolve runtime, if not already filled, when the model is loaded, which has both the model and backend directory information.

Going back to the original question, auto-complete does not touch the runtime field, it is determined/filled when the model is loaded, if it is not already filled.

rmccorm4 · 2024-01-02T19:28:20Z

qa/L0_pytorch_python_runtime/test.sh

@@ -0,0 +1,147 @@
+#!/bin/bash


Can you add a generic test that tries to set runtime field to some invalid values, and test that Triton gracefully handles it?

# config.pbtxt # unexpected/bad value runtime: "invalid_value"

and

# config.pbtxt # no python implementation found backend: "onnxruntime" runtime: "model.py"

Maybe we can have something like this:

L0_runtime_invalid L0_runtime_pytorch_python

and future tests can be L0_runtime_* for organization.

This could be a quick follow-up if you wanted to separate it.

I think can simply add the invalid runtime test into L0_model_config, because many of the existing Python runtime related model config tests are already added there, plus the runtime escape test is also added there on this PR.

I like the idea of having L0_runtime_* for better organization. I think we can do a giant refactor later on when we see it fits.

Add invalid runtime test

… jacky-python-based-pytorch

rmccorm4

🚀

* Patch L0_model_config with runtime * Add L0_pytorch_python_runtime * Update expected runtime field * Add test for escaping runtime * Add comments on unit test imports * Add invalid runtime test * User to build PyTorch env * Update copyright

This reverts commit 64433b3.

…"" This reverts commit 40e8ae8.

kthui changed the title ~~Bring back Python backend based Py~~ Bring back Python backend based PyTorch backend Nov 4, 2023

This was referenced Nov 4, 2023

Add runtime to model configuration triton-inference-server/common#103

Merged

Incorporate runtime into model configuration triton-inference-server/core#285

Merged

kthui force-pushed the jacky-python-based-pytorch branch 3 times, most recently from 996a6e5 to 0562546 Compare November 8, 2023 04:49

Patch L0_model_config with runtime

4ed5f21

kthui force-pushed the jacky-python-based-pytorch branch from 0562546 to 7b1b22a Compare November 14, 2023 03:36

github-advanced-security bot found potential problems Nov 14, 2023

View reviewed changes

qa/L0_pytorch_python_runtime/infer.py Fixed Show fixed Hide fixed

qa/L0_pytorch_python_runtime/infer.py Fixed Show fixed Hide fixed

kthui mentioned this pull request Nov 14, 2023

Bring back Python backend based PyTorch backend triton-inference-server/pytorch_backend#117

Merged

kthui requested review from Tabrizian, tanmayv25 and rmccorm4 November 14, 2023 18:30

Add L0_pytorch_python_runtime

ec777d5

kthui force-pushed the jacky-python-based-pytorch branch from 7b1b22a to ec777d5 Compare November 14, 2023 18:42

kthui marked this pull request as ready for review November 14, 2023 18:47

rmccorm4 reviewed Nov 27, 2023

View reviewed changes

qa/L0_model_config/autofill_noplatform_success/custom/no_backend.identity/expected Show resolved Hide resolved

rmccorm4 reviewed Nov 28, 2023

View reviewed changes

qa/L0_pytorch_python_runtime/unit_test.py Show resolved Hide resolved

rmccorm4 reviewed Nov 28, 2023

View reviewed changes

qa/L0_pytorch_python_runtime/test.sh Show resolved Hide resolved

kthui added 6 commits December 4, 2023 18:31

Update expected runtime field

b674ea4

Merge branch 'main' of github.com:triton-inference-server/server into…

b52159d

… jacky-python-based-pytorch

Add test for escaping runtime

e3fc774

Add comments on unit test imports

b59f7d7

Merge branch 'main' of github.com:triton-inference-server/server into…

50670f6

… jacky-python-based-pytorch

Merge branch 'main' of github.com:triton-inference-server/server into…

dde5b33

… jacky-python-based-pytorch

Tabrizian reviewed Jan 2, 2024

View reviewed changes

rmccorm4 reviewed Jan 2, 2024

View reviewed changes

kthui added 2 commits January 2, 2024 17:19

Merge branch 'main' of github.com:triton-inference-server/server into…

1f4607a

… jacky-python-based-pytorch

Add invalid runtime test

cc6b256

kthui added 3 commits January 3, 2024 11:46

Merge branch 'main' of github.com:triton-inference-server/server into…

8d2f155

… jacky-python-based-pytorch

Merge branch 'main' of github.com:triton-inference-server/server into…

282b2e5

… jacky-python-based-pytorch

User to build PyTorch env

c368f47

kthui mentioned this pull request Jan 8, 2024

Python backend based PyTorch backend documentations triton-inference-server/backend#94

Merged

kthui added 2 commits January 8, 2024 15:17

Merge branch 'main' of github.com:triton-inference-server/server into…

e1d82df

… jacky-python-based-pytorch

Update copyright

530a6f0

Tabrizian approved these changes Jan 10, 2024

View reviewed changes

kthui requested a review from rmccorm4 January 10, 2024 18:17

rmccorm4 approved these changes Jan 10, 2024

View reviewed changes

kthui merged commit 4ffec9f into main Jan 11, 2024
3 checks passed

kthui deleted the jacky-python-based-pytorch branch January 11, 2024 17:11

oandreeva-nv added a commit that referenced this pull request Jan 12, 2024

Revert "Bring back Python backend based PyTorch backend (#6518)"

d02cd0c

This reverts commit 64433b3.

oandreeva-nv added a commit that referenced this pull request Jan 12, 2024

Revert "Bring back Python backend based PyTorch backend (#6518)"

40e8ae8

This reverts commit 64433b3.

oandreeva-nv added a commit that referenced this pull request Jan 12, 2024

Revert "Revert "Bring back Python backend based PyTorch backend (#6518)…

867d3bf

…"" This reverts commit 40e8ae8.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bring back Python backend based PyTorch backend #6518

Bring back Python backend based PyTorch backend #6518

kthui commented Nov 4, 2023 •

edited

Loading

rmccorm4 Nov 28, 2023

kthui Dec 8, 2023

Tabrizian Jan 2, 2024

kthui Jan 3, 2024

rmccorm4 Jan 2, 2024 •

edited

Loading

kthui Jan 3, 2024

kthui Jan 3, 2024

rmccorm4 left a comment

Bring back Python backend based PyTorch backend #6518

Bring back Python backend based PyTorch backend #6518

Conversation

kthui commented Nov 4, 2023 • edited Loading

rmccorm4 Nov 28, 2023

Choose a reason for hiding this comment

kthui Dec 8, 2023

Choose a reason for hiding this comment

Tabrizian Jan 2, 2024

Choose a reason for hiding this comment

kthui Jan 3, 2024

Choose a reason for hiding this comment

rmccorm4 Jan 2, 2024 • edited Loading

Choose a reason for hiding this comment

kthui Jan 3, 2024

Choose a reason for hiding this comment

kthui Jan 3, 2024

Choose a reason for hiding this comment

rmccorm4 left a comment

Choose a reason for hiding this comment

kthui commented Nov 4, 2023 •

edited

Loading

rmccorm4 Jan 2, 2024 •

edited

Loading