Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Torchfx dynamic shapes additional support #8

Conversation

cavusmustafa
Copy link
Owner

Details:

  • item1
  • ...

Tickets:

  • ticket-id

cavusmustafa and others added 30 commits June 30, 2024 18:05
### Details:
 - Fuse QKV FCs to one FC 

### Tickets:
 - 142815
…penvinotoolkit#25249)

### Details:
 - Skip fc fake alignment when fused desc has full tensor input

### Tickets:
 - 144293
### Details:
- *update shape for user when current node is the input of fused prim of
user even if the user is updated shape by other node*
 - *...*

### Tickets:
 - *145756*
### Details:
 - Coverity fixes

### Tickets:
 - *CVS-145082*

---------

Co-authored-by: Pavel Durandin <[email protected]>
…vinotoolkit#25356)

### Details:
 - Fixed mem leak in cl engine in roi align rotated test.

### Tickets:
 - *CVS-145836*
…penvinotoolkit#25365)

### Details:
- *aten::layer_norm support extended for support normalized shapes
represented as non constant and multiple dims*
 tested with stable-audio model

### Tickets:
 - TBD
…sed QKV FC (openvinotoolkit#25304)

### Details:
- Convert QKV FC + StridedSlices pattern to QKV FC + VariadicSplit for
potential optimization regarding crop runtime buffer fusing

### Tickets:
 - 144394
### Details:
 - Enabled whole Tensor comparison

### Tickets:
 - CVS-137211
…otoolkit#25253)

### Details:
CPU plugin doesn't natively support `boolean` data type, thus it's
replaced with `u8` during the convert precision transformation pass.
However, casting numerical types to `bool` implies clamping the
numerical values with the interval [0; 1] (either true or false), so to
mimic this behavior a Clamp operation should be inserted before the
modified convert.

### Tickets:
 - CVS-145166
### Details:
 - *item1*
 - *...*

### Tickets:
 - *CVS-136937*
…5354)

### Details:
 - *Fix regression introduced by openvinotoolkit#25077*

### Tickets:
 - *CVS-144695*
 - *CVS-144789*
Add unique models to precommit PA tests

Change the list of models covering all the big real models.

Removed:
hf-tiny-model-private/tiny-random-GPT2LMHeadModel
hf-tiny-model-private/tiny-random-BartForCausalLM,xfail
hf-tiny-model-private/tiny-random-BigBirdForCausalLM,xfail
hf-tiny-model-private/tiny-random-BigBirdPegasusForCausalLM,xfail
hf-tiny-model-private/tiny-random-BlenderbotSmallForCausalLM,xfail
hf-tiny-model-private/tiny-random-ErnieForCausalLM,xfail
hf-tiny-model-private/tiny-random-GPTNeoXJapaneseForCausalLM,xfail
hf-tiny-model-private/tiny-random-MBartForCausalLM,xfail
hf-tiny-model-private/tiny-random-MvpForCausalLM,xfail
hf-tiny-model-private/tiny-random-PLBartForCausalLM,xfail

PrunaAI/hf-tiny-model-private-tiny-random-BloomForCausalLM-bnb-4bit-smashed,xfail

PrunaAI/hf-tiny-model-private-tiny-random-BloomForCausalLM-bnb-8bit-smashed,xfail

PrunaAI/hf-tiny-model-private-tiny-random-BloomForCausalLM-HQQ-2bit-smashed,xfail
hf-internal-testing/tiny-random-MegatronBertForCausalLM,xfail
hf-internal-testing/tiny-random-BlenderbotSmallForCausalLM,xfail

Added:
hf-internal-testing/Mixtral-tiny
hf-internal-testing/tiny-random-gpt2
katuni4ka/tiny-random-xverse
katuni4ka/tiny-random-baichuan2-13b
katuni4ka/tiny-random-qwen
katuni4ka/tiny-random-aquilachat
katuni4ka/tiny-random-aquila2
katuni4ka/tiny-random-qwen1.5-moe
katuni4ka/tiny-random-codegen2
katuni4ka/tiny-random-olmo-hf
katuni4ka/tiny-random-baichuan2
katuni4ka/tiny-random-jais
katuni4ka/tiny-random-internlm
katuni4ka/tiny-random-internlm2
katuni4ka/tiny-random-minicpm
fxmarty/tiny-random-GemmaForCausalLM
fxmarty/tiny-dummy-qwen2
fxmarty/really-tiny-falcon-testing
Xenova/tiny-random-Phi3ForCausalLM
facebook/opt-350m
katuni4ka/tiny-random-dbrx
katuni4ka/tiny-random-falcon-40b
katuni4ka/tiny-random-orion,xfail,No ScaledDotProductAttention operation
observed in the graph
katuni4ka/tiny-random-chatglm2,xfail,Model references undeclared
parameters: beam_idx ()
katuni4ka/tiny-random-glm4,xfail,Model references undeclared parameters
beam_idx () attention_mask ()


### Tickets:
 - CVS-145243

Signed-off-by: Andrii Staikov <[email protected]>

---------

Signed-off-by: Andrii Staikov <[email protected]>
…it#25311)

### Details:
- Call `getPartialShape` method on dynamic input/output lead to error.
This change fix that.
### Details:
 - *item1*
 - *...*

### Tickets:
 - 145078

---------

Co-authored-by: Karol Blaszczak <[email protected]>
Reason: LLM beam search perf regression on XEON platforms

Signed-off-by: Chen Peter <[email protected]>
…openvinotoolkit#25235)

### Details:
 - *Propagate information from runtime loop info to serialized graphs*

### Tickets:
 - *-*
Incorrect check resulted into decompostion of different RNN operations
### Details:
 - *Support `aten::col2im`*

### Tickets:
 - *CVS-101054*
Transformation that undoing make_stateful from optimum-intel.

### How to use in Python
```python
import openvino as ov
from openvino._offline_transformations import stateful_to_stateless_transformation
core = ov.Core()
model = core.read_model('your_chatty_stateful_model_right_from_vanilla_optimum_intel.xml')
stateful_to_stateless_transformation(model)
# use `model`
```

### How to use in C++
```c++
#include <openvino/openvino.hpp>
#include <openvino/pass/stateful_to_stateless.hpp>

int main() {
    auto core = ov::Core();
    auto model = core.read_model("your_chatty_stateful_model_right_from_vanilla_optimum_intel.xml");
    ov::pass::StatefulToStateless().run_on_model(model);
    // use `model`
}
```

### TODO

- [x] Restore the original order of inputs/output (now they are not
globally ordered, but kv inputs corresponds to kv outputs by indices
with a proper offset).
- [x] Restore the original names of inputs and outputs based on
optimum-intel conventions in make_stateful.
### Details:
- Added path to tokenizers build to find ov wheel
(openvinotoolkit/openvino_tokenizers#183)
 
### Tickets:
 - *ticket-id*
@cavusmustafa cavusmustafa merged commit 99d1fcd into torchfx_dynamic_shapes_split_slice_support Jul 8, 2024
19 of 30 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment