Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding QEFFAutoModel i.e. model loader for loading any type of model. #31

Merged
merged 20 commits into from
Jun 6, 2024

Conversation

ochougul
Copy link
Contributor

@ochougul ochougul commented May 28, 2024

  • Added QEfficient.QEFFAutoModel that acts as loader for any kind of GENAI model
  • Added QEFFBaseModel that acts as parent class for any GENAI model loaded via QEFFAutoModel.from_pretrained
  • Added common interface for export and transform functions and made them compatible with QEFFBaseModel
  • Cleaned __init__ files
  • cleaned infer API file

@ochougul ochougul requested a review from anujgupt-github May 29, 2024 08:23
@ochougul ochougul self-assigned this May 29, 2024
@ochougul ochougul added the enhancement New feature or request label May 29, 2024
Copy link
Contributor

@vbaddi vbaddi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, in general can we start tapping necessary info. in the logger.info() the log file can sit in the same qeff_models root directory.

QEfficient/exporter/export_hf_to_cloud_ai_100.py Outdated Show resolved Hide resolved
QEfficient/loader/loader_factory.py Outdated Show resolved Hide resolved
QEfficient/utils/_utils.py Outdated Show resolved Hide resolved
tests/utils.py Outdated Show resolved Hide resolved
@anujgupt-github
Copy link
Contributor

@ochougul - it would help if you push readme updates also part of this PR, so that it's easy to visualize how you want users to use the interface, and review can be staged along with the intent.

@ochougul
Copy link
Contributor Author

Also, in general can we start tapping necessary info. in the logger.info() the log file can sit in the same qeff_models root directory.

Will do this in separate PR.

@ochougul
Copy link
Contributor Author

@ochougul - it would help if you push readme updates also part of this PR, so that it's easy to visualize how you want users to use the interface, and review can be staged along with the intent.

Updated README and notebooks

QEfficient/cross_compile.py Outdated Show resolved Hide resolved
QEfficient/cloud/infer.py Outdated Show resolved Hide resolved
QEfficient/cloud/infer.py Show resolved Hide resolved
@vbaddi
Copy link
Contributor

vbaddi commented Jun 3, 2024

Also, as discussed, I feel it does make sense to wrap transform() inside the pre_trained() API, since it makes equivalent to transformers repo. ref: https://github.com/huggingface/optimum-nvidia?tab=readme-ov-file#generate

@quic-rishinr quic-rishinr mentioned this pull request Jun 3, 2024
@ochougul
Copy link
Contributor Author

ochougul commented Jun 4, 2024

image

This is the hierarchy of classes being followed.
QEFFCommonModelLoader recognizes the type of the model using config of the model and tries to initialize the correct type of the model class i.e. QEFFAutoModelForCausalLM etc.
This is used when user only passes name of the model to infer API.

Whereas, if user wants to use low level APIs. user can directly initialize QEFFAutoModelForCausalLM or QEFFAutoModel in the same way user initializes transformers.AutoModelForCausalLM.

In this PR, I am adding support only for from_pretrained method to these classes, and in subsequenst PRs. following method will be enabled for all these classes i.e.

  1. transform
  2. export
  3. execute (based on runtime)
  4. compile
    etc.

@ochougul ochougul requested a review from irajagop as a code owner June 5, 2024 09:57
ochougul added 2 commits June 6, 2024 11:44
Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
ochougul added 13 commits June 6, 2024 11:44
Signed-off-by: Onkar Chougule <[email protected]>
…n loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
@ochougul ochougul force-pushed the feature/loader_api branch from 36baa5b to 4ae5825 Compare June 6, 2024 06:14
@ochougul ochougul requested a review from quic-jhugo June 6, 2024 08:41
Copy link
Contributor

@vbaddi vbaddi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM +1

@ochougul ochougul merged commit 4c8b4c9 into quic:main Jun 6, 2024
2 checks passed
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jun 7, 2024
…quic#31)

* all changes

Signed-off-by: Onkar Chougule <[email protected]>

* only loader changes

Signed-off-by: Onkar Chougule <[email protected]>

* removed unused imports

Signed-off-by: Onkar Chougule <[email protected]>

* allowed to initialize QEFFAUtoLMModel

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests bugs

Signed-off-by: Onkar Chougule <[email protected]>

* renamed utils.py to _utils.py

Signed-off-by: Onkar Chougule <[email protected]>

* added more type hinting and docstrings

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments, added test file for new interface

Signed-off-by: Onkar Chougule <[email protected]>

* enabled CLI APIs

Signed-off-by: Onkar Chougule <[email protected]>

* *Updated README, notebooks *Removed circular import *Added comments on loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>

* bug-fix infer

Signed-off-by: Onkar Chougule <[email protected]>

* using QEfficient.export, compile in cloud APIs

Signed-off-by: Onkar Chougule <[email protected]>

* cleaner infer,epxport APIs

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

* *updated notebooks, readme *moved class desc to base.py *Added Runtime Enum

Signed-off-by: Onkar Chougule <[email protected]>

* updated cloud_ai_100_exec_kv to be callable from QEfficient package

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests

Signed-off-by: Onkar Chougule <[email protected]>

* clenaed notebook

Signed-off-by: Onkar Chougule <[email protected]>

* *Added transfrom call within init *reanmed cross_compile *updated notebooks *updated README

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

---------

Signed-off-by: Onkar Chougule <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jun 7, 2024
…quic#31)

* all changes

Signed-off-by: Onkar Chougule <[email protected]>

* only loader changes

Signed-off-by: Onkar Chougule <[email protected]>

* removed unused imports

Signed-off-by: Onkar Chougule <[email protected]>

* allowed to initialize QEFFAUtoLMModel

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests bugs

Signed-off-by: Onkar Chougule <[email protected]>

* renamed utils.py to _utils.py

Signed-off-by: Onkar Chougule <[email protected]>

* added more type hinting and docstrings

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments, added test file for new interface

Signed-off-by: Onkar Chougule <[email protected]>

* enabled CLI APIs

Signed-off-by: Onkar Chougule <[email protected]>

* *Updated README, notebooks *Removed circular import *Added comments on loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>

* bug-fix infer

Signed-off-by: Onkar Chougule <[email protected]>

* using QEfficient.export, compile in cloud APIs

Signed-off-by: Onkar Chougule <[email protected]>

* cleaner infer,epxport APIs

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

* *updated notebooks, readme *moved class desc to base.py *Added Runtime Enum

Signed-off-by: Onkar Chougule <[email protected]>

* updated cloud_ai_100_exec_kv to be callable from QEfficient package

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests

Signed-off-by: Onkar Chougule <[email protected]>

* clenaed notebook

Signed-off-by: Onkar Chougule <[email protected]>

* *Added transfrom call within init *reanmed cross_compile *updated notebooks *updated README

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

---------

Signed-off-by: Onkar Chougule <[email protected]>
ochougul added a commit to quic-amitraj/efficient-transformers that referenced this pull request Jun 12, 2024
…quic#31)

* all changes

Signed-off-by: Onkar Chougule <[email protected]>

* only loader changes

Signed-off-by: Onkar Chougule <[email protected]>

* removed unused imports

Signed-off-by: Onkar Chougule <[email protected]>

* allowed to initialize QEFFAUtoLMModel

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests bugs

Signed-off-by: Onkar Chougule <[email protected]>

* renamed utils.py to _utils.py

Signed-off-by: Onkar Chougule <[email protected]>

* added more type hinting and docstrings

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments, added test file for new interface

Signed-off-by: Onkar Chougule <[email protected]>

* enabled CLI APIs

Signed-off-by: Onkar Chougule <[email protected]>

* *Updated README, notebooks *Removed circular import *Added comments on loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>

* bug-fix infer

Signed-off-by: Onkar Chougule <[email protected]>

* using QEfficient.export, compile in cloud APIs

Signed-off-by: Onkar Chougule <[email protected]>

* cleaner infer,epxport APIs

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

* *updated notebooks, readme *moved class desc to base.py *Added Runtime Enum

Signed-off-by: Onkar Chougule <[email protected]>

* updated cloud_ai_100_exec_kv to be callable from QEfficient package

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests

Signed-off-by: Onkar Chougule <[email protected]>

* clenaed notebook

Signed-off-by: Onkar Chougule <[email protected]>

* *Added transfrom call within init *reanmed cross_compile *updated notebooks *updated README

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

---------

Signed-off-by: Onkar Chougule <[email protected]>
ochougul added a commit to quic-amitraj/efficient-transformers that referenced this pull request Jun 12, 2024
…quic#31)

* all changes

Signed-off-by: Onkar Chougule <[email protected]>

* only loader changes

Signed-off-by: Onkar Chougule <[email protected]>

* removed unused imports

Signed-off-by: Onkar Chougule <[email protected]>

* allowed to initialize QEFFAUtoLMModel

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests bugs

Signed-off-by: Onkar Chougule <[email protected]>

* renamed utils.py to _utils.py

Signed-off-by: Onkar Chougule <[email protected]>

* added more type hinting and docstrings

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments, added test file for new interface

Signed-off-by: Onkar Chougule <[email protected]>

* enabled CLI APIs

Signed-off-by: Onkar Chougule <[email protected]>

* *Updated README, notebooks *Removed circular import *Added comments on loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>

* bug-fix infer

Signed-off-by: Onkar Chougule <[email protected]>

* using QEfficient.export, compile in cloud APIs

Signed-off-by: Onkar Chougule <[email protected]>

* cleaner infer,epxport APIs

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

* *updated notebooks, readme *moved class desc to base.py *Added Runtime Enum

Signed-off-by: Onkar Chougule <[email protected]>

* updated cloud_ai_100_exec_kv to be callable from QEfficient package

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests

Signed-off-by: Onkar Chougule <[email protected]>

* clenaed notebook

Signed-off-by: Onkar Chougule <[email protected]>

* *Added transfrom call within init *reanmed cross_compile *updated notebooks *updated README

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

---------

Signed-off-by: Onkar Chougule <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jun 25, 2024
…quic#31)

* all changes

Signed-off-by: Onkar Chougule <[email protected]>

* only loader changes

Signed-off-by: Onkar Chougule <[email protected]>

* removed unused imports

Signed-off-by: Onkar Chougule <[email protected]>

* allowed to initialize QEFFAUtoLMModel

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests bugs

Signed-off-by: Onkar Chougule <[email protected]>

* renamed utils.py to _utils.py

Signed-off-by: Onkar Chougule <[email protected]>

* added more type hinting and docstrings

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments, added test file for new interface

Signed-off-by: Onkar Chougule <[email protected]>

* enabled CLI APIs

Signed-off-by: Onkar Chougule <[email protected]>

* *Updated README, notebooks *Removed circular import *Added comments on loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>

* bug-fix infer

Signed-off-by: Onkar Chougule <[email protected]>

* using QEfficient.export, compile in cloud APIs

Signed-off-by: Onkar Chougule <[email protected]>

* cleaner infer,epxport APIs

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

* *updated notebooks, readme *moved class desc to base.py *Added Runtime Enum

Signed-off-by: Onkar Chougule <[email protected]>

* updated cloud_ai_100_exec_kv to be callable from QEfficient package

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests

Signed-off-by: Onkar Chougule <[email protected]>

* clenaed notebook

Signed-off-by: Onkar Chougule <[email protected]>

* *Added transfrom call within init *reanmed cross_compile *updated notebooks *updated README

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

---------

Signed-off-by: Onkar Chougule <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jun 25, 2024
…quic#31)

* all changes

Signed-off-by: Onkar Chougule <[email protected]>

* only loader changes

Signed-off-by: Onkar Chougule <[email protected]>

* removed unused imports

Signed-off-by: Onkar Chougule <[email protected]>

* allowed to initialize QEFFAUtoLMModel

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests bugs

Signed-off-by: Onkar Chougule <[email protected]>

* renamed utils.py to _utils.py

Signed-off-by: Onkar Chougule <[email protected]>

* added more type hinting and docstrings

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments, added test file for new interface

Signed-off-by: Onkar Chougule <[email protected]>

* enabled CLI APIs

Signed-off-by: Onkar Chougule <[email protected]>

* *Updated README, notebooks *Removed circular import *Added comments on loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>

* bug-fix infer

Signed-off-by: Onkar Chougule <[email protected]>

* using QEfficient.export, compile in cloud APIs

Signed-off-by: Onkar Chougule <[email protected]>

* cleaner infer,epxport APIs

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

* *updated notebooks, readme *moved class desc to base.py *Added Runtime Enum

Signed-off-by: Onkar Chougule <[email protected]>

* updated cloud_ai_100_exec_kv to be callable from QEfficient package

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests

Signed-off-by: Onkar Chougule <[email protected]>

* clenaed notebook

Signed-off-by: Onkar Chougule <[email protected]>

* *Added transfrom call within init *reanmed cross_compile *updated notebooks *updated README

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

---------

Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: quic-amitraj <[email protected]>
abukhoy pushed a commit to abukhoy/efficient-transformers that referenced this pull request Jun 28, 2024
…quic#31)

* all changes

Signed-off-by: Onkar Chougule <[email protected]>

* only loader changes

Signed-off-by: Onkar Chougule <[email protected]>

* removed unused imports

Signed-off-by: Onkar Chougule <[email protected]>

* allowed to initialize QEFFAUtoLMModel

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests bugs

Signed-off-by: Onkar Chougule <[email protected]>

* renamed utils.py to _utils.py

Signed-off-by: Onkar Chougule <[email protected]>

* added more type hinting and docstrings

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments, added test file for new interface

Signed-off-by: Onkar Chougule <[email protected]>

* enabled CLI APIs

Signed-off-by: Onkar Chougule <[email protected]>

* *Updated README, notebooks *Removed circular import *Added comments on loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>

* bug-fix infer

Signed-off-by: Onkar Chougule <[email protected]>

* using QEfficient.export, compile in cloud APIs

Signed-off-by: Onkar Chougule <[email protected]>

* cleaner infer,epxport APIs

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

* *updated notebooks, readme *moved class desc to base.py *Added Runtime Enum

Signed-off-by: Onkar Chougule <[email protected]>

* updated cloud_ai_100_exec_kv to be callable from QEfficient package

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests

Signed-off-by: Onkar Chougule <[email protected]>

* clenaed notebook

Signed-off-by: Onkar Chougule <[email protected]>

* *Added transfrom call within init *reanmed cross_compile *updated notebooks *updated README

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

---------

Signed-off-by: Onkar Chougule <[email protected]>

Signed-off-by: root <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jul 16, 2024
…quic#31)

* all changes

Signed-off-by: Onkar Chougule <[email protected]>

* only loader changes

Signed-off-by: Onkar Chougule <[email protected]>

* removed unused imports

Signed-off-by: Onkar Chougule <[email protected]>

* allowed to initialize QEFFAUtoLMModel

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests bugs

Signed-off-by: Onkar Chougule <[email protected]>

* renamed utils.py to _utils.py

Signed-off-by: Onkar Chougule <[email protected]>

* added more type hinting and docstrings

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments, added test file for new interface

Signed-off-by: Onkar Chougule <[email protected]>

* enabled CLI APIs

Signed-off-by: Onkar Chougule <[email protected]>

* *Updated README, notebooks *Removed circular import *Added comments on loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>

* bug-fix infer

Signed-off-by: Onkar Chougule <[email protected]>

* using QEfficient.export, compile in cloud APIs

Signed-off-by: Onkar Chougule <[email protected]>

* cleaner infer,epxport APIs

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

* *updated notebooks, readme *moved class desc to base.py *Added Runtime Enum

Signed-off-by: Onkar Chougule <[email protected]>

* updated cloud_ai_100_exec_kv to be callable from QEfficient package

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests

Signed-off-by: Onkar Chougule <[email protected]>

* clenaed notebook

Signed-off-by: Onkar Chougule <[email protected]>

* *Added transfrom call within init *reanmed cross_compile *updated notebooks *updated README

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

---------

Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: quic-amitraj <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jul 16, 2024
…quic#31)

* all changes

Signed-off-by: Onkar Chougule <[email protected]>

* only loader changes

Signed-off-by: Onkar Chougule <[email protected]>

* removed unused imports

Signed-off-by: Onkar Chougule <[email protected]>

* allowed to initialize QEFFAUtoLMModel

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests bugs

Signed-off-by: Onkar Chougule <[email protected]>

* renamed utils.py to _utils.py

Signed-off-by: Onkar Chougule <[email protected]>

* added more type hinting and docstrings

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments, added test file for new interface

Signed-off-by: Onkar Chougule <[email protected]>

* enabled CLI APIs

Signed-off-by: Onkar Chougule <[email protected]>

* *Updated README, notebooks *Removed circular import *Added comments on loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>

* bug-fix infer

Signed-off-by: Onkar Chougule <[email protected]>

* using QEfficient.export, compile in cloud APIs

Signed-off-by: Onkar Chougule <[email protected]>

* cleaner infer,epxport APIs

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

* *updated notebooks, readme *moved class desc to base.py *Added Runtime Enum

Signed-off-by: Onkar Chougule <[email protected]>

* updated cloud_ai_100_exec_kv to be callable from QEfficient package

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests

Signed-off-by: Onkar Chougule <[email protected]>

* clenaed notebook

Signed-off-by: Onkar Chougule <[email protected]>

* *Added transfrom call within init *reanmed cross_compile *updated notebooks *updated README

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

---------

Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: quic-amitraj <[email protected]>
Signed-off-by: amitraj <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jul 19, 2024
…quic#31)

* all changes

Signed-off-by: Onkar Chougule <[email protected]>

* only loader changes

Signed-off-by: Onkar Chougule <[email protected]>

* removed unused imports

Signed-off-by: Onkar Chougule <[email protected]>

* allowed to initialize QEFFAUtoLMModel

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests bugs

Signed-off-by: Onkar Chougule <[email protected]>

* renamed utils.py to _utils.py

Signed-off-by: Onkar Chougule <[email protected]>

* added more type hinting and docstrings

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments, added test file for new interface

Signed-off-by: Onkar Chougule <[email protected]>

* enabled CLI APIs

Signed-off-by: Onkar Chougule <[email protected]>

* *Updated README, notebooks *Removed circular import *Added comments on loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>

* bug-fix infer

Signed-off-by: Onkar Chougule <[email protected]>

* using QEfficient.export, compile in cloud APIs

Signed-off-by: Onkar Chougule <[email protected]>

* cleaner infer,epxport APIs

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

* *updated notebooks, readme *moved class desc to base.py *Added Runtime Enum

Signed-off-by: Onkar Chougule <[email protected]>

* updated cloud_ai_100_exec_kv to be callable from QEfficient package

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests

Signed-off-by: Onkar Chougule <[email protected]>

* clenaed notebook

Signed-off-by: Onkar Chougule <[email protected]>

* *Added transfrom call within init *reanmed cross_compile *updated notebooks *updated README

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

---------

Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: quic-amitraj <[email protected]>
Signed-off-by: amitraj <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jul 24, 2024
…quic#31)

* all changes

Signed-off-by: Onkar Chougule <[email protected]>

* only loader changes

Signed-off-by: Onkar Chougule <[email protected]>

* removed unused imports

Signed-off-by: Onkar Chougule <[email protected]>

* allowed to initialize QEFFAUtoLMModel

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests bugs

Signed-off-by: Onkar Chougule <[email protected]>

* renamed utils.py to _utils.py

Signed-off-by: Onkar Chougule <[email protected]>

* added more type hinting and docstrings

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments, added test file for new interface

Signed-off-by: Onkar Chougule <[email protected]>

* enabled CLI APIs

Signed-off-by: Onkar Chougule <[email protected]>

* *Updated README, notebooks *Removed circular import *Added comments on loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>

* bug-fix infer

Signed-off-by: Onkar Chougule <[email protected]>

* using QEfficient.export, compile in cloud APIs

Signed-off-by: Onkar Chougule <[email protected]>

* cleaner infer,epxport APIs

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

* *updated notebooks, readme *moved class desc to base.py *Added Runtime Enum

Signed-off-by: Onkar Chougule <[email protected]>

* updated cloud_ai_100_exec_kv to be callable from QEfficient package

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests

Signed-off-by: Onkar Chougule <[email protected]>

* clenaed notebook

Signed-off-by: Onkar Chougule <[email protected]>

* *Added transfrom call within init *reanmed cross_compile *updated notebooks *updated README

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

---------

Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: quic-amitraj <[email protected]>
Signed-off-by: amitraj <[email protected]>
shubhagr-quic pushed a commit to shubhagr-quic/efficient-transformers that referenced this pull request Oct 4, 2024
…quic#31)

* all changes

Signed-off-by: Onkar Chougule <[email protected]>

* only loader changes

Signed-off-by: Onkar Chougule <[email protected]>

* removed unused imports

Signed-off-by: Onkar Chougule <[email protected]>

* allowed to initialize QEFFAUtoLMModel

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests bugs

Signed-off-by: Onkar Chougule <[email protected]>

* renamed utils.py to _utils.py

Signed-off-by: Onkar Chougule <[email protected]>

* added more type hinting and docstrings

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments, added test file for new interface

Signed-off-by: Onkar Chougule <[email protected]>

* enabled CLI APIs

Signed-off-by: Onkar Chougule <[email protected]>

* *Updated README, notebooks *Removed circular import *Added comments on loader files * separated cross-compile script *separated utils funcs

Signed-off-by: Onkar Chougule <[email protected]>

* bug-fix infer

Signed-off-by: Onkar Chougule <[email protected]>

* using QEfficient.export, compile in cloud APIs

Signed-off-by: Onkar Chougule <[email protected]>

* cleaner infer,epxport APIs

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

* *updated notebooks, readme *moved class desc to base.py *Added Runtime Enum

Signed-off-by: Onkar Chougule <[email protected]>

* updated cloud_ai_100_exec_kv to be callable from QEfficient package

Signed-off-by: Onkar Chougule <[email protected]>

* fixed tests

Signed-off-by: Onkar Chougule <[email protected]>

* clenaed notebook

Signed-off-by: Onkar Chougule <[email protected]>

* *Added transfrom call within init *reanmed cross_compile *updated notebooks *updated README

Signed-off-by: Onkar Chougule <[email protected]>

* addressed review comments

Signed-off-by: Onkar Chougule <[email protected]>

---------

Signed-off-by: Onkar Chougule <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants