[Misc] Update to Transformers 4.48 #12120

tlrmchlsmth · 2025-01-16T14:42:17Z

Update Transformers to 4.48. Split off from #10909 to isolate any 4.48-related changes here and for easier debugging.

From @fabianlim, we have the following open issues:

Basic Correctness Test: this somehow we are observing memory access errors with gemma probably related to this. (skip and wait for patch fix)?

Language Model Test: this happens for a custom model, where in the remote code it imports something from transformers/pytorch_utils.py that is not there anymore (hard to fix since its not public code)

Multi GPU Test: this one I cant really repro yet locally as im running into some ray troubles. But my guess its 4.48 related since it passed before I upgrade the transformers, or its an intermittent error (is it possible to retrigger to test?)

Quantization Test: this one I reproduced locally, it actually comes from this PR. Its happening on the ckpt neuralmagic/Llama-3.2-1B-quantized.w8a8, where quantization_config.quantization_status=="frozen" . In 4.47, it will need to come to this line to run as compressed, but in 4.48, the new logic will determine that because of the frozen tag, it is not compressed, and will skip to the other line .

Signed-off-by: Tyler Michael Smith <[email protected]>

github-actions · 2025-01-16T14:42:29Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

tlrmchlsmth · 2025-01-16T14:44:07Z

FYI @mgoin, @dsikka, @robertgshaw2-redhat on the neuralmagic/Llama-3.2-1B-quantized.w8a8 issue

DarkLight1337 · 2025-01-16T15:24:31Z

requirements-test.txt

@@ -2,7 +2,7 @@
 # This file is autogenerated by pip-compile with Python 3.12
 # by the following command:
 #
-#    python3.12 -m piptools compile requirements-test.in -o requirements-test.txt
+#    pip-compile --output-file=requirements-test.txt requirements-test.in


Please keep the original command as we want to keep this in Python 3.12 to maintain consistency.

@DarkLight1337 yea this is what I got when I used py 3.12 and the following versions to compile with the same command

python3.12 -m piptools compile requirements-test.in -o requirements-test.txt -P transformers

but what comes out is the command that you see above, can you suggest what is the preferred way to compile? Or just manually update the command?

versions:

build==1.2.2.post1 click==8.1.8 packaging==24.2 pip-tools==7.4.1 pyproject_hooks==1.2.0 setuptools==75.8.0 wheel==0.45.1

@DarkLight1337 I thought the same thing when I saw it in #10909, but when you run

python3.12 -m piptools compile requirements-test.in -o requirements-test.txt

it generates

# This file is autogenerated by pip-compile with Python 3.12 # by the following command: # # pip-compile --output-file=requirements-test.txt requirements-test.in

You still think I should update the comment? I can definitely see a case for being explicit about python 3.12 in the command. If so I'll update the comment indicating that the command should be left as-is.

yea it doesnt print the command that was actually run, it messages it to this format, which is actually consistent with the pip-tools docs

cc @khluu since you made this change.

dsikka · 2025-01-16T15:35:22Z

The transformers 4.48 release has the required change in how Automodel is expected to handle compressed vs frozen models. The model in question has the frozen state, which really should be compressed. Likely just outdated

I'll look into updating the model

fabianlim · 2025-01-16T15:46:46Z

The custom model in question that is causing Language Model Test to fail is this one in particular. What can we do about this since this code is only modifiable but the its maintainers?.

[2025-01-15T23:58:08Z]     from transformers.modeling_outputs import BaseModelOutputWithPast, CausalLMOutputWithPast, SequenceClassifierOutputWithPast
--
  | [2025-01-15T23:58:08Z]     from transformers.modeling_utils import PreTrainedModel
  | [2025-01-15T23:58:08Z] >   from transformers.pytorch_utils import ALL_LAYERNORM_LAYERS, is_torch_greater_or_equal_than_1_13
  | [2025-01-15T23:58:08Z] E   ImportError: cannot import name 'is_torch_greater_or_equal_than_1_13' from 'transformers.pytorch_utils' (/usr/local/lib/python3.12/dist-packages/transformers/pytorch_utils.py)
  | [2025-01-15T23:58:08Z]
  | [2025-01-15T23:58:08Z] /root/.cache/huggingface/modules/transformers_modules/openbmb/MiniCPM3-4B/e5715484011d723a1892db91da8b59d979d14aee/modeling_minicpm.py:41: ImportError

dsikka · 2025-01-16T15:51:37Z

The custom model in question that is causing Language Model Test to fail is this one in particular. What can we do about this since this code is only modifiable but the its maintainers?.

[2025-01-15T23:58:08Z]     from transformers.modeling_outputs import BaseModelOutputWithPast, CausalLMOutputWithPast, SequenceClassifierOutputWithPast
--
  | [2025-01-15T23:58:08Z]     from transformers.modeling_utils import PreTrainedModel
  | [2025-01-15T23:58:08Z] >   from transformers.pytorch_utils import ALL_LAYERNORM_LAYERS, is_torch_greater_or_equal_than_1_13
  | [2025-01-15T23:58:08Z] E   ImportError: cannot import name 'is_torch_greater_or_equal_than_1_13' from 'transformers.pytorch_utils' (/usr/local/lib/python3.12/dist-packages/transformers/pytorch_utils.py)
  | [2025-01-15T23:58:08Z]
  | [2025-01-15T23:58:08Z] /root/.cache/huggingface/modules/transformers_modules/openbmb/MiniCPM3-4B/e5715484011d723a1892db91da8b59d979d14aee/modeling_minicpm.py:41: ImportError

The same issue has appeared in a few other models as a result of the release e.g Deepseek

tlrmchlsmth · 2025-01-16T17:20:12Z

I put up a PR to fix those ch

The custom model in question that is causing Language Model Test to fail is this one in particular. What can we do about this since this code is only modifiable but the its maintainers?.

[2025-01-15T23:58:08Z]     from transformers.modeling_outputs import BaseModelOutputWithPast, CausalLMOutputWithPast, SequenceClassifierOutputWithPast
--
  | [2025-01-15T23:58:08Z]     from transformers.modeling_utils import PreTrainedModel
  | [2025-01-15T23:58:08Z] >   from transformers.pytorch_utils import ALL_LAYERNORM_LAYERS, is_torch_greater_or_equal_than_1_13
  | [2025-01-15T23:58:08Z] E   ImportError: cannot import name 'is_torch_greater_or_equal_than_1_13' from 'transformers.pytorch_utils' (/usr/local/lib/python3.12/dist-packages/transformers/pytorch_utils.py)
  | [2025-01-15T23:58:08Z]
  | [2025-01-15T23:58:08Z] /root/.cache/huggingface/modules/transformers_modules/openbmb/MiniCPM3-4B/e5715484011d723a1892db91da8b59d979d14aee/modeling_minicpm.py:41: ImportError

The same issue has appeared in a few other models as a result of the release e.g Deepseek

I have a PR up to fix it just for MiniCMP here: https://huggingface.co/openbmb/MiniCPM3-4B/discussions/39

But alternatively if there are multiple models with the same issue, I think it would be better to fix via huggingface/transformers#35734 -- @dsikka do you have links to other models with the same issue?

dsikka · 2025-01-16T19:26:27Z

FYI the model has been updated and the tests now pass (at least locally)
@tlrmchlsmth @fabianlim

ani300 · 2025-01-17T19:07:05Z

The basic correctness tests pass for me if using this: huggingface/transformers#35681. For now we can wait until it gets merged, or skip the tests until the new point release of transformers

tlrmchlsmth · 2025-01-18T00:22:12Z

Added the ready label to see if the Multi GPU test is intermittent

tlrmchlsmth · 2025-01-18T03:49:34Z

Looks like the multi gpu tests are passing so we should be good to go once huggingface/transformers#35681 lands. I'm inclined to wait for a transformers point release but lmk if you disagree

fabianlim · 2025-01-18T14:58:57Z

@tlrmchlsmth thanks! Im absolutely on board with waiting for the point release

DarkLight1337 · 2025-01-18T15:00:47Z

requirements-test.txt

@@ -533,6 +533,7 @@ tqdm==4.66.6
    #   lm-eval
    #   nltk
    #   peft
+    #   pqdm
    #   sentence-transformers
    #   tqdm-multiprocess
    #   transformers


We should update transformers version in this file as well. Some of the resulting errors have been fixed in #12187

@tlrmchlsmth ah yes, when you generate the file you need to pass the -P argument so that the transformers version get updated.

python3.12 -m piptools compile requirements-test.in -o requirements-test.txt -P transformers

Update to Transformers 4.48

04a97de

Signed-off-by: Tyler Michael Smith <[email protected]>

mergify bot added the ci/build label Jan 16, 2025

DarkLight1337 reviewed Jan 16, 2025

View reviewed changes

tlrmchlsmth mentioned this pull request Jan 16, 2025

Restore is_torch_greater_or_equal_than for backward compatibility huggingface/transformers#35734

Merged

Isotr0py mentioned this pull request Jan 17, 2025

[Model] Port deepseek-vl2 processor and remove deepseek_vl2 dependency #12169

Merged

tlrmchlsmth added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 18, 2025

DarkLight1337 reviewed Jan 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc] Update to Transformers 4.48 #12120

[Misc] Update to Transformers 4.48 #12120

tlrmchlsmth commented Jan 16, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Jan 16, 2025

tlrmchlsmth commented Jan 16, 2025

DarkLight1337 Jan 16, 2025

fabianlim Jan 16, 2025 •

edited

Loading

tlrmchlsmth Jan 16, 2025

fabianlim Jan 16, 2025

DarkLight1337 Jan 17, 2025

dsikka commented Jan 16, 2025 •

edited

Loading

fabianlim commented Jan 16, 2025

dsikka commented Jan 16, 2025

tlrmchlsmth commented Jan 16, 2025 •

edited

Loading

dsikka commented Jan 16, 2025

ani300 commented Jan 17, 2025

tlrmchlsmth commented Jan 18, 2025

tlrmchlsmth commented Jan 18, 2025

fabianlim commented Jan 18, 2025

DarkLight1337 Jan 18, 2025

fabianlim Jan 19, 2025

[Misc] Update to Transformers 4.48 #12120

Are you sure you want to change the base?

[Misc] Update to Transformers 4.48 #12120

Conversation

tlrmchlsmth commented Jan 16, 2025 • edited by github-actions bot Loading

github-actions bot commented Jan 16, 2025

tlrmchlsmth commented Jan 16, 2025

DarkLight1337 Jan 16, 2025

Choose a reason for hiding this comment

fabianlim Jan 16, 2025 • edited Loading

Choose a reason for hiding this comment

tlrmchlsmth Jan 16, 2025

Choose a reason for hiding this comment

fabianlim Jan 16, 2025

Choose a reason for hiding this comment

DarkLight1337 Jan 17, 2025

Choose a reason for hiding this comment

dsikka commented Jan 16, 2025 • edited Loading

fabianlim commented Jan 16, 2025

dsikka commented Jan 16, 2025

tlrmchlsmth commented Jan 16, 2025 • edited Loading

dsikka commented Jan 16, 2025

ani300 commented Jan 17, 2025

tlrmchlsmth commented Jan 18, 2025

tlrmchlsmth commented Jan 18, 2025

fabianlim commented Jan 18, 2025

DarkLight1337 Jan 18, 2025

Choose a reason for hiding this comment

fabianlim Jan 19, 2025

Choose a reason for hiding this comment

tlrmchlsmth commented Jan 16, 2025 •

edited by github-actions bot

Loading

fabianlim Jan 16, 2025 •

edited

Loading

dsikka commented Jan 16, 2025 •

edited

Loading

tlrmchlsmth commented Jan 16, 2025 •

edited

Loading