Improve BatchFeature: stack list and lists of torch tensors #42750

yonigozlan · 2025-12-09T21:41:14Z

I have been wanting to change that for a while, it shouldn't be a breaking change, but align what we support in BatchFeature between numpy arrays and torch tensors.
The issue was that np.array() works on lists of array and even nested list of arrays), but torch.tensor() doesn't, so if we tried to call batch feature, with lists of tensors, we would get an error. I haven't added support for nested lists of tensors as I haven't seen the need anywhere.
Also the errors we were getting were very generic, and not very useful, this should help with that.

…ests

HuggingFaceDocBuilderDev · 2025-12-09T21:53:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp

Great PR! Left a few questions to make sure I got it right

zucchini-nlp · 2025-12-10T09:25:24Z

src/transformers/models/eomt/image_processing_eomt_fast.py

+        return BatchFeature(
+            data={"pixel_values": processed_images, "patch_offsets": patch_offsets},
+            tensor_type=return_tensors,
+            skip_tensor_conversion=["patch_offsets"],


For my understanding, do we skip the conversion because offsets aren't padded?

Yep! We used to create the batch then add the non-converted path_offsets, but I fell it's cleaner to be explicit this way

zucchini-nlp · 2025-12-10T09:31:27Z

src/transformers/models/fuyu/image_processing_fuyu.py

    BatchFeature class for Fuyu image processor and processor.

    The outputs dictionary from the processors contains a mix of tensors and lists of tensors.
    """


looks like Fuyu has its own BatchFeature because we couldn't skip conversion or convert nested lists? 🤔 Can you inspect and maybe remove it if possible, now that the base class supports skipping?

Yes I can do that in another PR! I didn't look into too much details what the Fuyu batch feature does, but it would be great to get rid of it

zucchini-nlp · 2025-12-10T09:35:13Z

src/transformers/feature_extraction_utils.py

+
+                # stack list of tensors if tensor_type is PyTorch (# torch.tensor() does not support list of tensors)
+                if isinstance(value, (list, tuple)) and len(value) > 0 and torch.is_tensor(value[0]):
+                    return torch.stack(value)
+
+                # convert list of numpy arrays to numpy array (stack) if tensor_type is Numpy


i dunno if you saw the PR. Community member noticed that VideoMetadata objects throw and error when return type is 'pt', because they can't be converted to tensors

I think we can add the fix here by checking if value is a list/array/etc and early existing otherwise. We won't be able to convert non-list objects anyway

Agreed, I was just wondering why we restricted BatchFeature to only be able to contain arrays/tensors structures in the first place, just to make sure we wouldn't break an important assumption by silently allowing other objects in BatchFeature.
Also these changes should be made along with changes to the ".to()" method no?

zucchini-nlp · 2025-12-10T09:42:32Z

tests/models/owlv2/test_image_processing_owlv2.py


            mean_value = round(pixel_values.mean().item(), 4)
-            self.assertEqual(mean_value, 0.2353)
+            self.assertEqual(mean_value, -0.2303)


to make sure: was it a typo or did this PR change pixel outputs?

I'll have to check on a CI runner if that value is correct, but this tests failed on main on my end (using an A10)

zucchini-nlp · 2025-12-10T09:43:08Z

tests/utils/test_feature_extraction_utils.py

+class BatchFeatureTester(unittest.TestCase):
+    """Tests for the BatchFeature class and tensor conversion."""
+


thanks for adding a test 🤩

github-actions · 2025-12-10T18:33:03Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: beit, bridgetower, cohere2_vision, convnext, deepseek_vl, deepseek_vl_hybrid, depth_pro, dinov3_vit, donut, dpt, efficientloftr, efficientnet, eomt, flava, fuyu, gemma3

github-actions · 2025-12-10T18:40:17Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=42750&sha=e2ceac

yonigozlan added 2 commits December 9, 2025 20:30

stack lists of tensors in BatchFeature, improve error messages, add t…

e5d1092

…ests

remove unnecessary stack in fast image processors and video processors

3a4cecd

yonigozlan force-pushed the improve-batch-feature branch from 5be88b5 to 3a4cecd Compare December 9, 2025 21:41

yonigozlan requested review from ArthurZucker, Cyrilvallez and zucchini-nlp December 9, 2025 21:42

make style

7f1ef5d

yonigozlan added 2 commits December 9, 2025 22:10

Merge remote-tracking branch 'upstream/main' into improve-batch-feature

2b69cc0

fix tests

875c36e

zucchini-nlp reviewed Dec 10, 2025

View reviewed changes

yonigozlan force-pushed the improve-batch-feature branch from e2ceac2 to 875c36e Compare December 10, 2025 18:33

yonigozlan changed the title ~~Improve BatchFeature: stack list and nested lists of torch tensors~~ Improve BatchFeature: stack list and lists of torch tensors Dec 10, 2025

		class BatchFeatureTester(unittest.TestCase):
		"""Tests for the BatchFeature class and tensor conversion."""

Improve BatchFeature: stack list and lists of torch tensors #42750

Are you sure you want to change the base?

Improve BatchFeature: stack list and lists of torch tensors #42750

Conversation

yonigozlan commented Dec 9, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 9, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants