Fixed left_pad_sequence - correctly flip dims based on batch_first #1523

mirceamironenco · 2024-09-08T17:04:35Z

Context

What is the purpose of this PR? Is it to

add a new feature
fix a bug
update tests and/or documentation
other (please add here)

Changelog

Fix left_pad_sequence (from data/_collate.py) to have feature parity with pad_sequence from torch.nn.utils.rnn. Previous version would not account for the case where batch_first is False when post-flipping the padded sequence.
Added corresponding test.

Note that (I believe) future versions of pytorch (> 2.4.1) will expose a padding_side argument which could replace this utility altogether.

Test plan

Please make sure to do each of the following if applicable to your PR. (If you're not sure about any one of these just ask and we will happily help. We also have a contributing page for some guidance on contributing.)

run pre-commit hooks and linters (make sure you've first installed via pre-commit install)
add unit tests for any new functionality
update docstrings for any new or updated methods or classes
run unit tests via pytest tests
run recipe tests via pytest tests -m integration_test
manually run any new or modified recipes with sufficient proof of correctness
include relevant commands and any other artifacts in this summary (pastes of loss curves, eval results, etc.)

UX

If your function changed a public API, please add a dummy example of what the user experience will look like when calling it.
Example of docstring:

torchtune/torchtune/modules/vision_transformer.py

Line 285 in 6a7951f

Examples:

Example in our docs: https://pytorch.org/torchtune/main/tutorials/qat_finetune.html#applying-qat-to-llama3-models

I did not change any public API;
I have added an example to docs or docstrings;

pytorch-bot · 2024-09-08T17:04:38Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1523

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit f30e1f9 with merge base 5d5caca ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

RdoubleA

Thanks for the fix!

SalmanMohammadi · 2024-09-08T17:15:23Z

tests/torchtune/data/test_collate.py

@@ -57,6 +57,9 @@ def test_left_pad_sequence(self):
 expected = torch.tensor([[0, 0, 1, 2, 3], [0, 4, 5, 6, 7], [8, 9, 10, 11, 12]])
 assert torch.equal(result, expected)



Could I be annoying and ask if you could explicitly write out the expected tensor, please? Helps a lot to understand what's going at a glance.

Sure! I've added the expected tensor for the batch_first=False case.

Fixed left_pad_sequence - correctly flip dims based on batch_first

9a99d89

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 8, 2024

RdoubleA approved these changes Sep 8, 2024

View reviewed changes

SalmanMohammadi reviewed Sep 8, 2024

View reviewed changes

Update left_pad test with explicit tensor.

f30e1f9

SalmanMohammadi approved these changes Sep 8, 2024

View reviewed changes

SalmanMohammadi merged commit 68d4f3e into pytorch:main Sep 8, 2024
17 checks passed

mirceamironenco deleted the fix-leftpad-sequence branch September 8, 2024 19:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed left_pad_sequence - correctly flip dims based on batch_first #1523

Fixed left_pad_sequence - correctly flip dims based on batch_first #1523

mirceamironenco commented Sep 8, 2024

pytorch-bot bot commented Sep 8, 2024 •

edited

Loading

RdoubleA left a comment

SalmanMohammadi Sep 8, 2024

mirceamironenco Sep 8, 2024

		@@ -57,6 +57,9 @@ def test_left_pad_sequence(self):
		expected = torch.tensor([[0, 0, 1, 2, 3], [0, 4, 5, 6, 7], [8, 9, 10, 11, 12]])
		assert torch.equal(result, expected)

Fixed left_pad_sequence - correctly flip dims based on batch_first #1523

Fixed left_pad_sequence - correctly flip dims based on batch_first #1523

Conversation

mirceamironenco commented Sep 8, 2024

Context

Changelog

Test plan

UX

pytorch-bot bot commented Sep 8, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1523

✅ No Failures

RdoubleA left a comment

Choose a reason for hiding this comment

SalmanMohammadi Sep 8, 2024

Choose a reason for hiding this comment

mirceamironenco Sep 8, 2024

Choose a reason for hiding this comment

pytorch-bot bot commented Sep 8, 2024 •

edited

Loading