Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change the chunk_iter function to handle #16730

Merged
merged 2 commits into from
Apr 12, 2022
Merged

Conversation

Narsil
Copy link
Contributor

@Narsil Narsil commented Apr 12, 2022

the subtle cases where the last chunk gets ignored since all the
data is in the left_strided data.

We need to remove the right striding on the previous item.

What does this PR do?

Change the chunk_iter function to handle

the subtle cases where the last chunk gets ignored since all the
data is in the left_strided data.

We need to remove the right striding on the previous item.

Fixes #16671

@LysandreJik @patrickvonplaten

Fixes # (issue)

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you read the contributor guideline,
    Pull Request section?
  • Was this discussed/approved via a Github issue or the forum? Please add a link
    to it if that's the case.
  • Did you make sure to update the documentation with your changes? Here are the
    documentation guidelines, and
    here are tips on formatting docstrings.
  • Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

the subtle cases where the last chunk gets ignored since all the
data is in the `left_strided` data.

We need to remove the right striding on the previous item.
@patrickvonplaten
Copy link
Contributor

Very nice - thanks for fixing it!

@patrickvonplaten
Copy link
Contributor

@sgugger - I think the build doc failing test is unrelated here no?

@sgugger
Copy link
Collaborator

sgugger commented Apr 12, 2022

Yes, will look into that.

@patrickvonplaten patrickvonplaten merged commit a192f61 into main Apr 12, 2022
@patrickvonplaten patrickvonplaten deleted the fix_chunking_bug branch April 12, 2022 16:25
elusenji pushed a commit to elusenji/transformers that referenced this pull request Jun 12, 2022
* Change the chunk_iter function to handle

the subtle cases where the last chunk gets ignored since all the
data is in the `left_strided` data.

We need to remove the right striding on the previous item.

* Remove commented line.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

ASR Pipeline: End of transcripts missing when chunking enabled
3 participants