-
Notifications
You must be signed in to change notification settings - Fork 151
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parquet is missing rows #81
Comments
Thanks for the report! Can you share a parquet file that has this issue? |
Unfortunately no, it's business-related. But nothing special, 30 or so columns with mostly UTF8 and two INT32 types. |
One of column does contain very large values, but other than that, normal stuff. |
@mariussoutier multiprocessio/datastation#278 should fix it! |
Closed in #82 now available in dsq 0.21.0 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I have a Parquet file that should have 30,000+ rows, but
SELECT COUNT(*) FROM {}
returns7000
. Another one with more than 40,000 rows returns exactly8000
. Converting the same data to JSON works fine.The text was updated successfully, but these errors were encountered: