Release AWS Data Wrangler 2.15.0 · aws/aws-sdk-pandas

Noteworthy

⚠️ Dropped Python 3.6 support

⚠️ For platforms without PyArrow 7 support (e.g. MWAA, EMR, Glue PySpark Job):

➡️ pip install pyarrow==2 awswrangler

New Functionalities

Amazon Neptune module 🚀 #1084 Check out the tutorial. Thanks to @bechbd & @sakti-mishra !
ARM64 Support for Python 3.8 and 3.9 layers 🔥 #1129 Many thanks @cnfait !

Enhancements

Timestream module - support multi-measure records #1214
Warnings for implicit float conversion of nulls in to_parquet #1221
Support additional sql params in Redshift COPY operation #1210
Add create_ctas_table to Athena module #1207
S3 Proxy support #1206
Add Athena get_named_query_statement #1183
Add manifest parameter to 'redshift.copy_from_files' method #1164

Documentation

Update install section #1242
Update lambda layers section #1236

Bug Fix

Give precedence to user path for Athena UNLOAD S3 Output Location #1216
Honor User specified workgroup in athena.read_sql_query with unload_approach=True #1178
Support map type in Redshift copy #1185
data_api.rds.read_sql_query() does not preserve data type when column is all NULLS - switches to Boolean #1158
Allow decimal values within struct when writing to parquet #1179

Thanks

We thank the following contributors/users for their work on this release:

@bechbd, @sakti-mishra, @mateogianolio, @jasadams, @malachi-constant, @cnfait, @jaidisido, @kukushking

P.S. The AWS Lambda Layer file (.zip) and the AWS Glue file (.whl) are available below. Just upload it and run or use them from our S3 public bucket!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AWS Data Wrangler 2.15.0