Skip to content

AWS Data Wrangler 2.15.0

Compare
Choose a tag to compare
@kukushking kukushking released this 28 Mar 14:36
· 1060 commits to main since this release

Noteworthy

⚠️ Dropped Python 3.6 support

⚠️ For platforms without PyArrow 7 support (e.g. MWAA, EMR, Glue PySpark Job):

➡️ pip install pyarrow==2 awswrangler

New Functionalities

Enhancements

  • Timestream module - support multi-measure records #1214
  • Warnings for implicit float conversion of nulls in to_parquet #1221
  • Support additional sql params in Redshift COPY operation #1210
  • Add create_ctas_table to Athena module #1207
  • S3 Proxy support #1206
  • Add Athena get_named_query_statement #1183
  • Add manifest parameter to 'redshift.copy_from_files' method #1164

Documentation

  • Update install section #1242
  • Update lambda layers section #1236

Bug Fix

  • Give precedence to user path for Athena UNLOAD S3 Output Location #1216
  • Honor User specified workgroup in athena.read_sql_query with unload_approach=True #1178
  • Support map type in Redshift copy #1185
  • data_api.rds.read_sql_query() does not preserve data type when column is all NULLS - switches to Boolean #1158
  • Allow decimal values within struct when writing to parquet #1179

Thanks

We thank the following contributors/users for their work on this release:

@bechbd, @sakti-mishra, @mateogianolio, @jasadams, @malachi-constant, @cnfait, @jaidisido, @kukushking


P.S. The AWS Lambda Layer file (.zip) and the AWS Glue file (.whl) are available below. Just upload it and run or use them from our S3 public bucket!