Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update from upstream #6

Merged
merged 189 commits into from
Nov 1, 2023
Merged

Update from upstream #6

merged 189 commits into from
Nov 1, 2023

Commits on May 13, 2023

  1. Configuration menu
    Copy the full SHA
    8649548 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ac9bec1 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    ddc1098 View commit details
    Browse the repository at this point in the history
  4. chore: clippy

    roeap committed May 13, 2023
    Configuration menu
    Copy the full SHA
    e7178f4 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    c57d8db View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    ab59f9a View commit details
    Browse the repository at this point in the history
  7. fix: simpler defaults

    roeap committed May 13, 2023
    Configuration menu
    Copy the full SHA
    05642da View commit details
    Browse the repository at this point in the history
  8. Update rust/src/data_catalog/unity/mod.rs

    Co-authored-by: nohajc <[email protected]>
    roeap and nohajc committed May 13, 2023
    Configuration menu
    Copy the full SHA
    a1ea5aa View commit details
    Browse the repository at this point in the history
  9. fix: imports

    roeap committed May 13, 2023
    Configuration menu
    Copy the full SHA
    052f60b View commit details
    Browse the repository at this point in the history
  10. fix: add some defaults

    roeap committed May 13, 2023
    Configuration menu
    Copy the full SHA
    18317e1 View commit details
    Browse the repository at this point in the history

Commits on May 14, 2023

  1. test: add failing provider test

    roeap committed May 14, 2023
    Configuration menu
    Copy the full SHA
    5d5fd91 View commit details
    Browse the repository at this point in the history

Commits on May 15, 2023

  1. feat: list catalogs

    roeap committed May 15, 2023
    Configuration menu
    Copy the full SHA
    697b66f View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    75c745a View commit details
    Browse the repository at this point in the history

Commits on May 21, 2023

  1. Configuration menu
    Copy the full SHA
    5930da1 View commit details
    Browse the repository at this point in the history

Commits on May 22, 2023

  1. Configuration menu
    Copy the full SHA
    a04b45f View commit details
    Browse the repository at this point in the history

Commits on May 27, 2023

  1. Configuration menu
    Copy the full SHA
    378e41c View commit details
    Browse the repository at this point in the history

Commits on Jun 2, 2023

  1. merge main

    roeap committed Jun 2, 2023
    Configuration menu
    Copy the full SHA
    66e83cb View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    1020898 View commit details
    Browse the repository at this point in the history
  3. fix: remove artifact

    roeap committed Jun 2, 2023
    Configuration menu
    Copy the full SHA
    80b25ca View commit details
    Browse the repository at this point in the history

Commits on Jun 3, 2023

  1. Configuration menu
    Copy the full SHA
    83dcb6d View commit details
    Browse the repository at this point in the history

Commits on Jul 26, 2023

  1. Start python api docs

    wjones127 committed Jul 26, 2023
    Configuration menu
    Copy the full SHA
    2fccbdb View commit details
    Browse the repository at this point in the history

Commits on Sep 15, 2023

  1. docs: update Readme (delta-io#1440)

    # Description
    
    With summit coming up I thought we might update our README, since
    delta-rs has evolved quite a bit since the README was first written...
    
    Just opening the Draft to get feedback on the general "patterns" i.e.
    how the tables are formatted, how detailed we want to show the features
    and mostly the looks of the header.
    
    Also hoping our community experts may have some content they wat to add
    here 😆.
    
    cc @dennyglee @MrPowers @wjones127 @rtyler @houqp @fvaleye
    
    ---------
    
    Co-authored-by: Will Jones <[email protected]>
    Co-authored-by: R. Tyler Croy <[email protected]>
    3 people authored Sep 15, 2023
    Configuration menu
    Copy the full SHA
    4638fcf View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9a20dc7 View commit details
    Browse the repository at this point in the history
  3. Pin chrono to 0.4.30

    v0.4.31 was just released which introduces some spurious deprecation warnings
    rtyler committed Sep 15, 2023
    Configuration menu
    Copy the full SHA
    ef17f9e View commit details
    Browse the repository at this point in the history

Commits on Sep 17, 2023

  1. docs: update Readme (delta-io#1633)

    # Description
    - Changed the icons as, at first glance, it looked like AWS was not
    supported (in blue), while the green open icon looked like it was
    completed
    - Added one line linking to the Delta Lake docker
    - Fixed some minor grammar issues
    
    Including community experts @roeap @MrPowers @wjones127 @rtyler @houqp
    @fvaleye to ensure these updates make sense. Thanks!
    dennyglee authored Sep 17, 2023
    Configuration menu
    Copy the full SHA
    f6cbdc9 View commit details
    Browse the repository at this point in the history
  2. chore: update datafusion to 31, arrow to 46 and object_store to 0.7 (d…

    …elta-io#1634)
    
    # Description
    
    Update datafusion to 31
    houqp authored Sep 17, 2023
    Configuration menu
    Copy the full SHA
    9169e23 View commit details
    Browse the repository at this point in the history
  3. chore: relax chrono pin to 0.4 (delta-io#1635)

    # Description
    
    relax chrono pin to improve downstream compatibility.
    houqp authored Sep 17, 2023
    Configuration menu
    Copy the full SHA
    63c14b3 View commit details
    Browse the repository at this point in the history
  4. make create_checkpoint_for public

    haruband authored and rtyler committed Sep 17, 2023
    Configuration menu
    Copy the full SHA
    f8efbd9 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    dbd02d2 View commit details
    Browse the repository at this point in the history
  6. Implement parsing for the new domainMetadata actions in the commit log

    The Delta Lake protocol which will be released in conjunction with "3.0.0"
    (currently at RC1) introduces `domainMetadata` actions to the commit log to
    enable system or user-provided metadata about the commits to be added to the
    log. With DBR 13.3 in the Databricks ecosystem, tables are already being written
    with this action via the "liquid clustering" feature.
    
    This change enables the clean reading of these tables, but at present nothing
    novel is done with this information.
    
    [Read more here](https://www.databricks.com/blog/announcing-delta-lake-30-new-universal-format-and-liquid-clustering)
    
    Fixes delta-io#1626
    
    Sponsored-by: Databricks Inc
    rtyler committed Sep 17, 2023
    Configuration menu
    Copy the full SHA
    47fc849 View commit details
    Browse the repository at this point in the history
  7. fix: include in-progress row group when calculating in-memory buffer …

    …length (delta-io#1638)
    
    # Description
    `PartitionWriter.buffer_len()` is documented as returning: 
    
    > the current byte length of the in memory buffer.
    
    However, this doesn't currently include the length of the in-progress
    row group. This means that until a row group is flushed, `buffer_len()`
    returns `0`. Based on the documented description, its length should
    probably include the bytes currently in-memory as part of an unflushed
    row group.
    
    `buffered_record_batch_count` _does_ include in-progress row groups, so
    this change also means record count and buffered bytes are reported
    consistently.
    
    # Related Issue(s)
    <!---
    For example:
    
    - closes delta-io#106
    --->
    - closes delta-io#1637
    
    # Documentation
    
    <!---
    Share links to useful documentation
    --->
    
    [`buffer_len` on
    `RecordBatchWriter`](https://docs.rs/deltalake/0.15.0/deltalake/writer/record_batch/struct.RecordBatchWriter.html#method.buffer_len)
    
    ---------
    
    Co-authored-by: Will Jones <[email protected]>
    BnMcG and wjones127 authored Sep 17, 2023
    Configuration menu
    Copy the full SHA
    9579b04 View commit details
    Browse the repository at this point in the history

Commits on Sep 19, 2023

  1. feat: allow multiple incremental commits in optimize

    Currently "optimize" executes the whole plan in one commit, which might
    fail. The larger the table, the more likely it is to fail and the more
    expensive the failure is.
    
    Add an option in OptimizeBuilder that allows specifying a commit
    interval. If that is provided, the plan executor will periodically
    commit the accumulated actions.
    kvap authored and rtyler committed Sep 19, 2023
    Configuration menu
    Copy the full SHA
    fae39b1 View commit details
    Browse the repository at this point in the history
  2. fix: explicitly require chrono 0.4.31 or greater

    The Python binding relies on `timestamp_nanos)opt()` which requires 0.4.31 or
    greater from chroni since it did not previously exist.
    
    As a [cargo dependency
    refresher](https://doc.rust-lang.org/cargo/reference/specifying-dependencies.html#specifying-dependencies-from-cratesio)
    this version range is >=0.4.31, < 0.5.0 which is I believe what we need for
    optimal downstream compatibility.
    rtyler committed Sep 19, 2023
    Configuration menu
    Copy the full SHA
    fdebcd6 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    719266e View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    b2ba0d7 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    b1fab03 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    9119ee6 View commit details
    Browse the repository at this point in the history
  7. fix: avoid excess listing of log files

    eeroel authored and rtyler committed Sep 19, 2023
    Configuration menu
    Copy the full SHA
    ba89ab1 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    fff05c7 View commit details
    Browse the repository at this point in the history
  9. feat: pass known file sizes to filesystem in Python (delta-io#1630)

    # Description
    Currently the Filesystem implementation always makes a HEAD request when
    opening a file, to determine the file size. The proposed change is to
    read the file sizes from the delta log instead, and to pass them down to
    the `open_input_file` call, eliminating the HEAD request.
    
    # Related Issue(s)
    <!---
    For example:
    
    - closes delta-io#106
    --->
    
    # Documentation
    
    <!---
    Share links to useful documentation
    --->
    eeroel authored Sep 19, 2023
    Configuration menu
    Copy the full SHA
    d2fc649 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    51db313 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    645abb9 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    9b3a0e5 View commit details
    Browse the repository at this point in the history

Commits on Sep 20, 2023

  1. Merge pull request delta-io#1338 from roeap/unity-catalog

    feat: integrate unity catalog with datafusion
    rtyler authored Sep 20, 2023
    Configuration menu
    Copy the full SHA
    6746dd4 View commit details
    Browse the repository at this point in the history
  2. Proposed updated CODEOWNERS to allow better review notifications

    Based on current pull request feedback and maintenance trends I'm suggesting
    these rules to get the right people on the reviews by default.
    
    Closes delta-io#1553
    rtyler committed Sep 20, 2023
    Configuration menu
    Copy the full SHA
    81d5a99 View commit details
    Browse the repository at this point in the history
  3. fix: add support for Microsoft OneLake

    This change introduces tests and support for Microsoft OneLake. This specific
    commit is a rebase of the work done by our pals at Microsoft.
    
    Co-authored-by: Mohammed Muddassir <[email protected]>
    Co-authored-by: Christopher Watford <[email protected]>
    2 people authored and rtyler committed Sep 20, 2023
    Configuration menu
    Copy the full SHA
    6eef77b View commit details
    Browse the repository at this point in the history
  4. Ignore failing integration tests which require a special environment …

    …to operate
    
    The OneLake support should be considered unsupported and experimental until such
    time when we can add integration testing to our CI process
    rtyler committed Sep 20, 2023
    Configuration menu
    Copy the full SHA
    d13056e View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    734baab View commit details
    Browse the repository at this point in the history
  6. Compensate for invalid log files created by Delta Live Tables

    It would appear that in some cases Delta Live Tables will create a Delta table
    which does not adhere to the Delta Table protocol.
    
    The metaData action as a **required** `schemaString` property which simply
    doesn't exist. Since it appears that this only exists at version zero of the
    transaction log, and the _actual_ schema exists in the following versions of the
    table (e.g. 1), this change introduces a default deserializer on the MetaData
    action which provides a simple empty schema.
    
    This is an alternative implementation to delta-io#1305 which is a bit more invasive and
    makes our schema_string struct member `Option<String>` which I do not believe is
    worth it for this unfortunate compatibility issue
    
    Closes delta-io#1305, delta-io#1302, delta-io#1357
    
    Sponsored-by: Databricks Inc
    rtyler committed Sep 20, 2023
    Configuration menu
    Copy the full SHA
    f6cb3ca View commit details
    Browse the repository at this point in the history
  7. chore: fix the incorrect Slack link in our readme

    not sure what the deal with the go.delta.io service, no idea where that lives
    
    Fixes delta-io#1636
    rtyler committed Sep 20, 2023
    Configuration menu
    Copy the full SHA
    9ef52a8 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    5fe86a8 View commit details
    Browse the repository at this point in the history
  9. Merge pull request delta-io#1649 from rtyler/fix-readme-slack

    chore: fix the incorrect Slack link in our readme
    rtyler authored Sep 20, 2023
    Configuration menu
    Copy the full SHA
    a74589b View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    f6824fe View commit details
    Browse the repository at this point in the history

Commits on Sep 21, 2023

  1. enable offset listing for s3

    eeroel committed Sep 21, 2023
    Configuration menu
    Copy the full SHA
    d151a97 View commit details
    Browse the repository at this point in the history
  2. Merge pull request delta-io#1654 from eeroel/fix/offset_listing_in_s3

    fix: enable offset listing for s3
    rtyler authored Sep 21, 2023
    Configuration menu
    Copy the full SHA
    7d2bb27 View commit details
    Browse the repository at this point in the history

Commits on Sep 22, 2023

  1. Configuration menu
    Copy the full SHA
    5131f76 View commit details
    Browse the repository at this point in the history
  2. Merge pull request delta-io#1647 from rtyler/dlt-compatibility

    fix: compensate for invalid log files created by Delta Live Tables
    rtyler authored Sep 22, 2023
    Configuration menu
    Copy the full SHA
    98b33e1 View commit details
    Browse the repository at this point in the history
  3. Make docs.rs build docs with all features enabled

    I was confused that I could not find the documentation integrating datafusion with delta-rs.
    
    With this PR, everything should show up. Perhaps docs for a feature gated method should also mention which feature is required. Similar to what Tokio does. Perhaps it could be done in followup PRs.
    simonvandel authored Sep 22, 2023
    Configuration menu
    Copy the full SHA
    312a7cb View commit details
    Browse the repository at this point in the history
  4. Merge pull request delta-io#1658 from simonvandel/patch-1

    docs: make docs.rs build docs with all features enabled
    rtyler authored Sep 22, 2023
    Configuration menu
    Copy the full SHA
    81a5451 View commit details
    Browse the repository at this point in the history
  5. feat: expose min_commit_interval to optimize.compact and `optimize.…

    …z_order` (delta-io#1645)
    
    # Description
    Exposes min_commit_interval in the Python API to `optimize.compact` and
    `optimize.z_order`. Added one test-case to verify the
    min_commit_interval.
    
    # Related Issue(s)
    closes delta-io#1640
    
    ---------
    
    Co-authored-by: Will Jones <[email protected]>
    ion-elgreco and wjones127 authored Sep 22, 2023
    Configuration menu
    Copy the full SHA
    72f8531 View commit details
    Browse the repository at this point in the history

Commits on Sep 23, 2023

  1. Configuration menu
    Copy the full SHA
    9eccd4c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6ef0d7c View commit details
    Browse the repository at this point in the history

Commits on Sep 24, 2023

  1. Configuration menu
    Copy the full SHA
    9fef678 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    18d7a33 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    1aa8429 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    2a6492c View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    dda2d9d View commit details
    Browse the repository at this point in the history
  6. chore: format

    roeap committed Sep 24, 2023
    Configuration menu
    Copy the full SHA
    839d1d7 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    4ed5ac0 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    6c3c726 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    815b075 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    c02c4c8 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    f9842cd View commit details
    Browse the repository at this point in the history
  12. docs: fix some typos (delta-io#1662)

    # Description
    Saw two typos and marking merge in rust as half-done with a comment on
    it's current limitation.
    ion-elgreco authored Sep 24, 2023
    Configuration menu
    Copy the full SHA
    62e853a View commit details
    Browse the repository at this point in the history

Commits on Sep 25, 2023

  1. Configuration menu
    Copy the full SHA
    77de967 View commit details
    Browse the repository at this point in the history
  2. feat: use url parsing from object store

    roeap authored and rtyler committed Sep 25, 2023
    Configuration menu
    Copy the full SHA
    02b3cea View commit details
    Browse the repository at this point in the history
  3. fix: ensure config for ms fabric

    roeap authored and rtyler committed Sep 25, 2023
    Configuration menu
    Copy the full SHA
    c7f2e1a View commit details
    Browse the repository at this point in the history
  4. chore: drive-by simplify test files

    roeap authored and rtyler committed Sep 25, 2023
    Configuration menu
    Copy the full SHA
    9db777b View commit details
    Browse the repository at this point in the history
  5. fix: update aws http config key

    roeap authored and rtyler committed Sep 25, 2023
    Configuration menu
    Copy the full SHA
    84cb6b7 View commit details
    Browse the repository at this point in the history
  6. fix: feature gate azure update

    roeap authored and rtyler committed Sep 25, 2023
    Configuration menu
    Copy the full SHA
    7cb24ca View commit details
    Browse the repository at this point in the history
  7. feat: more robust azure config handling

    roeap authored and rtyler committed Sep 25, 2023
    Configuration menu
    Copy the full SHA
    6fb62c0 View commit details
    Browse the repository at this point in the history
  8. fix: in memory store handling

    roeap authored and rtyler committed Sep 25, 2023
    Configuration menu
    Copy the full SHA
    21b9483 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    05e6444 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    7e2afb6 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    a73e362 View commit details
    Browse the repository at this point in the history
  12. Merge pull request delta-io#1646 from rtyler/update-codeowners

    chore: proposed updated CODEOWNERS to allow better review notifications
    rtyler authored Sep 25, 2023
    Configuration menu
    Copy the full SHA
    a482e4f View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    a6d8c56 View commit details
    Browse the repository at this point in the history
  14. Merge pull request delta-io#1661 from roeap/partition-values

    fix: more consistent handling of partition values and file paths
    rtyler authored Sep 25, 2023
    Configuration menu
    Copy the full SHA
    56e1e87 View commit details
    Browse the repository at this point in the history

Commits on Sep 26, 2023

  1. refactor: re-organize top level modules (delta-io#1434)

    # Description
    
    ~This contains changes from delta-io#1432, will rebase once that's merged.~
    
    This PR constitutes the bulk of re-organising our top level modules.
    - move `DeltaTable*` structs into new `table` module
    - move table configuration into `table` module
    - move schema related modules into `schema` module
    - rename `action` module to `protocol` - hoping to isolate everything
    that can one day be the log kernel.
    
    ~It also removes the deprecated commit logic from `DeltaTable` and
    updates call sites and tests accordingly.~
    
    I am planning one more follow up, where I hope to make `transactions`
    currently within `operations` a top level module. While the number of
    touched files here is already massive, I want to do this in a follow up,
    as it will also include some updates to the transactions itself, that
    should be more carefully reviewed.
    
    # Related Issue(s)
    
    closes: delta-io#1136
    
    # Documentation
    
    <!---
    Share links to useful documentation
    --->
    roeap authored Sep 26, 2023
    Configuration menu
    Copy the full SHA
    65179b6 View commit details
    Browse the repository at this point in the history
  2. chore: increment python library version (delta-io#1664)

    # Description
    The description of the main changes of your pull request
    
    # Related Issue(s)
    <!---
    For example:
    
    - closes delta-io#106
    --->
    
    # Documentation
    
    <!---
    Share links to useful documentation
    --->
    wjones127 authored Sep 26, 2023
    Configuration menu
    Copy the full SHA
    b447934 View commit details
    Browse the repository at this point in the history
  3. fix exception string in writer.py

    The exception message is ambiguous as it interchanges the table and data schemas.
    sebdiem authored Sep 26, 2023
    Configuration menu
    Copy the full SHA
    12eedfe View commit details
    Browse the repository at this point in the history
  4. Merge pull request delta-io#1665 from sebdiem/patch-1

    fix: exception string in writer.py
    rtyler authored Sep 26, 2023
    Configuration menu
    Copy the full SHA
    0a5aa39 View commit details
    Browse the repository at this point in the history
  5. Update docs

    ion-elgreco committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    c4c8cfd View commit details
    Browse the repository at this point in the history
  6. add read me

    ion-elgreco committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    83ff71e View commit details
    Browse the repository at this point in the history
  7. Add space

    ion-elgreco committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    b14d7b5 View commit details
    Browse the repository at this point in the history
  8. Merge pull request delta-io#1666 from ion-elgreco/docs/update_guide

    docs: small consistency update in guide and readme
    rtyler authored Sep 26, 2023
    Configuration menu
    Copy the full SHA
    93eb9ce View commit details
    Browse the repository at this point in the history

Commits on Sep 27, 2023

  1. feat: allow to set large dtypes for the schema check in `write_deltal…

    …ake` (delta-io#1668)
    
    # Description
    Currently it was always checking the schema for non-large types, I
    didn't know before we could change it so in polars we added some schema
    casting from large to non-large, this however became a problem today
    when I wanted to write 200M records at once because the array was too
    big the fit in normal string type.
    
    ```python
    ArrowInvalid: Failed casting from large_string to string: input array too large
    ```
    
    Adding this flag will allow libraries like polars to write directly with
    their large dtypes in arrow. If this is merged, I can work on fix in
    polars to remove the schema casting for these large types.
    ion-elgreco authored Sep 27, 2023
    Configuration menu
    Copy the full SHA
    113fd0f View commit details
    Browse the repository at this point in the history
  2. fix: change partitioning schema from large to normal string for pyarr…

    …ow<12 (delta-io#1671)
    
    # Description
    If pyarrow is below v12.0.0 it changes the partitioning schema fields
    from large_string to string.
    
    # Related Issue(s)
    closes delta-io#1669 
    
    # Documentation
    apache/arrow#34546 (comment)
    
    ---------
    
    Co-authored-by: Will Jones <[email protected]>
    ion-elgreco and wjones127 authored Sep 27, 2023
    Configuration menu
    Copy the full SHA
    23d1e90 View commit details
    Browse the repository at this point in the history
  3. chore: bump rust crate version

    rtyler committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    ea9ae1b View commit details
    Browse the repository at this point in the history
  4. Merge pull request delta-io#1675 from rtyler/version-bump

    chore: bump rust crate version
    rtyler authored Sep 27, 2023
    Configuration menu
    Copy the full SHA
    55a309d View commit details
    Browse the repository at this point in the history
  5. fix: use epoch instead of ce for date stats (delta-io#1672)

    # Description
    date32 statistics logic was subjectively wrong. It was using
    `from_num_days_from_ce_opt` which
    > Makes a new NaiveDate from a day's number in the proleptic Gregorian
    calendar, with January 1, 1 being day 1.
    
    while date32 is commonly represented as days since UNIX epoch
    (1970-01-01)
    
    
    
    # Related Issue(s)
    closes delta-io#1670
    
    # Documentation
    It doesn't seem like parquet actually has a spec for what a `date`
    should be, but many other tools seem to use the epoch logic.
    
    duckdb, and polars seem to use epoch instead of gregorian. 
    
    Also arrow spec states that date32 should be epoch.
    
    for example, if i write using polars
    ```py
    import polars as pl
    
    # %%
    df = pl.DataFrame(
        {
            "a": [
                10561,
                9200,
                9201,
                9202,
                9203,
                9204,
                9205,
                9206,
                9207,
                9208,
                9199,
            ]
        }
    )
    # %%
    
    df.select(pl.col("a").cast(pl.Date)).write_delta("./db/polars/")
    ```
    the stats are correctly interpreted
    ```
    {"add":{"path":"0-7b8f11ab-a259-4673-be06-9deedeec34ff-0.parquet","size":557,"partitionValues":{},"modificationTime":1695779554372,"dataChange":true,"stats":"{\"numRecords\": 11, \"minValues\": {\"a\": \"1995-03-10\"}, \"maxValues\": {\"a\": \"1998-12-01\"}, \"nullCount\": {\"a\": 0}}"}}
    ```
    universalmind303 authored Sep 27, 2023
    Configuration menu
    Copy the full SHA
    94cb9b3 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    44d6594 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    6115195 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    c04d46e View commit details
    Browse the repository at this point in the history

Commits on Sep 28, 2023

  1. update readme

    Trinity Xia committed Sep 28, 2023
    Configuration menu
    Copy the full SHA
    7efddea View commit details
    Browse the repository at this point in the history
  2. fix: update the delta-inspect CLI to be build again by Cargo

    This sort of withered on the vine a bit, this pull request allows it to be built
    properly again
    rtyler committed Sep 28, 2023
    Configuration menu
    Copy the full SHA
    6cb9c8a View commit details
    Browse the repository at this point in the history
  3. update readme

    hnaoto committed Sep 28, 2023
    Configuration menu
    Copy the full SHA
    22d0f6b View commit details
    Browse the repository at this point in the history
  4. Merge pull request delta-io#1676 from delta-io/changelog

    chore: update changelog for the rust-v0.16.0 release
    rtyler authored Sep 28, 2023
    Configuration menu
    Copy the full SHA
    18eec38 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    39501cd View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    8234c16 View commit details
    Browse the repository at this point in the history
  7. Merge pull request delta-io#1677 from hnaoto/docs/update-rust-readme

    docs: update readme in Rust directory
    rtyler authored Sep 28, 2023
    Configuration menu
    Copy the full SHA
    11de699 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    e7807ea View commit details
    Browse the repository at this point in the history
  9. Merge pull request delta-io#1678 from rtyler/update-delta-inspect

    fix: update the delta-inspect CLI to be build again by Cargo
    rtyler authored Sep 28, 2023
    Configuration menu
    Copy the full SHA
    fcfd1bf View commit details
    Browse the repository at this point in the history

Commits on Sep 29, 2023

  1. Configuration menu
    Copy the full SHA
    7ca17cd View commit details
    Browse the repository at this point in the history
  2. fix: unify environment variables referenced by Databricks docs

    Long-term fix will be for Databricks to release a Rust SDK for Unity 😄
    
    Fixes delta-io#1627
    rtyler committed Sep 29, 2023
    Configuration menu
    Copy the full SHA
    13e8be1 View commit details
    Browse the repository at this point in the history

Commits on Oct 2, 2023

  1. Configuration menu
    Copy the full SHA
    e6699d3 View commit details
    Browse the repository at this point in the history

Commits on Oct 3, 2023

  1. Merge pull request delta-io#1673 from rtyler/databricks-env-variables…

    …-1627
    
    fix: unify environment variables referenced by Databricks docs
    rtyler authored Oct 3, 2023
    Configuration menu
    Copy the full SHA
    fac7fdb View commit details
    Browse the repository at this point in the history
  2. docs: get docs.rs configured correctly again (delta-io#1693)

    # Description
    
    The docs build was changed in delta-io#1658 to compile on docs.rs with all
    features, but our crate cannot compile with all-features due to the TLS
    features, which are mutually exclusive.
    
    # Related Issue(s)
    
    For example:
    
    - closes delta-io#1692
    
    This has been tested locally with the following command:
    
    ```
    cargo doc --features azure,datafusion,datafusion,gcs,glue,json,python,s3,unity-experimental
    ```
    wjones127 authored Oct 3, 2023
    Configuration menu
    Copy the full SHA
    dd1fa8c View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    575392e View commit details
    Browse the repository at this point in the history
  4. fix!: ensure predicates are parsable (delta-io#1690)

    # Description
    Resolves two issues that impact Datafusion implemented operators
    
    1. When a user has an expression with a scalar built-in scalar function
    we are unable parse the output predicate since the
    `DummyContextProvider`'s methods are unimplemented. The provider now
    uses the user provided state or a default. More work is required in the
    future to allow a user provided Datafusion state to be used during the
    conflict checker.
    
    2. The string representation was not parsable by sqlparser since it was
    not valid SQL. New code was written to transform an expression into a
    parsable sql string. Current implementation is not exhaustive however
    common use cases are covered.
    
    The delta_datafusion.rs file is getting large so I transformed it into a
    module.
    
    This implementation makes reuse of some code from Datafusion. I've added
    the Apache License at the top of the file. Let me know if any else is
    required to be compliant.
    
    
    # Related Issue(s)
    - closes delta-io#1625
    
    ---------
    
    Co-authored-by: Will Jones <[email protected]>
    Blajda and wjones127 authored Oct 3, 2023
    Configuration menu
    Copy the full SHA
    4da7d66 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    6c5e9dc View commit details
    Browse the repository at this point in the history
  6. fix typo in readme

    JosiahParry authored Oct 3, 2023
    Configuration menu
    Copy the full SHA
    e326473 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    03dd2ab View commit details
    Browse the repository at this point in the history
  8. fix: remove an unused import

    rtyler committed Oct 3, 2023
    Configuration menu
    Copy the full SHA
    1ec7cf1 View commit details
    Browse the repository at this point in the history

Commits on Oct 5, 2023

  1. feat(python): expose delete operation (delta-io#1687)

    # Description
    Naively expose the delete operation, with the option to provide a
    predicate.
    
    I first tried to expose a richer API with the Python `FilterType` and
    DNF expressions, but from what I understand delta-rs doesn't implement
    generic filters but only `PartitionFilter`. The `DeleteBuilder` also
    only accepts datafusion expressions. So Instead of hacking my way around
    or proposing a refactor I went for the simpler approach of sending a
    string predicate to the rust lib.
    
    If this implementation is OK I will add tests.
    
    # Related Issue(s)
    - closes delta-io#1417
    
    ---------
    
    Co-authored-by: Will Jones <[email protected]>
    guilhem-dvr and wjones127 authored Oct 5, 2023
    Configuration menu
    Copy the full SHA
    3ba3426 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    01696a3 View commit details
    Browse the repository at this point in the history
  3. Merge pull request delta-io#1696 from JosiahParry/patch-1

    docs: fix typo in readme
    rtyler authored Oct 5, 2023
    Configuration menu
    Copy the full SHA
    2b37454 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    808790d View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    e5eabf9 View commit details
    Browse the repository at this point in the history
  6. Merge pull request delta-io#1688 from GlareDB/universalmind303/create…

    …-or-replace
    
    feat: support CREATE OR REPLACE
    rtyler authored Oct 5, 2023
    Configuration menu
    Copy the full SHA
    bdf1c4e View commit details
    Browse the repository at this point in the history

Commits on Oct 7, 2023

  1. Configuration menu
    Copy the full SHA
    a523639 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    eea9d2a View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    66ad28f View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    acc2a02 View commit details
    Browse the repository at this point in the history
  5. Update the sphinx ignore for building

    =_=
    rtyler committed Oct 7, 2023
    Configuration menu
    Copy the full SHA
    e8ae493 View commit details
    Browse the repository at this point in the history
  6. Merge pull request delta-io#1707 from rtyler/rust-173-clipz

    chore: fix new clippy lints introduced in Rust 1.73
    rtyler authored Oct 7, 2023
    Configuration menu
    Copy the full SHA
    30a3800 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    83efb17 View commit details
    Browse the repository at this point in the history
  8. Merge pull request delta-io#1563 from wjones127/docs/python-api-refer…

    …ence
    
    docs: add Python API reference to mkdocs
    rtyler authored Oct 7, 2023
    Configuration menu
    Copy the full SHA
    0187c46 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    583ee12 View commit details
    Browse the repository at this point in the history
  10. Merge pull request delta-io#1704 from guilhem-dvr/gdy/python-document…

    …-delete-operation
    
    docs(python): document the delete operation
    rtyler authored Oct 7, 2023
    Configuration menu
    Copy the full SHA
    0a41ebc View commit details
    Browse the repository at this point in the history

Commits on Oct 8, 2023

  1. Enable prebuffer

    ion-elgreco committed Oct 8, 2023
    Configuration menu
    Copy the full SHA
    94b41b7 View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2023

  1. implement issue 1169

    r3stl355 committed Oct 9, 2023
    Configuration menu
    Copy the full SHA
    14e77b4 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    506553d View commit details
    Browse the repository at this point in the history
  3. Merge pull request delta-io#1709 from ion-elgreco/feat/enable_prebuff…

    …er_pyarrow
    
    feat: improve read performance by 7x with prebuffer
    rtyler authored Oct 9, 2023
    Configuration menu
    Copy the full SHA
    ab6b0cf View commit details
    Browse the repository at this point in the history
  4. fix format

    r3stl355 authored Oct 9, 2023
    Configuration menu
    Copy the full SHA
    e7bb640 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    5abb40e View commit details
    Browse the repository at this point in the history
  6. Merge pull request delta-io#1711 from r3stl355/issue_1169

    docs: add a write example to delta.rs
    rtyler authored Oct 9, 2023
    Configuration menu
    Copy the full SHA
    dfb04e9 View commit details
    Browse the repository at this point in the history

Commits on Oct 11, 2023

  1. feat: add version number in .history() and display in reversed chro…

    …nological order (delta-io#1710)
    
    # Description
    Adds the version number to each commit info.
    
    # Related Issue(s)
    <!---
    For example:
    
    - closes delta-io#106 
    --->
    - Closes delta-io#1561
    - Closes delta-io#1680
    
    ---------
    
    Co-authored-by: R. Tyler Croy <[email protected]>
    ion-elgreco and rtyler authored Oct 11, 2023
    Configuration menu
    Copy the full SHA
    f92d9e5 View commit details
    Browse the repository at this point in the history
  2. feat(python): expose UPDATE operation (delta-io#1694)

    # Description
    
    - Exposes UPDATE operation to Python.
    - Added two test cases, with predicate and without
    - Took some learnings in simplifying the code (will apply it in MERGE PR
    as well)
    
    
    # Related Issue(s)
    <!---
    For example:
    
    - closes delta-io#106
    --->
    
    Closes delta-io#1505
    
    ---------
    
    Co-authored-by: Will Jones <[email protected]>
    ion-elgreco and wjones127 authored Oct 11, 2023
    Configuration menu
    Copy the full SHA
    3639ac7 View commit details
    Browse the repository at this point in the history

Commits on Oct 12, 2023

  1. fix: merge operation with string predicates (delta-io#1705)

    # Description
    Fixes an issue when users use string predicates with the merge
    operation.
    
    Parsing a string predicate did not properly handle table references and
    would always assume a bare table with a table name of the empty string.
    Now the qualifier is `None` however a `DFSchema` with qualifiers can be
    supplied where it makes sense.
    
    Now users must provide source and target aliases whenever both sides
    share a column name otherwise the operation will error out.
    
    Minor refactoring of the expression parser was also done and allowed
    using of case expressions.
    
    
    # Related Issue(s)
    - closes delta-io#1699
    
    ---------
    
    Co-authored-by: Will Jones <[email protected]>
    Blajda and wjones127 authored Oct 12, 2023
    Configuration menu
    Copy the full SHA
    04576f4 View commit details
    Browse the repository at this point in the history

Commits on Oct 14, 2023

  1. refactor!: remove a layer of lifetimes from PartitionFilter (delta-io…

    …#1725)
    
    # Description
    This commit removes a bunch of lifetime restrictions on the
    `PartitionFilter` and `PartitionFilterValue` classes to make them easier
    to use. While the original discussion in Slack and delta-io#1501 made mention of
    using a reference type, there doesn't seem to a need for it. A
    particular instance of a `PartitionFilter` is created once and just
    borrowed and read for the remainder of its life.
    
    Functions, when necessary continue to accept the non-container types
    (i.e, `&str` and `&[&str]`), allowing their containerized counterparts
    to continue working with them without needing to borrow or clone the
    containers (i.e, `String` and `Vec<String>`).
    
    # Related Issue(s)
    - resolves delta-io#1501 
    
    # Documentation
    cmackenzie1 authored Oct 14, 2023
    Configuration menu
    Copy the full SHA
    187a58c View commit details
    Browse the repository at this point in the history

Commits on Oct 17, 2023

  1. feat(python): expose MERGE operation (delta-io#1685)

    # Description
    This exposes MERGE commands to the Python API. The updates and
    predicates are first kept in the Class TableMerger and only dispatched
    to Rust after `TableMerge.execute()`.
    
    This was my first thought on how to implement it since I have limited
    experience with Rust and PyO3 (still learning 😄). Maybe a more elegant
    solution is that every class method on TableMerger is dispatched to Rust
    and then the Rust MergeBuilder gets serialized and sent back to Python
    (back and forth). Let me know your thoughts on this. If this is better,
    I could also do this in the next PR, so we at least can push this one
    out sooner.
    
    Couple of issues at the moment, I need feedback on, where the first one
    is blocking since I can't test it now:
    
    ~- Source_alias is not applying, somehow during a schema check the
    prefix is missing, however when I printed the lines inside merge, it
    showed the prefix correctly. So not sure where the issue is~
    ~- I had to make datafusion_utils public since I needed to get the
    Expression Struct from it, is this the right way to do that? @Blajda~
    
    Edit:
    I will pull @Blajda's changes
    delta-io#1705 once merged with develop:
    
    
    # Related Issue(s)
    <!---
    For example:
    
    - closes delta-io#106
    --->
    closes  delta-io#1357
    ion-elgreco authored Oct 17, 2023
    Configuration menu
    Copy the full SHA
    21e369f View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    da8a4c4 View commit details
    Browse the repository at this point in the history
  3. chore: bump the python package version (delta-io#1734)

    # Description
    The description of the main changes of your pull request
    
    # Related Issue(s)
    <!---
    For example:
    
    - closes delta-io#106
    --->
    
    # Documentation
    
    <!---
    Share links to useful documentation
    --->
    wjones127 authored Oct 17, 2023
    Configuration menu
    Copy the full SHA
    19e5cd5 View commit details
    Browse the repository at this point in the history
  4. fix: reorder encode_partition_value() checks and add tests (delta-io#…

    …1733)
    
    # Description
    The `isinstance(val, datetime)` check was after `isinstance(val, date)`
    which meant that it was never found. I added a test for each encoding
    type.
    
    ---------
    
    Co-authored-by: Robert Pack <[email protected]>
    ldacey and roeap authored Oct 17, 2023
    Configuration menu
    Copy the full SHA
    1860158 View commit details
    Browse the repository at this point in the history

Commits on Oct 18, 2023

  1. Configuration menu
    Copy the full SHA
    58f83de View commit details
    Browse the repository at this point in the history
  2. Merge pull request delta-io#1735 from wjones127/deprecation-cleanup

    chore: remove deprecated functions
    rtyler authored Oct 18, 2023
    Configuration menu
    Copy the full SHA
    3bcc428 View commit details
    Browse the repository at this point in the history

Commits on Oct 20, 2023

  1. Relax pyarrow pin

    dhirschfeld authored and wjones127 committed Oct 20, 2023
    Configuration menu
    Copy the full SHA
    1691030 View commit details
    Browse the repository at this point in the history
  2. fix: remove pandas pin (delta-io#1746)

    # Description
    
    Removes the `pandas` pin.
    
    # Related Issue(s)
    
    Resolves delta-io#1745
    dhirschfeld authored Oct 20, 2023
    Configuration menu
    Copy the full SHA
    a9cdd60 View commit details
    Browse the repository at this point in the history

Commits on Oct 21, 2023

  1. docs: get docs.rs configured correctly again (delta-io#1693)

    # Description
    
    The docs build was changed in delta-io#1658 to compile on docs.rs with all
    features, but our crate cannot compile with all-features due to the TLS
    features, which are mutually exclusive.
    
    # Related Issue(s)
    
    For example:
    
    - closes delta-io#1692
    
    This has been tested locally with the following command:
    
    ```
    cargo doc --features azure,datafusion,datafusion,gcs,glue,json,python,s3,unity-experimental
    ```
    wjones127 authored and rtyler committed Oct 21, 2023
    Configuration menu
    Copy the full SHA
    5ac8c88 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    c3776af View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    f729bea View commit details
    Browse the repository at this point in the history

Commits on Oct 22, 2023

  1. refactor!: update operations to use delta scan (delta-io#1639)

    # Description
    Recently implemented operations did not use `DeltaScan` it had some
    gaps. These gaps would make it harder switch towards logical plans which
    is required for merge.
    
    Gaps:
    - It was not possible to include file lineage in the result
    - The subset of files to be scanned is known ahead of time. Users had to
    reconstruct a parquet scan based on those files
    
    The PR introduces a `DeltaScanBuilder` that allow users to specify which
    files to use when constructing the scan, if the scan should be enhanced
    to include additional metadata columns, and allows a projection to be
    specified. It also retains previous functionality of pruning based on
    the provided filter when files to scan are not provided.
    
    `DeltaScanConfig` is also introduced which allows users to deterministic
    obtain the names of any added metadata columns or allows them to specify
    the name if required.
    
    The public interface for `find_files` has changed but functionality
    remains the same.
    
    A new table provider was introduced which accepts an `DeltaScanConfig`.
    This is required for future merge enhancements so unmodified files can
    be pruned pruned prior to writes.
    
    ---------
    
    Co-authored-by: Robert Pack <[email protected]>
    Blajda and roeap authored Oct 22, 2023
    Configuration menu
    Copy the full SHA
    bacb794 View commit details
    Browse the repository at this point in the history
  2. chore: update datafusion (delta-io#1741)

    Updates arrow and datafusion dependencies to latest.
    roeap authored Oct 22, 2023
    Configuration menu
    Copy the full SHA
    7681ef0 View commit details
    Browse the repository at this point in the history

Commits on Oct 23, 2023

  1. docs: convert docs to use mkdocs (delta-io#1731)

    # Description
    Completed the outstanding tasks in delta-io#1708
    
    Also changed theme from readthedocs to mkdocs - both are built-in but
    latter looks sleeker
    
    # Related Issue(s)
    closes delta-io#1708
    
    ---------
    
    Co-authored-by: Robert Pack <[email protected]>
    Co-authored-by: R. Tyler Croy <[email protected]>
    3 people authored Oct 23, 2023
    Configuration menu
    Copy the full SHA
    a67140c View commit details
    Browse the repository at this point in the history

Commits on Oct 24, 2023

  1. docs: dynamodb lock configuration (delta-io#1752)

    # Description
    I have added documentation in the API and also on the Python usage page
    regarding this configuration. Please let me know if it is satisfactory,
    and if not, I am more than happy to address any issues or make any
    necessary adjustments.
    
    # Related Issue(s)
    - closes delta-io#1674
    
    # Documentation
    brayanjuls authored Oct 24, 2023
    Configuration menu
    Copy the full SHA
    cffc4d7 View commit details
    Browse the repository at this point in the history

Commits on Oct 25, 2023

  1. Configuration menu
    Copy the full SHA
    3d85b9b View commit details
    Browse the repository at this point in the history
  2. feat: honor appendOnly table config (delta-io#1747)

    # Description
    Throw an error if a transaction includes Remove action with data change
    but the Delta Table is append-only.
    
    # Related Issue(s)
    - closes delta-io#352
    junjunjd authored Oct 25, 2023
    Configuration menu
    Copy the full SHA
    2cbf938 View commit details
    Browse the repository at this point in the history

Commits on Oct 26, 2023

  1. Merge tag 'rust-v0.16.2' into naston_upgrade

    Minor docsrs change to remove hdfs feature
    Ryan Aston committed Oct 26, 2023
    Configuration menu
    Copy the full SHA
    415434f View commit details
    Browse the repository at this point in the history

Commits on Oct 27, 2023

  1. chore: fix building/running tests without the datafusion feature

    This looks like an oversight that our CI didn't test because we have the
    datafusion feature typically enabled for our tests. The build error would only
    show up when building tests without it.
    rtyler committed Oct 27, 2023
    Configuration menu
    Copy the full SHA
    e5d2da3 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0ad02d4 View commit details
    Browse the repository at this point in the history

Commits on Oct 30, 2023

  1. feat(python): expose FSCK (repair) operation (delta-io#1730)

    # Description
    This PR exposes the FSCK operation as a `repair` method under the
    `DeltaTable `class.
    
    # Related Issue(s)
    <!---
    For example:
    
    - closes delta-io#106
    --->
    - closes delta-io#1727
    
    ---------
    
    Co-authored-by: Will Jones <[email protected]>
    ion-elgreco and wjones127 authored Oct 30, 2023
    Configuration menu
    Copy the full SHA
    b53b686 View commit details
    Browse the repository at this point in the history
  2. refactor: perform bulk deletes during metadata cleanup

    In addition to doing bulk deletes, I removed what seems like (at least to me)
    unnecessary code. At it's core, files are considered up for deletion
    when their last_modified time is older than the cutoff time AND the version
    if less than the specific version (usually the latest version).
    cmackenzie1 authored and rtyler committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    4622ef2 View commit details
    Browse the repository at this point in the history
  3. Make an attempt at improving the utilization of delete_stream for cle…

    …aning up expired logs
    
    This change builds on @cmackenzie1's work and feeds the list stream directly into
    the delete_stream with a predicate function to identify paths for deletion
    rtyler committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    f9b7080 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    953f473 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    bcd7ea0 View commit details
    Browse the repository at this point in the history
  6. attempt to calculate size

    ion-elgreco authored and rtyler committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    9637d29 View commit details
    Browse the repository at this point in the history
  7. add test

    ion-elgreco authored and rtyler committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    eec0349 View commit details
    Browse the repository at this point in the history
  8. chore: bump Python package version

    wjones127 authored and rtyler committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    d196c56 View commit details
    Browse the repository at this point in the history
  9. fix: ignore inf in stats

    wjones127 authored and rtyler committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    b1c7f1b View commit details
    Browse the repository at this point in the history
  10. doc(README): remove typo

    Deep145757 authored and rtyler committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    3c34b9e View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    cd38107 View commit details
    Browse the repository at this point in the history
  12. use official Python API for references

    r3stl355 authored and rtyler committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    8f0b2d7 View commit details
    Browse the repository at this point in the history

Commits on Oct 31, 2023

  1. chore: refactor into the deltalake meta crate and deltalake-core crates

    This puts the groundwork in place for starting to partition into smaller crates
    in a simpler and more manageable fashion.
    
    See delta-io#1713
    rtyler committed Oct 31, 2023
    Configuration menu
    Copy the full SHA
    d5ad4fb View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    8054882 View commit details
    Browse the repository at this point in the history
  3. feat: add deltalake sql crate (delta-io#1757)

    # Description
    
    This is an fairly early draft to create logical plans from sql using the
    datafusion abstractions. Adopted the patterns over there quite closely
    since the ultimate goal would be to ask the datafusion community if they
    would accept these changes within the datafusion sql crate ...
    
    # Related Issue(s)
    <!---
    For example:
    
    - closes delta-io#106
    --->
    
    # Documentation
    
    <!---
    Share links to useful documentation
    --->
    
    ---------
    
    Co-authored-by: R. Tyler Croy <[email protected]>
    roeap and rtyler authored Oct 31, 2023
    Configuration menu
    Copy the full SHA
    60d757f View commit details
    Browse the repository at this point in the history

Commits on Nov 1, 2023

  1. rollback resolve bucket region change

    Ryan Aston committed Nov 1, 2023
    Configuration menu
    Copy the full SHA
    0971ff2 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    51e0461 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    8ec92ef View commit details
    Browse the repository at this point in the history