Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Upgrade v14 #71

Merged
merged 1,463 commits into from
Sep 23, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1463 commits
Select commit Hold shift + click to select a range
64d4172
feat(forms) Handle deleting forms references when hard deleting forms…
chriscollins3456 Jul 1, 2024
8d5f0f3
refactor(ui): Misc improvements to the setup ingestion flow (ingest u…
jjoyce0510 Jul 1, 2024
2e496d5
fix(ingestion/airflow-plugin): pipeline tasks discoverable in search …
dushayntAW Jul 2, 2024
640d42d
feat(ingest/transformer): tags to terms transformer (#10758)
sagar-salvi-apptware Jul 2, 2024
40418d9
fix(ingestion/unity-catalog): fixed issue with profiling with GE turn…
dushayntAW Jul 2, 2024
5e4a3af
feat(forms) Add java SDK for form entity PATCH + CRUD examples (#10822)
chriscollins3456 Jul 2, 2024
a7f4b71
feat(SDK) Add java SDK for structuredProperty entity PATCH + CRUD exa…
chriscollins3456 Jul 2, 2024
b651d81
feat(SDK) Add StructuredPropertyPatchBuilder in python sdk and provid…
chriscollins3456 Jul 2, 2024
3175571
feat(forms) Add CRUD endpoints to GraphQL for Form entities (#10825)
chriscollins3456 Jul 2, 2024
286a3cc
add flag for includeSoftDeleted in scroll entities API (#10831)
kevin1chun Jul 2, 2024
f067573
feat(deprecation) Return actor entity with deprecation aspect (#10832)
chriscollins3456 Jul 2, 2024
65c7adb
feat(structuredProperties) Add CRUD graphql APIs for structured prope…
chriscollins3456 Jul 2, 2024
b153473
add scroll parameters to openapi v3 spec (#10833)
kevin1chun Jul 2, 2024
226b059
fix(ingest): correct profile_day_of_week implementation (#10818)
jordanjeremy Jul 3, 2024
099021c
feat(ingest/glue): allow ingestion of empty databases from Glue (#10666)
skrydal Jul 3, 2024
e45f7a4
feat(cli): add more details to get cli (#10815)
anshbansal Jul 3, 2024
b8af2b9
fix(ingestion/glue): ensure date formatting works on all platforms fo…
sagar-salvi-apptware Jul 3, 2024
ea6bc61
fix(ingestion): fix datajob patcher (#10827)
david-leifker Jul 3, 2024
ec1865c
fix(smoke-test): add suffix in temp file creation (#10841)
sid-acryl Jul 3, 2024
73cdfeb
feat(ingest/glue): add helper method to permit user or group ownershi…
aviv-julienjehannet Jul 3, 2024
6628ef6
feat(): Show data platform instances in policy modal if they are set …
githendrik Jul 3, 2024
80e5a64
docs(patch): add patch documentation for how implementation works (#1…
RyanHolstien Jul 3, 2024
581dc7f
fix(jar): add missing custom-plugin-jar task (#10847)
david-leifker Jul 3, 2024
87ff19e
fix(): also check exceptions/stack trace when filtering log messages …
Masterchen09 Jul 3, 2024
1b923b3
docs(): Update posts.md (#9893)
chardaway Jul 3, 2024
c24d780
chore(ingest): update acryl-datahub-classify version (#10844)
cburroughs Jul 3, 2024
fa3e381
refactor(ingest): Refactor structured logging to support infos, warni…
jjoyce0510 Jul 4, 2024
906bc98
fix(restli): log aspect-not-found as a warning rather than as an erro…
ksrinath Jul 4, 2024
6e50bcc
fix(ingest/nifi): remove duplicate upstream jobs (#10849)
mayurinehate Jul 4, 2024
ae9f1fb
fix(smoke-test): test access to create/revoke personal access tokens …
ksrinath Jul 4, 2024
3f29a0b
fix(smoke-test): missing test for move domain (#10837)
Kunal-kankriya Jul 4, 2024
74a543d
ci: update usernames to not considered for community (#10851)
anshbansal Jul 4, 2024
ebe7b2d
env: change defaults for data contract visibility (#10854)
shirshanka Jul 5, 2024
97638c3
fix(ingest/tableau): quote special characters in external URL (#10842)
ipolding-cais Jul 8, 2024
25dca37
fix(smoke-test): fix flakiness of auto complete test
Kunal-kankriya Jul 8, 2024
54b9d98
ci(ingest): pin dask dependency for feast (#10865)
mayurinehate Jul 8, 2024
43bac36
fix(ingestion/lookml): liquid template resolution and view-to-view cl…
sid-acryl Jul 8, 2024
41b9e15
feat(ingest/audit): add client id and version in system metadata prop…
anshbansal Jul 8, 2024
1c8e8c3
chore(ingest): Mypy 1.10.1 pin (#10867)
treff7es Jul 8, 2024
38faa5b
docs: use acryl-datahub-actions as expected python package to install…
aviv-julienjehannet Jul 8, 2024
8722878
docs: add new js snippet (#10846)
hsheth2 Jul 8, 2024
b6c7fe8
refactor(ingestion): remove company domain for security reason (#10839)
shubhamjagtap639 Jul 9, 2024
d204d56
fix(ingestion/spark): Platform instance and column level lineage fix …
treff7es Jul 9, 2024
3e86192
feat(ingestion/tableau): optionally ingest multiple sites and create …
haeniya Jul 9, 2024
609847f
fix(ingestion/looker): Add sqlglot dependency and remove unused sqlpa…
sid-acryl Jul 9, 2024
cd932c3
fix(manage-tokens): fix manage access token policy (#10853)
david-leifker Jul 10, 2024
623b6f9
Batch get entity endpoints (#10880)
kevin1chun Jul 10, 2024
5327f80
feat(system): support conditional write semantics (#10868)
david-leifker Jul 11, 2024
e5d4271
fix(build): upgrade vercel builds to Node 20.x (#10890)
hsheth2 Jul 11, 2024
418f7e0
feat(ingest/lookml): shallow clone repos (#10888)
hsheth2 Jul 11, 2024
44930df
fix(ingest/looker): add missing dependency (#10876)
hsheth2 Jul 11, 2024
82bd3c2
fix(ingest): only populate audit stamps where accurate (#10604)
hsheth2 Jul 11, 2024
351e434
fix(ingest/dbt): always encode tag urns (#10799)
hsheth2 Jul 11, 2024
89bda5b
fix(ingest/redshift): handle multiline alter table commands (#10727)
hsheth2 Jul 11, 2024
aa92a99
fix(ingestion/looker): column name missing in explore (#10892)
sid-acryl Jul 12, 2024
d77d565
fix(lineage) Fix lineage source/dest filtering with explored per hop …
chriscollins3456 Jul 12, 2024
46530f2
feat(conditional-writes): misc updates and fixes (#10901)
david-leifker Jul 12, 2024
542f6c1
feat(ci): update outdated action (#10899)
anshbansal Jul 12, 2024
423af83
feat(rest-emitter): adding async flag to rest emitter (#10902)
gabe-lyons Jul 12, 2024
a4bce6a
feat(ingest): add snowflake-queries source (#10835)
hsheth2 Jul 12, 2024
ea7d6a9
fix(ingest): improve `auto_materialize_referenced_tags_terms` error h…
hsheth2 Jul 13, 2024
43ae12f
docs: add new company to adoption list (#10909)
shtephlee Jul 15, 2024
8967db0
refactor(redshift): Improve redshift error handling with new structur…
jjoyce0510 Jul 15, 2024
36fd614
feat(ui) Finalize support for all entity types on forms (#10915)
chriscollins3456 Jul 15, 2024
78bffc3
Index ExecutionRequestResults status field (#10811)
noggi Jul 15, 2024
437bacb
feat(ingest): grafana connector (#10891)
anshbansal Jul 15, 2024
5f38d13
fix(gms) Add Form entity type to EntityTypeMapper (#10916)
chriscollins3456 Jul 15, 2024
cfcd216
feat(dataset): add support for external url in Dataset (#10877)
dragontail Jul 15, 2024
643ba0c
docs(saas-overview) added missing features to observe section (#10913)
jayacryl Jul 16, 2024
642fb40
fix(ingest/spark): Fixing Micrometer warning (#10882)
treff7es Jul 16, 2024
ea2216a
fix(structured properties): allow application of structured propertie…
gabe-lyons Jul 16, 2024
ee18a17
fix(data-contracts-web) handle other schedule types (#10919)
jayacryl Jul 16, 2024
bb1ba09
fix(ingestion/tableau): human-readable message for PERMISSIONS_MODE_S…
sid-acryl Jul 16, 2024
12ee485
Add feature flag for view defintions (#10914)
ethan-cartwright Jul 16, 2024
ff1c6b8
feat(ingest/BigQuery): refactor+parallelize dataset metadata extracti…
mayurinehate Jul 16, 2024
bb24651
fix(airflow): add error handling around render_template() (#10907)
hsheth2 Jul 16, 2024
a8b07c5
feat(ingestion/sqlglot): add optional `default_dialect` parameter to …
nadavgross Jul 16, 2024
1565fb0
feat(mcp-mutator): new mcp mutator plugin (#10904)
david-leifker Jul 16, 2024
4b83adf
fix(ingest/bigquery): changes helper function to decode unicode scape…
PatrickfBraz Jul 16, 2024
298c299
feat(ingest/postgres): fetch table sizes for profile (#10864)
pie1nthesky Jul 16, 2024
13b6feb
feat(ingest/abs): Adding azure blob storage ingestion source (#10813)
joelmataKPN Jul 17, 2024
82e5a04
fix(ingest/redshift): reduce severity of SQL parsing issues (#10924)
hsheth2 Jul 17, 2024
11c5c3e
fix(build): fix lint fix web react (#10896)
anshbansal Jul 17, 2024
ec788df
fix(ingest/bigquery): handle quota exceeded for project.list requests…
sagar-salvi-apptware Jul 17, 2024
79e1e2e
feat(ingest): report extractor failures more loudly (#10908)
hsheth2 Jul 17, 2024
bccfd8f
feat(ingest/snowflake): integrate snowflake-queries into main source …
hsheth2 Jul 17, 2024
5f79621
fix(ingest): fix docs build (#10926)
hsheth2 Jul 17, 2024
7f3da47
fix(ingest/snowflake): fix test connection (#10927)
hsheth2 Jul 17, 2024
90f0743
fix(ingest/lookml): add view load failures to cache (#10923)
hsheth2 Jul 17, 2024
998d7d1
docs(slack) overhauled setup instructions and screenshots (#10922)
jayacryl Jul 17, 2024
0b64de8
fix(airflow): Add comma parsing of owners to DataJobs (#10903)
eboneil Jul 17, 2024
452b94f
fix(entityservice): fix merging sideeffects (#10937)
david-leifker Jul 17, 2024
1617d62
feat(ingest): Support System Ingestion Sources, Show and hide system …
jjoyce0510 Jul 18, 2024
a7ae99c
chore() Set a default lineage filtering end time on backend when a st…
jjoyce0510 Jul 18, 2024
8266b02
Added relationships APIs to V3. Added these generic APIs to V3 swagge…
ajoymajumdar Jul 18, 2024
8124ecf
docs: add learning center to docs (#10921)
yoonhyejin Jul 18, 2024
e66b2b9
doc: Update hubspot form id (#10943)
yoonhyejin Jul 18, 2024
3733a40
chore(airflow): add python 3.11 w/ Airflow 2.9 to CI (#10941)
hsheth2 Jul 19, 2024
348d449
fix(ingest/Glue): column upstream lineage between S3 and Glue (#10895)
sagar-salvi-apptware Jul 19, 2024
65ef858
fix(ingest/abs): split abs utils into multiple files (#10945)
treff7es Jul 19, 2024
c30f4c7
doc(ingest/looker): fix doc for sql parsing documentation (#10883)
sid-acryl Jul 19, 2024
91e8001
fix(ingest/bigquery): Adding missing BigQuery types (#10950)
treff7es Jul 19, 2024
4fe5f28
fix(ingest/setup): feast and abs source setup (#10951)
treff7es Jul 19, 2024
30e9cfd
fix(connections) Harden adding /gms to connections in backend (#10942)
chriscollins3456 Jul 19, 2024
eace710
feat(siblings) Add flag to prevent combining siblings in the UI (#10952)
chriscollins3456 Jul 19, 2024
44cdb04
fix(docs): make graphql doc gen more automated (#10953)
hsheth2 Jul 19, 2024
20574cf
feat(ingest/athena): Add option for Athena partitioned profiling (#10…
treff7es Jul 19, 2024
aa97cba
fix(spark-lineage): default timeout for future responses (#10947)
deepgarg-visa Jul 22, 2024
9f570a7
feat(datajob/flow): add environment filter using info aspects (#10814)
anshbansal Jul 22, 2024
8b88930
fix(ui/ingest): correct privilege used to show tab (#10483)
anshbansal Jul 22, 2024
a090518
feat(ingest/looker): include dashboard urns in browse v2 (#10955)
hsheth2 Jul 22, 2024
ebe687b
add a structured type to batchGet in OpenAPI V3 spec (#10956)
kevin1chun Jul 22, 2024
c5d3153
fix(ui): scroll on the domain sidebar to show all domains (#10966)
chriscollins3456 Jul 23, 2024
c5dae74
fix(ingest/sagemaker): resolve incorrect variable assignment for Sage…
TristanHeisler Jul 23, 2024
123e84c
fix(airflow/build): Pinning mypy (#10972)
treff7es Jul 23, 2024
f0f226c
Fixed a bug where the OpenAPI V3 spec was incorrect. The bug was intr…
ajoymajumdar Jul 24, 2024
8d874ad
fix(ingest/test): Fix for mssql integration tests (#10978)
treff7es Jul 24, 2024
4921967
fix(entity-service) exist check correctly extracts status (#10973)
jayacryl Jul 24, 2024
959d351
fix(structuredProps) casing bug in StructuredPropertiesValidator (#10…
chriscollins3456 Jul 24, 2024
725df58
bugfix: use anyOf instead of allOf when creating references in openap…
kevin1chun Jul 24, 2024
66f8930
fix(ui): Remove ant less imports (#10988)
asikowitz Jul 25, 2024
fde71d4
feat(ingest/graph): Add get_results_by_filter to DataHubGraph (#10987)
asikowitz Jul 25, 2024
6fe5692
feat(ingest/cli): init does not actually support environment variable…
darnaut Jul 25, 2024
56bb4c8
fix(ingest/graph): Update get_results_by_filter graphql query (#10991)
asikowitz Jul 25, 2024
f4fb89e
feat(ingest/spark): Promote beta plugin (#10881)
treff7es Jul 25, 2024
1fa7998
feat(ingest): support domains in meta -> "datahub" section (#10967)
hsheth2 Jul 25, 2024
5a2fc3c
feat(ingest): add `check server-config` command (#10990)
hsheth2 Jul 25, 2024
dd732d0
feat(cli): Make consistent use of DataHubGraphClientConfig (#10466)
pedro93 Jul 25, 2024
71d1cdb
fix(ingest/s3): Fixing container creation when there is no folder in …
treff7es Jul 25, 2024
b173f60
fix(ingest/looker): support platform instance for dashboards & charts…
sid-acryl Jul 26, 2024
8c3bfd9
feat(ingest/bigquery): improve handling of information schema in sql …
hsheth2 Jul 26, 2024
304fc4e
feat(ingest): improve `ingest deploy` command (#10944)
hsheth2 Jul 26, 2024
1717a30
fix(backend): allow excluding soft-deleted entities in relationship-q…
ksrinath Jul 26, 2024
82412e7
fix(ingest/looker): downgrade missing chart type log level (#10996)
hsheth2 Jul 26, 2024
236043b
doc(acryl-cloud): release docs for 0.3.4.x (#10984)
anshbansal Jul 26, 2024
01b3461
fix(protobuf/build): Fix protobuf check jar script (#11006)
treff7es Jul 26, 2024
0274c70
fix(ui/ingest): Support invalid cron jobs (#10998)
asikowitz Jul 26, 2024
f816a14
fix(ingest): fix graph config loading (#11002)
hsheth2 Jul 26, 2024
1f7c92b
feat(docs): Document __DATAHUB_TO_FILE_ directive (#10968)
pedro93 Jul 26, 2024
bc75f7a
fix(graphql/upsertIngestionSource): Validate cron schedule; parse err…
asikowitz Jul 26, 2024
fdbcb68
feat(ece): support custom ownership type urns in ECE generation (#10999)
hsheth2 Jul 26, 2024
d85da39
feat(assertion-v2): changed Validation tab to Quality and created new…
amit-apptware Jul 26, 2024
a09575f
fix(ingestion/glue): Add support for missing config options for profi…
sagar-salvi-apptware Jul 29, 2024
1c4c858
feat(propagation): Add models for schema field docs, tags, terms (#29…
shirshanka Jul 29, 2024
a6eb1f4
docs: standardize terminology to DataHub Cloud (#11003)
yoonhyejin Jul 30, 2024
da72ba2
fix(ingestion/transformer): replace the externalUrl container (#11013)
sagar-salvi-apptware Jul 30, 2024
7faaf1b
docs(slack) troubleshoot docs (#11014)
jayacryl Jul 30, 2024
43e3cd9
feat(propagation): Add graphql API (#11030)
shirshanka Jul 30, 2024
c2470f1
feat(propagation): Add models for Action feature settings (#11029)
samblackk Jul 30, 2024
9321e94
docs(custom properties): Remove duplicate from sidebar (#11033)
eboneil Jul 30, 2024
6f09b96
feat(models): Introducing Dataset Partitions Aspect (#10997)
jjoyce0510 Jul 30, 2024
d6be56f
feat(propagation): Add Documentation Propagation Settings (#11038)
samblackk Jul 30, 2024
7d4b645
fix(models): chart schema fields mapping, add dataHubAction entity, t…
shirshanka Jul 30, 2024
a734d69
fix(ci): smoke test lint failures (#11044)
shirshanka Jul 30, 2024
32247da
docs: fix learning center color scheme & typo (#11043)
yoonhyejin Jul 31, 2024
bcde06d
feat: add cloud main page (#11017)
yoonhyejin Jul 31, 2024
76f1b23
feat(restore-indices): add additional step to also clear system metad…
Masterchen09 Jul 31, 2024
a3c7ede
docs: fix typo (#11046)
yoonhyejin Jul 31, 2024
27e1130
fix(lint): apply spotless (#11050)
anshbansal Jul 31, 2024
f73149a
docs(airflow): example query to get datajobs for a dataflow (#11034)
eboneil Jul 31, 2024
1955c05
feat(cli): Add run-id option to put sub-command (#11023)
pedro93 Jul 31, 2024
e751189
fix(ingest): improve sql error reporting calls (#11025)
hsheth2 Jul 31, 2024
2333304
fix(airflow): fix CI setup (#11031)
hsheth2 Jul 31, 2024
4b9844d
feat(ingest/dbt): add experimental `prefer_sql_parser_lineage` flag (…
hsheth2 Jul 31, 2024
4512209
fix(ingestion/lookml): enable stack-trace in lookml logs (#10971)
sid-acryl Jul 31, 2024
7fc6556
(chore): Linting fix (#11015)
rtekal Jul 31, 2024
347ac1a
chore(ci): update deprecated github actions (#10977)
anshbansal Jul 31, 2024
52c50f0
Fix ALB configuration example (#10981)
steffengr Jul 31, 2024
edc8fd4
chore(ingestion-base): bump base image packages (#11053)
david-leifker Jul 31, 2024
e14dc91
feat(cli): Trim report of dataHubExecutionRequestResult to max GMS si…
pedro93 Jul 31, 2024
0667470
fix(ingestion/lookml): emit dummy sql condition for lookml custom con…
sid-acryl Jul 31, 2024
dffdef2
fix(ingestion/powerbi): fix issue with broken report lineage (#10910)
sid-acryl Jul 31, 2024
e83550b
feat(ingest/tableau): add retry on timeout (#10995)
hsheth2 Jul 31, 2024
fc7b685
change generate kafka connect properties from env (#10545)
wornjs Jul 31, 2024
2336207
fix(ingest): fix oracle cronjob ingestion (#11001)
lima-renan Jul 31, 2024
b13d990
chore(ci): revert update deprecated github actions (#10977) (#11062)
david-leifker Jul 31, 2024
89933fe
feat(ingest/dbt-cloud): update metadata_endpoint inference (#11041)
hsheth2 Jul 31, 2024
2ab43f3
build: Reduce size of datahub-frontend-react image by 50-ish% (#10878)
xkollar Aug 1, 2024
1e6065e
fix(ci): Fix lint issue in datahub_ingestion_run_summary_provider.py …
pedro93 Aug 1, 2024
e9a0e27
docs(ingest): update developing-a-transformer.md (#11019)
acrylJonny Aug 1, 2024
c83907f
feat(search-test): update search tests from #10408 (#11056)
david-leifker Aug 1, 2024
66ecfae
feat(cli): add aspects parameter to DataHubGraph.get_entity_semityped…
Masterchen09 Aug 1, 2024
2369032
docs(airflow): update min version for plugin v2 (#11065)
hsheth2 Aug 1, 2024
d5eda0d
doc(ingestion/tableau): doc update for derived permission (#11054)
sid-acryl Aug 1, 2024
f78b6c0
fix(py): remove dep on types-pkg_resources (#11076)
hsheth2 Aug 2, 2024
f2e461e
feat(ingest/mode): add option to exclude restricted (#11081)
anshbansal Aug 2, 2024
5013943
fix(ingest): set lastObserved in sdk when unset (#11071)
hsheth2 Aug 2, 2024
6704d44
doc(ingest): Update capabilities (#11072)
treff7es Aug 3, 2024
a8ef7b6
chore(vulnerability): Log Injection (#11090)
pinakipb2 Aug 3, 2024
b544c0a
chore(vulnerability): Information exposure through a stack trace (#11…
pinakipb2 Aug 3, 2024
c572d39
chore(vulnerability): Comparison of narrow type with wide type in loo…
pinakipb2 Aug 3, 2024
aeef69c
chore(vulnerability): Insertion of sensitive information into log fil…
pinakipb2 Aug 3, 2024
2c9e3b7
chore(vulnerability): Risky Cryptographic Algorithm (#11059)
pinakipb2 Aug 3, 2024
210b311
chore(vulnerability): Overly permissive regex range (#11061)
pinakipb2 Aug 3, 2024
903ec87
fix: update customer data (#11075)
yoonhyejin Aug 3, 2024
9e413aa
fix(models): fixing the datasetPartition models (#11085)
jjoyce0510 Aug 5, 2024
8657288
fix(ui): Adding view, forms GraphQL query, remove showing a fallback …
jjoyce0510 Aug 5, 2024
28f1c92
feat(docs-site): hiding learn more from cloud page (#11097)
jayacryl Aug 5, 2024
e4e77c3
fix(docs): Add correct usage of orFilters in search API docs (#11082)
gabe-lyons Aug 5, 2024
41fbae5
fix(ingest/mode): Regexp in mode name matcher didn't allow underscore…
treff7es Aug 5, 2024
018a106
docs: Refactor customer stories section (#10869)
yoonhyejin Aug 6, 2024
63faeec
fix(release): fix full/slim suffix on tag (#11087)
david-leifker Aug 6, 2024
a5a33f0
feat(config): support alternate hashing algorithm for doc id (#10423)
pinakipb2 Aug 6, 2024
090e760
fix(emitter): fix typo in get method of java kafka emitter (#11007)
rtekal Aug 6, 2024
9619553
fix(ingest): use correct native data type in all SQLAlchemy sources b…
Masterchen09 Aug 6, 2024
0400785
chore: Update contributors list in PR labeler (#11105)
skrydal Aug 6, 2024
832093a
feat(ingest): tweak stale entity removal messaging (#11064)
hsheth2 Aug 6, 2024
543e447
fix(ingestion): enforce lastObserved timestamps in SystemMetadata (#1…
david-leifker Aug 6, 2024
2755cf3
fix(ingest/powerbi): fix broken lineage between chart and dataset (#1…
sid-acryl Aug 7, 2024
8bea5d2
feat(ingest/lookml): CLL support for sql set in sql_table_name attrib…
sid-acryl Aug 7, 2024
40e61f9
docs: update graphql docs on forms & structured properties (#11100)
yoonhyejin Aug 7, 2024
900c259
test(search): search openAPI v3 test (#11049)
Kunal-kankriya Aug 7, 2024
edb0f19
fix(ingest/tableau): prevent empty site content urls (#11057)
hsheth2 Aug 7, 2024
c226883
feat(entity-client): implement client batch interface (#11106)
david-leifker Aug 7, 2024
a25df8e
fix(snowflake): avoid reporting warnings/info for sys tables (#11114)
hsheth2 Aug 7, 2024
d6e46b9
fix(ingest): downgrade column type mapping warning to info (#11115)
hsheth2 Aug 7, 2024
e08412e
feat(api): add AuditStamp to the V3 API entity/aspect response (#11118)
ajoymajumdar Aug 8, 2024
a4a887c
fix(ingest/redshift): replace r'\n' with '\n' to avoid token error re…
AndreasHegerNuritas Aug 8, 2024
3d9a954
fix(entiy-client): handle null entityUrn case for restli (#11122)
david-leifker Aug 8, 2024
840b150
fix(sql-parser): prevent bad urns from alter table lineage (#11092)
hsheth2 Aug 8, 2024
78336c9
fix(ingest/bigquery): use small batch size if use_tables_list_query_v…
mayurinehate Aug 9, 2024
aa07e2a
fix(graphql): add missing entities to EntityTypeMapper and EntityType…
Masterchen09 Aug 9, 2024
3a38415
feat(ui): Changes to allow editable dataset name (#10608)
jayasimhankv Aug 9, 2024
5b16252
fix: remove saxo (#11127)
yoonhyejin Aug 9, 2024
080f2a2
feat(mcl-processor): Update mcl processor hooks (#11134)
david-leifker Aug 9, 2024
573c1cb
fix(openapi): fix openapi v2 endpoints & v3 documentation update
david-leifker Aug 9, 2024
469654c
Revert "fix(openapi): fix openapi v2 endpoints & v3 documentation upd…
david-leifker Aug 9, 2024
3dfbbd5
docs(policies): updates to policies documentation (#11073)
david-leifker Aug 9, 2024
479f31d
fix(openapi): fix openapi v2 and v3 docs update (#11139)
david-leifker Aug 9, 2024
946b9f3
feat(auth): grant type and acr values custom oidc parameters support …
RyanHolstien Aug 9, 2024
4d2af40
fix(mutator): mutator hook fixes (#11140)
RyanHolstien Aug 9, 2024
06562f3
feat(search): support sorting on multiple fields (#10775)
RyanHolstien Aug 9, 2024
17868cb
feat(ingest): various logging improvements (#11126)
hsheth2 Aug 9, 2024
b1f16f9
fix(ingestion/lookml): fix for sql parsing error (#11079)
sid-acryl Aug 9, 2024
3f4b8ea
feat(docs-site) cloud page spacing and content polishes (#11141)
jayacryl Aug 9, 2024
796483b
feat(ui) Enable editing structured props on fields (#11042)
chriscollins3456 Aug 12, 2024
c9cc9e1
feat(tests): add md5 and last computed to testResult model (#11117)
RyanHolstien Aug 12, 2024
3155914
test(openapi): openapi regression smoke tests (#11143)
david-leifker Aug 12, 2024
3d4b3b9
fix(airflow): fix tox tests + update docs (#11125)
hsheth2 Aug 12, 2024
9dc85cb
docs: add chime to adoption stories (#11142)
yoonhyejin Aug 13, 2024
5e9188c
fix(ingest/databricks): Updating code to work with Databricks sdk 0.3…
treff7es Aug 13, 2024
2c609b8
fix(kafka-setup): add missing script to image (#11190)
david-leifker Aug 15, 2024
10e5d44
fix(config): fix hash algo config (#11191)
david-leifker Aug 16, 2024
3418ed3
test(smoke-test): updates to smoke-tests (#11152)
david-leifker Aug 13, 2024
668b506
fix(elasticsearch): refactor idHashAlgo setting (#11193)
david-leifker Aug 16, 2024
98ad824
chore(kafka): kafka version bump (#11211)
david-leifker Aug 20, 2024
5d5f60b
upgrade to v14 1st* try
arosanda Sep 3, 2024
dd64ecd
readd UsageStatsWorkUnit
arosanda Sep 3, 2024
1bda024
fix merge problems
arosanda Sep 3, 2024
7524399
change logo
arosanda Sep 23, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
9 changes: 7 additions & 2 deletions .dockerignore
Original file line number Diff line number Diff line change
@@ -1,6 +1,11 @@
**/node_modules/
datahub-frontend/build/
metadata-ingestion/venv/
*/build/
*/*/build/
**/venv/
**/.tox/
**/.mypy_cache/
**/.pytest_cache/
**/__pycache__/
out
**/*.class
# Have to copy gradle/wrapper/gradle-wrapper.jar, can't exclude ALL jars
Expand Down
92 changes: 92 additions & 0 deletions .github/actions/ci-optimization/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,92 @@
name: 'Identify CI Optimizations'
description: 'Determine if code changes are specific to certain modules.'

outputs:
frontend-only:
description: "Frontend only change"
value: ${{ steps.filter.outputs.frontend == 'true' && steps.filter.outputs.ingestion == 'false' && steps.filter.outputs.backend == 'false' }}
ingestion-only:
description: "Ingestion only change"
value: ${{ steps.filter.outputs.frontend == 'false' && steps.filter.outputs.ingestion == 'true' && steps.filter.outputs.backend == 'false' }}
backend-only:
description: "Backend only change"
value: ${{ steps.filter.outputs.frontend == 'false' && steps.filter.outputs.ingestion == 'false' && steps.filter.outputs.backend == 'true' }}
backend-change:
description: "Backend code has changed"
value: ${{ steps.filter.outputs.backend == 'true' }}
ingestion-change:
description: "Ingestion code has changed"
value: ${{ steps.filter.outputs.ingestion == 'true' }}
ingestion-base-change:
description: "Ingestion base image docker image has changed"
value: ${{ steps.filter.outputs.ingestion-base == 'true' }}
frontend-change:
description: "Frontend code has changed"
value: ${{ steps.filter.outputs.frontend == 'true' }}
docker-change:
description: "Docker code has changed"
value: ${{ steps.filter.outputs.docker == 'true' }}
kafka-setup-change:
description: "Kafka setup docker change"
value: ${{ steps.filter.outputs.kafka-setup == 'true' }}
mysql-setup-change:
description: "Mysql setup docker change"
value: ${{ steps.filter.outputs.mysql-setup == 'true' }}
postgres-setup-change:
description: "Postgres setup docker change"
value: ${{ steps.filter.outputs.postgres-setup == 'true' }}
elasticsearch-setup-change:
description: "Elasticsearch setup docker change"
value: ${{ steps.filter.outputs.elasticsearch-setup == 'true' }}
smoke-test-change:
description: "Smoke test change"
value: ${{ steps.filter.outputs.smoke-test == 'true' }}
runs:
using: "composite"
steps:
- uses: dorny/paths-filter@v2
id: filter
with:
filters: |
frontend:
- "datahub-frontend/**"
- "datahub-web-react/**"
- "smoke-test/tests/cypress/**"
- "docker/datahub-frontend/**"
ingestion:
- "metadata-ingestion-modules/**"
- "metadata-ingestion/**"
- "metadata-models/**"
- "smoke-test/**"
- "docker/datahub-ingestion**"
ingestion-base:
- "docker/datahub-ingestion-base/**"
docker:
- "docker/**"
backend:
- ".github/**"
- "metadata-models/**"
- "datahub-upgrade/**"
- "entity-registry/**"
- "li-utils/**"
- "metadata-auth/**"
- "metadata-dao-impl/**"
- "metadata-events/**"
- "metadata-io/**"
- "metadata-jobs/**"
- "metadata-service/**"
- "metadata-utils/**"
- "metadata-operation-context/**"
- "datahub-graphql-core/**"
- "smoke-test/**"
- "docker/**"
kafka-setup:
- "docker/kafka-setup/**"
mysql-setup:
- "docker/mysql-setup/**"
postgres-setup:
- "docker/postgres-setup/**"
elasticsearch-setup:
- "docker/elasticsearch-setup/**"
smoke-test:
- "smoke-test/**"
23 changes: 16 additions & 7 deletions .github/actions/docker-custom-build-and-push/action.yml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ inputs:
required: false

images:
# e.g. linkedin/datahub-gms
# e.g. acryldata/datahub-gms
description: "List of Docker images to use as base name for tags"
required: true
build-args:
Expand Down Expand Up @@ -55,7 +55,7 @@ runs:

# Code for testing the build when not pushing to Docker Hub.
- name: Build and Load image for testing (if not publishing)
uses: docker/build-push-action@v3
uses: docker/build-push-action@v5
if: ${{ inputs.publish != 'true' }}
with:
context: ${{ inputs.context }}
Expand All @@ -70,27 +70,36 @@ runs:
push: false
cache-from: type=registry,ref=${{ steps.docker_meta.outputs.tags }}
cache-to: type=inline
- name: Single Tag
if: ${{ inputs.publish != 'true' }}
shell: bash
run: |
TAGS="""
${{ steps.docker_meta.outputs.tags }}
"""
echo "SINGLE_TAG=$(echo $TAGS | tr '\n' ' ' | awk -F' ' '{ print $1 }')" >> $GITHUB_OUTPUT
id: single_tag
- name: Upload image locally for testing (if not publishing)
uses: ishworkh/docker-image-artifact-upload@v1
if: ${{ inputs.publish != 'true' }}
with:
image: ${{ steps.docker_meta.outputs.tags }}
image: ${{ steps.single_tag.outputs.SINGLE_TAG }}

# Code for building multi-platform images and pushing to Docker Hub.
- name: Set up QEMU
uses: docker/setup-qemu-action@v2
uses: docker/setup-qemu-action@v3
if: ${{ inputs.publish == 'true' }}
- name: Set up Docker Buildx
uses: docker/setup-buildx-action@v2
uses: docker/setup-buildx-action@v3
if: ${{ inputs.publish == 'true' }}
- name: Login to DockerHub
uses: docker/login-action@v2
uses: docker/login-action@v3
if: ${{ inputs.publish == 'true' }}
with:
username: ${{ inputs.username }}
password: ${{ inputs.password }}
- name: Build and Push Multi-Platform image
uses: docker/build-push-action@v3
uses: docker/build-push-action@v5
if: ${{ inputs.publish == 'true' }}
with:
context: ${{ inputs.context }}
Expand Down
28 changes: 20 additions & 8 deletions .github/scripts/check_policies.py
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@
elif urn == "urn:li:dataHubPolicy:editor-platform-policy":
editor_platform_policy_privileges = policy["info"]["privileges"]
elif urn == "urn:li:dataHubPolicy:7":
all_user_platform_policy_privilges = policy["info"]["privileges"]
all_user_platform_policy_privileges = policy["info"]["privileges"]
try:
doc_type = policy["info"]["type"]
privileges = policy["info"]["privileges"]
Expand Down Expand Up @@ -54,10 +54,22 @@
)
assert len(diff_policies) == 0, f"Missing privileges for root user are {diff_policies}"

diff_policies = set(editor_platform_policy_privileges).difference(
set(all_user_platform_policy_privilges)
)
assert "MANAGE_POLICIES" not in all_user_platform_policy_privilges
assert (
len(diff_policies) == 0
), f"Missing privileges for all user policies are {diff_policies}"
# All users privileges checks
assert "MANAGE_POLICIES" not in all_user_platform_policy_privileges
assert "MANAGE_USERS_AND_GROUPS" not in all_user_platform_policy_privileges
assert "MANAGE_SECRETS" not in all_user_platform_policy_privileges
assert "MANAGE_USER_CREDENTIALS" not in all_user_platform_policy_privileges
assert "MANAGE_ACCESS_TOKENS" not in all_user_platform_policy_privileges
assert "EDIT_ENTITY" not in all_user_platform_policy_privileges
assert "DELETE_ENTITY" not in all_user_platform_policy_privileges

# Editor checks
assert "MANAGE_POLICIES" not in editor_platform_policy_privileges
assert "MANAGE_USERS_AND_GROUPS" not in editor_platform_policy_privileges
assert "MANAGE_SECRETS" not in editor_platform_policy_privileges
assert "MANAGE_USER_CREDENTIALS" not in editor_platform_policy_privileges
assert "MANAGE_ACCESS_TOKENS" not in editor_platform_policy_privileges
# These don't prevent a user from modifying entities they are an asset owner of, i.e. their own profile info
assert "EDIT_CONTACT_INFO" not in editor_platform_policy_privileges
assert "EDIT_USER_PROFILE" not in editor_platform_policy_privileges
assert "EDIT_ENTITY_OWNERS" not in editor_platform_policy_privileges
33 changes: 33 additions & 0 deletions .github/scripts/check_python_package.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
import setuptools
import os

folders = ["./smoke-test/tests"]

for folder in folders:
print(f"Checking folder {folder}")
packages = [i for i in setuptools.find_packages(folder) if "cypress" not in i]
namespace_packages = [
i for i in setuptools.find_namespace_packages(folder) if "cypress" not in i
]

print("Packages found:", packages)
print("Namespace packages found:", namespace_packages)

in_packages_not_namespace = set(packages) - set(namespace_packages)
in_namespace_not_packages = set(namespace_packages) - set(packages)

if in_packages_not_namespace:
print(f"Packages not in namespace packages: {in_packages_not_namespace}")
if in_namespace_not_packages:
print(f"Namespace packages not in packages: {in_namespace_not_packages}")
for pkg in in_namespace_not_packages:
pkg_path = os.path.join(folder, pkg.replace(".", os.path.sep))
print(f"Contents of {pkg_path}:")
print(os.listdir(pkg_path))

assert (
len(in_packages_not_namespace) == 0
), f"Found packages in {folder} that are not in namespace packages: {in_packages_not_namespace}"
assert (
len(in_namespace_not_packages) == 0
), f"Found namespace packages in {folder} that are not in packages: {in_namespace_not_packages}"
14 changes: 7 additions & 7 deletions .github/scripts/docker_helpers.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,29 +12,29 @@ export SHORT_SHA=$(get_short_sha)
echo "SHORT_SHA: $SHORT_SHA"

function get_tag {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}\,${SHORT_SHA},g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1,g')
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG},g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1,g'),${SHORT_SHA}
}

function get_tag_slim {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}-slim\,${SHORT_SHA}-slim,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-slim,g')
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}-slim,g" -e 's,refs/tags/\(.*\),\1-slim,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-slim,g'),${SHORT_SHA}-slim
}

function get_tag_full {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}-full\,${SHORT_SHA}-full,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-full,g')
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}-full,g" -e 's,refs/tags/\(.*\),\1-full,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-full,g'),${SHORT_SHA}-full
}

function get_python_docker_release_v {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},0.0.0+docker.${SHORT_SHA},g" -e 's,refs/tags/v\(.*\),\1+docker,g' -e 's,refs/pull/\([0-9]*\).*,0.0.0+docker.pr\1,g')
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},1!0.0.0+docker.${SHORT_SHA},g" -e 's,refs/tags/v\(.*\),1!\1+docker,g' -e 's,refs/pull/\([0-9]*\).*,1!0.0.0+docker.pr\1,g')
}

function get_unique_tag {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${SHORT_SHA},g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1,g')
}

function get_unique_tag_slim {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${SHORT_SHA}-slim,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-slim,g')
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${SHORT_SHA}-slim,g" -e 's,refs/tags/\(.*\),\1-slim,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-slim,g')
}

function get_unique_tag_full {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${SHORT_SHA}-full,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-full,g')
}
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${SHORT_SHA}-full,g" -e 's,refs/tags/\(.*\),\1-full,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-full,g')
}
30 changes: 22 additions & 8 deletions .github/workflows/airflow-plugin.yml
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ on:
branches:
- "**"
paths:
- ".github/**"
- ".github/workflows/airflow-plugin.yml"
- "metadata-ingestion-modules/airflow-plugin/**"
- "metadata-ingestion/**"
- "metadata-models/**"
Expand All @@ -32,32 +32,45 @@ jobs:
strategy:
matrix:
include:
# Note: this should be kept in sync with tox.ini.
- python-version: "3.8"
extra_pip_requirements: "apache-airflow~=2.1.4"
extra_pip_extras: plugin-v1
- python-version: "3.8"
extra_pip_requirements: "apache-airflow~=2.2.4"
extra_pip_extras: plugin-v1
- python-version: "3.10"
extra_pip_requirements: "apache-airflow~=2.4.0"
extra_pip_requirements: "apache-airflow~=2.4.3"
extra_pip_extras: plugin-v2,test-airflow24
- python-version: "3.10"
extra_pip_requirements: 'apache-airflow~=2.6.3 -c https://raw.githubusercontent.com/apache/airflow/constraints-2.6.3/constraints-3.10.txt'
extra_pip_extras: plugin-v2
- python-version: "3.10"
extra_pip_requirements: "apache-airflow~=2.6.0"
extra_pip_requirements: 'apache-airflow~=2.7.3 -c https://raw.githubusercontent.com/apache/airflow/constraints-2.7.3/constraints-3.10.txt'
extra_pip_extras: plugin-v2
- python-version: "3.10"
extra_pip_requirements: "apache-airflow>=2.7.0"
extra_pip_requirements: 'apache-airflow~=2.8.1 -c https://raw.githubusercontent.com/apache/airflow/constraints-2.8.1/constraints-3.10.txt'
extra_pip_extras: plugin-v2
- python-version: "3.11"
extra_pip_requirements: 'apache-airflow~=2.9.3 -c https://raw.githubusercontent.com/apache/airflow/constraints-2.9.3/constraints-3.11.txt'
extra_pip_extras: plugin-v2
fail-fast: false
steps:
- uses: actions/checkout@v3
- name: Set up JDK 17
uses: actions/setup-java@v3
with:
distribution: "zulu"
java-version: 17
- uses: gradle/actions/setup-gradle@v3
- uses: acryldata/sane-checkout-action@v3
- uses: actions/setup-python@v4
with:
python-version: ${{ matrix.python-version }}
cache: "pip"
- name: Install dependencies
run: ./metadata-ingestion/scripts/install_deps.sh
- name: Install airflow package and test (extras ${{ matrix.extra_pip_requirements }})
run: ./gradlew -Pextra_pip_requirements='${{ matrix.extra_pip_requirements }}' -Pextra_pip_extras='${{ matrix.extra_pip_extras }}' :metadata-ingestion-modules:airflow-plugin:lint :metadata-ingestion-modules:airflow-plugin:testQuick
run: ./gradlew -Pextra_pip_requirements='${{ matrix.extra_pip_requirements }}' -Pextra_pip_extras='${{ matrix.extra_pip_extras }}' :metadata-ingestion-modules:airflow-plugin:build
- name: pip freeze show list installed
if: always()
run: source metadata-ingestion-modules/airflow-plugin/venv/bin/activate && pip freeze
Expand All @@ -69,15 +82,16 @@ jobs:
**/build/reports/tests/test/**
**/build/test-results/test/**
**/junit.*.xml
!**/binary/**
- name: Upload coverage to Codecov
if: always()
uses: codecov/codecov-action@v3
with:
token: ${{ secrets.CODECOV_TOKEN }}
directory: .
fail_ci_if_error: false
flags: airflow-${{ matrix.python-version }}-${{ matrix.extraPythonRequirement }}
name: pytest-airflow
flags: airflow,airflow-${{ matrix.extra_pip_extras }}
name: pytest-airflow-${{ matrix.python-version }}-${{ matrix.extra_pip_requirements }}
verbose: true

event-file:
Expand Down
Loading