Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update internal 0.12.1 #52

Merged
merged 655 commits into from
Mar 12, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
655 commits
Select commit Hold shift + click to select a range
2c01914
fix(metadata-ingestion): adds default value to _resolved_domain_urn i…
alexklavensnyt Oct 30, 2023
f5c8192
ci: tweak to only run relevant workflows (#9052)
anshbansal Oct 30, 2023
9c72bd9
fix(test): Fix for flaky download_lineage_results cypress test (#9132)
kkorchak Oct 30, 2023
300cea3
docs: Update updating-datahub.md (#9131)
pedro93 Oct 30, 2023
58bcedc
fix(ingest/clickhouse): pin version to solve column reflection regres…
hsheth2 Oct 30, 2023
51d6d1f
feat(ingest/looker): cleanup error handling (#9135)
hsheth2 Oct 30, 2023
0bd2d9a
feat(ingest): add `entity_supports_aspect` helper (#9120)
hsheth2 Oct 30, 2023
ce0f36b
feat(sqlparser): support more update syntaxes + fix bug with subqueri…
hsheth2 Oct 30, 2023
94d438d
docs: correct broken doc links (#9137)
sachinsaju Oct 31, 2023
ea12732
feat(ingest): sql parser perf + asyncio fixes (#9119)
hsheth2 Oct 31, 2023
b565a65
feat(quickstart): fix broker InconsistentClusterIdException issues (#…
hsheth2 Oct 31, 2023
2e8954f
fix(policies): remove non-existent policies, fix name (#9150)
anshbansal Oct 31, 2023
b8dcc86
refactor(smoke): Fix for a test that passed on Oss and failed on Saas…
kkorchak Oct 31, 2023
dae320c
docs(teradata): teradata doc external link 404 fix (#9152)
sachinsaju Oct 31, 2023
7a31950
fix(datahub-client): Include relocation for snakeyaml dependency. (#8…
jiateoh Nov 1, 2023
73514ad
fix(ingest): cleanup large images in CI (#9153)
hsheth2 Nov 1, 2023
d231497
build: increase gradle retries (#9091)
hsheth2 Nov 1, 2023
55f1453
feat(ingest): bump sqlglot parser (#9155)
hsheth2 Nov 1, 2023
876de21
feat(ingest/mongodb): support stateful ingestion (#9118)
TonyOuyangGit Nov 1, 2023
f2eb0cf
smoke-test: API test for managing secrets privilege (#9121)
kkorchak Nov 1, 2023
95d9ff2
fix(ingest): handle exceptions in min, max, mean profiling (#9129)
mayurinehate Nov 1, 2023
932eebe
feat: rename Assets tab to Owner Of (#9141)
kushagra-apptware Nov 1, 2023
5078922
fix(ingest/mongodb): fix schema inference for lists of values (#9145)
hsheth2 Nov 1, 2023
f7cd802
fix(ingest/db2): fix handling for table properties (#9128)
deepgarg-visa Nov 2, 2023
bab9d1c
fix(ingest): fully support MCPs in urn_iter primitive (#9157)
hsheth2 Nov 2, 2023
12b4171
fix(ingest/bigquery): use correct row count in null count profiling c…
mayurinehate Nov 2, 2023
11d8988
docs: add feature guides for subscriptions and notifications (#9122)
yoonhyejin Nov 2, 2023
f42cb95
docs: unify oidc guides using tabs (#9068)
yoonhyejin Nov 2, 2023
ec97250
chore(ingest): remove legacy memory_leak_detector (#9158)
hsheth2 Nov 2, 2023
148ad1a
feat(ingest/looker): support emitting unused explores (#9159)
hsheth2 Nov 2, 2023
7ff48b3
refactor(policy): refactor policy locking, no functional difference (…
david-leifker Nov 3, 2023
0731111
API test for managing access token privilege (#9167)
kkorchak Nov 3, 2023
ddb4e1b
fix(mysql-setup): quote database name (#9169)
darnaut Nov 3, 2023
c2bc41d
fix(health): fix health check url authentication (#9117)
david-leifker Nov 3, 2023
efd73a5
fix(elasticsearch): fix elasticsearch-setup for dropped 000001 index …
david-leifker Nov 3, 2023
ac1a5a6
fix(tests): Origin/fix flaky glossary navigation cypress test (#9175)
kkorchak Nov 4, 2023
60131a8
fix: bad lineage link in `LineageGraphOnboardingConfig.tsx` (#9162)
walter9388 Nov 4, 2023
7cfe3c7
OBS-191 | Viewing domains page should not require Manage Domains priv…
sumitappt Nov 4, 2023
81daae8
fix: expand the stats row in search preview cards (#9140)
gaurav2733 Nov 4, 2023
0215666
docs(ingest): clarify adding source guide (#9161)
hsheth2 Nov 6, 2023
4a4c290
chore: stop ingestion-smoke CI errors on forks (#9160)
hsheth2 Nov 6, 2023
86d2b08
docs(ingest): inherit capabilities from superclasses (#9174)
hsheth2 Nov 6, 2023
2c58c63
fix(ingest/datahub-source): Order by version in memory (#9185)
asikowitz Nov 6, 2023
f2ce3ab
lint(frontend): fix HeaderLinks lint error (#9189)
david-leifker Nov 6, 2023
34aa08b
refactor(ui): Refactor entity page loading indicators (#9195)
jjoyce0510 Nov 7, 2023
279fdd5
fix(security): fix for zookeeper CVE-2023-44981 (#9190)
david-leifker Nov 7, 2023
ac9a014
refactor(ui): Rename "dataset details" button text to "view details" …
jjoyce0510 Nov 7, 2023
4577001
feat(ui): Add command-k icons to search bar (#9194)
jjoyce0510 Nov 7, 2023
88cde08
feat(ui): Update Apollo cache to work with union types (#9193)
chriscollins3456 Nov 7, 2023
23c98ec
feat(policy): enable support for 10k+ policies (#9177)
david-leifker Nov 7, 2023
353584c
feat(browsepathv2): Allow system-update to reprocess browse paths v2 …
david-leifker Nov 8, 2023
e73e926
feat(integration/fivetran): Fivetran connector integration (#9018)
shubhamjagtap639 Nov 8, 2023
399e032
feat(neo4j): Allow datahub to connect to specific neo4j database (#9179)
deepgarg-visa Nov 8, 2023
332d4af
feat(subtypes): support subtypes for charts in the UI (#9186)
gabe-lyons Nov 8, 2023
7213591
feat(ui) Debounce auto-complete in search bar (#9205)
chriscollins3456 Nov 8, 2023
70692b4
fix(lineage): magical lineage layout fix (#9187)
gabe-lyons Nov 8, 2023
f87983d
refactor(pdl): Refactoring Assertion model enums out (#9191)
jjoyce0510 Nov 8, 2023
f38c808
feat(auth): Add roles to policy engine validation logic (#9178)
pedro93 Nov 8, 2023
f73ecfd
style(ingest/tableau): Rename tableau_constant to c (#9207)
asikowitz Nov 8, 2023
9174301
docs: update broken link in metadata-modelling (#9184)
sachinsaju Nov 9, 2023
e494a9c
test(): Test policy to create and manage privileges (#9173)
kkorchak Nov 9, 2023
2187d24
docs(security): add security doc to website (#9209)
RyanHolstien Nov 9, 2023
5911a7b
docs(java-sdk-dataset): add dataset via java sdk example (#9136)
sachinsaju Nov 9, 2023
d6cb106
doc(java-sdk-example):example to create tag via java-sdk (#9151)
sachinsaju Nov 9, 2023
1077138
fix(ingest/powerbi): use dataset workspace id as key for parent conta…
looppi Nov 10, 2023
bfa1769
refactor(schema tab): Remove last observed timestamps from schema tab…
jjoyce0510 Nov 10, 2023
9c0f4de
docs: adjust sidebar & create new admin section (#9064)
yoonhyejin Nov 10, 2023
179f103
fix(metadata-io): in Neo4j service use proper algorithm to get lineag…
lix-mms Nov 10, 2023
b851d59
Managed Ingestion UX Improvements (#9216)
purnimagarg1 Nov 10, 2023
89dff8f
chore(ingest): start working on pydantic v2 support (#9220)
hsheth2 Nov 10, 2023
a187127
feat(ingestion): file-based state checkpoint provider (#9029)
shubhamjagtap639 Nov 10, 2023
bb73002
feat(ingestion/airflow): support datajobs as task inlets (#9211)
shubhamjagtap639 Nov 10, 2023
0e3efab
fix(build): set `@cliMajorVersion@` correctly (#9228)
hsheth2 Nov 11, 2023
ebd2e23
fix(datahub-ingestion): remove old jars, sync pyspark version (#9217)
david-leifker Nov 11, 2023
7ba54fd
fix: re-add security.md to sidebar (#9229)
yoonhyejin Nov 13, 2023
582eebe
feat(policies): reduce default access for all users (#9067)
RyanHolstien Nov 13, 2023
4461b60
Update add new company s7 airlines (#9019)
YuriyGavrilov Nov 13, 2023
3844b78
docs(debug): add debug information for cli (#9208)
RyanHolstien Nov 13, 2023
ff90fb6
fix(datahub-ingestion): prevent transitive deps, bump addtional pyspa…
david-leifker Nov 13, 2023
19aa215
feat(ingest/dbt): dbt column-level lineage (#8991)
hsheth2 Nov 14, 2023
906a5b9
chore(ingest): cleanup various methods (#9221)
hsheth2 Nov 14, 2023
f1b6aa7
docs: clarify how to disable telemetry (#9236)
hsheth2 Nov 14, 2023
cfeecd7
feat(ingest/mongodb): support AWS DocumentDB for MongoDB (#9201)
TonyOuyangGit Nov 14, 2023
ec13847
feat(airflow): make RUN_IN_THREAD configurable (#9226)
hsheth2 Nov 14, 2023
486e394
fix(signup): prevent invalid email signup (#9234)
RyanHolstien Nov 14, 2023
f70d8a4
chore(security): version adjustments for security vulns (#9243)
david-leifker Nov 15, 2023
8475fc9
docs(ingest): fix typo in snowflake ingestion docs (#9239)
PGuiv Nov 15, 2023
4201e54
chore(security): jre to headless, removes x11 dependency (#9245)
david-leifker Nov 15, 2023
6655918
feat(recomendations): Make top platforms account only for searchable …
pedro93 Nov 15, 2023
5dd09dd
Feature/prd 770 (#9224)
gaurav2733 Nov 16, 2023
8451c75
fix(search): fix search on paginated lists (#9198)
Salman-Apptware Nov 16, 2023
ee9ee40
fix(): increase the search bar highlight border to double the width (…
gaurav2733 Nov 16, 2023
e6305c0
feat: Add loading indicator to Manage Domains sidebar (#9142)
sumitappt Nov 16, 2023
15efa72
fix(ui): show external url also in entity profile of containers (#8834)
Masterchen09 Nov 16, 2023
78abeb9
feat(ingest/unity): Support specifying catalogs directly; pass env co…
asikowitz Nov 16, 2023
e15e28e
refactor(datahub-web-react): allows proxying to external datahub-fron…
PatrickfBraz Nov 16, 2023
cd789b8
chore(node): update node to non-EOL version (#9252)
david-leifker Nov 16, 2023
9d41a8f
fix(ingest): drop redshift-legacy and redshift-usage-legacy sources (…
hsheth2 Nov 16, 2023
417ffb1
feat(ingest): support advanced configs for aws (#9237)
hsheth2 Nov 16, 2023
d0fa5de
fix(sql-parser): convert platform instance to lowercase when building…
Starkie Nov 16, 2023
5cb0498
test(ingest/unity): Update goldens (#9254)
asikowitz Nov 16, 2023
b778def
build(ingest/hive): Update thrift pin (#8964)
asikowitz Nov 16, 2023
c348f84
docs(airflow): update plugin setup docs to include UI setup approach …
jiateoh Nov 17, 2023
b03515f
feat(usageclient): updates for usageclient (#9255)
david-leifker Nov 17, 2023
bc89a1c
fix(graphql): prevent duplicate index queries for dataproducts (#9260)
david-leifker Nov 17, 2023
2b0811b
logging(search): log level highlight value urn detection (#9262)
david-leifker Nov 17, 2023
d2ca2dd
docs(development): Add Python version in Developer README (#9268)
kevin1chun Nov 17, 2023
b7df9e0
Sync datahub-head on merge (#9267)
noggi Nov 18, 2023
a704290
PRD-742/fix:Settings tab should have 2 scrollable sections (#9218)
Salman-Apptware Nov 20, 2023
1ad4f96
feat: add ingestion overview pages (#9210)
yoonhyejin Nov 20, 2023
0187480
fix(ingest/athena): detect decimal type correctly (#9270)
bossenti Nov 20, 2023
d9de854
fix(ui): Do not show manage in settings when no permissions for it Fi…
gaurav2733 Nov 21, 2023
b51cfc3
build(deps): bump @babel/traverse from 7.22.11 to 7.23.2 in /docs-web…
dependabot[bot] Nov 21, 2023
38adff8
fix(gha): fix gha for single tag (#9283)
david-leifker Nov 21, 2023
15e68bb
fix(node): fix node_options (#9281)
david-leifker Nov 21, 2023
fd129c7
fix: Revamp features page (#8839)
yoonhyejin Nov 22, 2023
cda980b
docs(acryl cloud): release notes 0.2.13 (#9291)
anshbansal Nov 22, 2023
308de49
fix(): stats are spaced out too far (#9292)
gaurav2733 Nov 22, 2023
a62d52e
feat(mysql): upgrade to version 8.2 for quickstart (#9241)
RyanHolstien Nov 22, 2023
37ea292
feat: add townhall RSVP link on the main page (#9277)
yoonhyejin Nov 23, 2023
f794a90
fix(ingest/snowflake): Apply email filter on all usage metrics (#9269)
treff7es Nov 23, 2023
1c5871c
docs(ingestion): Added mention of host without protocol (#9301)
SimonOsipov Nov 24, 2023
5ccb30e
fix(ingest/teradata): Teradata speed up changes (#9059)
treff7es Nov 24, 2023
514c2fb
fix(kafka): fix consumer properties on due consumer (#9304)
david-leifker Nov 24, 2023
298b9be
fix(dbt-cloud): do not pass macros to sorting nodes (#9302)
anshbansal Nov 24, 2023
a34fdfd
fix(ingest/lookml): emit all views with same name and different file …
mayurinehate Nov 24, 2023
cacddf7
fix(deprecation): bring frontend in-sync with model (#9303)
anshbansal Nov 27, 2023
ae5969f
fix(settings): fix the settings height when there are not many items …
Salman-Apptware Nov 27, 2023
3806dc1
docs: update recommended CLI (#9307)
anshbansal Nov 27, 2023
a9650b6
feat(ui): bump frontend dependencies (#8353)
ngamanda Nov 27, 2023
27127eb
fix(java) Fixes NPE ES service (#9311)
chriscollins3456 Nov 27, 2023
03be68c
feat(config): Configurable bootstrap of ownership types (#9308)
skrydal Nov 28, 2023
ac7fa56
fix(sec): update the "json-schema" version from package.json to solve…
kushagra-apptware Nov 28, 2023
ff9876f
fix(ingest/mssql): Add MONEY and SMALLMONEY data types as Number (#9313)
terratrue-daniel Nov 28, 2023
08fb730
fix(ingest): drop deprecated database_alias from sql sources (#9299)
mayurinehate Nov 28, 2023
966cb17
feat(dev): Make repositories configurable for enterprise developers (…
githendrik Nov 28, 2023
3d7962c
fix(ingest/sql): improve handling of views with dots in their names (…
Starkie Nov 28, 2023
2031bd4
docs(ingest): update docs on adding stateful ingestion (#9327)
hsheth2 Nov 28, 2023
3a84037
fix(docker): docker compose health checks port fix (#9326)
david-leifker Nov 29, 2023
bc24136
fix(ui): vulnerability (React): Inefficient Regular Expression Comple…
gaurav2733 Nov 29, 2023
e4c05fa
fix(ui): Fix UI glitch in policies creator (#9266)
chriscollins3456 Nov 29, 2023
10b7a95
fix(sidebar): remove a space reserved for scroll bars when sidebar is…
allizex Nov 29, 2023
ab10e6b
feat(ingest/mssql): enable TLS encryption for SQLServer using pytds (…
terratrue-daniel Nov 29, 2023
c946d26
fix(datahub-frontend): Add playCaffeine as replacement for removed pl…
Nov 29, 2023
4dd6738
fix(ingest): bump pyhive to fix headers issue (#9328)
hsheth2 Nov 29, 2023
0795f0b
feat(gradle): quickstart postgres gradle task (#9329)
david-leifker Nov 29, 2023
f8db909
Upload metadata model to s3 (#9325)
noggi Nov 29, 2023
fe444af
fix(ui) Set explicit height on logo images to fix render bug (#9344)
chriscollins3456 Nov 29, 2023
5e52e31
fix(ingest/browse): Re-emit browse path v2 aspects to avoid race cond…
asikowitz Nov 29, 2023
863894b
feat(ingest/ldap): make ingestion robust to string departmentId (#9258)
hsheth2 Nov 29, 2023
dd09f5e
doc(ingest/teradata): Adding Teradata to list of Integrations (#9336)
treff7es Nov 29, 2023
c00ce51
fix(ui): Complexity in chalk/ansi-regex and minimatch ReDoS Vulnera…
kushagra-apptware Nov 30, 2023
cb72253
build(deps): bump tmpl from 1.0.4 to 1.0.5 in /datahub-web-react (#9345)
dependabot[bot] Nov 30, 2023
c3499f8
fix(): Address @babel/traverse vulnerabilities (#9343)
Salman-Apptware Nov 30, 2023
efaf21d
docs(ingest/looker): mark platform instance as a supported capability…
hsheth2 Nov 30, 2023
65d5034
fix(): Address HIGH vulnerability with Axios (#9353)
Salman-Apptware Nov 30, 2023
ae1169d
fix(ui): show formatted total result count in Search (#9356)
gaurav2733 Nov 30, 2023
a7dc9c9
feat(sdk): autogenerate urn types (#9257)
hsheth2 Nov 30, 2023
a8476ee
fix(airflow): support inlet datajobs correctly in v1 plugin (#9331)
hsheth2 Nov 30, 2023
f9fd946
feat(ingest): clean up DataHubRestEmitter return type (#9286)
hsheth2 Dec 1, 2023
4d9eb12
feat(ingest/dbt): support custom ownership types in dbt meta (#9332)
hsheth2 Dec 1, 2023
82f375d
docs(ingest/lookml): clarify that ssh key has no passphrase (#9348)
hsheth2 Dec 1, 2023
3142efc
fix(migrate): connect with token without dry-run (#9317)
anshbansal Dec 1, 2023
864d3df
fix(ui): Minor: fix unnecessary lineage tab scroll by removing -1 mar…
jjoyce0510 Dec 1, 2023
36c7813
feat(ui): Support dynamic entity profile tab names (#9352)
kushagra-apptware Dec 1, 2023
7b0a8f4
docs: add setup instructions for mac dependencies (#9346)
hsheth2 Dec 1, 2023
f3abfd1
feat(ui): Add caching to search, entity profile for better UX (#9362)
jjoyce0510 Dec 1, 2023
4562d79
refactor(ui): Remove primary color for sort selector + add t… (#9363)
jjoyce0510 Dec 1, 2023
14a463b
feat(ui): Supporting subtypes for data jobs (#9361)
jjoyce0510 Dec 1, 2023
f7f25bd
fix(ingest/bigquery): Fix format arguments for table lineage test (#9…
middagj Dec 4, 2023
7857944
fix(siblingsHook): add logic to account for non dbt upstreams (#9154)
ethan-cartwright Dec 4, 2023
f9b24e0
feat: Support CSV ingestion through the UI (#9280)
purnimagarg1 Dec 4, 2023
6a18010
fix(vulns): node-fetch forwards secure headers to untrusted sites (#9…
Salman-Apptware Dec 4, 2023
c0ef728
fix(ingest/powerbi): Allow old parser to parse [db].[schema].[table] …
asikowitz Dec 4, 2023
4ec3208
feat(ingest): support stdin in `datahub put` (#9359)
hsheth2 Dec 4, 2023
7517c77
fix(ingest): resolve issue with caplog and asyncio (#9377)
hsheth2 Dec 5, 2023
0d9aa26
fix(ingest/airflow): compat with pluggy 1.0 (#9365)
hsheth2 Dec 5, 2023
d123b61
feat(ingest/athena): Enable Athena view ingestion and view lineage (#…
treff7es Dec 5, 2023
3ee8259
fix(ingest/redshift): Identify materialized views properly + fix conn…
treff7es Dec 5, 2023
806f09a
test(ingest/unity): Unity catalog data generation (#8949)
asikowitz Dec 5, 2023
c66619c
fix(elasticsearch): set datahub usage events shard & replica count (#…
david-leifker Dec 5, 2023
7fb6086
feat(gms/search): Adding support for DOUBLE Searchable type (#9369)
siladitya2 Dec 6, 2023
e144741
feat(lint): add spotless for java lint (#9373)
anshbansal Dec 6, 2023
3c0727e
feat(ci): split no cypress test suite (#9387)
anshbansal Dec 6, 2023
a9c5c39
fix(ingest/redshift): too many values unpack (#9394)
anshbansal Dec 6, 2023
2eee333
fix(ingest/redshift): Fix psycopg2 removal from Redshift Source (#9395)
treff7es Dec 6, 2023
7a2b8bf
fix(ui): fixed font src spelling mistake (#9204)
accso-jo Dec 6, 2023
27f23ec
feat(ingest/unity): GE Profiling (#8951)
asikowitz Dec 6, 2023
16fe22a
feat(ui/last-updated): Calculate last updated time as max(properties …
asikowitz Dec 6, 2023
dc6f169
docs: add youtube link to townhall button on docs (#9381)
yoonhyejin Dec 6, 2023
1ce752e
fix: set new sidebar section (#9393)
yoonhyejin Dec 7, 2023
1b48877
fix(ingest/json-schema): take into account environment (#9385)
matthiasdg Dec 7, 2023
4c348a8
feat(datahub-frontend): make Java memory options configurable via ENV…
haeniya Dec 7, 2023
a6726c1
docs(ingest/sql-queries): Add documentation (#9406)
asikowitz Dec 7, 2023
923e76d
docs: fix duplicated overview link for api section (#9402)
yoonhyejin Dec 7, 2023
3096aa6
feat(glossary): add toggle sidebar button and functionality to Busine…
olgadimova Dec 7, 2023
d182667
refactor(ui): Refactor entity registry to be inside App Providers (#9…
jjoyce0510 Dec 7, 2023
81a93dc
feat(ui): handle content prop changes in Editor component (#9400)
hsheth2 Dec 7, 2023
e3e9904
fix(ingest/profiling): Add back db_name to sql_generic_profiler metho…
asikowitz Dec 7, 2023
7247369
feat(observability): add actor urn to GraphQL spans (#9382)
ngamanda Dec 7, 2023
f03c66c
fix(ingest/lookml): make deploy key optional (#9378)
hsheth2 Dec 7, 2023
0e40d38
fix(ingest/powerbi): fix powerbi chart input handling (#9415)
looppi Dec 8, 2023
d52f030
fix(ingest): fix metadata for custom python packages (#9391)
hsheth2 Dec 8, 2023
08a9b9b
fix(ingest): bug fixes and docs updates (#9422)
hsheth2 Dec 8, 2023
3e79a13
Pin alpine base image version to 3.18 (#9421)
noggi Dec 8, 2023
159a013
fix(cypress) Fix flakiness of cypress test for glossary navigation (#…
chriscollins3456 Dec 8, 2023
b4e9d3b
[GDP-1363] Switch on nested meta-mapping permanently (#15)
YusufMahtab May 30, 2023
4885ae6
Add custom Okta functionality for OAuth tokens and team attribute
luos-fc May 25, 2023
d1f494f
Add src/datahub/metadata
maiarareinaldo Jun 8, 2023
86b210d
Set DataHub version to prevent vuln flags
maiarareinaldo Jun 13, 2023
0426011
Add department into user's custom properties
YusufMahtab Jul 4, 2023
2e95556
Add jinja substitution for meta mapping
luos-fc Jul 6, 2023
d84337e
Add platform_instance to browse path for Tableau source and linting
maiarareinaldo Jul 17, 2023
d6e5339
Update tests for okta and kafka
maiarareinaldo Jul 18, 2023
1e26e69
Use schema field path V2 for dbt schema fields
maiarareinaldo Aug 1, 2023
718c890
Fix flake8 issues
maiarareinaldo Aug 1, 2023
2b771c9
Add support for parsing custom Athena SQL
luos-fc Aug 2, 2023
5f55b7e
Update dbt schema field metadata
maiarareinaldo Aug 2, 2023
2350396
Add athena lineage for upstream external tables
luos-fc Aug 9, 2023
3271612
Apply lineage_platform_instance for all upstreams
luos-fc Aug 10, 2023
590a3b8
Allow ignoring of certain upstream lineage platforms (#15)
luos-fc Aug 11, 2023
04b1e13
Revert "feat: add dbt-athena adapter support for column types mapping…
maiarareinaldo Aug 22, 2023
3df187d
Add jinja substitution for meta mapping
luos-fc Jul 6, 2023
1ca0a3b
Allow ignoring of certain upstream lineage platforms (#15)
luos-fc Aug 11, 2023
eea3ef0
Update src/datahub/metadata folder
maiarareinaldo Aug 22, 2023
d07d919
Update datahub cli version
maiarareinaldo Aug 22, 2023
59cd18a
feat(tableau): emit browsePathsV2 for upstream tables
maiarareinaldo Sep 18, 2023
267c1ea
feat(dbt): emit browsePathsV2 for dbt nodes
maiarareinaldo Sep 18, 2023
0914fa1
feat(kafka): emit browsePathsV2
maiarareinaldo Sep 19, 2023
4d943dc
feat(tableau): emit browsePathsV2 for datasources and custom sql
maiarareinaldo Sep 22, 2023
a348afa
Build frontend images (internal branch)
danielcmessias Oct 4, 2023
097ce93
Setup Amplitude in the frontend
danielcmessias Aug 22, 2023
a99c38f
Add departmentName when getting user data for identify calls
danielcmessias Oct 16, 2023
c2628e2
fix: set ignore_upstream_lineage_platforms to [] if unset
danielfordfc Oct 20, 2023
aa043ad
feat: add upstream postgres db whitelisting feature
danielfordfc Oct 23, 2023
7d86089
Apply suggestions from code review
danielfordfc Oct 24, 2023
c913f98
test: fix tests for upstream_lineage and new whitelist feature
danielfordfc Oct 24, 2023
21a70f8
fix(dbt): Test ingestion assertee urn issues
maiarareinaldo Dec 7, 2023
29794a2
Update src/datahub/metadata autogenerated files
maiarareinaldo Feb 20, 2024
80b62f5
Linting
maiarareinaldo Feb 20, 2024
41cb4da
Resolve merge conflicts
maiarareinaldo Feb 20, 2024
4b84c14
Switch on nested metamapping
maiarareinaldo Feb 20, 2024
d13b779
Update datahub version on __init__
maiarareinaldo Feb 23, 2024
ce86642
fix(ingest): avoid git dependency in dbt (#9447)
hsheth2 Dec 12, 2023
5c6015f
feat(ingest/dbt): speed up dbt CLL (#9617)
hsheth2 Jan 16, 2024
4d07a84
Fix tableau bug
maiarareinaldo Feb 28, 2024
290ff47
Fix frontend docker build
maiarareinaldo Mar 4, 2024
9b8933b
feat(ingest/glue): add catalog alias to the urn
maiarareinaldo Mar 4, 2024
4970750
fix(ingest/tableau): add database to upstream Athena tables
maiarareinaldo Mar 5, 2024
8f2a7d4
fix(ingest): fix inferred column lineage for upstream Athena tables
maiarareinaldo Mar 5, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
5 changes: 3 additions & 2 deletions .dockerignore
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
**/node_modules/
datahub-frontend/build/
metadata-ingestion/venv/
*/build/
*/*/build/
*/venv/
out
**/*.class
# Have to copy gradle/wrapper/gradle-wrapper.jar, can't exclude ALL jars
Expand Down
24 changes: 21 additions & 3 deletions .github/actions/docker-custom-build-and-push/action.yml
Original file line number Diff line number Diff line change
Expand Up @@ -30,6 +30,9 @@ inputs:
# e.g. latest,head,sha12345
description: "List of tags to use for the Docker image"
required: true
target:
description: "Sets the target stage to build"
required: false
outputs:
image_tag:
description: "Docker image tags"
Expand All @@ -49,7 +52,7 @@ runs:
# add git short SHA as Docker tag
tag-custom: ${{ inputs.tags }}
tag-custom-only: true

# Code for testing the build when not pushing to Docker Hub.
- name: Build and Load image for testing (if not publishing)
uses: docker/build-push-action@v3
Expand All @@ -62,14 +65,26 @@ runs:
platforms: linux/amd64
build-args: ${{ inputs.build-args }}
tags: ${{ steps.docker_meta.outputs.tags }}
target: ${{ inputs.target }}
load: true
push: false
cache-from: type=registry,ref=${{ steps.docker_meta.outputs.tags }}
cache-to: type=inline
- name: Single Tag
if: ${{ inputs.publish != 'true' }}
shell: bash
run: |
TAGS="""
${{ steps.docker_meta.outputs.tags }}
"""
echo "SINGLE_TAG=$(echo $TAGS | tr '\n' ' ' | awk -F' ' '{ print $1 }')" >> $GITHUB_OUTPUT
id: single_tag
- name: Upload image locally for testing (if not publishing)
uses: ishworkh/docker-image-artifact-upload@v1
if: ${{ inputs.publish != 'true' }}
with:
image: ${{ steps.docker_meta.outputs.tags }}
image: ${{ steps.single_tag.outputs.SINGLE_TAG }}

# Code for building multi-platform images and pushing to Docker Hub.
- name: Set up QEMU
uses: docker/setup-qemu-action@v2
Expand All @@ -92,6 +107,9 @@ runs:
platforms: ${{ inputs.platforms }}
build-args: ${{ inputs.build-args }}
tags: ${{ steps.docker_meta.outputs.tags }}
target: ${{ inputs.target }}
push: true
cache-from: type=registry,ref=${{ steps.docker_meta.outputs.tags }}
cache-to: type=inline

# TODO add code for vuln scanning?
28 changes: 20 additions & 8 deletions .github/scripts/check_policies.py
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@
elif urn == "urn:li:dataHubPolicy:editor-platform-policy":
editor_platform_policy_privileges = policy["info"]["privileges"]
elif urn == "urn:li:dataHubPolicy:7":
all_user_platform_policy_privilges = policy["info"]["privileges"]
all_user_platform_policy_privileges = policy["info"]["privileges"]
try:
doc_type = policy["info"]["type"]
privileges = policy["info"]["privileges"]
Expand Down Expand Up @@ -54,10 +54,22 @@
)
assert len(diff_policies) == 0, f"Missing privileges for root user are {diff_policies}"

diff_policies = set(editor_platform_policy_privileges).difference(
set(all_user_platform_policy_privilges)
)
assert "MANAGE_POLICIES" not in all_user_platform_policy_privilges
assert (
len(diff_policies) == 0
), f"Missing privileges for all user policies are {diff_policies}"
# All users privileges checks
assert "MANAGE_POLICIES" not in all_user_platform_policy_privileges
assert "MANAGE_USERS_AND_GROUPS" not in all_user_platform_policy_privileges
assert "MANAGE_SECRETS" not in all_user_platform_policy_privileges
assert "MANAGE_USER_CREDENTIALS" not in all_user_platform_policy_privileges
assert "MANAGE_ACCESS_TOKENS" not in all_user_platform_policy_privileges
assert "EDIT_ENTITY" not in all_user_platform_policy_privileges
assert "DELETE_ENTITY" not in all_user_platform_policy_privileges

# Editor checks
assert "MANAGE_POLICIES" not in editor_platform_policy_privileges
assert "MANAGE_USERS_AND_GROUPS" not in editor_platform_policy_privileges
assert "MANAGE_SECRETS" not in editor_platform_policy_privileges
assert "MANAGE_USER_CREDENTIALS" not in editor_platform_policy_privileges
assert "MANAGE_ACCESS_TOKENS" not in editor_platform_policy_privileges
# These don't prevent a user from modifying entities they are an asset owner of, i.e. their own profile info
assert "EDIT_CONTACT_INFO" not in editor_platform_policy_privileges
assert "EDIT_USER_PROFILE" not in editor_platform_policy_privileges
assert "EDIT_ENTITY_OWNERS" not in editor_platform_policy_privileges
18 changes: 17 additions & 1 deletion .github/scripts/docker_helpers.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,15 @@ export SHORT_SHA=$(get_short_sha)
echo "SHORT_SHA: $SHORT_SHA"

function get_tag {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}\,${SHORT_SHA},g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1,g')
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG},g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1,g'),${SHORT_SHA}
}

function get_tag_slim {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}-slim,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-slim,g'),${SHORT_SHA}-slim
}

function get_tag_full {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}-full,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-full,g'),${SHORT_SHA}-full
}

function get_python_docker_release_v {
Expand All @@ -21,4 +29,12 @@ function get_python_docker_release_v {

function get_unique_tag {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${SHORT_SHA},g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1,g')
}

function get_unique_tag_slim {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${SHORT_SHA}-slim,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-slim,g')
}

function get_unique_tag_full {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${SHORT_SHA}-full,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-full,g')
}
90 changes: 90 additions & 0 deletions .github/workflows/airflow-plugin.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,90 @@
name: Airflow Plugin
on:
push:
branches:
- master
paths:
- ".github/workflows/airflow-plugin.yml"
- "metadata-ingestion-modules/airflow-plugin/**"
- "metadata-ingestion/**"
- "metadata-models/**"
pull_request:
branches:
- "**"
paths:
- ".github/workflows/airflow-plugin.yml"
- "metadata-ingestion-modules/airflow-plugin/**"
- "metadata-ingestion/**"
- "metadata-models/**"
release:
types: [published]

concurrency:
group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}
cancel-in-progress: true

jobs:
airflow-plugin:
runs-on: ubuntu-latest
env:
SPARK_VERSION: 3.0.3
DATAHUB_TELEMETRY_ENABLED: false
strategy:
matrix:
include:
- python-version: "3.8"
extra_pip_requirements: "apache-airflow~=2.1.4"
extra_pip_extras: plugin-v1
- python-version: "3.8"
extra_pip_requirements: "apache-airflow~=2.2.4"
extra_pip_extras: plugin-v1
- python-version: "3.10"
extra_pip_requirements: "apache-airflow~=2.4.0"
extra_pip_extras: plugin-v2
- python-version: "3.10"
extra_pip_requirements: "apache-airflow~=2.6.0"
extra_pip_extras: plugin-v2
- python-version: "3.10"
extra_pip_requirements: "apache-airflow>=2.7.0"
extra_pip_extras: plugin-v2
fail-fast: false
steps:
- uses: actions/checkout@v3
- uses: actions/setup-python@v4
with:
python-version: ${{ matrix.python-version }}
cache: "pip"
- name: Install dependencies
run: ./metadata-ingestion/scripts/install_deps.sh
- name: Install airflow package and test (extras ${{ matrix.extra_pip_requirements }})
run: ./gradlew -Pextra_pip_requirements='${{ matrix.extra_pip_requirements }}' -Pextra_pip_extras='${{ matrix.extra_pip_extras }}' :metadata-ingestion-modules:airflow-plugin:lint :metadata-ingestion-modules:airflow-plugin:testQuick
- name: pip freeze show list installed
if: always()
run: source metadata-ingestion-modules/airflow-plugin/venv/bin/activate && pip freeze
- uses: actions/upload-artifact@v3
if: ${{ always() && matrix.python-version == '3.10' && matrix.extra_pip_requirements == 'apache-airflow>=2.7.0' }}
with:
name: Test Results (Airflow Plugin ${{ matrix.python-version}})
path: |
**/build/reports/tests/test/**
**/build/test-results/test/**
**/junit.*.xml
- name: Upload coverage to Codecov
if: always()
uses: codecov/codecov-action@v3
with:
token: ${{ secrets.CODECOV_TOKEN }}
directory: .
fail_ci_if_error: false
flags: airflow-${{ matrix.python-version }}-${{ matrix.extraPythonRequirement }}

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[actionlint] reported by reviewdog 🐶
property "extrapythonrequirement" is not defined in object type {extra_pip_extras: string; extra_pip_requirements: string; python-version: number} [expression]

name: pytest-airflow
verbose: true

event-file:
runs-on: ubuntu-latest
steps:
- name: Upload
uses: actions/upload-artifact@v3
with:
name: Event File
path: ${{ github.event_path }}
36 changes: 18 additions & 18 deletions .github/workflows/build-and-test.yml
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ on:
- "**.md"
pull_request:
branches:
- master
- "**"
paths-ignore:
- "docs/**"
- "**.md"
Expand All @@ -24,37 +24,37 @@ jobs:
strategy:
fail-fast: false
matrix:
command:
[
"./gradlew build -x :metadata-ingestion:build -x :metadata-ingestion:check -x docs-website:build -x :metadata-integration:java:spark-lineage:test -x :metadata-io:test -x :metadata-ingestion-modules:airflow-plugin:build -x :datahub-frontend:build -x :datahub-web-react:build --parallel",
"./gradlew :datahub-frontend:build :datahub-web-react:build --parallel",
"./gradlew :metadata-ingestion-modules:airflow-plugin:build --parallel"
]
timezone:
[
"UTC",
"America/New_York",
command: [
# metadata-ingestion and airflow-plugin each have dedicated build jobs
"except_metadata_ingestion",
"frontend",
]
timezone: ["UTC", "America/New_York"]
runs-on: ubuntu-latest
timeout-minutes: 60
steps:
- uses: szenius/set-timezone@v1.0
with:
timezoneLinux: ${{ matrix.timezone }}
- uses: actions/checkout@v3
with:
fetch-depth: 800
- uses: hsheth2/sane-checkout-action@v1
- name: Set up JDK 11
uses: actions/setup-java@v3
with:
distribution: "zulu"
java-version: 11
- uses: actions/setup-python@v4
with:
python-version: "3.7"
- name: Gradle build (and test)
python-version: "3.10"
cache: pip
- name: Gradle build (and test) for metadata ingestion
# we only need the timezone runs for frontend tests
if: ${{ matrix.command == 'except_metadata_ingestion' && matrix.timezone == 'America/New_York' }}
run: |
./gradlew build -x :metadata-ingestion:build -x :metadata-ingestion:check -x docs-website:build -x :metadata-integration:java:spark-lineage:test -x :metadata-io:test -x :metadata-ingestion-modules:airflow-plugin:build -x :metadata-ingestion-modules:airflow-plugin:check -x :datahub-frontend:build -x :datahub-web-react:build --parallel
- name: Gradle build (and test) for frontend
if: ${{ matrix.command == 'frontend' }}
run: |
${{ matrix.command }}
./gradlew :datahub-frontend:build :datahub-web-react:build --parallel
env:
NODE_OPTIONS: "--max-old-space-size=3072"
- uses: actions/upload-artifact@v3
Expand All @@ -81,7 +81,7 @@ jobs:
- uses: actions/checkout@v3
- uses: actions/setup-python@v4
with:
python-version: "3.7"
python-version: "3.10"
- name: Download YQ
uses: chrisdickinson/setup-yq@v1.0.1
with:
Expand Down
27 changes: 8 additions & 19 deletions .github/workflows/check-datahub-jars.yml
Original file line number Diff line number Diff line change
Expand Up @@ -4,17 +4,13 @@ on:
push:
branches:
- master
paths-ignore:
- "docker/**"
- "docs/**"
- "**.md"
paths:
- "metadata-integration"
pull_request:
branches:
- master
paths-ignore:
- "docker/**"
- "docs/**"
- "**.md"
- "**"
paths:
- "metadata-integration"
release:
types: [published]

Expand All @@ -28,25 +24,18 @@ jobs:
max-parallel: 1
fail-fast: false
matrix:
command:
[
"datahub-client",
"datahub-protobuf",
"spark-lineage"
]
command: ["datahub-client", "datahub-protobuf", "spark-lineage"]
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
with:
fetch-depth: 800
- uses: hsheth2/sane-checkout-action@v1
- name: Set up JDK 11
uses: actions/setup-java@v3
with:
distribution: "zulu"
java-version: 11
- uses: actions/setup-python@v4
with:
python-version: "3.7"
python-version: "3.10"
- name: check ${{ matrix.command }} jar
run: |
./gradlew :metadata-integration:java:${{ matrix.command }}:build --info
Expand Down
4 changes: 3 additions & 1 deletion .github/workflows/close-stale-issues.yml
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,9 @@ jobs:
days-before-issue-stale: 30
days-before-issue-close: 30
stale-issue-label: "stale"
stale-issue-message: "This issue is stale because it has been open for 30 days with no activity. If you believe this is still an issue on the latest DataHub release please leave a comment with the version that you tested it with. If this is a question/discussion please head to https://slack.datahubproject.io. For feature requests please use https://feature-requests.datahubproject.io"
stale-issue-message:
"This issue is stale because it has been open for 30 days with no activity. If you believe this is still an issue on the latest DataHub release please leave a comment with the version that you tested it with. If this is a question/discussion please head to https://slack.datahubproject.io.\
\ For feature requests please use https://feature-requests.datahubproject.io"
close-issue-message: "This issue was closed because it has been inactive for 30 days since being marked as stale."
days-before-pr-stale: -1
days-before-pr-close: -1
Expand Down
Loading
Loading