Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[fix](merge-on-write) segcompaction should process delete bitmap if necessary (#38369) #39707

Merged
merged 6 commits into from
Aug 21, 2024

Conversation

zhannngchen
Copy link
Contributor

Proposed changes

Issue Number: close #xxx

cherry-pick #38369 and #38800

…ecessary (apache#38369)

Issue Number: close #xxx

When loading data to a unique key table with sequence column, some data
in current load job might be marked as delete due to a lower sequence
value.
If there's many segments in such load job, segcompaction might be
triggered, which don't process the delete bitmap currently, will cause
data correctness issue

For example:
1. we have 4 segments in current load job initially, and due to seq
column, some rows are marked as deleted
2. after segcompaction, if we don't process the delete bitmap, it's
content is still corresponding to the old segment layout, and row
7,14,15 is not mark deleted correctly on new generated segment 1.
3. in this PR, we convert old delete bitmap to fit new segment layout,
it use similar way as base/cumulative compaction to convert delete
bitmaps on old layout to new one, but the rowid conversion is simpler

![whiteboard_exported_image-2](https://github.com/user-attachments/assets/a419b6a4-e583-457a-bf4e-56d9bd2a3544)
## Proposed changes

Issue Number: close #xxx

introduced by apache#38369
@doris-robot
Copy link

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR

Since 2024-03-18, the Document has been moved to doris-website.
See Doris Document.

@zhannngchen
Copy link
Contributor Author

run buildall

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

clang-tidy made some suggestions

be/test/olap/segcompaction_mow_test.cpp Show resolved Hide resolved
be/test/olap/segcompaction_mow_test.cpp Show resolved Hide resolved
be/test/olap/segcompaction_mow_test.cpp Show resolved Hide resolved
be/test/olap/segcompaction_mow_test.cpp Show resolved Hide resolved
@zhannngchen
Copy link
Contributor Author

run buildall

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

clang-tidy made some suggestions

be/test/olap/segcompaction_mow_test.cpp Show resolved Hide resolved
be/test/olap/segcompaction_mow_test.cpp Show resolved Hide resolved
be/test/olap/segcompaction_mow_test.cpp Show resolved Hide resolved
Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

clang-tidy made some suggestions

be/test/olap/segcompaction_mow_test.cpp Show resolved Hide resolved
be/test/olap/segcompaction_mow_test.cpp Show resolved Hide resolved
be/test/olap/segcompaction_mow_test.cpp Show resolved Hide resolved
@zhannngchen
Copy link
Contributor Author

run buildall

@yiguolei yiguolei merged commit 0e694f1 into apache:branch-2.1 Aug 21, 2024
19 of 21 checks passed
zhannngchen added a commit to zhannngchen/incubator-doris that referenced this pull request Aug 22, 2024
@yiguolei yiguolei mentioned this pull request Sep 5, 2024
3 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants