Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Separate null bits from data in columns #1545

Merged
merged 1 commit into from
May 18, 2023
Merged

Separate null bits from data in columns #1545

merged 1 commit into from
May 18, 2023

Conversation

ray6080
Copy link
Contributor

@ray6080 ray6080 commented May 17, 2023

This PR reworks the storage null bits layout of node/rel columns.
We used to inline null bits in each physical page. Now the design shifts to separate null as a sub-column.
Note that in this PR, nulls are stored as bytes, which will be refactored into bits in a separate PR.

Besides this, node copy is simplified a bit: NodeCopyExecutor is simplified, and finally get rid of the semantically redundant InMemColumn and InMemNodeColumn.

The feature of adding a new property with a given default value is disabled on columns.
The correct way to enable this later is to add support for constant columns.

TODO:
"CopyLargeNpyTest" is commented out, as I didn't finish changes in the copier to handle data types whose value size are not aligned with power of 2. I will finish that soon, meanwhile the PR should be reviewable.

@codecov
Copy link

codecov bot commented May 17, 2023

Codecov Report

Patch coverage: 89.93% and project coverage change: -0.10 ⚠️

Comparison is base (996b1e1) 92.00% compared to head (59a3673) 91.90%.

❗ Current head 59a3673 differs from pull request most recent head fd3f585. Consider uploading reports for the commit fd3f585 to get more accurate results

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #1545      +/-   ##
==========================================
- Coverage   92.00%   91.90%   -0.10%     
==========================================
  Files         696      692       -4     
  Lines       24864    24718     -146     
==========================================
- Hits        22875    22717     -158     
- Misses       1989     2001      +12     
Impacted Files Coverage Δ
src/include/common/types/types.h 100.00% <ø> (ø)
src/include/storage/copier/npy_reader.h 100.00% <ø> (ø)
src/include/storage/copier/rel_copy_executor.h 100.00% <ø> (ø)
src/include/storage/copier/table_copy_executor.h 100.00% <ø> (ø)
...lude/storage/storage_structure/storage_structure.h 100.00% <ø> (ø)
src/include/storage/wal/wal_record.h 36.61% <ø> (ø)
src/include/storage/wal_replayer_utils.h 100.00% <ø> (ø)
src/main/query_result.cpp 80.39% <50.00%> (+2.90%) ⬆️
src/include/processor/operator/copy/copy_node.h 57.14% <66.66%> (-5.36%) ⬇️
src/include/processor/operator/copy/copy_rel.h 57.14% <66.66%> (-5.36%) ⬇️
... and 23 more

... and 9 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

@ray6080 ray6080 marked this pull request as ready for review May 17, 2023 21:36
@ray6080 ray6080 requested a review from acquamarin May 17, 2023 21:36
@ray6080 ray6080 merged commit d8d5152 into master May 18, 2023
7 checks passed
@ray6080 ray6080 deleted the column-rework branch May 18, 2023 19:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants