Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement copy node rdf #2028

Merged
merged 1 commit into from
Sep 13, 2023
Merged

Implement copy node rdf #2028

merged 1 commit into from
Sep 13, 2023

Conversation

acquamarin
Copy link
Collaborator

No description provided.

@codecov
Copy link

codecov bot commented Sep 13, 2023

Codecov Report

Patch coverage: 93.47% and project coverage change: +0.11% 🎉

Comparison is base (c578381) 89.97% compared to head (f99d35a) 90.08%.
Report is 5 commits behind head on master.

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #2028      +/-   ##
==========================================
+ Coverage   89.97%   90.08%   +0.11%     
==========================================
  Files         909      919      +10     
  Lines       32992    33248     +256     
==========================================
+ Hits        29685    29953     +268     
+ Misses       3307     3295      -12     
Files Changed Coverage Δ
src/include/common/copier_config/copier_config.h 100.00% <ø> (ø)
.../processor/operator/persistent/reader/csv_reader.h 100.00% <ø> (ø)
src/processor/map/map_copy_from.cpp 100.00% <ø> (+0.97%) ⬆️
...rocessor/operator/persistent/reader/csv_reader.cpp 87.50% <ø> (ø)
src/processor/processor.cpp 100.00% <ø> (ø)
src/binder/bind/bind_copy.cpp 86.41% <88.88%> (-0.16%) ⬇️
...rocessor/operator/persistent/reader/rdf_reader.cpp 90.19% <90.19%> (ø)
src/processor/operator/persistent/copy_node.cpp 92.25% <94.87%> (+0.01%) ⬆️
.../include/processor/operator/persistent/copy_node.h 100.00% <100.00%> (ø)
src/include/processor/operator/persistent/reader.h 100.00% <100.00%> (ø)
... and 3 more

... and 39 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@@ -172,6 +193,68 @@ void HashIndexBuilder<T>::flush() {
}
}

template<typename T>
void HashIndexBuilder<T>::rehashSlots(slot_id_t primarySlotId) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should we try to cover this code path?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like the rdf test dataset doesn't trigger the rehash.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we not introduce changes to hash index now? I feel this should be done later in a more careful way.

src/processor/map/map_copy_from.cpp Outdated Show resolved Hide resolved
@@ -86,7 +86,7 @@ void QueryProcessor::decomposePlanIntoTasks(
} break;
case PhysicalOperatorType::READER: {
auto reader = (Reader*)op;
if (reader->getContainsSerial()) {
if (reader->getContainsSerial() || reader->isCopyTurtleFile()) {
Copy link
Contributor

@andyfengHKU andyfengHKU Sep 13, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought we only need to read in SERIAL mode rather than copy in single thread. If we need to execute in single thread mode I'll refactor this check to front-end

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We need to execute in single thread mode due to the special copy codepath (append to hashIndex first, then append to columnChunk).

@acquamarin acquamarin merged commit 678055d into master Sep 13, 2023
10 checks passed
@acquamarin acquamarin deleted the rdf-copy branch September 13, 2023 19:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants