Multi Copy for Node Tables #3298

benjaminwinger · 2024-04-16T21:17:29Z

Performance is currently badly bottlenecked by HashIndex::splitSlot which hasn't been optimized using the new caching disk array WriteIterator (used when doing bulk inserts and the hash index is non-empty, i.e., for each additional copy). Currently the second copy performance is roughly 10x worse as a result. I've already written an optimized version that improves it to 2.8x worse, but there's a bug I need to fix and I thought I should get this merged now that it's passing the tests.

I also discovered a bug in DBFileUtils::updatePage, which is mostly unrelated to multi copy: if a WAL page already exists, it won't be marked as dirty, however it's possible for a wal page to not be dirty if it was previously evicted and then read (but not modified), before being updated with DBFileUtils::updatePage (probably reproduced here due to the last of the multi copy tests being quite large and pushing the 64MB buffer pool size used by the tests).

ray6080

LGTM!

src/include/storage/local_storage/local_node_table.h

ray6080 · 2024-04-17T05:31:24Z

src/include/storage/local_storage/local_table.h

 #include "common/vector/value_vector.h"
+#include "storage/store/chunked_node_group.h"


Double check if the include of chunked_node_group.h is needed. I think it should be indirectly included by chunked_node_group_collection.h already.

It is, but I don't like relying on indirect includes, they just make the changes more confusing when things are removed and you have to add unrelated headers (though admittedly in this case it is something that's closely related to the existing include).

src/include/storage/store/node_table.h

src/storage/local_storage/local_table.cpp

src/processor/operator/persistent/node_batch_insert.cpp

ray6080 · 2024-04-17T06:03:39Z

src/processor/operator/persistent/node_batch_insert.cpp

+void NodeBatchInsert::clearToIndex(std::unique_ptr<ChunkedNodeGroup>& nodeGroup,
+    common::offset_t startIndexInGroup) {
+    // Create a new chunked node group and move the unwritten values to it
+    // TODO(bmwinger): Can probably re-use the chunk and shift the values


Or we can let writeToExistingNodeGroup handle writing from the middle of the node group? so we don't need clearToIndex.

Maybe. Do you mean write the last n values to the existing chunk, truncate the group, and continue adding to the existing one and use that when creating a new node group?
In addition to it meaning that even in single-threaded mode the values will not be in their original order (though I'm not sure if we have any guarantees of that anyway), truncation could lead to things like the re-used ListColumnChunk storing extra values from the original data, since ListColumnChunk::finalize doesn't always remove unused values.
But either way it would add a function that needs to be implemented by all the column chunk types and could have subtle side-effects if omitted (but not failures, which makes it easier to miss), which was why I decided against including that in this PR.

src/include/storage/index/hash_index.h

ray6080 · 2024-04-17T06:23:28Z

src/processor/operator/persistent/node_batch_insert.cpp

    auto nodeSharedState =
        ku_dynamic_cast<BatchInsertSharedState*, NodeBatchInsertSharedState*>(sharedState.get());
    if (nodeSharedState->pkType.getLogicalTypeID() != LogicalTypeID::SERIAL) {
        nodeSharedState->initPKIndex(context);
    }
+    // Set initial node group index, which should be the last one available on disk which is not
+    // full, or the next index.
+    auto nodeTable = ku_dynamic_cast<Table*, NodeTable*>(nodeSharedState->table);


hmm this is a bit confusing. what is currentNodeGroupIdx used for?

It's used to track the next node group index to write to. This sets it either to the last non-empty node group, or to the first non-existent one, depending on whether or not there is a last non-empty one.

src/include/storage/store/node_table.h

src/include/storage/store/node_table_data.h

benjaminwinger force-pushed the multi-copy branch from beaf397 to 1d7616d Compare April 16, 2024 21:32

ray6080 approved these changes Apr 17, 2024

View reviewed changes

benjaminwinger force-pushed the multi-copy branch 4 times, most recently from c3b6055 to 7d7c9d5 Compare April 18, 2024 21:34

Enable multi copy for node tables

8250751

benjaminwinger force-pushed the multi-copy branch from 7d7c9d5 to 8250751 Compare April 19, 2024 13:37

benjaminwinger merged commit b41b649 into master Apr 19, 2024
17 checks passed

benjaminwinger deleted the multi-copy branch April 19, 2024 14:12

benjaminwinger mentioned this pull request Apr 19, 2024

Optimize Hash Index slot splitting #3325

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi Copy for Node Tables #3298

Multi Copy for Node Tables #3298

benjaminwinger commented Apr 16, 2024

ray6080 left a comment

ray6080 Apr 17, 2024

benjaminwinger Apr 17, 2024

ray6080 Apr 17, 2024

benjaminwinger Apr 17, 2024

ray6080 Apr 17, 2024

benjaminwinger Apr 17, 2024

		#include "common/vector/value_vector.h"
		#include "storage/store/chunked_node_group.h"

Multi Copy for Node Tables #3298

Multi Copy for Node Tables #3298

Conversation

benjaminwinger commented Apr 16, 2024

ray6080 left a comment

Choose a reason for hiding this comment

ray6080 Apr 17, 2024

Choose a reason for hiding this comment

benjaminwinger Apr 17, 2024

Choose a reason for hiding this comment

ray6080 Apr 17, 2024

Choose a reason for hiding this comment

benjaminwinger Apr 17, 2024

Choose a reason for hiding this comment

ray6080 Apr 17, 2024

Choose a reason for hiding this comment

benjaminwinger Apr 17, 2024

Choose a reason for hiding this comment