Backport the Close Index API refactoring to 6.x #37359

tlrx · 2019-01-11T12:16:28Z

This pull request is a backport to 6.x of the Close Index API refactoring.

It cherry-picks the following commits from master:

3ca885e [Close Index API] Add TransportShardCloseAction for pre-closing verifications ([Close Index API] Add TransportShardCloseAction for pre-closing verifications #36249)
8e5dd20 [Close Index API] Refactor MetaDataIndexStateService ([Close Index API] Refactor MetaDataIndexStateService #36354)
7372529 [Tests] Reduce randomization in CloseWhileRelocatingShardsIT ([Tests] Reduce randomization in CloseWhileRelocatingShardsIT #36694)
103c4d4 [Close Index API] Mark unavailable shard copy as stale during verification ([Close Index API] Mark shard copy as stale if needed during shard verification #36755)
1959388 [Close Index API] Propagate tasks ids between Freeze, Close and Verify([Close Index API] Propagate tasks ids between Freeze, Close and Verify Shard actions #36630)
e149b08 [Close Index API] Add unique UUID to ClusterBlock ([Close Index API] Add unique UUID to ClusterBlock #36775)
dc371ef [Tests] Fix ReopenWhileClosingIT with correct min num shards

The serialization logic changes were adapted during the cherry picks.

The following two commits were needed to adapt the change to 6.x:

ef6ae69 [Close Index API] Adapt MetaDataIndexStateServiceTests after merge
21b7653 [Tests] Adapt CloseIndexIT tests for 6.x

…ications (elastic#36249) This pull request adds the TransportShardCloseAction which is a transport replication action that acquires all index shard permits for its execution. This action will be used in the future by the MetaDataIndexStateService in a new index closing process, where we need to execute some sanity checks before closing an index. The action executes the following verifications on the primary and replicas: * there is no other on going operation active on the shard * the data node holding the shard knows that the index is blocked for writes * the shard's max sequence number is equal to the global checkpoint When the verifications are done and successful, the shard is flushed. Relates elastic#33888

The commit changes how indices are closed in the MetaDataIndexStateService. It now uses a 3 steps process where writes are blocked on indices to be closed, then some verifications are done on shards using the TransportVerifyShardBeforeCloseAction added in elastic#36249, and finally indices states are moved to CLOSE and their routing tables removed. The closing process also takes care of using the pre-7.0 way to close indices if the cluster contains mixed version of nodes and a node does not support the TransportVerifyShardBeforeCloseAction. It also closes unassigned indices. Related to elastic#33888

…#36694)

…ation (elastic#36755) This pull request modifies the TransportVerifyShardBeforeCloseAction so that it marks unavailable shards as stale.

…y Shard actions (elastic#36630) This pull request changes the Freeze Index and Close Index actions so that these actions always requires a Task. The task's id is then propagated from the Freeze action to the Close action, and then to the Verify shard action. This way it is possible to track which Freeze task initiates the closing of an index, and which consecutive verifiy shard are executed for the index closing.

This commit adds a unique id to cluster blocks, so that they can be uniquely identified if needed. This is important for the Close Index API where multiple concurrent closing requests can be executed at the same time. By adding a UUID to the cluster block, we can generate unique "closing block" that can later be verified on shards and then checked again from the cluster state before closing the index. When the verification on shard is done, the closing block is replaced by the regular INDEX_CLOSED_BLOCK instance. If something goes wrong, calling the Open Index API will remove the block. Related to elastic#33888

The test intercepts TransportVerifyShardBeforeCloseAction shard requests, so it needs a minimum of 2 primary shards on 2 different nodes to correctly intercepts requests.

elasticmachine · 2019-01-11T12:16:30Z

Pinging @elastic/es-distributed

…rt to 6.x (#37360) This commit changes the versions in the serialization logic of ClusterBlock after the backport to 6.x of the Close Index API refactoring (#37359).

Before this change, closed indexes were simply not replicated. It was therefore possible to close an index and then decommission a data node without knowing that this data node contained shards of the closed index, potentially leading to data loss. Shards of closed indices were not completely taken into account when balancing the shards within the cluster, or automatically replicated through shard copies, and they were not easily movable from node A to node B using APIs like Cluster Reroute without being fully reopened and closed again. This commit changes the logic executed when closing an index, so that its shards are not just removed and forgotten but are instead reinitialized and reallocated on data nodes using an engine implementation which does not allow searching or indexing, which has a low memory overhead (compared with searchable/indexable opened shards) and which allows shards to be recovered from peer or promoted as primaries when needed. This new closing logic is built on top of the new Close Index API introduced in 6.7.0 (#37359). Some pre-closing sanity checks are executed on the shards before closing them, and closing an index on a 8.0 cluster will reinitialize the index shards and therefore impact the cluster health. Some APIs have been adapted to make them work with closed indices: - Cluster Health API - Cluster Reroute API - Cluster Allocation Explain API - Recovery API - Cat Indices - Cat Shards - Cat Health - Cat Recovery This commit contains all the following changes (most recent first): * c6c42a1 Adapt NoOpEngineTests after #39006 * 3f9993d Wait for shards to be active after closing indices (#38854) * 5e7a428 Adapt the Cluster Health API to closed indices (#39364) * 3e61939 Adapt CloseFollowerIndexIT for replicated closed indices (#38767) * 71f5c34 Recover closed indices after a full cluster restart (#39249) * 4db7fd9 Adapt the Recovery API for closed indices (#38421) * 4fd1bb2 Adapt more tests suites to closed indices (#39186) * 0519016 Add replica to primary promotion test for closed indices (#39110) * b756f6c Test the Cluster Shard Allocation Explain API with closed indices (#38631) * c484c66 Remove index routing table of closed indices in mixed versions clusters (#38955) * 00f1828 Mute CloseFollowerIndexIT.testCloseAndReopenFollowerIndex() * e845b0a Do not schedule Refresh/Translog/GlobalCheckpoint tasks for closed indices (#38329) * cf9a015 Adapt testIndexCanChangeCustomDataPath for replicated closed indices (#38327) * b9becdd Adapt testPendingTasks() for replicated closed indices (#38326) * 02cc730 Allow shards of closed indices to be replicated as regular shards (#38024) * e53a9be Fix compilation error in IndexShardIT after merge with master * cae4155 Relax NoOpEngine constraints (#37413) * 54d110b [RCI] Adapt NoOpEngine to latest FrozenEngine changes * c63fd69 [RCI] Add NoOpEngine for closed indices (#33903) Relates to #33888

Backport support for replicating closed indices (#39499) Before this change, closed indexes were simply not replicated. It was therefore possible to close an index and then decommission a data node without knowing that this data node contained shards of the closed index, potentially leading to data loss. Shards of closed indices were not completely taken into account when balancing the shards within the cluster, or automatically replicated through shard copies, and they were not easily movable from node A to node B using APIs like Cluster Reroute without being fully reopened and closed again. This commit changes the logic executed when closing an index, so that its shards are not just removed and forgotten but are instead reinitialized and reallocated on data nodes using an engine implementation which does not allow searching or indexing, which has a low memory overhead (compared with searchable/indexable opened shards) and which allows shards to be recovered from peer or promoted as primaries when needed. This new closing logic is built on top of the new Close Index API introduced in 6.7.0 (#37359). Some pre-closing sanity checks are executed on the shards before closing them, and closing an index on a 8.0 cluster will reinitialize the index shards and therefore impact the cluster health. Some APIs have been adapted to make them work with closed indices: - Cluster Health API - Cluster Reroute API - Cluster Allocation Explain API - Recovery API - Cat Indices - Cat Shards - Cat Health - Cat Recovery This commit contains all the following changes (most recent first): * c6c42a1 Adapt NoOpEngineTests after #39006 * 3f9993d Wait for shards to be active after closing indices (#38854) * 5e7a428 Adapt the Cluster Health API to closed indices (#39364) * 3e61939 Adapt CloseFollowerIndexIT for replicated closed indices (#38767) * 71f5c34 Recover closed indices after a full cluster restart (#39249) * 4db7fd9 Adapt the Recovery API for closed indices (#38421) * 4fd1bb2 Adapt more tests suites to closed indices (#39186) * 0519016 Add replica to primary promotion test for closed indices (#39110) * b756f6c Test the Cluster Shard Allocation Explain API with closed indices (#38631) * c484c66 Remove index routing table of closed indices in mixed versions clusters (#38955) * 00f1828 Mute CloseFollowerIndexIT.testCloseAndReopenFollowerIndex() * e845b0a Do not schedule Refresh/Translog/GlobalCheckpoint tasks for closed indices (#38329) * cf9a015 Adapt testIndexCanChangeCustomDataPath for replicated closed indices (#38327) * b9becdd Adapt testPendingTasks() for replicated closed indices (#38326) * 02cc730 Allow shards of closed indices to be replicated as regular shards (#38024) * e53a9be Fix compilation error in IndexShardIT after merge with master * cae4155 Relax NoOpEngine constraints (#37413) * 54d110b [RCI] Adapt NoOpEngine to latest FrozenEngine changes * c63fd69 [RCI] Add NoOpEngine for closed indices (#33903) Relates to #33888

tlrx added 9 commits January 11, 2019 10:49

[Close Index API] Adapt MetaDataIndexStateServiceTests after merge

af533cc

[Tests] Reduce randomization in CloseWhileRelocatingShardsIT (elastic…

5b756c3

…#36694)

[Close Index API] Mark unavailable shard copy as stale during verific…

b3ae701

…ation (elastic#36755) This pull request modifies the TransportVerifyShardBeforeCloseAction so that it marks unavailable shards as stale.

[Tests] Fix ReopenWhileClosingIT with correct min num shards

fc41b3a

The test intercepts TransportVerifyShardBeforeCloseAction shard requests, so it needs a minimum of 2 primary shards on 2 different nodes to correctly intercepts requests.

[Tests] Adapt CloseIndexIT tests for 6.x

21b7653

tlrx added :Distributed/Distributed A catch all label for anything in the Distributed Area. If you aren't sure, use this one. >refactoring backport v6.7.0 labels Jan 11, 2019

tlrx requested a review from ywelsch January 11, 2019 12:16

tlrx mentioned this pull request Jan 11, 2019

Update serialization versions for ClusterBlock after backport to 6.x #37360

Merged

tlrx merged commit cedf204 into elastic:6.x Jan 14, 2019

tlrx deleted the backport-close-api-refactoring-to-6.x branch January 14, 2019 12:12

tlrx mentioned this pull request Feb 28, 2019

Add support for replicating closed indices #39499

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backport the Close Index API refactoring to 6.x #37359

Backport the Close Index API refactoring to 6.x #37359

tlrx commented Jan 11, 2019

elasticmachine commented Jan 11, 2019

Backport the Close Index API refactoring to 6.x #37359

Backport the Close Index API refactoring to 6.x #37359

Conversation

tlrx commented Jan 11, 2019

elasticmachine commented Jan 11, 2019