Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update the Flush API documentation #33551

Merged
merged 2 commits into from
Jan 15, 2019
Merged

Update the Flush API documentation #33551

merged 2 commits into from
Jan 15, 2019

Conversation

bleskes
Copy link
Contributor

@bleskes bleskes commented Sep 9, 2018

The semantics of the API changed considerably since the documentation was written.

The main change is to remove references to memory reduction (this is related to refresh).
Instead, flush refers to recovery times. I also removed the references to trimming the translog
as the translog may be required for other purposes (operation history for ops based recovery
and complement ongoing file based recoveries).

Closes #32869

The semantics of the API changed considerably since the documentation was written.

The main change is to remove references to memory reduction (this is related to refresh).
Instead, flush refers to recovery times. I also removed the references to trimming the translog
as the translog may be required for other purposes (operation history for ops based recovery
and complement ongoing file based recoveries).

Closes #32869
@bleskes bleskes added >docs General docs changes :Distributed/Distributed A catch all label for anything in the Distributed Area. If you aren't sure, use this one. labels Sep 9, 2018
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-distributed

Copy link
Member

@jasontedor jasontedor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I left some comments.

default, Elasticsearch uses memory heuristics in order to automatically
trigger flush operations as required in order to clear memory.
flush process of an index makes sure that any data that is currently only
persisted in the <<index-modules-translog,transaction log>> is also perminantly
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

“perminantly” -> “permanently”

persisted in the <<index-modules-translog,transaction log>> is also perminantly
persisted in Lucene. This reduces recovery times as that data doesn't need to be
reindexed from the transaction logs after the Lucene indexed is opened. By
default, Elasticsearch use heuristics in order to automatically
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

“use” -> “uses”

persisted in Lucene. This reduces recovery times as that data doesn't need to be
reindexed from the transaction logs after the Lucene indexed is opened. By
default, Elasticsearch use heuristics in order to automatically
trigger flushes as required and is rare for users to call the API directly.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let’s end the sentence after “required”, drop “and”, add “It” before “is” and add “need to” between “to” and “call”. “It is rare for users to need to call the API directly.”

@ywelsch
Copy link
Contributor

ywelsch commented Jan 11, 2019

@bleskes are you still interested in merging this?

@bleskes
Copy link
Contributor Author

bleskes commented Jan 11, 2019

@ywelsch thanks for the ping. It's on the list but keeps on being pushed down in terms of priority. I'll pick it up.

Copy link
Member

@jasontedor jasontedor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@bleskes bleskes merged commit 9553bd1 into 6.4 Jan 15, 2019
@bleskes bleskes deleted the bleskes-patch-1 branch January 15, 2019 10:43
bleskes added a commit that referenced this pull request Jan 15, 2019
The semantics of the API changed considerably since the documentation was written.

The main change is to remove references to memory reduction (this is related to refresh).
Instead, flush refers to recovery times. I also removed the references to trimming the translog
as the translog may be required for other purposes (operation history for ops based recovery
and complement ongoing file based recoveries).

Closes #32869
bleskes added a commit that referenced this pull request Jan 15, 2019
The semantics of the API changed considerably since the documentation was written.

The main change is to remove references to memory reduction (this is related to refresh).
Instead, flush refers to recovery times. I also removed the references to trimming the translog
as the translog may be required for other purposes (operation history for ops based recovery
and complement ongoing file based recoveries).

Closes #32869
bleskes added a commit that referenced this pull request Jan 15, 2019
The semantics of the API changed considerably since the documentation was written.

The main change is to remove references to memory reduction (this is related to refresh).
Instead, flush refers to recovery times. I also removed the references to trimming the translog
as the translog may be required for other purposes (operation history for ops based recovery
and complement ongoing file based recoveries).

Closes #32869
jasontedor added a commit to jasontedor/elasticsearch that referenced this pull request Jan 15, 2019
* elastic/master:
  Docs be explicit on how to turn off deprecated auditing (elastic#37316)
  Fix line length for monitor and remove suppressions (elastic#37456)
  Fix IndexShardTestCase.recoverReplica(IndexShard, IndexShard, boolean) (elastic#37414)
  Update the Flush API documentation (elastic#33551)
  [TEST] Muted testDifferentRolesMaintainPathOnRestart
  Remove dead code from ShardSearchStats (elastic#37421)
  Simplify testSendSnapshotSendsOps (elastic#37445)
  SQL: Fix issue with field names containing "." (elastic#37364)
  Restore lost @Inject annotation (elastic#37452)
bleskes added a commit that referenced this pull request Jan 23, 2019
The semantics of the API changed considerably since the documentation was written.

The main change is to remove references to memory reduction (this is related to refresh).
Instead, flush refers to recovery times. I also removed the references to trimming the translog
as the translog may be required for other purposes (operation history for ops based recovery
and complement ongoing file based recoveries).

Closes #32869
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Distributed/Distributed A catch all label for anything in the Distributed Area. If you aren't sure, use this one. >docs General docs changes
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants