Updating the api chunking proposal to add remainingItemCount #3710

caesarxuchao · 2019-05-14T02:46:45Z

/sig api-machinery

Retrospectively updating the proposal for api-chunking for the remainingItemCount API change introduced in kubernetes/kubernetes#75993.

/assign @smarterclayton
cc @lavalamp

smarterclayton

Adding some detail in use cases of what this is for, how exact it has to be, and why we should add it would be my request.

smarterclayton · 2019-05-14T02:51:58Z

contributors/design-proposals/api-machinery/api-chunking.md

@@ -78,6 +80,7 @@ The server **may** limit the amount of time a continue token is valid for. Clien

 The server **must** support `continue` tokens that are valid across multiple API servers. The server **must** support a mechanism for rolling restart such that continue tokens are valid after one or all API servers have been restarted.

+The `remainingItemCount` returned by the server is the number of subsequent items in the list which are not included in this list response. If the list request contained label or field selectors, then the number of remaining items is unknown and this field will be unset and omitted during serialization. If the list is complete (either because the list request is not a chunking one or because this is the last chunk), then there are no more remaining items and this field is unset and is omitted during serialization. Servers older than v1.15 omit this field in their response to any list request.


Is it required to be exact? What’s the use case?

What happens when I’m halfway through a list?

smarterclayton · 2019-05-14T02:52:17Z

contributors/design-proposals/api-machinery/api-chunking.md

@@ -91,6 +94,8 @@ Implementations that cannot offer consistent ranging (returning a set of results

 For etcd3 the continue token would contain a resource version (the snapshot that we are reading that is consistent across the entire LIST) and the start key for the next set of results. Upon receiving a valid continue token the apiserver would instruct etcd3 to retrieve the set of results at a given resource version, beginning at the provided start key, limited by the maximum number of requests provided by the continue token (or optionally, by a different limit specified by the client). If more results remain after reading up to the limit, the storage should calculate a continue token that would begin at the next possible key, and the continue token set on the returned list.

+etcd3 returns the total number of keys within the range as `response.Count` if the read is a range read. The storage checks if the list request contained label or field selector. If not, the storage calculates the `remainingItemCount` as `response.Count-limit`.


If so, what happens?

k8s-ci-robot · 2019-05-14T02:53:50Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: caesarxuchao
To fully approve this pull request, please assign additional approvers.
We suggest the following additional approver: smarterclayton

If they are not already assigned, you can assign the PR to them by writing /assign @smarterclayton in a comment when ready.

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

contributors/design-proposals/api-machinery/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

caesarxuchao · 2019-05-14T06:02:43Z

@smarterclayton I added use cases and why we needed the field. PTAL.

The remainingItemCount is exact. The doc never suggests that the count is an approximation, so I think readers will get it.

smarterclayton · 2019-05-14T14:45:04Z

The remainingItemCount is exact. The doc never suggests that the count is an approximation, so I think readers will get it.

That's actually my concern, that we enshrine an exact count as part of the API. I would generally want to be as vague as possible to leave open the door for either relaxed semantics (performance) or alternate backends (proxies that can't necessarily know).

smarterclayton · 2019-05-14T14:45:51Z

Also add what feature flag gate this will be under and what release it will be promoted (1.16 is ok for promotion to GA if we get good data in 1.15).

smarterclayton · 2019-05-14T14:53:55Z

contributors/design-proposals/api-machinery/api-chunking.md

+
+For example, assuming there are 1450 pods in the cluster when the client sends a chunking list request for pods, with `limit` set to 500. The first chunked list response contains 500 pods, a `continue` token, and with `metadata.remainingItemCount` set to 950. If the client use the `continue` token to continue listing, the server returns the second chunked list response containing the next 500 pods, another `continue` token, and with `metadata.remainingItemCount` set to 450. Note that the `remainingItemCount` is calculated based on the consistent list taken at the first chunking list request, that is, no matter if pods are created or deleted between the first and the second chunking list requests, the `metadata.remainingItemCount` in the second list response is always set to 450.
+
+The `remainingItemCount` offers a simple and efficient way to get the count of objects of a resource type. For example, to get the total count of all pods in the cluster, simply sends `GET /api/v1/pods?limit=1` to the apiserver. Similarly, one can get the total count of all pods in a namespace. Note that the `remainingItemCount` is set to 0 when the list request contains any label or field selector, so this feature cannot be used to count the number of objects matching specific label or field selectors. Without the `remainingItemCount` feature, one needs to list all pods to get a count.


Since old servers do not support this, and I don't think we can require this value in all possible future implementations, you need to specifically describe to a client how they can tell when the server doesn't support this field, and instruct them how to correctly detect it (as other sections do).

Effectively the algorithm is:

Make a limit request

Observe whether continue is set (if not, you have only one or zero items)

Observe whether remainingItemCount is set (if not, remainingItemCount isn't supported)

smarterclayton · 2019-05-14T14:55:28Z

Hrm, I didn't see a description of the use cases in the form I would expect. I.e. "we want to let people estimate the size of the collection", "we want to make this part of a UI" (I hope it's the former rather than the latter, because we discouraged use of chunking for human UI viewing).

lavalamp · 2019-05-14T15:58:48Z

Yeah, it's the former, specifically we want to estimate when the storage migrator will finish a migration. *From: *Clayton Coleman <notifications@github.com> *Date: *Tue, May 14, 2019 at 7:56 AM *To: *kubernetes/community *Cc: *Daniel Smith, Mention Hrm, I didn't see a description of the use cases in the form I would

…

expect. I.e. "we want to let people estimate the size of the collection", "we want to make this part of a UI" (I hope it's the former rather than the latter, because we discouraged use of chunking for human UI viewing). — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#3710?email_source=notifications&email_token=AAE6BFTIC6WM77AAYMEV7DTPVLHIDA5CNFSM4HMU6A3KYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODVLX5LA#issuecomment-492273324>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAE6BFWPBD4GPFOEPSJIHALPVLHIDANCNFSM4HMU6A3A> .

caesarxuchao · 2019-05-14T17:21:03Z

RE. concerns over the "exact count", I added "The intended use of the remainingItemCount is estimating the size of a collection. Clients should not rely on the remainingItemCount to be set or to be exact."

RE. detecting if remainingItemCount is supported, I added instructions.

RE. feature flag, I'll wait for you and Daniel to converge. See Daniel's comment at kubernetes/kubernetes#75993 (comment). Starting the field as beta sounds more useful to me.

RE. use cases, perhaps I didn't understand your request, let me know if the current wording is still too vague.

caesarxuchao · 2019-05-15T17:55:54Z

Added feature flag. @smarterclayton PTAL. Thank you.

fejta-bot · 2019-08-13T18:13:26Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2019-09-12T19:11:58Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2019-10-12T19:57:02Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2019-10-12T19:57:10Z

@fejta-bot: Closed this PR.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot assigned smarterclayton May 14, 2019

k8s-ci-robot added sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels May 14, 2019

k8s-ci-robot requested review from deads2k and lavalamp May 14, 2019 02:46

k8s-ci-robot added size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. kind/design Categorizes issue or PR as related to design. labels May 14, 2019

caesarxuchao mentioned this pull request May 14, 2019

Adding RemainingItemCount to ListMeta kubernetes/kubernetes#75993

Merged

smarterclayton suggested changes May 14, 2019

View reviewed changes

Updating the api chunking proposal to add remainingItemCount

58aa279

caesarxuchao force-pushed the remaining-item-count branch from 5716c31 to 58aa279 Compare May 14, 2019 06:00

k8s-ci-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels May 14, 2019

smarterclayton reviewed May 14, 2019

View reviewed changes

Chao Xu added 2 commits May 14, 2019 10:21

addressing comments

5e426ff

alpha feature and feature flag

adfa4fe

caesarxuchao mentioned this pull request May 31, 2019

Protecting remainingItemCount behind a feature flag. Also updating the API doc kubernetes/kubernetes#78553

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 13, 2019

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Sep 12, 2019

k8s-ci-robot closed this Oct 12, 2019

serathius mentioned this pull request Jun 2, 2021

mvcc: push down RangeOptions.limit argv into index tree to reduce memory overhead etcd-io/etcd#11990

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updating the api chunking proposal to add remainingItemCount #3710

Updating the api chunking proposal to add remainingItemCount #3710

caesarxuchao commented May 14, 2019

smarterclayton left a comment

smarterclayton May 14, 2019

smarterclayton May 14, 2019

smarterclayton May 14, 2019

k8s-ci-robot commented May 14, 2019

caesarxuchao commented May 14, 2019

smarterclayton commented May 14, 2019

smarterclayton commented May 14, 2019

smarterclayton May 14, 2019

smarterclayton commented May 14, 2019

lavalamp commented May 14, 2019 via email

caesarxuchao commented May 14, 2019

caesarxuchao commented May 15, 2019

fejta-bot commented Aug 13, 2019

fejta-bot commented Sep 12, 2019

fejta-bot commented Oct 12, 2019

k8s-ci-robot commented Oct 12, 2019

		@@ -78,6 +80,7 @@ The server may limit the amount of time a continue token is valid for. Clien

		The server must support `continue` tokens that are valid across multiple API servers. The server must support a mechanism for rolling restart such that continue tokens are valid after one or all API servers have been restarted.

		The `remainingItemCount` returned by the server is the number of subsequent items in the list which are not included in this list response. If the list request contained label or field selectors, then the number of remaining items is unknown and this field will be unset and omitted during serialization. If the list is complete (either because the list request is not a chunking one or because this is the last chunk), then there are no more remaining items and this field is unset and is omitted during serialization. Servers older than v1.15 omit this field in their response to any list request.

		@@ -91,6 +94,8 @@ Implementations that cannot offer consistent ranging (returning a set of results

		For etcd3 the continue token would contain a resource version (the snapshot that we are reading that is consistent across the entire LIST) and the start key for the next set of results. Upon receiving a valid continue token the apiserver would instruct etcd3 to retrieve the set of results at a given resource version, beginning at the provided start key, limited by the maximum number of requests provided by the continue token (or optionally, by a different limit specified by the client). If more results remain after reading up to the limit, the storage should calculate a continue token that would begin at the next possible key, and the continue token set on the returned list.

		etcd3 returns the total number of keys within the range as `response.Count` if the read is a range read. The storage checks if the list request contained label or field selector. If not, the storage calculates the `remainingItemCount` as `response.Count-limit`.


		For example, assuming there are 1450 pods in the cluster when the client sends a chunking list request for pods, with `limit` set to 500. The first chunked list response contains 500 pods, a `continue` token, and with `metadata.remainingItemCount` set to 950. If the client use the `continue` token to continue listing, the server returns the second chunked list response containing the next 500 pods, another `continue` token, and with `metadata.remainingItemCount` set to 450. Note that the `remainingItemCount` is calculated based on the consistent list taken at the first chunking list request, that is, no matter if pods are created or deleted between the first and the second chunking list requests, the `metadata.remainingItemCount` in the second list response is always set to 450.

		The `remainingItemCount` offers a simple and efficient way to get the count of objects of a resource type. For example, to get the total count of all pods in the cluster, simply sends `GET /api/v1/pods?limit=1` to the apiserver. Similarly, one can get the total count of all pods in a namespace. Note that the `remainingItemCount` is set to 0 when the list request contains any label or field selector, so this feature cannot be used to count the number of objects matching specific label or field selectors. Without the `remainingItemCount` feature, one needs to list all pods to get a count.

Updating the api chunking proposal to add remainingItemCount #3710

Updating the api chunking proposal to add remainingItemCount #3710

Conversation

caesarxuchao commented May 14, 2019

smarterclayton left a comment

Choose a reason for hiding this comment

smarterclayton May 14, 2019

Choose a reason for hiding this comment

smarterclayton May 14, 2019

Choose a reason for hiding this comment

smarterclayton May 14, 2019

Choose a reason for hiding this comment

k8s-ci-robot commented May 14, 2019

caesarxuchao commented May 14, 2019

smarterclayton commented May 14, 2019

smarterclayton commented May 14, 2019

smarterclayton May 14, 2019

Choose a reason for hiding this comment

smarterclayton commented May 14, 2019

lavalamp commented May 14, 2019 via email

caesarxuchao commented May 14, 2019

caesarxuchao commented May 15, 2019

fejta-bot commented Aug 13, 2019

fejta-bot commented Sep 12, 2019

fejta-bot commented Oct 12, 2019

k8s-ci-robot commented Oct 12, 2019