Posting List should return error on read #4958

manishrjain · 2020-03-18T02:23:33Z

What version of Dgraph are you using?

master

Have you tried reproducing the issue with the latest release?

Yes

What is the hardware spec (RAM, OS)?

Thinkpad Linux

Steps to reproduce the issue (command/config used to run Dgraph).

Go into mrjn/incremental branch.
From compose dir, $ go build . && ./compose -a1 -z1 && ./run.sh
Run dgraph increment --alpha localhost:9180 --num=1000
Pick a log about Rolled up Key

alpha1    | I0318 02:13:30.022581      15 mvcc.go:87] Rolled up Key: 00000b636f756e7465722e76616c000000000000000001. Version: 288. Meta: 08

In this case, the key got rolled up at version 288. Any reads below version 288 should return error.

Expected behaviour and actual result.

$ curl "localhost:8180/query?startTs=287" -XPOST -H 'Content-Type: application/graphql+-' -d '{me(func: has(counter.val)) { uid, counter.val }}'

# As per code:
	if readTs < l.minTs {
		return errors.Errorf("readTs: %d less than minTs: %d for key: %q", readTs, l.minTs, l.key)
	}

But, this query does not return an error. Instead, it does not return counter.val. This is a bug.

Running the same query with startTs=288 does return a valid value.

The text was updated successfully, but these errors were encountered:

martinmr · 2020-03-18T18:44:56Z

I ran the debug command with the history flag. This is what I get:

ts: 328 {item}{discard}{complete}
 Uid: 18446744073709551615 Op: 1  Type: INT.  String Value: "162"
 Num uids = 1. Size = 22
 Uid = 18446744073709551615

ts: 326 {item}{delta}
 Uid: 18446744073709551615 Op: 1  Type: INT.  String Value: "161"

ts: 324 {item}{delta}
 Uid: 18446744073709551615 Op: 1  Type: INT.  String Value: "160"

ts: 322 {item}{delta}
 Uid: 18446744073709551615 Op: 1  Type: INT.  String Value: "159"

ts: 320 {item}{delta}
 Uid: 18446744073709551615 Op: 1  Type: INT.  String Value: "158"
...

// continues until the first delta entry setting the count to 1

In my case 328 is the timestamp at which the rollup happen. As you can see there's a discard marker so the earlier versions are ignored.

I think there are two options here:

Do not set the discard marker. This will allow reads below the rollup ts to keep working and it does not affect the reads above because ReadPostingList stops when it sees the first complete posting list. I am not sure how this marker is set or if this is a badger option.
Return an error when reading below the rollup timestamp. There are several reasons why I don't think we should do this.
- Dgraph supports passing a read timestamp to queries so we should honor this whenever possible. The data is there but it's inaccessible because of the discard marker. It's not like we are saving storage right now since all the data is in there anyways.
- There is no efficient way to know if we are below the latest rolled up list. When querying with a timestamp below it, we would need to scan the whole list from the latest version until we find the complete posting list and compare that with the requested timestamp. I don't think we are keeping track of the rollup timestamps in a way that would let us do this efficiently.

@manishrjain any thoughts?

martinmr · 2020-03-18T19:23:47Z

Found the place where the discard is set. I'll try changing the code and seeing what happens with the query that's failing right now.

// SetAt writes a key-value pair at the given timestamp.
func (w *TxnWriter) SetAt(key, val []byte, meta byte, ts uint64) error {
	return w.update(ts, func(txn *badger.Txn) error {
		switch meta {
		case BitCompletePosting, BitEmptyPosting:
			err := txn.SetEntry((&badger.Entry{
				Key:      key,
				Value:    val,
				UserMeta: meta,
			}).WithDiscard())
			if err != nil {
				return err
			}
		default:
			err := txn.SetEntry(&badger.Entry{
				Key:      key,
				Value:    val,
				UserMeta: meta,
			})
			if err != nil {
				return err
			}
		}
		return nil
	})
}

Also, I see that we will save space by only saving one version of the complete list. We are still storing the deltas, though. Do those entries ever get cleaned up?

manishrjain · 2020-03-18T19:28:56Z

As per the original post, we do return an error from posting list when the read is lower than the discard bit. See this:

# As per code:
	if readTs < l.minTs {
		return errors.Errorf("readTs: %d less than minTs: %d for key: %q", readTs, l.minTs, l.key)
	}

The condition in this case was causing most of the errors to be ignored. Verified that reading below the minTs throws an error now. Fixes #4958

manishrjain · 2020-03-18T19:58:40Z

Looks like this is the commit which introduced the issue: 78026df7f1

The error was being ignored and an empty response was being written because the condition in a case statement didn't exclude errors not equal to nil or ErrNoValue. This caused reads below the rollup Ts to succeed with an empty response when they should throw an error. The bug was triggered by running Jepsen tests with incremental rollups enabled. Fixes #4958

manishrjain assigned martinmr Mar 18, 2020

manishrjain added the kind/bug Something is broken. label Mar 18, 2020

martinmr added a commit that referenced this issue Mar 18, 2020

Throw errors returned by retrieveValuesAndFacets

73172d5

The condition in this case was causing most of the errors to be ignored. Verified that reading below the minTs throws an error now. Fixes #4958

martinmr added priority/P0 Critical issue that requires immediate attention. status/accepted We accept to investigate/work on it. labels Mar 18, 2020

martinmr mentioned this issue Mar 18, 2020

Throw errors returned by retrieveValuesAndFacets #4966

Merged

martinmr closed this as completed in #4966 Mar 18, 2020

martinmr mentioned this issue Mar 18, 2020

Cherry-pick "Throw errors returned by retrieveValuesAndFacets" #4968

Merged

martinmr mentioned this issue Mar 18, 2020

Cherry-pick "Throw errors returned by retrieveValuesAndFacets" #4970

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Posting List should return error on read #4958

Posting List should return error on read #4958

manishrjain commented Mar 18, 2020 •

edited

Loading

martinmr commented Mar 18, 2020

martinmr commented Mar 18, 2020

manishrjain commented Mar 18, 2020

manishrjain commented Mar 18, 2020

Posting List should return error on read #4958

Posting List should return error on read #4958

Comments

manishrjain commented Mar 18, 2020 • edited Loading

What version of Dgraph are you using?

Have you tried reproducing the issue with the latest release?

What is the hardware spec (RAM, OS)?

Steps to reproduce the issue (command/config used to run Dgraph).

Expected behaviour and actual result.

martinmr commented Mar 18, 2020

martinmr commented Mar 18, 2020

manishrjain commented Mar 18, 2020

manishrjain commented Mar 18, 2020

manishrjain commented Mar 18, 2020 •

edited

Loading