Cluster support #221

squeed · 2021-08-25T14:24:58Z

Fixes #129

We would like to support ovn raft clusters. This means we need:

a model for the special _Server database
the ability to monitor tables in that database
the ability to reject endpoints that are not the leader
to drop and reconnect if our endpoint loses leadership

squeed · 2021-08-25T14:26:29Z

@dave-tucker PTAL. I still need to implement the critical bits, but this is the first start.

dave-tucker · 2021-08-25T14:40:04Z

client/client.go

+			db.monitorsMutex.Lock()
+			defer db.monitorsMutex.Unlock()
+			for id, request := range db.monitors {
+				// TODO: should err here just be treated as failure to connect?


i've not encountered a situation where a monitor rpc failed, but it's plausible it could if we were disconnected due to a network hiccup and ovsdb-server hadn't yet closed the monitor session completely - the error would be that there is already a monitor with the same id. in which case, trying again once the state has been torn down would be the best way to fix it I think....

tl;dr i think it's ok to treat this error the same

dave-tucker

Looking good so far @squeed. Just a couple of comments so far

client/client.go

squeed · 2021-08-25T19:44:24Z

@dave-tucker thanks for the review; I'll work on the wording changes.

I've implemented the _Server watch, so that should be good to go now.

client/client.go

dave-tucker

you have a typo to fix s/betwen/between/g and the server_model package should be renamed serverdb and then the lint should pass 🤞

client/client.go

dave-tucker

Just had another look through. As db.schema gets updated on re-connection any read access to it needs to hold an RLock or the race detector gets upset.

dcbw · 2021-08-29T20:38:44Z

FWIW, trying to do this downstream in go-ovn via openshift/ovn-kubernetes#695

squeed · 2021-08-30T11:28:29Z

FYI, logger support is currently stuck until klog takes a new release, since it moves from logr v0.4 to v1.0. So that will have to wait.

otherwise, @dave-tucker, this is ready to go.

As a part of ovn-org#129, we need to monitor the special "_Server" database so we can disconnect if a server loses leader. That means we need (internal) support for multple databases / schemas / models on the same endpoint. This implements that, but doesn't expose it to the end-user. Signed-off-by: Casey Callendrello <cdc@redhat.com>

This table is used by ovsdb itself to report internal status. Signed-off-by: Casey Callendrello <cdc@redhat.com>

This adds an additional check when testing a potential endpoint: if the endpoint reports itself as not a leader, then reject it and continue. This needs to be explicitly requested via a specific Option. Note that this does not monitor for leadership changes. Signed-off-by: Casey Callendrello <cdc@redhat.com>

This sets up a monitor on the _Server/Database meta-table, and disconnects if leadership is lost. It also adds some useful locking. Signed-off-by: Casey Callendrello <cdc@redhat.com>

args is an arbitrary json cookie, so we need to handle that rather than assuming it's a string. Signed-off-by: Casey Callendrello <cdc@redhat.com>

dave-tucker

LGTM. Thanks @squeed

coveralls · 2021-09-01T00:25:41Z

Pull Request Test Coverage Report for Build 1182187084

213 of 314 (67.83%) changed or added relevant lines in 5 files are covered.
7 unchanged lines in 1 file lost coverage.
Overall coverage decreased (-0.4%) to 75.269%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
ovsdb/serverdb/model.go	5	11	45.45%
client/client.go	201	296	67.91%

Files with Coverage Reduction	New Missed Lines	%
client/client.go	7	70.65%

Totals
Change from base Build 1163233557:	-0.4%
Covered Lines:	3424
Relevant Lines:	4549

💛 - Coveralls

squeed force-pushed the feature/cluster-support branch from 8707553 to 2ea7818 Compare August 25, 2021 14:26

dave-tucker reviewed Aug 25, 2021

View reviewed changes

client/client.go Outdated Show resolved Hide resolved

client/client.go Outdated Show resolved Hide resolved

client/client.go Outdated Show resolved Hide resolved

client/client.go Outdated Show resolved Hide resolved

squeed force-pushed the feature/cluster-support branch from 152beaa to 375a0f1 Compare August 25, 2021 19:42

squeed changed the title ~~[WIP] Cluster support~~ Cluster support Aug 25, 2021

squeed mentioned this pull request Aug 25, 2021

Hack test libovsdb cluster openshift/ovn-kubernetes#689

Closed

squeed commented Aug 25, 2021

View reviewed changes

client/client.go Outdated Show resolved Hide resolved

dave-tucker requested changes Aug 26, 2021

View reviewed changes

client/client.go Outdated Show resolved Hide resolved

client/client.go Outdated Show resolved Hide resolved

client/client.go Show resolved Hide resolved

dave-tucker reviewed Aug 26, 2021

View reviewed changes

client/client.go Show resolved Hide resolved

dave-tucker reviewed Aug 26, 2021

View reviewed changes

squeed force-pushed the feature/cluster-support branch 2 times, most recently from 454041d to 3ad9102 Compare August 30, 2021 11:27

squeed added 5 commits August 30, 2021 14:43

Add model for _Server meta-table.

745694a

This table is used by ovsdb itself to report internal status. Signed-off-by: Casey Callendrello <cdc@redhat.com>

client: watch for leader loss, disconnect

a1249cf

This sets up a monitor on the _Server/Database meta-table, and disconnects if leadership is lost. It also adds some useful locking. Signed-off-by: Casey Callendrello <cdc@redhat.com>

server: correctly handle raw json monitor args

3d13c37

args is an arbitrary json cookie, so we need to handle that rather than assuming it's a string. Signed-off-by: Casey Callendrello <cdc@redhat.com>

squeed force-pushed the feature/cluster-support branch from 1c9cd01 to 3d13c37 Compare August 30, 2021 12:44

dave-tucker approved these changes Aug 31, 2021

View reviewed changes

dave-tucker merged commit b98d061 into ovn-org:main Aug 31, 2021

dave-tucker added the feature label Aug 31, 2021

squeed mentioned this pull request Sep 1, 2021

Update libovsdb, connect only to masters ovn-org/ovn-kubernetes#2464

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cluster support #221

Cluster support #221

squeed commented Aug 25, 2021

squeed commented Aug 25, 2021 •

edited

Loading

dave-tucker Aug 25, 2021

dave-tucker left a comment

squeed commented Aug 25, 2021

dave-tucker left a comment

dave-tucker left a comment

dcbw commented Aug 29, 2021

squeed commented Aug 30, 2021

dave-tucker left a comment

coveralls commented Sep 1, 2021

Cluster support #221

Cluster support #221

Conversation

squeed commented Aug 25, 2021

squeed commented Aug 25, 2021 • edited Loading

dave-tucker Aug 25, 2021

Choose a reason for hiding this comment

dave-tucker left a comment

Choose a reason for hiding this comment

squeed commented Aug 25, 2021

dave-tucker left a comment

Choose a reason for hiding this comment

dave-tucker left a comment

Choose a reason for hiding this comment

dcbw commented Aug 29, 2021

squeed commented Aug 30, 2021

dave-tucker left a comment

Choose a reason for hiding this comment

coveralls commented Sep 1, 2021

Pull Request Test Coverage Report for Build 1182187084

💛 - Coveralls

squeed commented Aug 25, 2021 •

edited

Loading