Revert 14548 #15541

mitake · 2023-03-21T13:06:30Z

Please read https://github.com/etcd-io/etcd/blob/main/CONTRIBUTING.md#contribution-flow.

#14322 was backported to release-3.4 in #14548, but it should be reverted too.

…f-#12271-upstream-release-3.4 Automated cherry pick of etcd-io#12271 on release 3.4

To fix a panic that happens when trying to get ids of etcd members in force new cluster mode, the issue happen if the cluster previously had etcd learner nodes added to the cluster Fixes etcd-io#12285

[Backport 3.4] etcdserver: add ConfChangeAddLearnerNode to the list of config changes

fixes: etcd-io#11954

…#12264-upstream-release-3.4 Automated cherry pick of etcd-io#12264

To prevent arbitrary command invocations. Signed-off-by: Gyuho Lee <leegyuho@amazon.com>

Signed-off-by: Gyuho Lee <leegyuho@amazon.com>

Use golang.org/x/sys/unix for F_OFD_* constants. This fixes the issue that F_OFD_GETLK was defined incorrectly, resulting in bugs such as moby/moby#31182 Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>

[3.4 backport] pkg/fileutil: fix F_OFD_ constants

Signed-off-by: Sam Batschelet <sbatsche@redhat.com>

vendor: bump gorilla/websocket

This fixes etcd being unable to send any message longer than 64 KB as a notification over the websocket. This was because the older version of grpc-websocket-proxy was used and WithMaxRespBodyBufferSize option wasn't set.

etcdserver: Fix 64 KB websocket notification message limit

…de health check in debug level ref. etcd-io#12677 ref. etcd-io@0b9cfa8

[Backport-3.4] etcdserver/api/etcdhttp: log successful etcd server side health check in debug level

Signed-off-by: Gyuho Lee <leegyuho@amazon.com>

Signed-off-by: Sam Batschelet <sbatsche@redhat.com>

Manual cherry pick of etcd-io#12448 on release 3.4

There are situations where we don't wish to fsync but we do want to write the data. Typically this occurs in clusters where fsync latency (often the result of firmware) transiently spikes. For Kubernetes clusters this causes (many) elections which have knock-on effects such that the API server will transiently fail causing other components fail in turn. By writing the data (buffered and asynchronously flushed, so in most situations the write is fast) and avoiding the fsync we no longer trigger this situation and opportunistically write out the data. Anecdotally: Because the fsync is missing there is the argument that certain types of failure events will cause data corruption or loss, in testing this wasn't seen. If this was to occur the expectation is the member can be readded to a cluster or worst-case restored from a robust persisted snapshot. The etcd members are deployed across isolated racks with different power feeds. An instantaneous failure of all of them simultaneously is unlikely. Testing was usually of the form: * create (Kubernetes) etcd write-churn by creating replicasets of some 1000s of pods * break/fail the leader Failure testing included: * hard node power-off events * disk removal * orderly reboots/shutdown In all cases when the node recovered it was able to rejoin the cluster and synchronize.

When using --unsafe-no-fsync still write out the data

The integration jobs fail with timeouts slightly over 3s, increase this marginally so false failures are less prevalent.

integration: relax leader timeout from 3s to 4s

…tion etcdserver: Fix PeerURL validation

Manual cherry-pick of 9571325 for release-3.4.

etcdserver: fix incorrect metrics generated when clients cancel watches

As go 1.12.2 is what is tested in CI as well as recommended to be built with 1.12.2 we should also pin to this in the go directive version.

It is kind of backport from etcd-io#14124. Signed-off-by: Wei Fu <fuweid89@gmail.com>

[3.4] mvcc: push down RangeOptions.limit argv into index tree to reduce memory overhead

Signed-off-by: Iavael <905853+iavael@users.noreply.github.com>

This formats ipv6 addresses to ensure they can be compared safely Signed-off-by: kidsan <8798449+Kidsan@users.noreply.github.com>

(cherry picked from commit 9c82e8c) Signed-off-by: Wei Fu <fuweid89@gmail.com>

Signed-off-by: Benjamin Wang <wachao@vmware.com>

…30209 [3.4] etctserver: add failpoints walBeforeSync and walAfterSync

Historic capnslog timestamps are in microsecond resolution. We need to match that when we migrate to the zap logger. Signed-off-by: James Blair <mail@jamesblair.net>

Signed-off-by: Benjamin Wang <wachao@vmware.com>

Signed-off-by: James Blair <mail@jamesblair.net>

[3.4] Backport bump to go 1.19.6 and golang.org/x/net to v0.7.0

Mitigates CVE-2023-24532. Signed-off-by: James Blair <mail@jamesblair.net>

[3.4] Backport update to latest go 1.19.7 release

Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>

… when sharing the same connection Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>

…h starvation Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>

…r-3.4 Watch random scheduler 3.4

Signed-off-by: Benjamin Wang <wachao@vmware.com>

[3.4] cleanup the go.mod & go.sum files

Problem: during restore in watchableStore.Restore, synced watchers are moved to unsynced. minRev will be behind since it's not updated when watcher stays synced. Solution: update minRev fixes: etcd-io#15271 Signed-off-by: Bogdan Kanivets <bkanivets@apple.com> Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>

Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>

[v3.4] Fix issue15271

This reverts commit 0c6e466.

jpbetz and others added 30 commits September 10, 2020 11:07

Merge pull request etcd-io#12280 from jingyih/automated-cherry-pick-o…

dd1b699

…f-#12271-upstream-release-3.4 Automated cherry pick of etcd-io#12271 on release 3.4

etcdserver: add ConfChangeAddLearnerNode to the list of config changes

3019246

To fix a panic that happens when trying to get ids of etcd members in force new cluster mode, the issue happen if the cluster previously had etcd learner nodes added to the cluster Fixes etcd-io#12285

Merge pull request etcd-io#12299 from galal-hussein/fix_panic_34

7e2d426

[Backport 3.4] etcdserver: add ConfChangeAddLearnerNode to the list of config changes

clientv3: get AuthToken automatically when clientConn is ready.

40b7107

fixes: etcd-io#11954

Merge pull request etcd-io#12356 from cfc4n/automated-cherry-pick-of-…

eb0fb0e

…#12264-upstream-release-3.4 Automated cherry pick of etcd-io#12264

tools/etcd-dump-metrics: validate exec cmd args

e3b29b6

To prevent arbitrary command invocations. Signed-off-by: Gyuho Lee <leegyuho@amazon.com>

pkg/netutil: remove unused "iptables" wrapper

a4b43b3

Signed-off-by: Gyuho Lee <leegyuho@amazon.com>

version: 3.4.14

8a03d2e

Signed-off-by: Gyuho Lee <leegyuho@amazon.com>

pkg/fileutil: fix F_OFD_ constants

bea35fd

Use golang.org/x/sys/unix for F_OFD_* constants. This fixes the issue that F_OFD_GETLK was defined incorrectly, resulting in bugs such as moby/moby#31182 Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>

Merge pull request etcd-io#12551 from kolyshkin/3.4-fix-lock

0880605

[3.4 backport] pkg/fileutil: fix F_OFD_ constants

vendor: bump gorilla/websocket

becc228

Signed-off-by: Sam Batschelet <sbatsche@redhat.com>

Merge pull request etcd-io#12645 from hexfusion/bump-dep

d51c6c6

vendor: bump gorilla/websocket

etcdserver: Fix 64 KB websocket notification message limit

a40f14d

This fixes etcd being unable to send any message longer than 64 KB as a notification over the websocket. This was because the older version of grpc-websocket-proxy was used and WithMaxRespBodyBufferSize option wasn't set.

Merge pull request etcd-io#12402 from vitalif/release-3.4

a1c5f59

etcdserver: Fix 64 KB websocket notification message limit

[Backport-3.4] etcdserver/api/etcdhttp: log successful etcd server si…

f27ef4d

…de health check in debug level ref. etcd-io#12677 ref. etcd-io@0b9cfa8

Merge pull request etcd-io#12679 from chaochn47/backport_3.4_#12677

3be9460

[Backport-3.4] etcdserver/api/etcdhttp: log successful etcd server side health check in debug level

version: 3.4.15

aa71268

Signed-off-by: Gyuho Lee <leegyuho@amazon.com>

server: Added config parameter experimental-warning-apply-duration

9aeabe4

Signed-off-by: Sam Batschelet <sbatsche@redhat.com>

Merge pull request etcd-io#12740 from hexfusion/cp-12448--release-3.4

afd6d8a

Manual cherry pick of etcd-io#12448 on release 3.4

Merge pull request etcd-io#12751 from cwedgwood/nofsyncdowrite

2702f9e

When using --unsafe-no-fsync still write out the data

integration: relax leader timeout from 3s to 4s

c499d9b

The integration jobs fail with timeouts slightly over 3s, increase this marginally so false failures are less prevalent.

Merge pull request etcd-io#12816 from cwedgwood/3.4-relax-gate-timeout

16fe9a8

integration: relax leader timeout from 3s to 4s

Merge pull request etcd-io#12815 from dbavatar/release-3.4-peervalida…

30799c9

…tion etcdserver: Fix PeerURL validation

etcdserver: fix incorrect metrics generated when clients cancel watches

656dc63

Manual cherry-pick of 9571325 for release-3.4.

Merge pull request etcd-io#12803 from cwedgwood/metrics-3.4

82eae92

etcdserver: fix incorrect metrics generated when clients cancel watches

go.mod: Pin go to 1.12 version

ef415e3

As go 1.12.2 is what is tested in CI as well as recommended to be built with 1.12.2 we should also pin to this in the go directive version.

go.sum, go.mod: Run go mod tidy with go 1.12

8557cb2

vendor: Run go mod vendor

b19eb0f

pkpkg/testutil/leak.go: Allowlist created by testing.runTests.func1

91bed2e

fuweid and others added 28 commits January 18, 2023 10:18

mvcc: update ut for Revisions/CountRevisions

931cf9a

It is kind of backport from etcd-io#14124. Signed-off-by: Wei Fu <fuweid89@gmail.com>

Merge pull request etcd-io#15137 from fuweid/backport-11990-to-3.4

e4b1542

[3.4] mvcc: push down RangeOptions.limit argv into index tree to reduce memory overhead

docker: remove nsswitch.conf

d2fc8db

Signed-off-by: Iavael <905853+iavael@users.noreply.github.com>

netutil: consistently format ipv6 addresses

c5347cb

This formats ipv6 addresses to ensure they can be compared safely Signed-off-by: kidsan <8798449+Kidsan@users.noreply.github.com>

server: set multiple concurrentReadTx instances share one txReadBuffer.

2f81586

(cherry picked from commit 9c82e8c) Signed-off-by: Wei Fu <fuweid89@gmail.com>

bump bbolt to v1.3.7 for release-3.4

b4e3ed7

Signed-off-by: Benjamin Wang <wachao@vmware.com>

etctserver: add failpoints walBeforeSync and walAfterSync

109873d

Signed-off-by: Benjamin Wang <wachao@vmware.com>

Merge pull request etcd-io#15265 from ahrtr/3.4_walSync_failpoint_202…

fb7a897

…30209 [3.4] etctserver: add failpoints walBeforeSync and walAfterSync

Fix regression in timestamp resolution

d32dceb

Historic capnslog timestamps are in microsecond resolution. We need to match that when we migrate to the zap logger. Signed-off-by: James Blair <mail@jamesblair.net>

clientv3: correct the nextRev on receving progress notification response

ed529ab

Signed-off-by: Benjamin Wang <wachao@vmware.com>

test: enhance the test case TestV3WatchProgressOnMemberRestart

9c81b86

Signed-off-by: Benjamin Wang <wachao@vmware.com>

bump version to 3.4.24

6d1bfe4

Signed-off-by: Benjamin Wang <wachao@vmware.com>

Bump to go 1.19.6

9570978

Signed-off-by: James Blair <mail@jamesblair.net>

Bump golang.org/x/net to v0.7.0 to address CVE GO-2023-1571.

7318f5d

Signed-off-by: James Blair <mail@jamesblair.net>

Formatted source code for go 1.19.6.

a91bacf

Signed-off-by: James Blair <mail@jamesblair.net>

Merge pull request etcd-io#15333 from jmhbnz/release-3.4

20eee55

[3.4] Backport bump to go 1.19.6 and golang.org/x/net to v0.7.0

Updated go to 1.19.7.

51ea1c0

Mitigates CVE-2023-24532. Signed-off-by: James Blair <mail@jamesblair.net>

Merge pull request etcd-io#15429 from jmhbnz/release-3.4-backport

4cdb91d

[3.4] Backport update to latest go 1.19.7 release

tests: Allow configuring progress notify interval in e2e tests

6025355

Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>

test: Test etcd watch stream starvation under high read response load…

e818b5f

… when sharing the same connection Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>

server: Switch back to random scheduler to improve resilience to watc…

60e381a

…h starvation Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>

Merge pull request etcd-io#15478 from serathius/watch-random-schedule…

08a42e6

…r-3.4 Watch random scheduler 3.4

cleanup the go.mod & go.sum files

7c6b088

Signed-off-by: Benjamin Wang <wachao@vmware.com>

Merge pull request etcd-io#15482 from ahrtr/3.4_gomod_cleanup_20230315

2eabc0b

[3.4] cleanup the go.mod & go.sum files

server: Test watch restore

29ecfc0

Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>

Merge pull request etcd-io#15520 from serathius/fix-issue15271-3.4

46ae7eb

[v3.4] Fix issue15271

Revert "*: handle auth invalid token and old revision errors in watch"

614dcca

This reverts commit 0c6e466.

mitake closed this Mar 21, 2023

mitake deleted the revert-14548 branch March 21, 2023 13:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert 14548 #15541

Revert 14548 #15541

mitake commented Mar 21, 2023

Revert 14548 #15541

Revert 14548 #15541

Conversation

mitake commented Mar 21, 2023