Skip to content
This repository has been archived by the owner on Jul 25, 2022. It is now read-only.

fix ssh issue user => bastion #239

Merged
merged 1 commit into from
Jul 31, 2020
Merged

Conversation

tedteng
Copy link
Contributor

@tedteng tedteng commented Jul 31, 2020

What this PR does / why we need it:

Which issue(s) this PR fixes:
Fixes #238

Special notes for your reviewer:

Release note:

 fix SSH Operation timed out when user => bastion host

@tedteng tedteng requested a review from a team as a code owner July 31, 2020 02:34
@gardener-robot-ci-3 gardener-robot-ci-3 added the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label Jul 31, 2020
@gardener-robot-ci-2 gardener-robot-ci-2 added needs/ok-to-test Needs approval for testing (check PR in detail before setting this label because PR is run on CI/CD) and removed reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) labels Jul 31, 2020
@neo-liang-sap
Copy link
Contributor

thanks @tedteng for this PR, two points here:

  1. Adding ConnectionAttempts from user->bastion is absolutely good, is there any specified reason changing ConnectionAttempts=3 to ConnectionAttempts=2 for bastion->node?

  2. I would suggest maybe a bit more clear in title of fix ssh Operation timed out  #238 and release notes of this PR, e.g. "add retry mechanism during ssh from user to bastion". fix ssh Operation timed out is kind of vague especially recently there are many PRs related to ssh enhancements of gardenctl...

WDYT?

/cc @DockToFuture @dansible

Copy link
Contributor

@DockToFuture DockToFuture left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@gardener-robot gardener-robot added the reviewed/lgtm Has approval for merging label Jul 31, 2020
@tedteng
Copy link
Contributor Author

tedteng commented Jul 31, 2020

thanks @tedteng for this PR, two points here:

  1. Adding ConnectionAttempts from user->bastion is absolutely good, is there any specified reason changing ConnectionAttempts=3 to ConnectionAttempts=2 for bastion->node?

Good questions why 3 or 2 !!!

The logic here, execute SSH when detecting 22 port opened, ideally, ssh should be work without any issue due to 22 port already opened and ssh service running. however, ssh still fails and close exit until ssh session timeout, even 22 port already open sometimes. I am feeling AWS EC2 or ssh service in EC2 has something in there.

Back to the question, 22 opened, but if ssh got issues not able to connect, whatever retry 2 or 3 times will still issue.
You have to wait 2 times or 3 times * ssh session timeout time before executing clean up bastion host and security group. Its killing people patient. as a consequence, people may close/cancel a session directly to re-ssh which cause bastion resource and security group remaining in AWS cloud as an issue.

I am more focusing on how to speed up ssh connection and increase the connection stable. If you want 3 no issue for me.

  1. I would suggest maybe a bit more clear in title of fix ssh Operation timed out  #238 and release notes of this PR, e.g. "add retry mechanism during ssh from user to bastion". fix ssh Operation timed out is kind of vague especially recently there are many PRs related to ssh enhancements of gardenctl...

I think "add retry mechanism during ssh from user to bastion" it more talking about PR #231

As mention above AWS ec2 some times like "blackbox" for ssh, I didn't meet issue when testing and working on PR #231

However, I got an issue today after upgrade to version v0.19.0, not sure whether timing not good form APJ today. I am not able to connect any node when I testing the latest version. that why I raised the new PR to fix the issue.

WDYT?

/cc @DockToFuture @dansible

@neo-liang-sap neo-liang-sap merged commit 1c7d342 into gardener-attic:master Jul 31, 2020
@dansible dansible mentioned this pull request Aug 3, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
needs/ok-to-test Needs approval for testing (check PR in detail before setting this label because PR is run on CI/CD) reviewed/lgtm Has approval for merging
Projects
None yet
Development

Successfully merging this pull request may close these issues.

fix ssh Operation timed out
6 participants