why can't I connect to my ssh server UNLESS I enter eval "$(ssh-agent -s)" first?
I have my own ssh server (on raspberry pi 5, Ubuntu Server 23) but when I try to connect from my PC using key authentication (having password disabled), I get a blank screen. A blinking cursor.
However, once I enter the command eval "$(ssh-agent -s)"
and try ssh again, I successfully login after entering my passphrase. I don't want to issue this command every time. Is that possible?
This does not occur when I have password enabled on the ssh server. Also, ideally, I want to enter my passphrase EVERYTIME I connect to my server, so ideally I don't want it to be stored in cache or something. I want the passphrase to be a lil' password so that other people can't accidentally connect to my server when they use my PC.
The whole point of ssh-agent is to remember your passphrase. If you don't want to do that your problem might be that for some reason ssh client doesn't pick up your key. Try defining it for the host
Also, there's -v flag for ssh. Use it to debug what's going on when it doesn't try to use your key
okay I tried that, using -i to specify private key. I get the same thing: blank / blinking cursor. When I use verbose -v flag, I see that in all cases (using -i, the config file, and originally) it ends with these two lines (after about 50 lines) :
where (etc) is some redacted text. It seems the server is ACCEPTING the key, which is nice. But then it's still a blinking cursor...
Check if it is true. In the server logs.
I'm not sure which logs I can and should check, but when I listen to this:
sudo tail -f /var/log/auth.log
I only get this right after I ctrl+C on my blank / blinking cursor screen. (Did this 3 times in a row.)
Where MY_PUBLIC_IP is redacted. I'm not even sure why my public IP is showing. I connect locally. But ports are forwarded, yes.
Using
sudo journalctl -u sshd -f
does not seem to output anything...That's only part of the handshake. It'd require agent input around that point.
replace passphrase with private key and you're very correct.
passphrases used to login to servers using PasswordAuthentication are not stored in the agent. i might be wrong with technical details on how the private key is actually stored in RAM by the agent, but in the context of ssh passphrases that could be directly used for login to servers, saying the agent stores passphrases is at least a bit misleading.
what you want is:
also an idea:
all depends on the level of security you want to achieve. additional TOTP could improve security too (but beware that some authenticator providers might have "sharing" features which could compromise the TOTP token even before its first use.
FWIW, I've found that the -v flag often doesn't say why it's not using your key, just that it isn't using your key and it has fallen back to password authentication.
It's usually not terribly helpful for figuring out why it's not using your key, just that it's not using your key, which you kind of already know if it's prompting you for a password. lol
Because it's basically axiomatic: ssh uses all keys it knows about. The system can't tell you why it's not using something it doesn't know it should be able to use. You can give a -i for the certificate to check if it doesn't know it because the content is broken or the location.
That said: this doesn't make -v more useful for cases like this, just because there's a reason!
Not OP but everytime I used the verbose output of ssh it didn't help me one bit. Even adding outrageous verbosity I was still quite confused on what step failed and which didn't.
I'm probably just bad at understanding SSH but i don't know it seems like ssh workflow includes many trial and error until it finds a way to connect.
Imo the verbose output of SSH is often not very helpful if you don't know very well ssh in the first place. Obviously it is still worth a shot and a good advice but don't expect ssh to clearly state what is going on :)
Well, you have configuration and flag options to define what is it supposed to be trying to use. What order, I think too. But definitely understanding SSH a little bit will make the log more understandable. As with everything tbh :D
This likely isn't helpful but it isn't meant to be a shitpost. However, I will point out this literature:
SSH, The Secure Shell: The Definitive Guide, 2nd Edition
https://github.com/manish-old/ebooks-2/blob/master/O'Reilly%20-%20SSH%20The%20Secure%20Shell%20The%20Definitive%20Guide-2.pdf
Other commenters clearly know more than me about tbs ssh, so I'll otherwise remain silent.
Maybe ssh can't find the key automatically. What is the path to your private key?
I'm pretty sure I generated it to ~/.ssh/id_rsa which I think the default location. It is also the location shown in the terminal image in my post.
I think some distros disable using RSA by default. Might need to use it explicitly.
As mentioned,
-v
(or-vv
) helps to analyze the situation.My theory is that you already have something providing ssh agent service, but that process is somehow stuck, and when ssh tries to connect it, it doesn't respond to the connect, or it accepts the connection but doesn't actually interact with ssh. Quite possibly ssh doesn't have a timeout for interacting with ssh-agent.
Using
eval $(ssh-agent -s)
starts a new ssh agent and replaces the environment variables in question with the new ones, therefore avoiding the use of the stuck process.If this is the actual problem here, then before running the
eval
,echo $SSH_AUTH_SOCK
would show the path of the existing ssh agent socket. If this is the case, then you can uselsof $SSH_AUTH_SOCK
to see what that process is. Quite possibly it's provided bygnome-keyring-daemon
if you're running Gnome. As to why that process would not be working I don't have ideas.Another way to analyze the problem is
strace -o logfile -f ssh ..
and then check out what is at the end of thelogfile
. If the theory applies, then it would likely be aconnect
call for the ssh-agent.I guess it's worth checking if those names point to the expected binaries, but I also think it would be highly unlikely they would be anything else than just
/usr/bin/ssh
and/usr/bin/ssh-agent
.I didn't really follow the former part, but I can give you this:
strace -o logfile -f ssh -p 8322 pi@192.168.2.223 of when I get blank
Please don't ignore the advice about SSH_AGENT_SOCK. It'll tell yoy what's going on (but not why).
At the end of the log you find:
meaning it's trying to interact with the ssh-agent, but it (finally) doesn't give a response.
Use the
lsof
command to figure out which program is providing the agent service and try to resolve issue that way. If it's not the OpenSSH ssh-agent, then maybe you can disable its ssh-agent functionality and use real ssh-agent in its place..My wild guess is that the program might be trying to interactively verify the use of the key from you, but it is not succeeding in doing that for some reason.
I am not sure I "solved" this but when I add this to my startup script for my terminal (~/.zshrc):
it works then. I am not sure I'm still using the ssh agent, but at least it also does not cache my passphrase/private key
Do you have that file? If not, then
unset SSH_AUTH_SOCK
will work just as well.If it does exist, then I suppose it has good chances of working correctly :).
ssh-add -l
will try to use that socket and list your keys in the service (or list nothing if there are no keys, but it would still work without error).in the past some xserver environments started an ssh-agent for you just in case of, and for some reason i don't remember that was annoying and i disabled it to start my agent in my shell environment as i wanted it.
also a possibility is tharlt there are other agents like the gpg-agent that afaik also handles ssh keys.
but i would also look into $HOME/.ssh/config if there was something configured that matches the hostname, ip, or with wildcards* parts of it, that could interfere with key selection as the .ssh/id_rsa key should IMHO always be tried if key auth is possible and no (matching) key is known to the ssh process, that is unless there already is something configured...
not sure if a system-wide /etc/ssh/ssh_config would interfere there too, maybe have a look there too. as this behaviour seems a bit unexpected if not configured specially to do so.
I am not sure I "solved" this but when I add this to my startup script for my terminal (~/.zshrc):
it works then. I am not sure I'm still using the ssh agent, but at least it also does not cache my passphrase.
you should definitely know what type of authentication you use (my opinion) !! the agent can hold the key forever, so if you are just not asked again when connecting once more, thats what the agent is for. however its only in ram, so stopping the process or rebooting ends that of course. if you didn't reboot meanwhile maybe try unload all keys from it (ssh-add -D, ssh-add -L) and see what the next login is like.
btw: i use ControlMaster /ControlPath (with timeouts) to even reduce the number of passwordless logins and speed things up when running scripts or things like ansible, monitoring via ssh etc. then everything goes through the already open channel and no authentication is needed for the second thing any more, it gets really fast then.
Without the ssh-agent invocation:
what does
ssh-add -L
show?what is the original SSH_AUTH_SOCK value?
what is listening to that? (Use
lsof
)This kind of stuff often happens because there's a ton of terrible advice online about managing ssh-agent - make sure there's none if that baked into your shellrc.
All before issuing the ssh-agent
It's the gnome key ring ssh agent.
It's possible that this has popped up a window asking gor permission / a passphrase / something and you're not seeing that.
Okay, that agent process is running but it looks wedged: multiple connections to the socket seem to be opened, probably your other attempts to use ssh.
The ssh-add output looks like it's responding a bit, however.
I'd use your package manager to work out what owns it and go looking for open bugs in the tool.
(Getting a trace of that process itself would be handy, while you're trying again. There may be a clue in its behaviour.)
The server reaponse seems like the handshake process is close to completing. It's not immediately clear what's up there I'm afraid.
Is this problem a recurring one after a reboot?
If it is it warrants more effort.
If not and you're happy with rhe lack of closure, you can potentially fix this: kill the old agent (watch out to see if it respawns; if it does and that works, fine). If it doesn't, you can (a) remove the socket file (b) launch ssh-agent with the righr flag (
-a $SSH_AGENT_SOCK
iirc) to listen at the same place, then future terminal sessions that inherit the env var will still look in the right place. Unsatisfactory but it'll get you going again.reboot makes no difference. A new terminal gives the symptoms from the start.
I think I found a bad workaround. If I add this script to ~/.zshrc (because I'm not using bash but zsh)
then it works. But I think I'm still using the ssh agent which I actually should not be using. At least it's asking for the passphrase every time, which is nice. Even in the same terminal after ssh logout.
EDIT: The first two lines do the trick as well:
EDIT: If I change this SSH_AUTH_SOCK to ANYTHING else, it also works. So
/run/user/1000/gcr/ssh
does not work. I gave ample permission to this file, so that cannot be the problem. Perhaps BECAUSE this is a file. I think the SSH_AUTH_SOCK should point to a nonexisting file because then it makes temporarily a special file that it needs. Ok I'm just shooting in the dark.Minimise your windows one at a time and check that the gnome keyring hasn't popped up a dialog box sonewhere behind everything else that's asking you if it's okay to proceed.
No unfortunately not... Would've been a real pain.
Have you considered storing your keys unencrypted? In this case ssh doesn't need the agent or a password.
Yes it's not as secure, but for me it's good enough considering my systems at home are not doing anything important. If you have an encrypted home partition it's just as secure when your partition is unmounted.
Search for /run/user/1000/gcr/ssh on the Internet. I'm on my phone and didn't find the solution, but I'm sure you'll find it.
I searched. When I change this variable (path), it works. So in the startup script for my terminal (~/.zshrc) I added this:
Now it works, but I'm not sure why. Anything BUT
/run/user/1000/gcr/ssh
works I thinkplease, it's
eval "$(ssh-agent -s)"
(quotes!)well seems to work without tho
edit: made no difference, but I changed it in the post title.
Just because it works, doesn't mean it's right.
I had a similar construct in my bashrc and forgot the quotes. It didn't throw an error but also didn't work. Took quite a while to find the issue. So personally, I would recommend trying to quote correctly whenever possible.
I was unclear: I did not mean to imply that it will work with it.
It's OT, but I'll clarify since it might be useful for people who find Bash cryptic.
Thing is, roughly speaking:
eval
will evaluate its first argument as Bash codeeval "$(any_command really)"
will run runany_command really
, take its output and then use it as first argument foreval
. So the assumption is thatany_command really
must output a valid Bash code snippet.So what
eval "$(ssh-agent -s)"
really means is, "run ssh-agent -s, collect the output and run it right here, where we are callingeval
. Compare tossh-agent -s | bash
-- this would also run ssh-agent output but it would run it in a new process--a child process of the current process---so the whatever the snippet would be, it would have no way of affecting state of the parent program, which is why it's safer.Aside: The reason we need eval in this case is that we actually need to affect state of the program: that's the whole point. We need to set several environment variables to values that ssh-agent "knows". Without
eval
we would have to "ask" ssh-agent separately for each value (I'm assuming it's not even supported) and then set all these envvars using eg.export
keyword. Usingeval
we let ssh-agent dictate the whole process: which variables are going to be set to what values, with the caveat that if compromised, it could do "evil" stuff like settingPATH
to override common commands with compromised code. etc.So what's the problem with the quotes? The Shell syntax,
foo "$(bar baz)"
will make sure that the thing between quotes isNow without quotes, Bash (as well as POSIX shell) actually have several things they can do with the output (read
man bash
for full list, but keep it for a long rainy evening). Some of it involves substituting eg. values like*
with matching filenames, some of it may involve actually splitting the output to separate arguments based on spaces or other special characters (which can even be different characters depending on current state, seeIFS
and the likes).You can see the difference, if you run eg.
printf '[%s]\n'
instead ofeval
. This printf syntax will simply print all of following arguments on a separate line, adding braces before and after. You can compare(both of these commands should be safe as long as
ssh-agent
is not compromised and as long I have not made any terrible typo)Where is the key? What are the permissions for it?
it got 600 both the private and public key, stored in ~/.ssh/
Often permissions can be an issue. I'd check permission for directory and files and hinr directory for user and group.
Sample permissions here: https://jonasbn.github.io/til/ssh/permissions_on_ssh_folder_and_files.html
Many. Ssh issues I've had have been permissions issues.
yeah I copied the permissions as shown, makes no difference. Maybe the ownership is wrong? Both group and owner is me
what are your ssh config settings: ~/.ssh/config or /etc/ssh/ssh_config
I just added the ~/.ssh/config file on client side:
Same result.
The /etc/ssh/ssh_config is only relevant on the server side, right? Well, here it is.
sshd_config is server side, ssh_config is client side AFAIK
Your config looks pretty tame. Anything interesting in
/etc/ssh/config.d/
?dir does not exist on either side.
yeah I barely changed anything. Just disabled password and changed port AFAIK.
Can you try
killall ssh
on the client, and then try to ssh into the rpi again?tried.
Try running ssh with
-vv
to get a better idea of the problem when no ssh agent is running.I have no idea why but ssh seems to not use keys with different names by default
can you expand on that? What do you mean different names? My PC has of course a different username than the server I'm connecting to. The label name at the end of the key is just a comment, so this is also not what you're referring to, I think.
By default the key is named id.rsa and ssh-client may only load that. Or none at all. Very strange
are you using fish shell?
I have zsh
https://github.com/ohmyzsh/ohmyzsh/tree/master/plugins/ssh-agent
Can you post the result of the
env
command as well? It sounds like your config is very minimal, but the fact that it's looking for a local Unix socket in the strace output is weird.What happens if you do:
And then try to connect?
Hey that works too! Same effect as my previous workaround, that I just posted yesterday.
I do have to repeat this command everytime, so I had to put it into ~/.zshrc so it's executed beforehand in every new terminal.
It still does feel lile a workaround since it 'resets' itself (as I said) with every new terminal.
So, this is set somewhere in your config files, I think. Maybe try:
Just to see where it's being set.
What happens if you run commands on that blinking cursor? E.g. it you run
ls
do you get an output? I've had that happen in the past, don't remember the reason though.also no output
I am not sure I "solved" this but when I add this to my startup script for my terminal (~/.zshrc):
it works then. I am not sure I'm still using the ssh agent, but at least it also does not cache my passphrase (or private key in ram)
The only reason ssh client would "hang" without any output is when it's waiting for external key storage to allow access. It's designed that way to give user some time to approve access to key storage.
It sometimes happen that the installed key storage is broken in a way that it fails to show user modal, for any reason (showing on wrong screen, wrong desktop, wrong activity, wrong framebuffer, ....)
One solution (that you already did) is to change the SSH agent env variable to point to different key storage.
Another would be (if possible) to uninstall the broken key storage if you don't use it. But it is sometimes needed/used by other apps.
It's overall good to notify/open bug on your distro issue tracker to notify that some packages are missconfigured (maybe have missing dependencies) or conflicts with other ones.
Your shell for user pi may be broken. Try adding the shell command to your ssh command explicitly like
ssh pi@host /bin/sh
Or use /bin/bash
@dysprosium ssh agent manages your ssh keys and automatically passes them as an identity when connecting to a server
If you want to connect without it, you can simply pass
-i \
flagokay I tried that, using -i to specify private key. I get the same thing: blank / blinking cursor. When I use verbose -v flag, I see that in BOTH cases (I see about 50 lines) it ends with these two lines:
where (etc) is some redacted text. It seems the server is ACCEPTING the key, which is nice. But then it’s still a blinking cursor…
@dysprosium Mind trying with -vvvv flag and sharing the output instead of -v?
https://www.dropbox.com/scl/fi/4rfg9s81q1a55xoj7ug8y/ssh_verbose.txt?rlkey=0gfiv6h3gitvmgaowduz1i83b&dl=0
edit: fixed link