[Bug 747772] New: iscsi: connection1:0: detected conn error (1020)
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c0 Summary: iscsi: connection1:0: detected conn error (1020) Classification: openSUSE Product: openSUSE 12.2 Version: Factory Platform: Other OS/Version: Other Status: NEW Severity: Major Priority: P5 - None Component: Basesystem AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: per@opensuse.org QAContact: qa@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; Linux i686 on x86_64; rv:8.0.1) Gecko/20100101 Firefox/8.0.1 I'm installing Factory x86_64 over PXE+iSCSI (root on iSCSI). When the system boots the first time after phase 1, I get the message: connection1:0: detected conn error (1020) which appears to be a disconnect from the iSCSI target. After lots of trial&error, I ended up removing boot.iscsid-early from the boot-sequence, which got the installation going so I could complete phase2. Only to run into the very same problem again, except later in the boot-up. I decided to remove iscsid from the boot-sequence, and now it works. Reproducible: Always -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c kk zhang <kkzhang@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |kkzhang@novell.com AssignedTo|bnc-team-screening@forge.pr |hare@suse.com |ovo.novell.com | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c1 Hannes Reinecke <hare@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |hare@suse.com AssignedTo|hare@suse.com |lduncan@novell.com --- Comment #1 from Hannes Reinecke <hare@suse.com> 2012-03-13 15:10:33 UTC --- Lee, can you have a look here? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c Lee Duncan <lduncan@novell.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |ASSIGNED -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c2 --- Comment #2 from Lee Duncan <lduncan@novell.com> 2012-06-18 20:05:55 PDT --- checking ... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c3 --- Comment #3 from Lee Duncan <lduncan@novell.com> 2012-06-19 15:07:28 PDT --- I'd like to see the messages file from the case where both boot.iscsid-early and iscsid are present. Any chance you could reproduce? In the mean time, I'll try to reproduce ... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c4 --- Comment #4 from Lee Duncan <lduncan@novell.com> 2012-06-19 15:32:12 PDT --- Looking over the code ... Disabling both boot.iscsid-early and iscsi is not a good idea. boot.iscsid-early is meant to start iscsd in the case where you've booted using iSCSI, which is the case for you. If you do not start the iSCSI daemon iscsid, then if there is any error you are not going to be able to recover. The daemon manages that. In addition, boot.iscsd-early makes sure that your login session gets marked as "startup=onboot", if not already so marked. Later, in /etc/init.d/iscsi, it logs into all notes using "--loginall=automatic", which logs into all nodes except those marked as "onboot". This means that it should skip re-logging into your root node. Any chance you could re-enable boot.iscsid-early and iscsid, and add "-d 6" to the ARGS in boot.iscsid-early so iscsid debugging is enabled? If I don't hear back from you I'll set up a system where I can try to reproduce this tomorrow. Note that Factory open-iscsi is the same as open-iscsi in openSUSE 12.1, as far as I can see. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c5 --- Comment #5 from Per Jessen <per@opensuse.org> 2012-06-20 05:46:47 UTC --- Hi Lee I'm pretty certain that test-setup is long gone, but I'll see if I can rebuild it. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c6 --- Comment #6 from Per Jessen <per@opensuse.org> 2012-06-20 09:29:56 UTC --- Okay, it looks like my test-system is still available: Hardware: 1UZ, taggart7, 1Gb RAM, dual 2.0GHz, iSCSI root. Software: openSUSE Factory (2012/06/20) x86_64. Procedure: PXE install, ssh+http, iSCSI root. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c7 --- Comment #7 from Per Jessen <per@opensuse.org> 2012-06-21 09:04:31 UTC --- Created an attachment (id=495811) --> (http://bugzilla.novell.com/attachment.cgi?id=495811) serial console capture (In reply to comment #3)
I'd like to see the messages file from the case where both boot.iscsid-early and iscsid are present. Any chance you could reproduce? In the mean time, I'll try to reproduce ...
Hi Lee I've done a clean Factory install. This is a serial capture of the first boot-up after stage1 install. Initially it seems to work fine, but look for "Starting Remount API VFS..." at line 768. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c8 --- Comment #8 from Per Jessen <per@opensuse.org> 2012-06-21 09:22:54 UTC --- (In reply to comment #4)
Any chance you could re-enable boot.iscsid-early and iscsid, and add "-d 6" to the ARGS in boot.iscsid-early so iscsid debugging is enabled?
I tried this, but saw no additional output on the console. I checked the iSCSI filesystem too, but nothing was written to /var/log/messages, presumably because rsyslog isn't started at this point. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c9 --- Comment #9 from Lee Duncan <lduncan@novell.com> 2012-06-27 15:19:56 PDT --- It looks like iSCSI is having a problem. It may be that iscsid is trying to login to a device that it is already logged into. But all of the I/O errors after the "detected conn error" seems to fail. ??? strange I will try to reproduce here, but if you get a chance to run wireshark on the iSCSI target network, the capture would be very helpful. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c10 Lee Duncan <lduncan@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO InfoProvider| |per@opensuse.org --- Comment #10 from Lee Duncan <lduncan@suse.com> 2012-09-14 15:56:12 PDT --- Per: I looked at this some more, and the first error you seem to be getting is on one line 769, where is says:
[ 29.418498] connection1:0: detected conn error (1020)
Error 1020 means the target has dropped the connection. The question is why did it do this? What service is being started on line 768, where is says:
Starting Remount API VFS...
I found this reference to a problem with systemd and the remote-api-vfs service: http://lists.freedesktop.org/archives/systemd-devel/2010-November/000750.htm... You didn't happen to have usbfs enabled and in use, did you? I'm guessing something you are starting up is causing your target to disconnect. Is there any message from the target when this happens? And what type of target is it? I can try duplicating your test case, but if your error is target-specific then I might never see the same error. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c11 --- Comment #11 from Per Jessen <per@opensuse.org> 2012-09-15 12:25:53 UTC --- (In reply to comment #10)
Per: I looked at this some more, and the first error you seem to be getting is on one line 769, where is says:
[ 29.418498] connection1:0: detected conn error (1020)
Error 1020 means the target has dropped the connection. The question is why did it do this?
I haven't played with this setup for quite some time, but I feel pretty certain that the target didn't drop the connection. There is simply no reason for the target to do so, except maybe the initiator misbehaving.
What service is being started on line 768, where is says:
Starting Remount API FS....
I don't know what that is - I guess it'll be in the initrd.
I found this reference to a problem with systemd and the remote-api-vfs service: http://lists.freedesktop.org/archives/systemd-devel/2010-November/000750.htm...
You didn't happen to have usbfs enabled and in use, did you?
I doubt it, certainly not explicitly.
I'm guessing something you are starting up is causing your target to disconnect. Is there any message from the target when this happens? And what type of target is it?
I'm using the openSUSE iscsitarget package. I don't know if there are any messages from the target, I'll see what state my test-system is in and see if I can reproduce.
I can try duplicating your test case, but if your error is target-specific then I might never see the same error.
If you've got time, just try to get a system to boot from iSCSI. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c12 Lee Duncan <lduncan@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |ASSIGNED InfoProvider|per@opensuse.org | --- Comment #12 from Lee Duncan <lduncan@suse.com> 2012-09-16 07:18:18 PDT --- I am planning on attempting an iSCSI root installation test for Monday, but in the mean time I tracked down the service that is running when you get an error. The message:
Starting Remount API VFS...
Is from /lib/systemd/systemd-remount-api-vfs. This command simply steps through /etc/fstab and runs "mount DIR -o remount" for directories found. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c13 Hans-Peter Raschke <Hans-Peter.Raschke@sds-oldenburg.de> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |Hans-Peter.Raschke@sds-olde | |nburg.de --- Comment #13 from Hans-Peter Raschke <Hans-Peter.Raschke@sds-oldenburg.de> 2012-09-18 10:34:02 UTC --- Here is the same Bug for 11.3: https://bugzilla.novell.com/show_bug.cgi?id=656119 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c14 --- Comment #14 from Per Jessen <per@computer.org> 2013-02-24 12:39:59 UTC --- Ping? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c15 --- Comment #15 from Per Jessen <per@computer.org> 2013-02-24 14:41:31 UTC --- (In reply to comment #12)
I am planning on attempting an iSCSI root installation test for Monday, but in the mean time I tracked down the service that is running when you get an error.
The message:
Starting Remount API VFS
Is from /lib/systemd/systemd-remount-api-vfs. This command simply steps through /etc/fstab and runs "mount DIR -o remount" for directories found.
Hi Duncan If this remount is the culprit, I figured a work-around would be to simply comment out the root filesystems in /etc/fstab. I tried that, didn't work. Just as when I first reported this issue, my work-around of removing boot.iscsid-early and iscsid from the init-sequence worked. When the system has completed boot-up, there is no iscsid running. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c16 --- Comment #16 from Per Jessen <per@computer.org> 2013-02-24 15:40:07 UTC --- Update: starting iscsid manually or with the init-script later on causes no problems. I have re-inserted iscsid into the init-sequence and booting now also works fine. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c17 --- Comment #17 from Per Jessen <per@computer.org> 2013-02-24 15:53:40 UTC --- Update2: I've also re-inserted boot.iscsid-early, no problems. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c18 --- Comment #18 from Lee Duncan <lduncan@suse.com> 2013-02-24 09:42:40 PST --- Per: I'm sorry I let this drop off my radar. So it sounds like it's working now? That's strange (but good, I guess). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c19 --- Comment #19 from Per Jessen <per@computer.org> 2013-02-25 07:15:31 UTC --- (In reply to comment #18)
Per: I'm sorry I let this drop off my radar. So it sounds like it's working now? That's strange (but good, I guess).
Not really, the problem during the 2nd phase of installation still remains. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c20 --- Comment #20 from Hans-Peter Raschke <Hans-Peter.Raschke@sds-oldenburg.de> 2013-02-26 10:03:07 UTC --- (In reply to comment #19)
(In reply to comment #18)
Per: I'm sorry I let this drop off my radar. So it sounds like it's working now? That's strange (but good, I guess).
Not really, the problem during the 2nd phase of installation still remains.
This update should solve your problem: https://bugzilla.novell.com/show_bug.cgi?id=766300#c20 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c21 --- Comment #21 from Per Jessen <per@computer.org> 2013-02-26 12:14:33 UTC --- (In reply to comment #20)
(In reply to comment #19)
(In reply to comment #18)
Per: I'm sorry I let this drop off my radar. So it sounds like it's working now? That's strange (but good, I guess).
Not really, the problem during the 2nd phase of installation still remains.
This update should solve your problem:
Thanks, I'm sure it will. Amazing that we had two reports covering exactly the same issue :-) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c22 --- Comment #22 from Per Jessen <per@computer.org> 2013-03-13 17:09:27 UTC --- Hmm, sofar I have been having pretty much the same issue with 12.3. After some debugging today where it became clear that iscsid failed to authenticate with the target, I built from the latest source code, and replaced iscsiadm and iscsid. Problem solved. I have repeated the exercise on three systems. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=747772 https://bugzilla.novell.com/show_bug.cgi?id=747772#c23 Lee Duncan <lduncan@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |RESOLVED Resolution| |DUPLICATE --- Comment #23 from Lee Duncan <lduncan@suse.com> 2013-03-14 14:08:49 PDT --- I believe this is a duplicate of bnc#766300, which I have reopened. Under that bug, I have submitted openSUSE 12.3 changes. The reason for this regression is that these same changes were submitted for openSUSE Factory 8 weeks ago, but they stalled there. I am following up on pushing those changes through, as well. Closing this, then, as a duplicate. Per: there are a few SUSE-specific changes in open-iscsi you will not get using the latest sources, but I'm sure you know that. *** This bug has been marked as a duplicate of bug 766300 *** http://bugzilla.novell.com/show_bug.cgi?id=766300 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com