[opensuse] How to start mdadm service in openSUSE Leap 42.2?

newer
[opensuse] baloo crashes resumed -...

Istvan Gabor

9 Jan 2017 9 Jan '17

12:38

Hello: I have a few md raid1 sets I used in previous openSUSE versions. In my freshly installed Leap 42.2 the raid sets are not assembled automatically at boot, though I have a correct mdadm.conf file. I can manually assemble the sets using mdadm. # cat /proc/mdstat cat: /proc/mdstat: No such file or directory # cat /etc/mdadm.conf DEVICE containers partitions ARRAY /dev/md/pc:6 UUID=21316afe:1a4dd0bf:50911056:88042a7c ARRAY /dev/md/pc:7 UUID=64e23ea9:7dcb9ee2:7bca71bd:248cc5cf # mdadm -E --scan ARRAY /dev/md/6 metadata=1.0 UUID=21316afe:1a4dd0bf:50911056:88042a7c name=pc:6 ARRAY /dev/md/7 metadata=1.0 UUID=64e23ea9:7dcb9ee2:7bca71bd:248cc5cf name=pc:7 # mdadm -A /dev/md6 /dev/sdb6 /dev/sdc6 mdadm: /dev/md6 has been started with 2 drives. # cat /proc/mdstat Personalities : [raid1] md6 : active raid1 sdb6[3] sdc6[2] 20971520 blocks super 1.0 [2/2] [UU] unused devices: <none> How can I start md service in Leap 42.2? Thanks, Istvan -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Show replies by date

Andrei Borzenkov

9 Jan 9 Jan

13:12

On Mon, Jan 9, 2017 at 3:38 PM, Istvan Gabor <suseuser04@gmail.hu> wrote:

...

How can I start md service in Leap 42.2?

MD devices are expected tp be assembled incrementally by udev rules. See /usr/lib/udev/rules.d/64-md-raid-assembly.rules. Try commands from these rules manually. If commands work, it probably means for some reason rules are not applied, in which case booting with "udev.debug" (and omitting "quiet" to be sure) on kernel command line may provide some hints. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Andrei Borzenkov

13:21

On Mon, Jan 9, 2017 at 4:12 PM, Andrei Borzenkov <arvidjaar@gmail.com> wrote:

...

On Mon, Jan 9, 2017 at 3:38 PM, Istvan Gabor <suseuser04@gmail.hu> wrote:

...
How can I start md service in Leap 42.2?

MD devices are expected tp be assembled incrementally by udev rules. See /usr/lib/udev/rules.d/64-md-raid-assembly.rules. Try commands from these rules manually. If commands work, it probably means for some reason rules are not applied, in which case booting with "udev.debug"

This is udev.log-priority=debug, sorry.

...

(and omitting "quiet" to be sure) on kernel command line may provide some hints.

-- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Istvan Gabor

15:07

On Mon, 9 Jan 2017 16:21:34 +0300, Andrei Borzenkov wrote:

...

On Mon, Jan 9, 2017 at 4:12 PM, Andrei Borzenkov <arvidjaar@gmail.com> wrote:

...
On Mon, Jan 9, 2017 at 3:38 PM, Istvan Gabor <suseuser04@gmail.hu> wrote:

...
How can I start md service in Leap 42.2?

MD devices are expected tp be assembled incrementally by udev rules. See /usr/lib/udev/rules.d/64-md-raid-assembly.rules. Try commands from these rules manually.

I looked at 64-md-raid-assembly.rules file but I don't know how to manually run the commands. I guess this is the command you mean: /sbin/mdadm --incremental --export $devnode --offroot ${DEVLINKS} I don't know what to take for $devnode and ${DEVLINKS}. I found that /dev/disk/by-uuid has only a few devices. It should have much more.

...

...
If commands work, it probably means for some reason rules are not applied, in which case booting with "udev.debug"

This is udev.log-priority=debug, sorry.

...
(and omitting "quiet" to be sure) on kernel command line may provide some hints.

OK, I booted with this kernel parameter. What to look for? And a very strange thing happened. I ran cfdisk. When I exited cfdisk (without writing anything) a lot of md devices have been created but only having one disk of the arrays: # cat /proc/mdstat Personalities : md6 : inactive sdb6[3](S) 20972752 blocks super 1.0 md7 : inactive sdb7[3](S) 31455164 blocks super 1.0 unused devices: <none /dev/disk/by-uuid has been populated too. What next? Thanks, Istvan -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Andrei Borzenkov

17:35

09.01.2017 18:07, Istvan Gabor пишет:

...

On Mon, 9 Jan 2017 16:21:34 +0300, Andrei Borzenkov wrote:

...
On Mon, Jan 9, 2017 at 4:12 PM, Andrei Borzenkov <arvidjaar@gmail.com> wrote:

...
On Mon, Jan 9, 2017 at 3:38 PM, Istvan Gabor <suseuser04@gmail.hu> wrote:

...
How can I start md service in Leap 42.2?

MD devices are expected tp be assembled incrementally by udev rules. See /usr/lib/udev/rules.d/64-md-raid-assembly.rules. Try commands from these rules manually.

I looked at 64-md-raid-assembly.rules file but I don't know how to manually run the commands. I guess this is the command you mean:

/sbin/mdadm --incremental --export $devnode --offroot ${DEVLINKS}

I don't know what to take for $devnode and ${DEVLINKS}.

$devnode is name of device that is being scanned (i.e. array component). $DEVLINKS is list of device aliases.

...

I found that /dev/disk/by-uuid has only a few devices. It should have much more.

It really sounds like some udev problem.

...

...
...
If commands work, it probably means for some reason rules are not applied, in which case booting with "udev.debug"

This is udev.log-priority=debug, sorry.

...
(and omitting "quiet" to be sure) on kernel command line may provide some hints.

OK, I booted with this kernel parameter. What to look for?

Upload output of "journalctl -b" somewhere, e.g. http://susepaste.org/

...

And a very strange thing happened. I ran cfdisk. When I exited cfdisk (without writing anything) a lot of md devices have been created but only having one disk of the arrays:

Well, cfdisk likely triggered rescan of devices that in turn triggered events and udev processed them. Why only for half of components is a good question.

...

# cat /proc/mdstat Personalities : md6 : inactive sdb6[3](S) 20972752 blocks super 1.0

md7 : inactive sdb7[3](S) 31455164 blocks super 1.0

unused devices: <none

/dev/disk/by-uuid has been populated too.

What next?

mdadm --examine --scan -vv in addition to journalctl output would be good. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Per Jessen

18:08

Andrei Borzenkov wrote:

...

Well, cfdisk likely triggered rescan of devices that in turn triggered events and udev processed them. Why only for half of components is a good question.

It came up with a really weird config - in the config below, md6 and md7 have three drives each, and the only one present is a hot-spare.

...

...
# cat /proc/mdstat Personalities : md6 : inactive sdb6[3](S) 20972752 blocks super 1.0

md7 : inactive sdb7[3](S) 31455164 blocks super 1.0

-- Per Jessen, Zürich (-0.2°C) http://www.cloudsuisse.com/ - your owncloud, hosted in Switzerland. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Andrei Borzenkov

18:33

09.01.2017 21:08, Per Jessen пишет:

...

Andrei Borzenkov wrote:

...
Well, cfdisk likely triggered rescan of devices that in turn triggered events and udev processed them. Why only for half of components is a good question.

It came up with a really weird config - in the config below, md6 and md7 have three drives each, and the only one present is a hot-spare.

The [3] is not total number of components, but rather unique component identifier. It is also *not* position in the RAID array. This identifier may change e.g. after device replacement. Spare status is correct after "mdadm --incremental" with single component. So far it looks pretty OK except I would expect second half of array to be present as well.

...

...
...
# cat /proc/mdstat Personalities : md6 : inactive sdb6[3](S) 20972752 blocks super 1.0

md7 : inactive sdb7[3](S) 31455164 blocks super 1.0

-- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Per Jessen

19:12

Andrei Borzenkov wrote:

...

09.01.2017 21:08, Per Jessen пишет:

...
Andrei Borzenkov wrote:

...
Well, cfdisk likely triggered rescan of devices that in turn triggered events and udev processed them. Why only for half of components is a good question.

It came up with a really weird config - in the config below, md6 and md7 have three drives each, and the only one present is a hot-spare.

The [3] is not total number of components, but rather unique component identifier. It is also *not* position in the RAID array. This identifier may change e.g. after device replacement.

ah, I misread it - actually I was thinking component#, not number of devices :-)

...

Spare status is correct after "mdadm --incremental" with single component. So far it looks pretty OK except I would expect second half of array to be present as well.

I would have expected the array to be running degraded with one disk missing/failed ? No disks present and one hot-spare still seems weird. -- Per Jessen, Zürich (-0.2°C) http://www.hostsuisse.com/ - virtual servers, made in Switzerland. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Istvan Gabor

20:20

On Mon, 09 Jan 2017 20:12:30 +0100, Per Jessen wrote:

...

Andrei Borzenkov wrote:

...
09.01.2017 21:08, Per Jessen пишет:

...
Andrei Borzenkov wrote:

...
Well, cfdisk likely triggered rescan of devices that in turn triggered events and udev processed them. Why only for half of components is a good question.

Sorry, I omitted that I only ran cfdisk /dev/sdb. This has triggered only process of sdb devices, one halves of the arrays. Once more the whole process, step by step: # cat /proc/mdstat cat: /proc/mdstat: No such file or directory # cfdisk /dev/sdb QUIT Many devices are generated. # cat /proc/mdstat Personalities : md7 : inactive sdb7[3](S) 31455164 blocks super 1.0 md6 : inactive sdb6[3](S) 20972752 blocks super 1.0 unused devices: <none> # cfdisk /dev/sdc QUIT Many devices are generated again. # cat /proc/mdstat Personalities : [raid1] md7 : active raid1 sdc7[2] sdb7[3] 31455164 blocks super 1.0 [2/2] [UU] md6 : active raid1 sdc6[2] sdb6[3] 20971520 blocks super 1.0 [2/2] [UU] unused devices: <none> In reality I have 18 arrays, but inserted here only two as examples. Andrei, I sent the required log files to your address. Thanks, Istvan -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Istvan Gabor

10 Jan 10 Jan

19:32

On Mon, 9 Jan 2017 20:35:16 +0300, Andrei Borzenkov wrote:

...

$devnode is name of device that is being scanned (i.e. array component). $DEVLINKS is list of device aliases.

...
I found that /dev/disk/by-uuid has only a few devices. It should have much more.

It really sounds like some udev problem.

...
...
...
If commands work, it probably means for some reason rules are not applied, in which case booting with "udev.debug"

This is udev.log-priority=debug, sorry.

...
(and omitting "quiet" to be sure) on kernel command line may provide some hints.

OK, I booted with this kernel parameter. What to look for?

Upload output of "journalctl -b" somewhere, e.g. http://susepaste.org/

I tried to upload the output from journalctl -b to susepaste.org but the site haven't accepted it. I guess it's too much to paste in. The whole log file is ~1 MB and it contains 11678 lines. I have sent the log to your email too, I don't know if you've received it.

...

...
And a very strange thing happened. I ran cfdisk. When I exited cfdisk (without writing anything) a lot of md devices have been created but only having one disk of the arrays:

Well, cfdisk likely triggered rescan of devices that in turn triggered events and udev processed them. Why only for half of components is a good question.

...
# cat /proc/mdstat Personalities : md6 : inactive sdb6[3](S) 20972752 blocks super 1.0

md7 : inactive sdb7[3](S) 31455164 blocks super 1.0

unused devices: <none

/dev/disk/by-uuid has been populated too.

What next?

mdadm --examine --scan -vv

in addition to journalctl output would be good.

I have uploaded it to susepaste.org: http://susepaste.org/02e27f89 Thanks, Istvan -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Andrei Borzenkov

11 Jan 11 Jan

03:58

10.01.2017 22:32, Istvan Gabor пишет: ...

...

...
Upload output of "journalctl -b" somewhere, e.g. http://susepaste.org/

I tried to upload the output from journalctl -b to susepaste.org but the site haven't accepted it. I guess it's too much to paste in. The whole log file is ~1 MB and it contains 11678 lines. I have sent the log to your email too, I don't know if you've received it.

I did; as I answered in another mail, udev debug output was not that useful and there is nothing standing out between good and bad cases. Also some logs are definitely lost. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Istvan Gabor

21:35

On Wed, 11 Jan 2017 06:58:21 +0300, Andrei Borzenkov wrote:

...

10.01.2017 22:32, Istvan Gabor пишет: ...

...
...
Upload output of "journalctl -b" somewhere, e.g. http://susepaste.org/

I tried to upload the output from journalctl -b to susepaste.org but the site haven't accepted it. I guess it's too much to paste in. The whole log file is ~1 MB and it contains 11678 lines. I have sent the log to your email too, I don't know if you've received it.

I did; as I answered in another mail, udev debug output was not that useful and there is nothing standing out between good and bad cases. Also some logs are definitely lost.

Hi Andrei, Was the new log file I sent this morning useful? I have tried openSUSE 13.2 and it assembles the arrays correctly at boot. I haven't tried Leap 42.1 because I don't have it installed. Thanks, Istvan -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

David C. Rankin

12 Jan 12 Jan

07:04

On 01/11/2017 03:35 PM, Istvan Gabor wrote:

...

Hi Andrei,

Was the new log file I sent this morning useful?

I have tried openSUSE 13.2 and it assembles the arrays correctly at boot. I haven't tried Leap 42.1 because I don't have it installed.

Thanks,

Istvan

Istvan, I'll let Andrei decipher the udev messages, but I do have a thought. On Arch at least, mdadm is added as a hook to the initcpio initramfs creation setup so that mdadm is present in your boot image. If OpenSuSE requires something similar, then when you moved arrays to your server, perhaps mdadm is missing from your image since it wasn't present when YAST created your setup. Like I said, this is just a 'thought' since I haven't waded into linux-raid on 42.2 yet and do not know what the underlying udev (should be Upray) involvement in activation and assembly is on this release. It just seems like you would need the kernel-level support active before udev could do its magic (maybe udev can do it all now -- I'll let Andrei fill in the details -- I don't know) -- David C. Rankin, J.D.,P.E. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Istvan Gabor

09:03

On Thu, 12 Jan 2017 01:04:14 -0600, David C. Rankin wrote:

...

On 01/11/2017 03:35 PM, Istvan Gabor wrote:

...
Hi Andrei,

Was the new log file I sent this morning useful?

I have tried openSUSE 13.2 and it assembles the arrays correctly at boot. I haven't tried Leap 42.1 because I don't have it installed.

Thanks,

Istvan

David, thank you.

...

I'll let Andrei decipher the udev messages, but I do have a thought. On Arch at least, mdadm is added as a hook to the initcpio initramfs creation setup so that mdadm is present in your boot image. If OpenSuSE requires something similar, then when you moved arrays to your server, perhaps mdadm is missing from your image since it wasn't present when YAST created your setup.

Your supposition is correct. My initram and yast does not have the arrays. When I install a new system I set up only the minimal requirements for it. I install only one partition, the root (/), and don't set up separate partitions. My fstab has only this root partition and a swap partition.

...

Like I said, this is just a 'thought' since I haven't waded into linux-raid on 42.2 yet and do not know what the underlying udev (should be Upray) involvement in activation and assembly is on this release. It just seems like you would need the kernel-level support active before udev could do its magic (maybe udev can do it all now -- I'll let Andrei fill in the details -- I don't know)

If the intram/initrd requires to have the mdadm arrays it also means that I have to recreate initram when I set up a new array or remove an array. Doesn't sound good. I can try to activate my arrays and make a new initrd. Thanks, Istvan -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Carlos E. R.

12:06

On 2017-01-12 10:03, Istvan Gabor wrote:

...

...
I'll let Andrei decipher the udev messages, but I do have a thought. On Arch at least, mdadm is added as a hook to the initcpio initramfs creation setup so that mdadm is present in your boot image. If OpenSuSE requires something similar, then when you moved arrays to your server, perhaps mdadm is missing from your image since it wasn't present when YAST created your setup.

Your supposition is correct. My initram and yast does not have the arrays. When I install a new system I set up only the minimal requirements for it. I install only one partition, the root (/), and don't set up separate partitions. My fstab has only this root partition and a swap partition.

if fstab doesn't mention the array, how would the system know it has to set up the array? Maybe that would trigger initram creation. -- Cheers / Saludos, Carlos E. R. (from 42.2 x86_64 "Malachite" at Telcontar)

Istvan Gabor

17:01

On Thu, 12 Jan 2017 13:06:49 +0100, Carlos E. R. wrote:

...

On 2017-01-12 10:03, Istvan Gabor wrote:

...
...
I'll let Andrei decipher the udev messages, but I do have a thought. On Arch at least, mdadm is added as a hook to the initcpio initramfs creation setup so that mdadm is present in your boot image. If OpenSuSE requires something similar, then when you moved arrays to your server, perhaps mdadm is missing from your image since it wasn't present when YAST created your setup.

Your supposition is correct. My initram and yast does not have the arrays. When I install a new system I set up only the minimal requirements for it. I install only one partition, the root (/), and don't set up separate partitions. My fstab has only this root partition and a swap partition.

if fstab doesn't mention the array, how would the system know it has to set up the array? Maybe that would trigger initram creation.

fstab only describes which partitions to mount and where. Partition/volume/device recognition doesn't require fstab entry. Cheers, Istvan -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Carlos E. R.

19:17

On 2017-01-12 18:01, Istvan Gabor wrote:

...

On Thu, 12 Jan 2017 13:06:49 +0100, Carlos E. R. wrote:

...
On 2017-01-12 10:03, Istvan Gabor wrote:

...
...
I'll let Andrei decipher the udev messages, but I do have a thought. On Arch at least, mdadm is added as a hook to the initcpio initramfs creation setup so that mdadm is present in your boot image. If OpenSuSE requires something similar, then when you moved arrays to your server, perhaps mdadm is missing from your image since it wasn't present when YAST created your setup.

Your supposition is correct. My initram and yast does not have the arrays. When I install a new system I set up only the minimal requirements for it. I install only one partition, the root (/), and don't set up separate partitions. My fstab has only this root partition and a swap partition.

if fstab doesn't mention the array, how would the system know it has to set up the array? Maybe that would trigger initram creation.

fstab only describes which partitions to mount and where. Partition/volume/device recognition doesn't require fstab entry.

Or array mounts, too: /dev/md0 /data/raid xfs defaults,nofail,relatime 1 3 Maybe I misunderstood you :-? -- Cheers / Saludos, Carlos E. R. (from 42.2 x86_64 "Malachite" at Telcontar)

Istvan Gabor

20:11

On Thu, 12 Jan 2017 20:17:49 +0100, Carlos E. R. wrote:

...

On 2017-01-12 18:01, Istvan Gabor wrote:

...
On Thu, 12 Jan 2017 13:06:49 +0100, Carlos E. R. wrote:

...
On 2017-01-12 10:03, Istvan Gabor wrote:

...
...
I'll let Andrei decipher the udev messages, but I do have a thought. On Arch at least, mdadm is added as a hook to the initcpio initramfs creation setup so that mdadm is present in your boot image. If OpenSuSE requires something similar, then when you moved arrays to your server, perhaps mdadm is missing from your image since it wasn't present when YAST created your setup.

Your supposition is correct. My initram and yast does not have the arrays. When I install a new system I set up only the minimal requirements for it. I install only one partition, the root (/), and don't set up separate partitions. My fstab has only this root partition and a swap partition.

if fstab doesn't mention the array, how would the system know it has to set up the array? Maybe that would trigger initram creation.

fstab only describes which partitions to mount and where. Partition/volume/device recognition doesn't require fstab entry.

Or array mounts, too:

/dev/md0 /data/raid xfs defaults,nofail,relatime 1 3

Maybe I misunderstood you :-?

raid arrays are partition volumes too. In mdraid system they are designated as /dev/md* devices. Istvan -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

David C. Rankin

13 Jan 13 Jan

07:56

On 01/12/2017 03:03 AM, Istvan Gabor wrote:

...

If the intram/initrd requires to have the mdadm arrays it also means that I have to recreate initram when I set up a new array or remove an array. Doesn't sound good.

It's not that bad. When you add an array to the system, you a enable the mdadm hook in /etc/sysconfig and make sure you have your arrays defined in /etc/mdadm.conf. Once that hook is in place, you only need to rebuild the initramfs once, then whatever is now SuSEconfig, should automatically add that to the rebuild of any future kernel. When you remove the arrays, it's no big deal if the hook is left in place (as long as you change /etc/mdadm.conf and /etc/fstab to reflect the removal). It doesn't hurt to have mdadm check if there are arrays that need to be assembled even if there are none. -- David C. Rankin, J.D.,P.E. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

David C. Rankin

08:10

On 01/13/2017 01:56 AM, David C. Rankin wrote:

...

It's not that bad. When you add an array to the system, you a enable the mdadm hook in /etc/sysconfig and make sure you have your arrays defined in /etc/mdadm.conf. Once that hook is in place, you only need to rebuild the initramfs once, then whatever is now SuSEconfig, should automatically add that to the rebuild of any future kernel. When you remove the arrays, it's no big deal if the hook is left in place (as long as you change /etc/mdadm.conf and /etc/fstab to reflect the removal). It doesn't hurt to have mdadm check if there are arrays that need to be assembled even if there are none.

Just saw Andrei's post -- looks like opensuse doesn't need mdadm hooks in initramfs -- the problem is elsewhere... Glad I didn't jump right in with a linux-raid 4 disk install :) -- David C. Rankin, J.D.,P.E. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org