[opensuse] Leap 15.2: Mounting XFS filesystem fails on large RAID partitions

newer
[opensuse] LibreOffice community...

older
[opensuse] What happens if I rerun...

Lew Wolfgang

13 Jul 2020 13 Jul '20

01:23

Hi Folks, I ran into a problem installing 15.2 on a system with a couple of large RAID partitions. I just filed bugid 1174056, but I thought I'd mention it here just in case. Bug# 1174056: Installation from full Leap 15.2 ISO fails when attempting to mount XFS filesystems on large RAID partitions. Error dialog follows: mount -t XFS /dev/sdc1 /mnt mount: /export/data: mount(2) system call failed: structure needs cleaning. After booting without mounting the filesystems, xfs_repair returns: xfs_repair -n /dev/sdc1: Phase 1 - Find and Verify Superblock... Bad Primary Superblock - bad stripe width in Superblock! Attempting to find secondary superblock... ........ (I didn't wait around, too many dots) Formatting the same partitions with Ext4 works as expected. RAID Controller: Avago 3108 MegaRAID /dev/sdc1: 267-TB /dev/sdd1: 127-TB I haven't seen this on other systems with similar RAID partitions with Leap 15.1 and lower. Any ideas? mkfs.xfs seems to work, at least it doesn't complain about anything. Regards, Lew -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Show replies by date

Dave Howorth

13 Jul 13 Jul

10:27

On Sun, 12 Jul 2020 18:23:05 -0700 Lew Wolfgang <wolfgang@sweet-haven.com> wrote:

...

Hi Folks,

I ran into a problem installing 15.2 on a system with a couple of large RAID partitions. I just filed bugid 1174056, but I thought I'd mention it here just in case.

Bug# 1174056:

I'm confused.

...

Installation from full Leap 15.2 ISO fails when attempting to mount XFS filesystems on large RAID partitions.

Error dialog follows:

mount -t XFS /dev/sdc1 /mnt

Assuming /dev/sdc1 is part of the RAID, why are you trying to mount just it? Why is the type XFS rather than the expected xfs?

...

mount: /export/data: mount(2) system call failed: structure needs cleaning.

After booting without mounting the filesystems, xfs_repair returns:

xfs_repair -n /dev/sdc1:

Phase 1 - Find and Verify Superblock... Bad Primary Superblock - bad stripe width in Superblock! Attempting to find secondary superblock... ........ (I didn't wait around, too many dots)

Formatting the same partitions with Ext4 works as expected.

RAID Controller: Avago 3108 MegaRAID /dev/sdc1: 267-TB /dev/sdd1: 127-TB

What RAID-type are you using, you don't say? I had guessed RAID1 with two partitions, but now you show they are different sizes?

...

I haven't seen this on other systems with similar RAID partitions with Leap 15.1 and lower.

Any ideas? mkfs.xfs seems to work, at least it doesn't complain about anything.

Regards, Lew

-- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Per Jessen

10:58

Dave Howorth wrote:

...

Assuming /dev/sdc1 is part of the RAID, why are you trying to mount just it?

I read /dev/sdc and /dev/sdd to be volumes "backed" by hardware RAID, individual drives not visible. -- Per Jessen, Zürich (21.9°C) http://www.dns24.ch/ - your free DNS host, made in Switzerland. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Lew Wolfgang

15:15

On 07/13/2020 03:58 AM, Per Jessen wrote:

...

Dave Howorth wrote:

...
Assuming /dev/sdc1 is part of the RAID, why are you trying to mount just it? I read /dev/sdc and /dev/sdd to be volumes "backed" by hardware RAID, individual drives not visible.

Yes, the RAID controller assembles, in this case 36 14-TB SAS disks, into two volumes: /dev/sdc and /dev/sdd. The volumes are each GPT labeled and partitions created as /dev/sdc1 and /dev/sdd1. mkfs.xfs is then used to create the two filesystems. mkfs.ext4 worked okay, which leads me to think that mkfs.xfs or something in the XFS libraries is broken. The system is remote (I'm teleworking), but I'll go in to day and try a couple of things, like booting a 15.1 rescue ISO to see if can mount the partitions. If it can't, I'll try the 15.1 mkfs.xfs and see what happens. Obviously 15.2 mkfs.xfs works elsewhere, I installed it a couple of days ago on a high-end gamer laptop and used XFS on its /home partition. BTW, the 15.2 install on that laptop was the easiest SuSE install, on a laptop, that I've ever done. The special function keys for volume and screen brightness worked, NetworkManager seamlessly works with wired and WiFi too! I've seen only one small issue, where settings in a konsole window, like font-size/background-color aren't persistent. Setting changes don't survive logout/login. Maybe I'm doing something wrong? Regards, Lew -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Per Jessen

15:43

Lew Wolfgang wrote:

...

On 07/13/2020 03:58 AM, Per Jessen wrote:

...
Dave Howorth wrote:

...
Assuming /dev/sdc1 is part of the RAID, why are you trying to mount just it? I read /dev/sdc and /dev/sdd to be volumes "backed" by hardware RAID, individual drives not visible.

Yes, the RAID controller assembles, in this case 36 14-TB SAS disks, into two volumes: /dev/sdc and /dev/sdd. The volumes are each GPT labeled and partitions created as /dev/sdc1 and /dev/sdd1. mkfs.xfs is then used to create the two filesystems. mkfs.ext4 worked okay, which leads me to think that mkfs.xfs or something in the XFS libraries is broken.

You do have some pretty sizeable volumes, but while we may not be testing that at openSUSE, somebody will have, I'm sure.

...

I've seen only one small issue, where settings in a konsole window, like font-size/background-color aren't persistent. Setting changes don't survive logout/login. Maybe I'm doing something wrong?

It has worked for me except for the 1st tab in the console window which remains at the default. I set a bigger font and scheme "Linux Colours". -- Per Jessen, Zürich (26.6°C) http://www.dns24.ch/ - free dynamic DNS, made in Switzerland. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Dave Howorth

19:32

On Mon, 13 Jul 2020 08:15:03 -0700 Lew Wolfgang <wolfgang@sweet-haven.com> wrote:

...

On 07/13/2020 03:58 AM, Per Jessen wrote:

...
Dave Howorth wrote:

...
Assuming /dev/sdc1 is part of the RAID, why are you trying to mount just it? I read /dev/sdc and /dev/sdd to be volumes "backed" by hardware RAID, individual drives not visible.

Yes, the RAID controller assembles, in this case 36 14-TB SAS disks, into two volumes: /dev/sdc and /dev/sdd. The volumes are each GPT labeled and partitions created as /dev/sdc1 and /dev/sdd1. mkfs.xfs is then used to create the two filesystems. mkfs.ext4 worked okay, which leads me to think that mkfs.xfs or something in the XFS libraries is broken.

The system is remote (I'm teleworking), but I'll go in to day and try a couple of things, like booting a 15.1 rescue ISO to see if can mount the partitions. If it can't, I'll try the 15.1 mkfs.xfs and see what happens.

Obviously 15.2 mkfs.xfs works elsewhere, I installed it a couple of days ago on a high-end gamer laptop and used XFS on its /home partition. BTW, the 15.2 install on that laptop was the easiest SuSE install, on a laptop, that I've ever done. The special function keys for volume and screen brightness worked, NetworkManager seamlessly works with wired and WiFi too! I've seen only one small issue, where settings in a konsole window, like font-size/background-color aren't persistent. Setting changes don't survive logout/login. Maybe I'm doing something wrong?

Thanks for the explanation. I'm still a bit confused though. Your bug report is about installation but what you're discussing appears to be a problem creating an xfs filesystem? But you haven't shown any details of that creation. Neither any output nor any arguments supplied to it. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Lew Wolfgang

19:52

On 07/13/2020 12:32 PM, Dave Howorth wrote:

...

On Mon, 13 Jul 2020 08:15:03 -0700 Lew Wolfgang <wolfgang@sweet-haven.com> wrote:

...
...
Dave Howorth wrote:

...
Assuming /dev/sdc1 is part of the RAID, why are you trying to mount just it? I read /dev/sdc and /dev/sdd to be volumes "backed" by hardware RAID, individual drives not visible. Yes, the RAID controller assembles, in this case 36 14-TB SAS disks, into two volumes: /dev/sdc and /dev/sdd. The volumes are each GPT labeled and partitions created as /dev/sdc1 and /dev/sdd1. mkfs.xfs is then used to create the two filesystems. mkfs.ext4 worked okay, which leads me to think that mkfs.xfs or something in the XFS

On 07/13/2020 03:58 AM, Per Jessen wrote: libraries is broken.

The system is remote (I'm teleworking), but I'll go in to day and try a couple of things, like booting a 15.1 rescue ISO to see if can mount the partitions. If it can't, I'll try the 15.1 mkfs.xfs and see what happens.

Thanks for the explanation.

I'm still a bit confused though. Your bug report is about installation but what you're discussing appears to be a problem creating an xfs filesystem? But you haven't shown any details of that creation. Neither any output nor any arguments supplied to it.

I kept it short and sweet for the bug report, a failed installation is something you can hang your hat on. I instructed the installation process to create the two large filesystems. The partitioner complained that the "structure needs cleaning". I hit the "ignore" button, but the first boot failed with the mount problem. After the install failed to boot, I commented out the two fstab entries and booted without mounting the RAID partitions. I then tried to build new filesystems using YaST's partitioner, gparted, and mkfs.xfs. In all cases the failure appeared when trying to mount the just-created filesystems. Mount returns: mount -t XFS /dev/sdc1 /mnt mount: /export/data: mount(2) system call failed: structure needs cleaning. Then, xfs_repair returns: Phase 1 - Find and Verify Superblock... Bad Primary Superblock - bad stripe width in Superblock! Attempting to find secondary superblock... ........ (I didn't wait around, too many dots) This morning, Arvin Schnell (Bugzilla) noticed this in /var/log/messages: [ 1361.758237] XFS (sdc1): SB stripe unit sanity check failed [ 1361.758315] XFS (sdc1): Metadata corruption detected at xfs_sb_read_verify+0xfe/0x170 [xfs], xfs_sb block 0xffffffffffffffff [ 1361.758315] XFS (sdc1): Unmount and run xfs_repair [ 1361.758316] XFS (sdc1): First 128 bytes of corrupted metadata buffer: Same entries for sdd1. Note the 0xffffffffffffffff, an overflow somewhere? Again, ext4 built without issue. I'm leaving right now to try some additional things. Further news when I return. Regards, Lew -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Lew Wolfgang

23:21

On 07/13/2020 12:52 PM, Lew Wolfgang wrote:

...

On 07/13/2020 12:32 PM, Dave Howorth wrote:

...
On Mon, 13 Jul 2020 08:15:03 -0700 Lew Wolfgang <wolfgang@sweet-haven.com> wrote:

...
...
Dave Howorth wrote:

...
Assuming /dev/sdc1 is part of the RAID, why are you trying to mount just it? I read /dev/sdc and /dev/sdd to be volumes "backed" by hardware RAID, individual drives not visible. Yes, the RAID controller assembles, in this case 36 14-TB SAS disks, into two volumes: /dev/sdc and /dev/sdd. The volumes are each GPT labeled and partitions created as /dev/sdc1 and /dev/sdd1. mkfs.xfs is then used to create the two filesystems. mkfs.ext4 worked okay, which leads me to think that mkfs.xfs or something in the XFS

On 07/13/2020 03:58 AM, Per Jessen wrote: libraries is broken.

The system is remote (I'm teleworking), but I'll go in to day and try a couple of things, like booting a 15.1 rescue ISO to see if can mount the partitions. If it can't, I'll try the 15.1 mkfs.xfs and see what happens.

Thanks for the explanation.

I'm still a bit confused though. Your bug report is about installation but what you're discussing appears to be a problem creating an xfs filesystem? But you haven't shown any details of that creation. Neither any output nor any arguments supplied to it.

I kept it short and sweet for the bug report, a failed installation is something you can hang your hat on. I instructed the installation process to create the two large filesystems. The partitioner complained that the "structure needs cleaning". I hit the "ignore" button, but the first boot failed with the mount problem.

After the install failed to boot, I commented out the two fstab entries and booted without mounting the RAID partitions. I then tried to build new filesystems using YaST's partitioner, gparted, and mkfs.xfs. In all cases the failure appeared when trying to mount the just-created filesystems.

Mount returns:

mount -t XFS /dev/sdc1 /mnt mount: /export/data: mount(2) system call failed: structure needs cleaning.

Then, xfs_repair returns:

Phase 1 - Find and Verify Superblock... Bad Primary Superblock - bad stripe width in Superblock! Attempting to find secondary superblock... ........ (I didn't wait around, too many dots)

This morning, Arvin Schnell (Bugzilla) noticed this in /var/log/messages:

[ 1361.758237] XFS (sdc1): SB stripe unit sanity check failed [ 1361.758315] XFS (sdc1): Metadata corruption detected at xfs_sb_read_verify+0xfe/0x170 [xfs], xfs_sb block 0xffffffffffffffff [ 1361.758315] XFS (sdc1): Unmount and run xfs_repair [ 1361.758316] XFS (sdc1): First 128 bytes of corrupted metadata buffer:

Same entries for sdd1.

Note the 0xffffffffffffffff, an overflow somewhere?

Again, ext4 built without issue.

I'm leaving right now to try some additional things. Further news when I return.

I'm back. The xfs_repair that I started yesterday finished, saying: "Sorry, could not find valid secondary superblock" Today I booted the 15.1 rescue system and determined that mount works! It reported: 4096 byte physical blocks 574218043392 blocks for sda1 (drive lettering changed) 273437163520 blocks for sdb1 Back running 15.2, fdisk reports the same block counts as 15.1. Note that 15.1 was able to mount the XFS partitions created by 15.2's mks.xis. This implies to me that the problem is probably in the 15.2 XIS kernel module? Regards, Lew -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org

Lew Wolfgang

14 Jul 14 Jul