All the mail mirrored from lore.kernel.org
 help / color / mirror / Atom feed
* mdadm/Software RAID problems
@ 2015-06-19 18:48 Gareth Evans
  2015-06-20  5:56 ` mdadm/Software RAID problems - update Gareth Evans
  2015-06-24 23:21 ` mdadm/Software RAID problems NeilBrown
  0 siblings, 2 replies; 5+ messages in thread
From: Gareth Evans @ 2015-06-19 18:48 UTC (permalink / raw
  To: linux-raid

Having failed in my first attempt at setting up encrypted software RAID on Ubuntu 14.04.2, I discovered there seem to be many and various problems with software RAID on Ubuntu and Debian at least.

For example, I have found via testing on KVM and VirtualBox (with and on Ubuntu 14.04.2 and with Debian 8.1) that:

1. Encrypted RAID doesn't boot after installation (no volume groups found, with unencrypted raided /boot)
2. Degraded unencrypted RAID doesn't boot (loops on "Intermittently starting RAID arrays..." with a few lines of related output)
3. /etc/initramfs-tools/conf.d/mdadm doesn't seem to exist as the Ubuntu Server Guide suggests it should
4. adding bootdegraded=true to grub options at boot seems to have no effect.

The second point above seems particularly concerning because it would seem (unencrypted) software RAID (which works well enough while it's working) fails precisely when it is needed, which rather defeats the point.

I have found lots of bug reports, some still "new" and of "undecided" importance after several years, for various problems, but not many solutions. 

I'm not sure if the issues lie with upstream or distro-related mdadm or other packages, or perhaps kernel issues?

Just FYI, I did consider FreeBSD as an alternative which implements ZFS-on-Root disk pools and GELI encryption.  After limited testing this seems to work (even with only one disk if you just want to use it for encryption) but I understand ZFS can be corrupted by hardware problems (eg. faulty RAM), and FreeBSD doesn't seem to want to open LUKS containers on-the-fly, such as an encrypted external HD.  

I would prefer a Linux solution - any explanations, solutions, tips or advice re linux software RAID would be gratefully received.

With kind regards
Gareth
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in

^ permalink raw reply	[flat|nested] 5+ messages in thread

* mdadm/Software RAID problems - update
  2015-06-19 18:48 mdadm/Software RAID problems Gareth Evans
@ 2015-06-20  5:56 ` Gareth Evans
  2015-06-21 14:21   ` mdadm/Software RAID problems - another update Gareth Evans
  2015-06-24 23:21 ` mdadm/Software RAID problems NeilBrown
  1 sibling, 1 reply; 5+ messages in thread
From: Gareth Evans @ 2015-06-20  5:56 UTC (permalink / raw
  To: linux-raid

I should clarify re point (2) in  my earlier email below, steps to replicate:

1. Unencrypted RAID 10, 2 x vdi virtual drives on VirtualBox, OS installed and RAID working.
2. Shutdown VM and remove vdi disk2 from VirtualBox settings - boots successfully
3. Shutdown and attach a new blank vdi ("disk2_2") - boot fails as per (2) below
4. Power off and remove blank vdi disk - boot fails as per (2) below
5. Power off and re-attach original vdi disk2 (so the 2 originals are now attached) - boot fails as per (2) below

I wonder if I am perhaps failing to do something correctly at step 2 above, mdadm -D /dev/md1 shows disk2 as missing and mdadm --remove produces a not found error.


On Fri, 19 Jun 2015, at 19:48, Gareth Evans wrote:
> Having failed in my first attempt at setting up encrypted software RAID on Ubuntu 14.04.2, I discovered there seem to be many and various problems with software RAID on Ubuntu and Debian at least.
> 
> For example, I have found via testing on KVM and VirtualBox (with and on Ubuntu 14.04.2 and with Debian 8.1) that:
> 
> 1. Encrypted RAID doesn't boot after installation (no volume groups found, with unencrypted raided /boot)
> 2. Degraded unencrypted RAID doesn't boot (loops on "Intermittently starting RAID arrays..." with a few lines of related output)
> 3. /etc/initramfs-tools/conf.d/mdadm doesn't seem to exist as the Ubuntu Server Guide suggests it should
> 4. adding bootdegraded=true to grub options at boot seems to have no effect.
> 
> The second point above seems particularly concerning because it would seem (unencrypted) software RAID (which works well enough while it's working) fails precisely when it is needed, which rather defeats the point.
> 
> I have found lots of bug reports, some still "new" and of "undecided" importance after several years, for various problems, but not many solutions. 
> 
> I'm not sure if the issues lie with upstream or distro-related mdadm or other packages, or perhaps kernel issues?
> 
> Just FYI, I did consider FreeBSD as an alternative which implements ZFS-on-Root disk pools and GELI encryption.  After limited testing this seems to work (even with only one disk if you just want to use it for encryption) but I understand ZFS can be corrupted by hardware problems (eg. faulty RAM), and FreeBSD doesn't seem to want to open LUKS containers on-the-fly, such as an encrypted external HD.  
> 
> I would prefer a Linux solution - any explanations, solutions, tips or advice re linux software RAID would be gratefully received.
> 
> With kind regards
> Gareth
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: mdadm/Software RAID problems - another update
  2015-06-20  5:56 ` mdadm/Software RAID problems - update Gareth Evans
@ 2015-06-21 14:21   ` Gareth Evans
  2015-06-24 23:26     ` NeilBrown
  0 siblings, 1 reply; 5+ messages in thread
From: Gareth Evans @ 2015-06-21 14:21 UTC (permalink / raw
  To: linux-raid

After a number of tests with VirtualBox this weekend, it seems things work as they should with Fedora 22 but openSUSE 13.2, Debian 8.1 and Ubuntu 14.04.2 all appear to hang on boot when one vdisk is removed and another is attached, even if formatted with a "Linux RAID autodetect"-type partition.  CentOS 7 doesn't hang but doesn't have mdadm in its emergency console (I gave up with CentOS after that as not sure how to add commands, if that's possible...) and neither Fedora nor CentOS emergency consoles seem to include fdisk, which isn't too helpful.  

I'm not sure if that helps but I thought I'd share the results of my testing - please let me know if there's anything further I can do.  This may not be an upstream bug after all - I'm not very experienced with such things but it seems to me that it is and the Fedora developers have fixed it.

With thanks
Gareth

On Sat, 20 Jun 2015, at 06:56, Gareth Evans wrote:
> I should clarify re point (2) in  my earlier email below, steps to replicate:
> 
> 1. Unencrypted RAID 10, 2 x vdi virtual drives on VirtualBox, OS installed and RAID working.
> 2. Shutdown VM and remove vdi disk2 from VirtualBox settings - boots successfully
> 3. Shutdown and attach a new blank vdi ("disk2_2") - boot fails as per (2) below
> 4. Power off and remove blank vdi disk - boot fails as per (2) below
> 5. Power off and re-attach original vdi disk2 (so the 2 originals are now attached) - boot fails as per (2) below
> 
> I wonder if I am perhaps failing to do something correctly at step 2 above, mdadm -D /dev/md1 shows disk2 as missing and mdadm --remove produces a not found error.
> 
> 
> On Fri, 19 Jun 2015, at 19:48, Gareth Evans wrote:
> > Having failed in my first attempt at setting up encrypted software RAID on Ubuntu 14.04.2, I discovered there seem to be many and various problems with software RAID on Ubuntu and Debian at least.
> > 
> > For example, I have found via testing on KVM and VirtualBox (with and on Ubuntu 14.04.2 and with Debian 8.1) that:
> > 
> > 1. Encrypted RAID doesn't boot after installation (no volume groups found, with unencrypted raided /boot)
> > 2. Degraded unencrypted RAID doesn't boot (loops on "Intermittently starting RAID arrays..." with a few lines of related output)
> > 3. /etc/initramfs-tools/conf.d/mdadm doesn't seem to exist as the Ubuntu Server Guide suggests it should
> > 4. adding bootdegraded=true to grub options at boot seems to have no effect.
> > 
> > The second point above seems particularly concerning because it would seem (unencrypted) software RAID (which works well enough while it's working) fails precisely when it is needed, which rather defeats the point.
> > 
> > I have found lots of bug reports, some still "new" and of "undecided" importance after several years, for various problems, but not many solutions. 
> > 
> > I'm not sure if the issues lie with upstream or distro-related mdadm or other packages, or perhaps kernel issues?
> > 
> > Just FYI, I did consider FreeBSD as an alternative which implements ZFS-on-Root disk pools and GELI encryption.  After limited testing this seems to work (even with only one disk if you just want to use it for encryption) but I understand ZFS can be corrupted by hardware problems (eg. faulty RAM), and FreeBSD doesn't seem to want to open LUKS containers on-the-fly, such as an encrypted external HD.  
> > 
> > I would prefer a Linux solution - any explanations, solutions, tips or advice re linux software RAID would be gratefully received.
> > 
> > With kind regards
> > Gareth
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: mdadm/Software RAID problems
  2015-06-19 18:48 mdadm/Software RAID problems Gareth Evans
  2015-06-20  5:56 ` mdadm/Software RAID problems - update Gareth Evans
@ 2015-06-24 23:21 ` NeilBrown
  1 sibling, 0 replies; 5+ messages in thread
From: NeilBrown @ 2015-06-24 23:21 UTC (permalink / raw
  To: Gareth Evans; +Cc: linux-raid

On Fri, 19 Jun 2015 19:48:56 +0100 Gareth Evans <donotspam@fastmail.fm>
wrote:

> Having failed in my first attempt at setting up encrypted software RAID on Ubuntu 14.04.2, I discovered there seem to be many and various problems with software RAID on Ubuntu and Debian at least.
> 
> For example, I have found via testing on KVM and VirtualBox (with and on Ubuntu 14.04.2 and with Debian 8.1) that:
> 
> 1. Encrypted RAID doesn't boot after installation (no volume groups found, with unencrypted raided /boot)
> 2. Degraded unencrypted RAID doesn't boot (loops on "Intermittently starting RAID arrays..." with a few lines of related output)
> 3. /etc/initramfs-tools/conf.d/mdadm doesn't seem to exist as the Ubuntu Server Guide suggests it should
> 4. adding bootdegraded=true to grub options at boot seems to have no effect.
> 
> The second point above seems particularly concerning because it would seem (unencrypted) software RAID (which works well enough while it's working) fails precisely when it is needed, which rather defeats the point.
> 
> I have found lots of bug reports, some still "new" and of "undecided" importance after several years, for various problems, but not many solutions. 
> 
> I'm not sure if the issues lie with upstream or distro-related mdadm or other packages, or perhaps kernel issues?

Most of the issues you identify lie on the overlap between mdadm and
the boot loader - mkinitramfs or dracut or whatever.
It is distro problem to make these work together.  As upstream md and
mdadm maintainer I'm happy to work with distro maintainer to resolve
issues.  I can tell you how I think it should work and discuss how it
can work, but I am not in a position to diagnose exactly what is
happening and figure out what to fix.

You really need to follow this up with Debian, though unfortunately I
think the current Debian mdadm maintainer is not able to put in as much
time as he would like.

NeilBrown


> 
> Just FYI, I did consider FreeBSD as an alternative which implements ZFS-on-Root disk pools and GELI encryption.  After limited testing this seems to work (even with only one disk if you just want to use it for encryption) but I understand ZFS can be corrupted by hardware problems (eg. faulty RAM), and FreeBSD doesn't seem to want to open LUKS containers on-the-fly, such as an encrypted external HD.  
> 
> I would prefer a Linux solution - any explanations, solutions, tips or advice re linux software RAID would be gratefully received.
> 
> With kind regards
> Gareth
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: mdadm/Software RAID problems - another update
  2015-06-21 14:21   ` mdadm/Software RAID problems - another update Gareth Evans
@ 2015-06-24 23:26     ` NeilBrown
  0 siblings, 0 replies; 5+ messages in thread
From: NeilBrown @ 2015-06-24 23:26 UTC (permalink / raw
  To: Gareth Evans; +Cc: linux-raid

On Sun, 21 Jun 2015 15:21:57 +0100 Gareth Evans <donotspam@fastmail.fm>
wrote:

> After a number of tests with VirtualBox this weekend, it seems things work as they should with Fedora 22 but openSUSE 13.2, Debian 8.1 and Ubuntu 14.04.2 all appear to hang on boot when one vdisk is removed and another is attached, even if formatted with a "Linux RAID autodetect"-type partition.  CentOS 7 doesn't hang but doesn't have mdadm in its emergency console (I gave up with CentOS after that as not sure how to add commands, if that's possible...) and neither Fedora nor CentOS emergency consoles seem to include fdisk, which isn't too helpful.  
> 

I cannot comment on the other distros, but I believe that openSUSE
13.2, with all updates installed, handles boot on degraded md arrays
correctly.  As I *am* the mdadm maintainer for that distro, I can put
effort there.

If you have specifics of a configuration on openSUSE-13.2 with latest
updates for mdadm and dracut installed, where it doesn't boot correctly,
then I would definitely like to know about them, either here or at
bugzilla.opensuse.org - whichever suits you.

Thanks,
NeilBrown


^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2015-06-24 23:26 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-06-19 18:48 mdadm/Software RAID problems Gareth Evans
2015-06-20  5:56 ` mdadm/Software RAID problems - update Gareth Evans
2015-06-21 14:21   ` mdadm/Software RAID problems - another update Gareth Evans
2015-06-24 23:26     ` NeilBrown
2015-06-24 23:21 ` mdadm/Software RAID problems NeilBrown

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.