[opensuse-kernel] Moblin kernel merged to FACTORY

newer
[opensuse-kernel] Workaround if...

Greg KH

19 Jun 2009 19 Jun '09

22:00

Hi all, I just now got the Moblin (2.6.29) kernel merged into the FACTORY kernel, so it should start showing up in the next few builds. It really wasn't that many changes, the real work is in the configurations. So, here's what I plan on doing, and it would be great to get some feedback. For Moblin, we used a PAE kernel as "kernel-default". For FACTORY, I can't do that, and as we moved away from the -legacy to -default naming scheme, I'll change the Moblin images to use kernel-pae. In the kernel-pae config, I'd like to start changing stuff to reflect the fastboot things we did for Moblin. In the end, we were booting the kernel in less than a second on a tiny netbook, and I see no reason why we can't do the same for FACTORY and all future releases. To achieve this, I'll start to change the i386/pae and x86-64/default configurations to build a whole raft of drivers into the kernel, which speeds up booting a _lot_ due to the async probing that it allows the kernel to do. I'll also disable a few things that PAE systems should never need (like ISA), and a few other things that are in the Moblin kernel config. In the end, this means that you can boot without an initrd at all, but we need to move our init script changes over to FACTORY as well to take full advantage of this. There's also some mkinitrd magic I need to figure out so that we don't accidentally create initrd when we don't need them (which is a bug right now.) Any objections to any of this? Hopefully this will help with both the Moblin releases, which should be in the near future, as well as openSUSE 11.2, which will probably happen afterward. thanks, greg k-h -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Show replies by thread

Greg KH

19 Jun 19 Jun

22:41

On Fri, Jun 19, 2009 at 03:00:46PM -0700, Greg KH wrote:

...

Hi all,

I just now got the Moblin (2.6.29) kernel merged into the FACTORY kernel, so it should start showing up in the next few builds.

Ah, sorry for the vagueness, this means that I am now using the 2.6.30 kernel for the Moblin builds. I forward ported the 2.6.29 Moblin bits to FACTORY, which is 2.6.30. Hope that clears up any confusion. thanks, greg k-h -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Jeff Mahoney

20 Jun 20 Jun

21:07

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Greg KH wrote:

...

Hi all,

I just now got the Moblin (2.6.29) kernel merged into the FACTORY kernel, so it should start showing up in the next few builds.

Cool. The majority of it looks like small fixes and adding the /dev stuff.

...

It really wasn't that many changes, the real work is in the configurations.

So, here's what I plan on doing, and it would be great to get some feedback.

For Moblin, we used a PAE kernel as "kernel-default". For FACTORY, I can't do that, and as we moved away from the -legacy to -default naming scheme, I'll change the Moblin images to use kernel-pae.

In the kernel-pae config, I'd like to start changing stuff to reflect the fastboot things we did for Moblin. In the end, we were booting the kernel in less than a second on a tiny netbook, and I see no reason why we can't do the same for FACTORY and all future releases.

To achieve this, I'll start to change the i386/pae and x86-64/default configurations to build a whole raft of drivers into the kernel, which speeds up booting a _lot_ due to the async probing that it allows the kernel to do.

I'm still not a fan of this, but in the absence of the ability to link in modules at install time, I guess the gains outweigh the drawbacks.

...

I'll also disable a few things that PAE systems should never need (like ISA), and a few other things that are in the Moblin kernel config.

The PAE config already has CONFIG_ISA=n.

...

In the end, this means that you can boot without an initrd at all, but we need to move our init script changes over to FACTORY as well to take full advantage of this. There's also some mkinitrd magic I need to figure out so that we don't accidentally create initrd when we don't need them (which is a bug right now.)

Did you figure out a way to discover when a module is built into the kernel instead of just unavailable?

...

Any objections to any of this? Hopefully this will help with both the Moblin releases, which should be in the near future, as well as openSUSE 11.2, which will probably happen afterward.

Outside of my usual objections, no. This looks like a good win. - -Jeff - -- Jeff Mahoney SUSE Labs -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.9 (GNU/Linux) Comment: Using GnuPG with SUSE - http://enigmail.mozdev.org iEYEARECAAYFAko9T3kACgkQLPWxlyuTD7JgOwCfW9izZrKxMttmtbAxzznU0rTp FBIAn0Dn3YOwZrfG42uw/TqQ1PYNJHwK =dOAS -----END PGP SIGNATURE----- -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Thomas Renninger

21 Jun 21 Jun

19:08

On Saturday 20 June 2009 12:00:46 am Greg KH wrote: ...

...

To achieve this, I'll start to change the i386/pae and x86-64/default configurations to build a whole raft of drivers into the kernel, which speeds up booting a _lot_ due to the async probing that it allows the kernel to do. What if built-in drivers break on specific HW? Normal /etc/modprobe.conf blacklisting won't work. It would be great to have the linuxrc interpreted boot param brokenmodules= (to at least make sure you can install if elementary stuff breaks) taken into account by the kernel. No idea whether/how this could work out.

Thomas -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Hannes Reinecke

22 Jun 22 Jun

07:12

Hi all, Jeff Mahoney wrote:

...

Greg KH wrote:

...
Hi all,

...
I just now got the Moblin (2.6.29) kernel merged into the FACTORY kernel, so it should start showing up in the next few builds.

Cool. The majority of it looks like small fixes and adding the /dev stuff.

...
It really wasn't that many changes, the real work is in the configurations.

...
So, here's what I plan on doing, and it would be great to get some feedback.

...
For Moblin, we used a PAE kernel as "kernel-default". For FACTORY, I can't do that, and as we moved away from the -legacy to -default naming scheme, I'll change the Moblin images to use kernel-pae.

...
In the kernel-pae config, I'd like to start changing stuff to reflect the fastboot things we did for Moblin. In the end, we were booting the kernel in less than a second on a tiny netbook, and I see no reason why we can't do the same for FACTORY and all future releases.

...
To achieve this, I'll start to change the i386/pae and x86-64/default configurations to build a whole raft of drivers into the kernel, which speeds up booting a _lot_ due to the async probing that it allows the kernel to do.

I'm still not a fan of this, but in the absence of the ability to link in modules at install time, I guess the gains outweigh the drawbacks.

Why don't we do something about it? I've already spent some thoughts about it, and come up with two possibilities: - Link in modules during initrd run. Shouldn't be too hard, after all that's what the kernel does nowadays during building anyway. So just some linker magic and you're done. Drawback is that you'd need an uncompressed kernel to start with, so I'm not sure it's the right way to go - Implement something like the 'kexec-cache' from Max OS-X. OS-X has a 'kexec-cache', which allow to preload some kernel modules during boot. Implementing a similar thing on Linux we could just stuff the preloaded modules into a blob and load this as an additional initrd image. Then we could just call the ->init calls and everything would be dandy. Or that's the hope. Seeing the amount of trouble we've been running with built in modules I'd rather avoid this exercise again. Building in infrastructure modules is okay in general, and also driver modules which are not expected to change a lot (like loopback interface or stuff like that). But everything else is bound to cause trouble. Cheers, Hannes -- Dr. Hannes Reinecke zSeries & Storage hare@suse.de +49 911 74053 688 SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 Nürnberg GF: Markus Rex, HRB 16746 (AG Nürnberg) -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Takashi Iwai

07:20

At Mon, 22 Jun 2009 09:12:14 +0200, Hannes Reinecke wrote:

...

Hi all,

Jeff Mahoney wrote:

...
Greg KH wrote:

...
Hi all,

...
I just now got the Moblin (2.6.29) kernel merged into the FACTORY kernel, so it should start showing up in the next few builds.

Cool. The majority of it looks like small fixes and adding the /dev stuff.

...
It really wasn't that many changes, the real work is in the configurations.

...
So, here's what I plan on doing, and it would be great to get some feedback.

...
For Moblin, we used a PAE kernel as "kernel-default". For FACTORY, I can't do that, and as we moved away from the -legacy to -default naming scheme, I'll change the Moblin images to use kernel-pae.

...
In the kernel-pae config, I'd like to start changing stuff to reflect the fastboot things we did for Moblin. In the end, we were booting the kernel in less than a second on a tiny netbook, and I see no reason why we can't do the same for FACTORY and all future releases.

...
To achieve this, I'll start to change the i386/pae and x86-64/default configurations to build a whole raft of drivers into the kernel, which speeds up booting a _lot_ due to the async probing that it allows the kernel to do.

I'm still not a fan of this, but in the absence of the ability to link in modules at install time, I guess the gains outweigh the drawbacks.

Why don't we do something about it?

I've already spent some thoughts about it, and come up with two possibilities:

- Link in modules during initrd run. Shouldn't be too hard, after all that's what the kernel does nowadays during building anyway. So just some linker magic and you're done. Drawback is that you'd need an uncompressed kernel to start with, so I'm not sure it's the right way to go

I thought we still have /boot/vmlinux-$VERSION.gz in each kernel package. I guess this will be kept in future, too, because it's needed for many debug tools. Takashi -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Hannes Reinecke

07:23

Takashi Iwai wrote:

...

At Mon, 22 Jun 2009 09:12:14 +0200, Hannes Reinecke wrote:

...
Hi all,

Jeff Mahoney wrote:

...
Greg KH wrote:

...
Hi all, I just now got the Moblin (2.6.29) kernel merged into the FACTORY kernel, so it should start showing up in the next few builds. Cool. The majority of it looks like small fixes and adding the /dev stuff.

[ ... ]

...
...
To achieve this, I'll start to change the i386/pae and x86-64/default configurations to build a whole raft of drivers into the kernel, which speeds up booting a _lot_ due to the async probing that it allows the kernel to do. I'm still not a fan of this, but in the absence of the ability to link in modules at install time, I guess the gains outweigh the drawbacks.

Why don't we do something about it?

I've already spent some thoughts about it, and come up with two possibilities:

- Link in modules during initrd run. Shouldn't be too hard, after all that's what the kernel does nowadays during building anyway. So just some linker magic and you're done. Drawback is that you'd need an uncompressed kernel to start with, so I'm not sure it's the right way to go

I thought we still have /boot/vmlinux-$VERSION.gz in each kernel package. I guess this will be kept in future, too, because it's needed for many debug tools.

Yes, but when going down that route we would either - boot from an uncompressed kernel -> longer booting time or - keep the bzImage header around somewhere an do the compressing ourselves. Neither of these approaches is very appealing. Cheers, Hannes -- Dr. Hannes Reinecke zSeries & Storage hare@suse.de +49 911 74053 688 SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 Nürnberg GF: Markus Rex, HRB 16746 (AG Nürnberg) -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Jeff Mahoney

14:46

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hannes Reinecke wrote:

...

Jeff Mahoney wrote:

...
I'm still not a fan of this, but in the absence of the ability to link in modules at install time, I guess the gains outweigh the drawbacks.

Why don't we do something about it?

I've already spent some thoughts about it, and come up with two possibilities:

- Link in modules during initrd run. Shouldn't be too hard, after all that's what the kernel does nowadays during building anyway. So just some linker magic and you're done. Drawback is that you'd need an uncompressed kernel to start with, so I'm not sure it's the right way to go

This is something we discussed briefly a few months ago and the consensus was that there just wasn't enough information in the installation to properly link and assemble the new image. The idea just sort of petered out. I was thinking, though, that with the addition of a few more files, we might be able to make it work. The helpers in .../tools/, setup.bin, and a bit of scripting might be enough, but I haven't looked into it deeply enough to back that up with solid data.

...

- Implement something like the 'kexec-cache' from Max OS-X. OS-X has a 'kexec-cache', which allow to preload some kernel modules during boot. Implementing a similar thing on Linux we could just stuff the preloaded modules into a blob and load this as an additional initrd image. Then we could just call the ->init calls and everything would be dandy. Or that's the hope.

Wouldn't this also require a build environment? If not, doesn't it run into the same problem that we have now with serially loading the modules? - -Jeff - -- Jeff Mahoney SUSE Labs -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.9 (GNU/Linux) Comment: Using GnuPG with SUSE - http://enigmail.mozdev.org iEYEARECAAYFAko/mVIACgkQLPWxlyuTD7K0qQCgj6V9ry7ZgIHNuanefpwqD6uR Vv0AoKSPOMjfmT95ikkCIA6D79W2BvvL =NMkp -----END PGP SIGNATURE----- -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Greg KH

15:53

On Mon, Jun 22, 2009 at 09:12:14AM +0200, Hannes Reinecke wrote:

...

...
I'm still not a fan of this, but in the absence of the ability to link in modules at install time, I guess the gains outweigh the drawbacks.

Why don't we do something about it?

I've already spent some thoughts about it, and come up with two possibilities:

- Link in modules during initrd run. Shouldn't be too hard, after all that's what the kernel does nowadays during building anyway. So just some linker magic and you're done. Drawback is that you'd need an uncompressed kernel to start with, so I'm not sure it's the right way to go - Implement something like the 'kexec-cache' from Max OS-X. OS-X has a 'kexec-cache', which allow to preload some kernel modules during boot. Implementing a similar thing on Linux we could just stuff the preloaded modules into a blob and load this as an additional initrd image. Then we could just call the ->init calls and everything would be dandy. Or that's the hope.

Big problem is that you need the .c files because you can have different code paths built in the file depending on if you are built to be a module or built into the kernel due to #ifdefs :( thanks, greg k-h -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Greg KH

15:55

On Sun, Jun 21, 2009 at 09:08:22PM +0200, Thomas Renninger wrote:

...

On Saturday 20 June 2009 12:00:46 am Greg KH wrote: ...

...
To achieve this, I'll start to change the i386/pae and x86-64/default configurations to build a whole raft of drivers into the kernel, which speeds up booting a _lot_ due to the async probing that it allows the kernel to do. What if built-in drivers break on specific HW?

Then we fix the problem :)

...

Normal /etc/modprobe.conf blacklisting won't work.

I agree, but if you look at the modules we are building in, they are all so far "common" modules that I do not think have ever been blacklisted. thanks, greg k-h -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Jean Delvare

16:55

Le lundi 22 juin 2009, Greg KH a écrit :

...

On Sun, Jun 21, 2009 at 09:08:22PM +0200, Thomas Renninger wrote:

...
On Saturday 20 June 2009 12:00:46 am Greg KH wrote: ...

...
To achieve this, I'll start to change the i386/pae and x86-64/default configurations to build a whole raft of drivers into the kernel, which speeds up booting a _lot_ due to the async probing that it allows the kernel to do. What if built-in drivers break on specific HW?

Then we fix the problem :)

...
Normal /etc/modprobe.conf blacklisting won't work.

I agree, but if you look at the modules we are building in, they are all so far "common" modules that I do not think have ever been blacklisted.

You'd be surprised. Please don't underestimate the problem Thomas is pointing you at, it's very real. It doesn't mean we don't want to build these drivers in, but this means that if we do, we need a way to disable them. If you decide to ignore this problem today, L3 and R&D will remind you about it on a weekly basis for the next 7 years ;) -- Jean Delvare Suse L3 -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Greg KH

17:01

On Mon, Jun 22, 2009 at 06:55:51PM +0200, Jean Delvare wrote:

...

Le lundi 22 juin 2009, Greg KH a écrit :

...
On Sun, Jun 21, 2009 at 09:08:22PM +0200, Thomas Renninger wrote:

...
On Saturday 20 June 2009 12:00:46 am Greg KH wrote: ...

...
To achieve this, I'll start to change the i386/pae and x86-64/default configurations to build a whole raft of drivers into the kernel, which speeds up booting a _lot_ due to the async probing that it allows the kernel to do. What if built-in drivers break on specific HW?

Then we fix the problem :)

...
Normal /etc/modprobe.conf blacklisting won't work.

I agree, but if you look at the modules we are building in, they are all so far "common" modules that I do not think have ever been blacklisted.

You'd be surprised. Please don't underestimate the problem Thomas is pointing you at, it's very real. It doesn't mean we don't want to build these drivers in, but this means that if we do, we need a way to disable them.

Fair enough, I'll work on that.

...

If you decide to ignore this problem today, L3 and R&D will remind you about it on a weekly basis for the next 7 years ;)

Heh. But note, that this is not being done (yet) for a product that we provide L3 support for :) thanks, greg k-h -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Jeff Mahoney

17:38

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Greg KH wrote:

...

On Mon, Jun 22, 2009 at 09:12:14AM +0200, Hannes Reinecke wrote:

...
...
I'm still not a fan of this, but in the absence of the ability to link in modules at install time, I guess the gains outweigh the drawbacks.

Why don't we do something about it?

I've already spent some thoughts about it, and come up with two possibilities:

- Link in modules during initrd run. Shouldn't be too hard, after all that's what the kernel does nowadays during building anyway. So just some linker magic and you're done. Drawback is that you'd need an uncompressed kernel to start with, so I'm not sure it's the right way to go - Implement something like the 'kexec-cache' from Max OS-X. OS-X has a 'kexec-cache', which allow to preload some kernel modules during boot. Implementing a similar thing on Linux we could just stuff the preloaded modules into a blob and load this as an additional initrd image. Then we could just call the ->init calls and everything would be dandy. Or that's the hope.

Big problem is that you need the .c files because you can have different code paths built in the file depending on if you are built to be a module or built into the kernel due to #ifdefs :(

I know they exist, but what are the valid use cases for doing that and do we need to worry about a lot of them? It seems like the cases can be broken down into a few categories: * print something * change a description string * optimize away things that aren't required when statically linked A lot of the stupid things are in ISA drivers. I do see one case in usbcore, but even that seems like it should always allow usbcore.nousb and enable nousb for ifndef MODULE. I do see your point that making assumptions like this could be fragile. - -Jeff - -- Jeff Mahoney SUSE Labs -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.9 (GNU/Linux) Comment: Using GnuPG with SUSE - http://enigmail.mozdev.org iEYEARECAAYFAko/wXgACgkQLPWxlyuTD7It1wCgqLKodCHueQD+xNrCrQT+MG/v o5wAoJNnWkV0aWqQG4eTjVnpD8CEaQOa =y8kH -----END PGP SIGNATURE----- -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Greg KH

17:47

On Mon, Jun 22, 2009 at 01:38:01PM -0400, Jeff Mahoney wrote:

...

Greg KH wrote:

...
On Mon, Jun 22, 2009 at 09:12:14AM +0200, Hannes Reinecke wrote:

...
...
I'm still not a fan of this, but in the absence of the ability to link in modules at install time, I guess the gains outweigh the drawbacks.

Why don't we do something about it?

I've already spent some thoughts about it, and come up with two possibilities:

- Link in modules during initrd run. Shouldn't be too hard, after all that's what the kernel does nowadays during building anyway. So just some linker magic and you're done. Drawback is that you'd need an uncompressed kernel to start with, so I'm not sure it's the right way to go - Implement something like the 'kexec-cache' from Max OS-X. OS-X has a 'kexec-cache', which allow to preload some kernel modules during boot. Implementing a similar thing on Linux we could just stuff the preloaded modules into a blob and load this as an additional initrd image. Then we could just call the ->init calls and everything would be dandy. Or that's the hope.

Big problem is that you need the .c files because you can have different code paths built in the file depending on if you are built to be a module or built into the kernel due to #ifdefs :(

I know they exist, but what are the valid use cases for doing that and do we need to worry about a lot of them? It seems like the cases can be broken down into a few categories:

* print something * change a description string * optimize away things that aren't required when statically linked

Also: - initialize something at a different run level Now that should be fixed up properly by doing the correct macro, but I have seen it enough that it is common.

...

A lot of the stupid things are in ISA drivers.

Agreed, and we aren't building ISA drivers for "real" systems anymore, thankfully :)

...

I do see one case in usbcore, but even that seems like it should always allow usbcore.nousb and enable nousb for ifndef MODULE.

I do see your point that making assumptions like this could be fragile.

Yeah, it's the odd-cases that I worry about here. thanks, greg k-h -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Greg KH

17:48

On Sat, Jun 20, 2009 at 05:07:05PM -0400, Jeff Mahoney wrote:

...

Greg KH wrote:

...
Hi all,

I just now got the Moblin (2.6.29) kernel merged into the FACTORY kernel, so it should start showing up in the next few builds.

Cool. The majority of it looks like small fixes and adding the /dev stuff.

Yes. There is also some wierd init call ordering that I'm not quite sure why it's needed, but it speeds boot up, so I'm not complaining.

...

...
It really wasn't that many changes, the real work is in the configurations.

So, here's what I plan on doing, and it would be great to get some feedback.

For Moblin, we used a PAE kernel as "kernel-default". For FACTORY, I can't do that, and as we moved away from the -legacy to -default naming scheme, I'll change the Moblin images to use kernel-pae.

In the kernel-pae config, I'd like to start changing stuff to reflect the fastboot things we did for Moblin. In the end, we were booting the kernel in less than a second on a tiny netbook, and I see no reason why we can't do the same for FACTORY and all future releases.

To achieve this, I'll start to change the i386/pae and x86-64/default configurations to build a whole raft of drivers into the kernel, which speeds up booting a _lot_ due to the async probing that it allows the kernel to do.

I'm still not a fan of this, but in the absence of the ability to link in modules at install time, I guess the gains outweigh the drawbacks.

...
I'll also disable a few things that PAE systems should never need (like ISA), and a few other things that are in the Moblin kernel config.

The PAE config already has CONFIG_ISA=n.

Ah, you're right, no wonder my diff didn't show it :)

...

...
In the end, this means that you can boot without an initrd at all, but we need to move our init script changes over to FACTORY as well to take full advantage of this. There's also some mkinitrd magic I need to figure out so that we don't accidentally create initrd when we don't need them (which is a bug right now.)

Did you figure out a way to discover when a module is built into the kernel instead of just unavailable?

No. thanks, greg k-h -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Jeff Mahoney

18:15

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Greg KH wrote:

...

On Mon, Jun 22, 2009 at 01:38:01PM -0400, Jeff Mahoney wrote:

...
Greg KH wrote:

...
On Mon, Jun 22, 2009 at 09:12:14AM +0200, Hannes Reinecke wrote:

...
...
I'm still not a fan of this, but in the absence of the ability to link in modules at install time, I guess the gains outweigh the drawbacks.

Why don't we do something about it?

I've already spent some thoughts about it, and come up with two possibilities:

- Link in modules during initrd run. Shouldn't be too hard, after all that's what the kernel does nowadays during building anyway. So just some linker magic and you're done. Drawback is that you'd need an uncompressed kernel to start with, so I'm not sure it's the right way to go - Implement something like the 'kexec-cache' from Max OS-X. OS-X has a 'kexec-cache', which allow to preload some kernel modules during boot. Implementing a similar thing on Linux we could just stuff the preloaded modules into a blob and load this as an additional initrd image. Then we could just call the ->init calls and everything would be dandy. Or that's the hope. Big problem is that you need the .c files because you can have different code paths built in the file depending on if you are built to be a module or built into the kernel due to #ifdefs :( I know they exist, but what are the valid use cases for doing that and do we need to worry about a lot of them? It seems like the cases can be broken down into a few categories:

* print something * change a description string * optimize away things that aren't required when statically linked

Also: - initialize something at a different run level

But that's really just to address dependencies, right? I don't intend to load the modules at the same runlevel where they would have run if normally compiled statically. If we load the linked module after the usual static parts have initialized, then we'll still observe the dependencies.

...

Now that should be fixed up properly by doing the correct macro, but I have seen it enough that it is common.

...
A lot of the stupid things are in ISA drivers.

Agreed, and we aren't building ISA drivers for "real" systems anymore, thankfully :)

...
I do see one case in usbcore, but even that seems like it should always allow usbcore.nousb and enable nousb for ifndef MODULE.

I do see your point that making assumptions like this could be fragile.

Yeah, it's the odd-cases that I worry about here.

Or perhaps a different solution would be to whitelist modules which are known to be safe. Given the number of modules we want to typically link in, this shouldn't be a long list. - -Jeff - -- Jeff Mahoney SUSE Labs -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.9 (GNU/Linux) Comment: Using GnuPG with SUSE - http://enigmail.mozdev.org iEYEARECAAYFAko/yjEACgkQLPWxlyuTD7KCsACeO3Lfr+zhj8dByRZ+E0LJpeKL ROUAoIErXMCBntEysg16w0knmOl/KIkr =Y0Yw -----END PGP SIGNATURE----- -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Greg KH

20:48

On Mon, Jun 22, 2009 at 02:15:13PM -0400, Jeff Mahoney wrote:

...

Greg KH wrote:

...
On Mon, Jun 22, 2009 at 01:38:01PM -0400, Jeff Mahoney wrote:

...
Greg KH wrote:

...
On Mon, Jun 22, 2009 at 09:12:14AM +0200, Hannes Reinecke wrote:

...
...
I'm still not a fan of this, but in the absence of the ability to link in modules at install time, I guess the gains outweigh the drawbacks.

Why don't we do something about it?

I've already spent some thoughts about it, and come up with two possibilities:

- Link in modules during initrd run. Shouldn't be too hard, after all that's what the kernel does nowadays during building anyway. So just some linker magic and you're done. Drawback is that you'd need an uncompressed kernel to start with, so I'm not sure it's the right way to go - Implement something like the 'kexec-cache' from Max OS-X. OS-X has a 'kexec-cache', which allow to preload some kernel modules during boot. Implementing a similar thing on Linux we could just stuff the preloaded modules into a blob and load this as an additional initrd image. Then we could just call the ->init calls and everything would be dandy. Or that's the hope. Big problem is that you need the .c files because you can have different code paths built in the file depending on if you are built to be a module or built into the kernel due to #ifdefs :( I know they exist, but what are the valid use cases for doing that and do we need to worry about a lot of them? It seems like the cases can be broken down into a few categories:

* print something * change a description string * optimize away things that aren't required when statically linked

Also: - initialize something at a different run level

But that's really just to address dependencies, right? I don't intend to load the modules at the same runlevel where they would have run if normally compiled statically. If we load the linked module after the usual static parts have initialized, then we'll still observe the dependencies.

No, it's to tell the kernel exactly when to initialize the code at what part during the init level processing, and link order matters. Actually, in thinking about it some more, I don't think this is going to work properly for the "fastboot" stuff that we really need. Here's why: - When drivers are build into the kernel, they are initialized in the order in which the Makefile places them, and we build them in pretty early. This allows the drivers to start up, and do some async stuff while the rest of the kernel initializes. - If you somehow "link" the modules into the built kernel, you will have to set up a mechanism to call the module_init() calls. The only safe way to do that is at the end of the init cycle. So any async processing that could have happened, will not, as these drivers will be the last things in the boot process now, instead of very early like they used to be. Now if you could figure out how to insert them into the link order in the boot process in the same sequence as if they were built in, that would be very nice, but I don't see how that would be possible. So, by adding the modules on to the kernel image, all we would save would be the module load time, which while not insignificant, is not sufficient for the boot times we are needing to achieve here. thanks, greg k-h -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Michael Matz

23 Jun 23 Jun

11:03

Hi, On Mon, 22 Jun 2009, Greg KH wrote:

...

...
But that's really just to address dependencies, right? I don't intend to load the modules at the same runlevel where they would have run if normally compiled statically. If we load the linked module after the usual static parts have initialized, then we'll still observe the dependencies.

No, it's to tell the kernel exactly when to initialize the code at what part during the init level processing, and link order matters.

"exactly when to initialize the code" == "addresses dependencies", isn't it?

...

Actually, in thinking about it some more, I don't think this is going to work properly for the "fastboot" stuff that we really need. Here's why:

- When drivers are build into the kernel, they are initialized in the order in which the Makefile places them, and we build them in pretty early. This allows the drivers to start up, and do some async stuff while the rest of the kernel initializes.

Excuse me for not being up-to-date wrt. the kernel anymore, but isn't this done via the .init sections?

...

- If you somehow "link" the modules into the built kernel, you will have to set up a mechanism to call the module_init() calls. The only safe way to do that is at the end of the init cycle. So any async processing that could have happened, will not, as these drivers will be the last things in the boot process now, instead of very early like they used to be.

... Because if it is, then linking modules into the built kernel after the fact isn't going to change this principle. You still have a .initcall section (well, two of them, one for the built kernel, one for the module lump) which the kernel proper would iterate over very early (after determining existence of the second initcall table). Ciao, Michael. -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Hannes Reinecke

11:06

Michael Matz wrote:

...

Hi,

On Mon, 22 Jun 2009, Greg KH wrote:

[ .. ]

...

...
Actually, in thinking about it some more, I don't think this is going to work properly for the "fastboot" stuff that we really need. Here's why:

- When drivers are build into the kernel, they are initialized in the order in which the Makefile places them, and we build them in pretty early. This allows the drivers to start up, and do some async stuff while the rest of the kernel initializes.

Excuse me for not being up-to-date wrt. the kernel anymore, but isn't this done via the .init sections?

...
- If you somehow "link" the modules into the built kernel, you will have to set up a mechanism to call the module_init() calls. The only safe way to do that is at the end of the init cycle. So any async processing that could have happened, will not, as these drivers will be the last things in the boot process now, instead of very early like they used to be.

... Because if it is, then linking modules into the built kernel after the fact isn't going to change this principle. You still have a .initcall section (well, two of them, one for the built kernel, one for the module lump) which the kernel proper would iterate over very early (after determining existence of the second initcall table).

Which is exactly my thoughts. The only valid argument currently against this is the #ifdef MODULE case. One would have to look at the individual cases, but I suspect the most of these are leftovers and should be cleaned up anyway. Cheers, Hannes -- Dr. Hannes Reinecke zSeries & Storage hare@suse.de +49 911 74053 688 SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 Nürnberg GF: Markus Rex, HRB 16746 (AG Nürnberg) -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Hannes Reinecke

11:14

Hannes Reinecke wrote:

...

Michael Matz wrote: [ .. ]

...
fact isn't going to change this principle. You still have a .initcall section (well, two of them, one for the built kernel, one for the module lump) which the kernel proper would iterate over very early (after determining existence of the second initcall table).

Which is exactly my thoughts.

The only valid argument currently against this is the #ifdef MODULE case. One would have to look at the individual cases, but I suspect the most of these are leftovers and should be cleaned up anyway.

As suspected. A quick glance at drivers/scsi revealed things like: drivers/scsi/gdth.c: #ifndef MODULE __setup("gdth=", option_setup); #endif drivers/scsi/gvp11.c: int gvp11_release(struct Scsi_Host *instance) { #ifdef MODULE DMA(instance)->CNTR = 0; release_mem_region(ZTWO_PADDR(instance->base), 256); free_irq(IRQ_AMIGA_PORTS, instance); wd33c93_release(); #endif return 1; } drivers/scsi/ibmmca.c: #if defined(MODULE) static char *boot_options = NULL; module_param(boot_options, charp, 0); module_param_array(io_port, int, NULL, 0); module_param_array(scsi_id, int, NULL, 0); MODULE_LICENSE("GPL"); #endif and my all-time favourite: drivers/scsi/BusLogic.c: #ifdef MODULE static struct pci_device_id BusLogic_pci_tbl[] __devinitdata = { { PCI_VENDOR_ID_BUSLOGIC, PCI_DEVICE_ID_BUSLOGIC_MULTIMASTER, PCI_ANY_ID, PCI_ANY_ID, 0, 0, 0}, { PCI_VENDOR_ID_BUSLOGIC, PCI_DEVICE_ID_BUSLOGIC_MULTIMASTER_NC, PCI_ANY_ID, PCI_ANY_ID, 0, 0, 0}, { PCI_VENDOR_ID_BUSLOGIC, PCI_DEVICE_ID_BUSLOGIC_FLASHPOINT, PCI_ANY_ID, PCI_ANY_ID, 0, 0, 0}, { } }; #endif MODULE_DEVICE_TABLE(pci, BusLogic_pci_tbl); So it's about time to have that cleaned up anyway. Cheers, Hannes -- Dr. Hannes Reinecke zSeries & Storage hare@suse.de +49 911 74053 688 SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 Nürnberg GF: Markus Rex, HRB 16746 (AG Nürnberg) -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org

Greg KH

16:04

On Tue, Jun 23, 2009 at 01:03:48PM +0200, Michael Matz wrote:

...

Hi,

On Mon, 22 Jun 2009, Greg KH wrote:

...
...
But that's really just to address dependencies, right? I don't intend to load the modules at the same runlevel where they would have run if normally compiled statically. If we load the linked module after the usual static parts have initialized, then we'll still observe the dependencies.

No, it's to tell the kernel exactly when to initialize the code at what part during the init level processing, and link order matters.

"exactly when to initialize the code" == "addresses dependencies", isn't it?

No, see below for details.

...

...
Actually, in thinking about it some more, I don't think this is going to work properly for the "fastboot" stuff that we really need. Here's why:

- When drivers are build into the kernel, they are initialized in the order in which the Makefile places them, and we build them in pretty early. This allows the drivers to start up, and do some async stuff while the rest of the kernel initializes.

Excuse me for not being up-to-date wrt. the kernel anymore, but isn't this done via the .init sections?

Yes it is, but order within the .init sections matter.

...

...
- If you somehow "link" the modules into the built kernel, you will have to set up a mechanism to call the module_init() calls. The only safe way to do that is at the end of the init cycle. So any async processing that could have happened, will not, as these drivers will be the last things in the boot process now, instead of very early like they used to be.

... Because if it is, then linking modules into the built kernel after the fact isn't going to change this principle. You still have a .initcall section (well, two of them, one for the built kernel, one for the module lump) which the kernel proper would iterate over very early (after determining existence of the second initcall table).

We really have 8 different levels of init calls in the kernel these days: #define pure_initcall(fn) __define_initcall("0",fn,0) #define core_initcall(fn) __define_initcall("1",fn,1) #define core_initcall_sync(fn) __define_initcall("1s",fn,1s) #define postcore_initcall(fn) __define_initcall("2",fn,2) #define postcore_initcall_sync(fn) __define_initcall("2s",fn,2s) #define arch_initcall(fn) __define_initcall("3",fn,3) #define arch_initcall_sync(fn) __define_initcall("3s",fn,3s) #define subsys_initcall(fn) __define_initcall("4",fn,4) #define subsys_initcall_sync(fn) __define_initcall("4s",fn,4s) #define fs_initcall(fn) __define_initcall("5",fn,5) #define fs_initcall_sync(fn) __define_initcall("5s",fn,5s) #define rootfs_initcall(fn) __define_initcall("rootfs",fn,rootfs) #define device_initcall(fn) __define_initcall("6",fn,6) #define device_initcall_sync(fn) __define_initcall("6s",fn,6s) #define late_initcall(fn) __define_initcall("7",fn,7) #define late_initcall_sync(fn) __define_initcall("7s",fn,7s) If you build any code as a module, any of these different levels all change to be the "generic" module_init() call, which runs after all of these 8 levels runs. So you can't work backwards and figure out what level of init call the module really wanted to be run at if you only have a .o file. And then, within the different init call levels, we call the functions in the order in which they are linked into the kernel, which is driven by the Makefile. If you look at some of the recent changes that were made for "fastboot", we reoder the Makefiles to allow some things to run in parallel before others do (like ata drivers very early, before other drivers in the same run level to take advantage of the slowness of those initialization sequences). So if you just take the module init sections, and run them some time after all of the above sections run, then you don't get the same speedups that we need. If you look at the startup boot graphs, this is seen quite well, with the ATA drives taking a long time to startup, all the while the rest of the kernel is running along, initalizing other things. If you move the ata drivers to the end of the init sequence, then the whole kernel waits for that hardware to startup, wasting almost a full second. Remember, we are talking about a whole boot time of the kernel to be less than a second right now, so optimizations like this are essencial to get there. hope this helps explain things a bit better, greg k-h -- To unsubscribe, e-mail: opensuse-kernel+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-kernel+help@opensuse.org