This is a minimal initrd for usage with Hocus. UUID based mounts are unreliable when used with conjuction with OverlayBD as most obd images don't have an disk UUID or have the same UUID. This initramfs is designed to mount virtio-blk disks by their serial number. To mount an rootfs please use:
mountdevice=SERIAL=<disk_serial> mount_target=/
Up to 32 boot time mounts are supported:
mountdevice=SERIAL=<disk1_serial> mount_target=/
mountdevice=SERIAL=<disk2_serial> mount_target=/mount2
...
mountdevice=SERIAL=<diskN_serial> mount_target=/mountN
Disks are mounted in the order specified on the kernel cmdline, if a mountpoint does not exist then the target directory including required parent directories will be created. Mount options are customisable:
mountdevice=SERIAL=<disk_serial> mount_target=<mount_target> mountflags=<optional_mount_options> mountfstype=<mount_filesystem_type>
By specifying mountfstype on the command line the OS will boot much faster as the initrd will avoid scanning the disk for the filesystem type.
To build this initrd please run ./docker-build.sh
This is a very minimalistic initramfs implementation for booting Linux systems. It has nearly no features, but is very small and very fast. It is written purely in C, but uses only parts of the standard library.
There are three primary use cases:
- It is designed for systems where an initramfs is typically not
necessary (block device drivers + root file system compiled into the
kernel, no separate /usr file system), but where an initramfs is
required for microcode upgrades. Instead of having to use a full
initramfs, which is larger (more time spent in the boot loader
loading it) and slower (because it does more),
tiny-initramfswill add next to no overhead. - In cases where
UUID-based boot is wanted not a full initramfs. - In systems with a split
/usrfile system, it is necessary to mount that in the initramfs already, else subtle problems may occur. If/usrresides on a simple block device already known to the kernel (without user space helpers such as udev),tiny-initramfsprovides a mechanism with very little overhead to mount it before the system is started.
- Simplicity: the implementation is really simple and very linear. It's most likely easier to understand than other initramfs implementations. The entire program is less about 2500 LoC, and that includes the License headers in the files.
- Size: the implementation is really small (see below).
- Speed: there is no noticeable performance penalty, because very little is done before execution is handed over to the operating system proper.
- Supports kernel-named devices, for example
root=/dev/sda1. - Supports
root=0xMAJMIN. - Supports
root=UUID=...for ext2, ext3, ext4, xfs and btrfs. - Supports parsing
/etc/fstabto determine if a separate/usrpartition exists and mounting that - as long as the entry there follows the same rule as theroot=parameter (kernel device name, orUUID=entry for a select number of filesystems). - Supports the
root=,rootflags=,rootfstype=,rootdelay=,rootwait,ro,rwandinit=parameters. - Default timeout of 180 seconds to wait for the root device to appear
(starts after the
rootdelay=delay is over), after which a kernel panic is caused; ifrootwaitis specified it will wait indefinitely. (Recompilation is necessary for a larger timeout.) - Supports mounting NFSv4 file systems with
sec=systhat do not use the idampper, i.e. use raw UIDs/GIDs. For the/usrfile system the standard/etc/fstabentries are interpreted, for the root file system one should useroot=/dev/nfsand thenfsroot=parameter (as documented in the kernel documentation). The network configuration needs to be specified via theip=kernel command line parameter. - Very trivial module loading support (no automatic dependency resolution).
When compiled on an x86_64 system with the default -Og compiler flags,
statically linked against dietlibc 0.33~cvs20120325-6, the binary
stripped and the resulting initramfs (without any modules added)
compressed with gzip -9, the images produced are between 9 kiB and
14 kiB, depending on the feature set selected.
Using musl instead of dietlibc adds between 1.8 and 2.4 kiB to the
resulting initrd.img size (depending on the feature set).
Using glibc instead of dietlibc adds around 310 kiB to the resulting initrd image and is not recommended (although it will work).
Adding modules to the initramfs will increase the size, and many block device and file system drivers are 100s of kiB in size. On the other hand, the kernel would be larger if they were compiled in, so the actual amount of space lost due to using modules is quite a bit smaller.
- The kernel should have the necessary block device and file system
drivers built-in that are required to access the root and
/usrfile systems. Warning: this is not true for most default kernels of mainstream distributions, as they require a full initramfs to load the modules required to mount the root file system. - If the necessary drivers are not built into the kernel, there is limited support for loading modules from within the initramfs, see below for details.
- The kernel must have
CONFIG_DEVTMPFSbuilt-in, because this implementation assumes the kernel will just create the devices by itself. (This is true for most distribution kernels.) - NFSv4 requires at least kernel 3.5 on both server and client (in
order for raw UIDs/GIDs to work) and requires built-in kernel
support for network autoconfiguration (
CONFIG_IP_PNPand for DHCP support alsoCONFIG_IP_PNP_DHCP) as well as built-in kernel support for NFSv4 (CONFIG_NFS_FSas well asCONFIG_NFS_V4).
tiny-initramfsdoes not supportPARTUUID=for mounting the root or/usrfile systems. It also doesn't support symlinks created by udev (such as/dev/disk/by-label/...). Only the kernel names themselves, such as/dev/sda1, as well asUUID=and hexadecimal device numbers (0xMAJMIN, e.g.0x801) are supported.- NFSv2/NFSv3 are currently not supported.
- When booting from USB storage you should always use
UUID=, because device names are not necessarily stable. /usron a FUSE file system, as they require user space helpers running to be able to mount. Generally speaking, any file system that can't be mounted with just a trivialmountsyscall, but requires a userspace helper, will not work.- Any complex storage setup, such as LVM, encryption, iSCSI, etc. Basically, only things that the kernel provides devices for out of the box (potentially with additional kernel parameters) is supported.
If your setup falls into one of these cases, please use a full
initramfs instead of tiny-initramfs. It is not meant to replace
those, but provide a light-weight solution in cases where the
complexities of a full initramfs are unnecessary.
- Since the initramfs is supposed to be small,
fsckwill not be executed bytiny-initramfs. For the/file system this is perfectly fine, as most distributions support checking the root file system at boot outside of the initramfs. But this doesn't work for/usr, because e.g.e2fsckwill not check a mounted file system other than the root file system; and e.g. systemd passes-Mtofsckby default for non-root file systems, so mounted file systems are excluded anyway. It shouldn't be too difficult to special-case/usrhere as well, but that work needs to be done if a file system check at boot is to be performed for/usrwithtiny-initramfs. (Note thate2fsckplus the required libraries are about 2.5 MiB in size, so havingfsckpresent in the initramfs image is not in the scope oftiny-initramfs, because it would remove all the advantages.) - You need to make sure that a split-
/usrfile system is remounted read-write if therooption is passed on the kernel command line (becausetiny-initramfswill also mount/usrread-only then); otherwise/usrwill remain read-only after boot. If you use systemd as your init system, or e.g. the newest Debian initscripts (2.88dsf-59.3or higher) in conjunction with sysvinit, this should work.tiny-initramfsitself doesn't care about which init system is used, but the init system must be able to cope with the state thattiny-initramfsleaves the/usrfile system in. This may require changes to some scripts. - If
/usris a bind mount in/etc/fstab, this will currently fail, even though it should be supportable. (It's on the TODO list, as long as that doesn't require yet another file system.) - Overlay-type file systems for
/usrare untested, but they should work if they are compiled into the kernel. What is not supported are overlay-type file systems for/and/or if something has to be done prior to mounting these file systems (such as creating a directory, or mounting an additional tmpfs or similar). - Booting from kernel-assembled RAID arrays (via
md=...) should work, but is untested. Don't combine this withUUID=, though, astiny-initrdcurrently does not check if a block device has array metadata, so it could falsely identify a member device (instead of the entire array) when usingUUID=in some cases. But for arrays that are assembled by the kernel viamd=...the device name is known anyway (typically/dev/md0), so this shouldn't be an issue. - NFS support is not thoroughly tested.
- Host names are unsupported for NFS mounts, only IP addresses work.
- While this is supposed to be portable, this has only been tested on x86_64 (amd64). Since low-level kernel syscalls are performed, there may be some issues on other architectures. Please report those if they are present.
Install an alternative libc implementation that's designed for embedded
use cases, such as musl or
dietlibc. This is strictly speaking not
required, as the default glibc will also work, but then the binary size
of the resulting binary will be far larger. If tiny-initramfs doesn't
work with your favorite libc implementation, please report this, so
that it may be fixed.
Find out the compile command required to use your C library. For
example, with dietlibc it's "diet gcc", with musl it's musl-gcc.
Use
./configure CC="diet gcc"
make
to compile the tiny_initramfs binary and
make initrd.img
to auto-create the initramfs image. Replace "diet gcc" with the
appropriate command for your libc implementation.
Note that if you specify CFLAGS (potentially via your build system)
you should take care to specify -Os and not to specify any debug
(-g) options, as those tend to increase the binary size quite a bit.
./configure will warn you about it, but it not abort in that case,
because the binary will work. Likewise, if you don't use an alternative
libc implementation but glibc, ./configure will warn you about it,
because that will increase the binary size by a factor of 10 to 20.
You may specify multiple options to enable/disable certain features in the initramfs. Specifically, you can disable UUID mounting support (enabled by default), and you can enable NFSv4 support (disabled by default). A list of possible options is displayed when using
./configure --help
The initramfs creation is really simple, you may also do so manually:
- create an empty directory, let's call it
initramfs/ - copy
tiny_initramfsto the directory, call itinit(you can call it something else, but then you also need to pass therdinit=/newnameoption to the kernel command line) - strip the binary to reduce it's size (optional)
- create the
dev,procandtargetsubdirectories - cpio the directory and compress it
The following commands do just that:
mkdir initramfs
cp tiny_initramfs initramfs/init
strip initramfs/init
mkdir initramfs/dev initramfs/proc initramfs/target
cd initramfs ; find . | cpio -o --quiet -R 0:0 -H newc | gzip > ../initrd.img
With this there's now a (kernel-independent) initramfs image that may be used to boot the system. Note that as of now there is no integration with distributions, so configuring the boot loader etc. has to be done manually.
There is limited support for loading modules if --enable-modules is
specified during the configure invocation. To use this feature, one
needs to create a file /modules in the initramfs image that is of the
following format:
/file.ko options
The modules should not be in a sub-directory, because the directory containing them will not be cleaned-up by tiny-initramfs after mounting the root file system. (Loading the modules will work though.)
For example, the virtio block device driver virtio_blk requires some
additional modules to work. Using modprobe one may find out which:
$ /sbin/modprobe --all --ignore-install --quiet --show-depends virtio_blk
insmod /lib/modules/[...]/kernel/drivers/virtio/virtio.ko
insmod /lib/modules/[...]/kernel/drivers/virtio/virtio_ring.ko
insmod /lib/modules/[...]/kernel/drivers/block/virtio_blk.ko
It turns out that this is not quite sufficient, because the
virtio_pci driver is also required for virtio_blk to work (the
driver loads without virtio_pci, but doesn't work), so one may use:
$ /sbin/modprobe --all --ignore-install --quiet --show-depends virtio_blk virtio_pci
insmod /lib/modules/3.16.0-4-amd64/kernel/drivers/virtio/virtio.ko
insmod /lib/modules/3.16.0-4-amd64/kernel/drivers/virtio/virtio_ring.ko
insmod /lib/modules/3.16.0-4-amd64/kernel/drivers/block/virtio_blk.ko
insmod /lib/modules/3.16.0-4-amd64/kernel/drivers/virtio/virtio.ko
insmod /lib/modules/3.16.0-4-amd64/kernel/drivers/virtio/virtio_ring.ko
insmod /lib/modules/3.16.0-4-amd64/kernel/drivers/virtio/virtio_pci.ko
(Note that in case soft dependencies are treated via install lines,
these have to be resolved manually. This is typically not the case for
drivers needed within initramfs, because other implementations also
suffer from the same issue. install is deprecated anyway according to
the manual page of modprobe.d.)
One may then copy these drivers to the initramfs image, and then add a module file with the following contents:
/virtio.ko
/virtio_ring.ko
/virtio_blk.ko
/virtio.ko
/virtio_ring.ko
/virtio_pci.ko
The order is important, because dependency resolution is not performed by tiny-initramfs, it has to be done while creating the initramfs image. Duplicate entries are not a problem, because they will silently be ignored (but you may remove duplicate entries if you don't change the order otherwise).
Options may be specified when followed by a space (tab characters not supported), for example:
/libata.ko noacpi
If an error occurs, tiny-initramfs will print an error message indicating the problem and then sleep for 10s before exiting. This is because exiting will cause a kernel panic, but typical kernel traces are so large that they replace the entire screen contents on a standard terminal, so that the original message is not visible anymore. The 10s delay allows the user to see what the problem is.
Additionally, one may use the --enable-debug flag of ./configure to
make initrd.img verbose (while increasing the size a bit). This makes
debugging easier, especially if the system hangs at a certain point.
When compiled with that option, tiny-initramfs will print the contents
of /proc/self/mountinfo and sleep for 5s after mounting the root (and
potentially /usr) file systems before executing init.
The design of tiny-initramfs is as minimalistic as possible. The
buffered I/O functions from the stdio.h standard library are
completely avoided, because they can increase the code size quite a
bit, depending on the libc implementation. At one point an own
minimalistic buffered I/O routine is implemented (much smaller than the
full standard library linked in).
Dynamic allocations are avoided and buffers on the stack are used. Code that properly handles dynamic allocations tends to be longer, so this reduces code size. The buffers are sized generously (there are not that many buffers that the amount of RAM used is a concern just yet, even for small systems), so that no real flexibility is sacrificed.
None of this is extremely performance-critical (it is going to be quite fast regardless, because very little is done compared to even just running a shell), so no algorithm is optimized for speed directly. For example, the mount option parser table is somewhat compressed to reduce the code size (negation and recursive variants of mount options are not repeated), to the point where further reduction would likely sacrifice the readability of the code. Execution speed is achieved by doing very little, not by micro-optimizing algorithms.
Sometimes it is necessary to reimplement certain libc functions because
using the libc variants increase the image size too much. For example,
using inet_ntoa and inet_aton (to convert between ASCII to binary
representations of IP addresses) from the musl C library will cause
initramfs images (after compression) to be an additional 5 KiB larger
as compared to the own implementation.
Of course, changes that reduce the current code size even further (as long as the code remains readable) are very welcome.
There is no goal of adding too many additional features here, because any additional feature is going to increase the binary size, and this is supposed to be minimalistic and not a replacement for a full initramfs. If you need advanced features, please use an already existing solution. That said, there are a couple of things that might be interesting regardless:
- Minimalistic NFSv2/3 mounting support (akin to the current NFSv4 code).
- Maybe support host name lookups for NFS mounts? (Probably not going to happen, as an own DNS resolver will likely increase the image size by too much - and is likely going to be rather complicated.)
- Support
UUID=for more filesystems, as long as they are really simple. Currently, the implementation checks the magic bytes of a given file system on the each device, and then compares the UUID at the right position in the file system metadata. (Seedevices.cfor details on how this is implemented for the currently supported file systems.) - Support for excluding MD/RAID/... devices when probing for UUIDs of file systems.
Note that the goal is to keep the initrd.img size smaller than 16 KiB
on all platforms, so a cutoff of 15 KiB is used on x86_64, to leave
room for different assembly code sizes etc., at least when used in a
minimal configuration. Therefore, some features (such as UUID= and
NFSv4 support) are compile-time optional.
Note: any features missing from tiny-initramfs that would be required in a space-constrained environment (i.e. mainly embedded), where it was designed for, stand an excellent chance of being included later, at least compile-time optional. Please make your case if you are missing something.
- bind mounts for /usr
- clean up the code a bit.
- go through all messages printed and make sure they are uniform in style