authors | state | discussion |
---|---|---|
Jason King <[email protected]> |
draft |
Today, when a SmartOS system boots (whether it be standalone, a Triton compute
node, or a Triton headnode), the filesystem/smartdc
service
attempts to identify which zpool visible to the system is the 'system' zpool
(the zpool containing the /var, /cores, /opt, etc. filesystems as well as the
zpool containing all of the zone filesystems). The service accomplishes this by
importing all zpools that are present and using the last zpool whose root
dataset contains a file named .system_pool
. It then sets the SMF property
config/zpool
in the smartdc/init
sevice to the name of the system zpool.
With the advent of encrypted compute nodes (aka EDAR / RFD 77), this presents
some issues. Most notably, one cannot reliably determine the system pool until
the keys for the root dataset of each zpool has been loaded. This also can
prevent the sdc-factoryreset
utility from working reliabily -- it relies on
setting a property on ${SYS_POOL}/var
to determine if the system zpool should
be destroyed, but it cannot determine which zpool is the system pool until
the pool's key has been loaded. If the ebox or yubikey protecting the ebox
for the pool has already been destroyed or reset, the pool cannot be unlocked
automatically, and the node factory reset will fail. This can be worked around
but it provides a less than ideal operator experience.
Since zfs properties are always available once a zpool has been imported (regardless of the encryption status), we propose moving this setting to a zfs property on the root dataset of a zpool.
The proposal here is to migrate this setting/flag into a zfs property on the
root dataset tentatively named smartdc:system_pool
. During system setup,
this property will be set on the system zpool in addition to creating
/.system_pool
(for backwards compatability).
When the filesystem/smartdc
service runs during boot, it will import all of
the pools present on the system unless the zpools
boot parameter is
given, then only the pools listed there are imported (i.e. no change here).
For each pool, if the smartdc:system_pool
property is present, the last
pool with the property set will be used for the system pool. In addition, if
multiple pools have the smartdc:system_pool
property set, a warning will
be written to the console to alert the operator.
If no pool is found with the property, the service will fall back to looking
for .system_pool
. Once found, the filesystem/smartdc
service will set the
smartdc:system_pool
property for forward compatability. If the system pool
is not found via searching for .system_pool
, we error out as we do today.
Note that we don't plan to create new zpools with the .system_pool
file,
but existing pool will continue to work with the described mechanism.
This design does mean that one cannot boot a SmartOS/Triton CN using a PI older than the PI version used to install the system. However, even if not explicitly stated, this is effectively the case today -- any instances where it does work are by accident than design. More specifically, when we create the zpool during setup (for either SmartOS or Triton), we've generally not disabled new zpool features. As a result, there is no guarantee an older PI will be able to import and use a zpool created on a new PI version. To be extra clear, this applies to new installs. The aforementioned forward compatibility mechanism described should still allow existing installs to revert back to earlier PIs as desired by the operator.
With this, the factory reset check (which looks at ${SYS_POOL}/var
) can be
proceed prior to unlocking of any encrypted zpools.
Once the system pool has been identified, we can unlock it as necessary.
Additionally, kbmd can then use the config/zpool
SMF property to determine
the system zpool instead of having to be informed of the value during boot
(since it has to unlock the system pool if the node is encrypted, kbmd must
start before the SMF property has been set).
The current proposal doesn't specify a value for smartdc:system_pool
-- it
just looks for its presence. If we want to have a definitive value here, the
question is what should it be -- 'yes/no', a GUID, something else? There
could be advantages to tieing the value to something on the system (e.g. the
smbios uuid). Any such behavior would need to consider things such as
chassis swaps (i.e. what would the procedure be to update the value) and
encrypted compute nodes.
An additional possibility might be to allow the zpool GUID to be optionally
specified in the boot parameters. When present, that value (and only that
value) is used to locate the system pool. All pools (or the ones in the
zpools
boot parameter when precent) could still be imported. However it
seems like the zpools
boot parameter can handle the same functionality, so
it seems unlikely an additional (even if optional) property is necessary.