- ULP Development

An update on system integration
by Simon Lees 17 Aug '21

17 Aug '21

Hi All, Here is a bit of an update on some of the system integration tasks I have been working on. 1. I have created a new utility ulp_buildid (open to suggestions on the name), this solves the problem of when we have multiple choices for which live patch to apply how do we choose. This utility takes a pid and libname and returns the NT_GNU_BUILD_ID. It is a long time since I have written more then a few lines of C at a time and back then I was working on a pretty old C compiler so any feedback and constructive criticism is more then welcome. https://github.com/SUSE/libpulp/pull/34 2. I also created a tool called ulp_apply which does a similar role to the dispatcher lua script, currently it takes a lib name and .ulp file and applies the patch to all running programs. Now that I have ulp_buildid I can drop the need for passing in the .ulp file ./ulp-apply "/usr/lib64/libcrypto.so.1.1" "/usr/lib64/openssl-1_1-livepatches/libcrypto_livepatch1.ulp" A work in progress version can be found here https://github.com/simotek/libpulp/blob/tools/tools/ulp_apply At some point we need to decide whether we move forward with this bash script or the dispatcher lua script. 3. As a debugging script I created a very simple script ulp_pids which will give you the pid and executable name of each process with libpulp loaded. 4. I created an experimental package using multibuild to try and build live patches in the simplest way possible. Using this approach all you would need to do is add the respective versions to the _multibuild file. However currently it doesn't work as obs only finds the latest version, I will chase this up with the obs team to see if what i'm trying to do is possible. Other things to note here is the use of Supplements: (libopenssl1_1 and libpulp-tools) which means if you have the repository with this package enabled it will automatically be installed if openssl and libpulp-tools are on the system. It also calls ulp_apply in the %posttrans section with a temporary file as a guard to ensure that live patches are only applied once per library. ## Whats Next ## Currently ulp_reverse takes a .ulp file as a parameter but the "ulp" program only provides us with the .so file that has been patched. So I either need to modify ulp_reverse to take the .so file as a parameter or modify "ulp" to also list metadata files or do something like ulp_buildid to get such info. Another thing I need to decide is whether to add a parameter to ulp_dump to just return the build_id or whether I just parse the full output in whichever script we end up using. I will probably also consider doing something similar for "ulp" just to return the list of live patches as thats all my script will need, primarily to assess whether I need to reverse an existing live patch at the start of the update. Once this is done we should have a fully functioning system. Cheers -- Simon Lees (Simotek) http://simotek.net Emergency Update Team keybase.io/simotek SUSE Linux Adelaide Australia, UTC+10:30 GPG Fingerprint: 5B87 DB9D 88DC F606 E489 CEC5 0922 C246 02F0 014B

1 0

backtraces, unwinding and changing consistency model
by Michael Matz 26 Apr '21

26 Apr '21

Hello, if people follow the git repo of ulp they might have noticed that the consistency model of checking library entries and exits was disabled. This is to describe the background for that (if for nothing else than us not forgetting the reasons :-) ). So, the consistency model basically answers the question "is this livepatch currently safe to activate for that thread?". There are many methods that can be designed for this (and it includes the trivial "yes!" model), but fairly from the beginning we settled on a model that would track library entries and exits, and only consider live-patch application safe if the library affected by the live patch was not currently active (on the call stack). There are certain conditions and fact that need to be taken into account for this to work: * for library entry tracking you need exit tracking * you need to do the tracking right from process start, even without any live patches loaded or active; otherwise you might see unbalancend entry/exits and the library might be considered non-entered even when it's entered at patch application tyime * we want the tracking to be per-thread, otherwise one thread might block application of the live patch for the whole process indefinitely We do the entry tracking by function entry point redirection (for exported functions). That's easy. Exit tracking is harder. Without much toolchain help we can't redirect the return path, so we resorted to frame stealing: * in the entry tracker we modify the original return address (on stack or return register) to point to the exit tracker, and store the original return address at $place; the exit tracker then restores things and ultimately returns to the original return address Now, there are further considerations for $place: * we can't modify the upward stack: they might hold function arguments * we can't use downward stack: it will be considered changable by the called functions * we can't allocate our own space on stack: function arguments to the original callee are relative to the stack pointer (or frame pointer derived from it), so changing the stack pointer would invalidate that. Even if we consider addresses of stack slots for argument passing to be undefined we still would need to copy the incoming arguments to a new place, with the problem that the entry tracker doesn't know _how much_ to copy (in the general case, e.g. variadic functions, only the caller knows how much space the arguments needed on the stack). So, we can't have $place be on the stack. But it needs to be per-thread. So, there's only one possibility: thread-local storage in one or another way. That's indeed the solution taken by libpulp, it stores the original return address into some TLS space and all was good. Well, except that backtraces don't work then. libpulp took precaution to mark the stolen frames as not backtracable (i.e. as stop points for back traces), and at least you would see in gdb that the exit tracker was the top-most frame and be reminded that something special was going on. But backtraces aren't only a pure debug facility. They are used for frame unwinding while throwing exceptions and for pthread cancellation, so backtraces not working over shared library borders also mean exceptions and thread cancellation not working over shared library borders. That's of course not acceptable. So, ideally we would fix backtraces to do work with the entry/exit tracking. Turns out that this isn't entirely trivial. During the process of unwinding there is one thing that needs to be done for each unwound frame, amongst other things: given the current frame and register state, get the return address of the calling frame. This requires certain pieces of information about the frames, for which there are multiple possiblities: windows uses standardized code layout plus custom info pieces per function, linux most often uses DWARF unwind information to describe frame layout and return address (arm and aarch64 uses some more compact form). The important thing to know is that the unwinder has some internal state of the current frame and when trying to get the return address for it, it essentially is given a symbolic expression (on linux a DWARF expression) that the unwinder interprets to calculate the value. (One example of such symbolic expression would be: "add 8 to the frame pointer; that's the place containing the current return address"). This expression comes from the program itself (e.g. in the .eh_frame section), so the program itself can specify how the unwinder needs to calculate things according to how the program was produced. So, for the stolen frame the return address is stored at some TLS place, so that's what the DWARF expression for the return address needs to say. Luckily there are DWARF operations that specify TLS addresses: DW_OP_GNU_push_tls_address and DW_OP_form_tls_address, i.e. a GNU extension or DWARF v3, so we can describe the situation we have in the unwind information. Very and extremely unfortunately our normal unwinder (in libgcc_s) doesn't support this operation in its expression interpreter :-/ What's worse, its structure is basically this: switch (dwarf->opcode) { ... nothing with tls ... default: abort(); } i.e. whenever the current unwinder would hit onto a frame that used the DWARF TLS opcodes it would abort the program. That's even worse than unwinding not working for one thread. Now, we could fix that unwinder (which we'll do). That leaves an unknown number of other unwinders in the world: they might be statically linked variants of the libgcc_s unwinder, or they might be completely different unwinders from unknown sources; but it's reasonable to assume that they don't implement the TLS opcodes either (no matter if they abort on unknown opcodes or not). Either way, simply using the dwarf TLS opcodes in our libraries exposes us to potential and unknown instability due to non-support in the unwinders, something that live patching is supposed to protect against ;-) We have no real solution to the above; a few can be imagined though: * fix all unwinders to accept TLS opcodes * make $place not be TLS (seems impossible, but who knows?) * don't steal frames: exit tracking then needs to be done different, e.g. by toolchain improvements to also be able to redirect the return paths of functions * don't do entry/exit tracking at all For now we resort to the last of these, ditch entry/exit tracking. This gives us a much weaker consistency model, and hence more care needs to be applied when constructing a live patch. We deem that to be okay for now. We want to make it so that a live patch can request a certain consistency model, so that in the future a entry/exit tracker, or something completely different, can be (re)implemented and enlarge the set of possible live patches. (One obvious alternative consistency checker would be one looking at backtraces itself to see if problematic functions are currently active). There's an advantage to not doing library entry/exit tracking: performance. The indirection through the tracking code, in particular the need to use TLS for the original return address and the "we're-in-that-lib" flag, are not cheap. Together with the fact that lib entry/exit has to be tracked right from process start makes the performance impact for userspace live patching noticable (not terribly so, but measurable). Obviously without such tracking we don't pay that cost at all. At least something ;-) Ciao, Michael.

1 0

Attempt to build a "hello world" live patch
by Libor Pechacek 16 Mar '21

16 Mar '21

Hello, I've tried to build a "hello world" demo live patch for a simple application. I focused on minimal code and applied a "cargo-cult" approach. First, I built an applicaiton with a shared library. ------------8<------------ main.c: #include <unistd.h> extern void workload(); int main(void) { int counter; for(counter = 0; counter < 30; counter++) { workload(); sleep(2); } } workload.c: #include <stdio.h> void workload(void) { printf("Hello World!\n"); } Compiled as: $ gcc -shared -fPIC -fpatchable-function-entry=40,38 -I/data/src/libpulp/include -o libworkload.so /data/src/libpulp/lib/trm.S workload.c $ /data/src/libpulp/tools/ulp_post libworkload.so $ gcc -Wl,-rpath,. -L. -o main main.c -lworkload And it works: $ ./main Hello World! Hello World! ^C ------------>8------------ Then I prepared a patch for workload(). ------------8<------------ workload_patch.c: #include <stdio.h> void workload_modernized(void) { printf("hello, world\n"); } libworkload_patch.dsc: /data/src/libpulp/demo/libworkload_patch.so @/data/src/libpulp/demo/libworkload.so workload:workload_modernized Compiled as: $ gcc -shared -fPIC -o libworkload_patch.so workload_patch.c $ /data/src/libpulp/tools/ulp_packer libworkload_patch.dsc libworkload_patch.ulp ------------>8------------ At this point, I believe I have all the bits in place and I want to try live patching of 'main'. ------------8<------------ $ LD_PRELOAD=/data/src/libpulp/lib/.libs/libpulp.so ./main & [1] 27910 libpulp loaded... Hello World! $ /data/src/libpulp/tools/ulp_trigger "$(pidof main)" /data/src/libpulp/demo/libworkload_patch.ulp ulp: to be patched object (/data/src/libpulp/demo/libworkload.so) not loaded. ------------>8------------ What am I doing wrong, apart from the uneducated approach, that ulp_trigger complaints about the missing libworkload.so? Libor Side notes: - It is unclear how to build live patches. README.md contains the high-level overview but not concrete steps or a pointer to a "how to". - ulp_packer help is wrong. It says "packer <descr> <.so> [.ulp]" while it's now "packer <descr> <.ulp>" - The role of ulp_post is unclear in the process. There is a clue in the commit log that introduces it but it was beyong my current knowledge level. - I've inferred compiler parameters and command usage from what I saw in "make check" output. - ulp_trigger says nothing in case libpulp.so is not preloaded. I suggest that it prints some diagnostic message. -- Libor Pechacek SUSE Labs Remember to have fun...

3 9

[ulp-devel] Re: Live patching MC at LPC2020?
by Michael Matz 01 Jul '20

01 Jul '20

Hello, On Tue, 30 Jun 2020, Josh Poimboeuf wrote: > On Thu, Jun 25, 2020 at 08:59:43AM +0200, Petr Mladek wrote: > > On Tue 2020-03-31 16:52:04, Joe Lawrence wrote: > > It seems that there is interest into sharing/discussing some topics. > > The question is whether is has to be under the LPC even umbrella. > > > > Advantages of LPC: > > > > + well defined date > > + more attendees (ARM people, Steven Rostedt ;-) > > + access to some powerful video conference tool > > + access to another LPC content > > + support for the conference in the long term > > > > > > Advantages of self-organized event: > > > > + less paperwork? > > + cheaper? > > + only interested people invited > > + date after summer holidays > > + more time for the discussion > > > > I am in the favor of self organized event. For me, LPC is much less > > interesting without the personal contact and hallway conversations. > > All the LPC date is not ideal for me. > > I'd prefer LPC proper, as it would be easier (infrastructure is already > taken care of) and more inclusive (in the past we often got good > feedback from outside the direct livepatch community). And it's only > $50 US. Yeah, I think having more attendees would be valuable. > But to be honest I have doubts about the usefulness of any online > conference, so either way may be equally useless ;-) Indeed. OTOH our fellow openSUSE guys seemed to have held a fairly successful virtual summit earlier this year with some in-development virtual-conferencing software that does more than just video/audio streaming ( https://requiredmagic.com/roundtable/ ). In particular the possibility to have interactive Q&A with the presenter during and after the allotted talk slot seemed to have made up for hallway discussions. Maybe that's a viable way. (I personally wasn't at the virtual summit to know for sure, but the experience report from some of them was quite positive; I know nothing about the conferencing software of LPC) Ciao, Michael. -- To unsubscribe, e-mail: ulp-devel+unsubscribe(a)opensuse.org To contact the owner, e-mail: ulp-devel+owner(a)opensuse.org

1 0

[ulp-devel] Info on KLP architecture
by Libor Pechacek 07 Apr '20

07 Apr '20

Hello, I've been asked to share information on the architecture of kernel live patching userspace infrastructure so that it can serve as a model for the ULP implementation. The description is intentionally high-level for the sake of keeping the text short. More technical details can be provided in areas of your interest. The main areas covered below are 1) Building patches in Build Service 2) Binding patches to the respective RPM packages 3) Patching upon package installation 4) KLP tool Building patches in Build Service --------------------------------- To state the obvious, live patching patches *old* packages. I.e. a fix for the kernel goes out both as a live patch RPM and the rebuilt kernel RPM. Kernel live patches bind deeply into kernel code, so we do build the live patch against the previously released kernel package. However, Build Service keeps only the latest RPM for build and throws away old packages. For that reason, live patches are built against so-called "maintenance projects" which hold the historical sources and binary RPMs. Binding patches to the respective RPM packages ---------------------------------------------- Now that we have a kernel module, we've got to tell the packaging system to install it only for the corresponding kernel. Mind that there may be multiple kernel packages installed on the system and one of the kernels is running. The binding used to be done via version+release numbers but that required some special tweaks in Build Service. Recently, we've changed that to (GIT) source hash binding, assuming that all binaries generated from the same sources will be equal[1]. The kernel package provides "kernel-<flavor>-srchash-<hash>" symbol and the live patch requires the same. The benefit is that the source hash is known at package submission time. In addition, there is RPM "Supplements: packageand(kernel-<flavor>-srchash-<hash>:kernel-livepatch-tools)", which is a zypper specific way to pull the live patch into the system when both the kernel package and kernel-livepatch-tools are being installed. Patching upon package installation ---------------------------------- RPM post-install script triggers `rpm-helper` that tries to load the live patch into the running kernel if applicable. Then it refreshes initrd to include the live patch so that it can be loaded upon (unplanned?) reboot. `rpm-helper` script is packaged in kernel-livepatch-tools. KLP tool -------- There is a system inspection tool called `klp`. The tool allows the user to check what patches are loaded into the kernel, which vulnerabilities are fixed by the patch and what is the system status. In the past, it also performed some maintenance tasks, which, however, are no longer necessary thanks to the improved kernel infrastructure. I hope that the above information provides insight into the inner workings of the kernel live patching and can serve as a starting point for the ULP implementation. Libor [1] Yes, I know this assumption broke many times in the past but we are still surviving. -- Libor Pechacek SUSE Labs Remember to have fun... -- To unsubscribe, e-mail: ulp-devel+unsubscribe(a)opensuse.org To contact the owner, e-mail: ulp-devel+owner(a)opensuse.org

1 0

[ulp-devel] Re: Live patching MC at LPC2020?
by Joe Lawrence 31 Mar '20

31 Mar '20

On Fri, Mar 27, 2020 at 02:20:52PM +0100, Jiri Kosina wrote: > Hi everybody, > > oh well, it sounds a bit awkward to be talking about any conference plans > for this year given how the corona things are untangling in the world, but > LPC planning committee has issued (a) statement about Covid-19 (b) call > for papers (as originally planned) nevertheless. Please see: > > https://linuxplumbersconf.org/ > https://linuxplumbersconf.org/event/7/abstracts/ > > for details. > > Under the asumption that this Covid nuisance is over by that time and > travel is possible (and safe) again -- do we want to eventually submit a > livepatching miniconf proposal again? > > I believe there are still kernel related topics on our plate (like revised > handling of the modules that has been agreed on in Lisbon and Petr has > started to work on, the C parsing effort by Nicolai, etc), and at the same > time I'd really like to include the new kids on the block too -- the > userspace livepatching folks (CCing those I know for sure are working on > it). > Hi Jiri, First off, I hope everyone is riding out COVID-19 as well as possible, considering all that's happening. As for LPC mini-conf topics, I'd be interested in (at least): - Petr's per-object livepatch POC - klp-convert status - objtool hacking - Nicolai's klp-ccp status - arch update (arm64, etc) > So, please if you have any opinion one way or the other, please speak up. > Depending on the feedback, I will be fine handling the logistics of the > miniconf submission as last year (together with Josh I guess?) unless > someone else wants to step up and volunter himself :) > > (*) which is totally unclear, yes -- for example goverment in my country > has been talking for border closure lasting for 1+ years ... but it > all depends on how things develop of course). Hmm, all good points. Some conferences have gone virtual to cope with necessary cancellations, but who knows what things will look like even at the end of August. Perhaps we can still do something remotely if the conditions dictate it. But my vote would be yes, and let's see what topics interest folks. Regards, -- Joe -- To unsubscribe, e-mail: ulp-devel+unsubscribe(a)opensuse.org To contact the owner, e-mail: ulp-devel+owner(a)opensuse.org

1 0

[ulp-devel] Re: Live patching MC at LPC2020?
by Michael Matz 30 Mar '20

30 Mar '20

Hello, On Fri, 27 Mar 2020, Jiri Kosina wrote: > oh well, it sounds a bit awkward to be talking about any conference plans > for this year given how the corona things are untangling in the world, but > LPC planning committee has issued (a) statement about Covid-19 (b) call > for papers (as originally planned) nevertheless. Please see: > > https://linuxplumbersconf.org/ > https://linuxplumbersconf.org/event/7/abstracts/ > > for details. > > Under the asumption that this Covid nuisance is over by that time and > travel is possible (and safe) again -- do we want to eventually submit a > livepatching miniconf proposal again? I think that's very optimistic, but in case the conference really can happen I think it'd be useful to have such mini-conference (from the userspace perspective) Ciao, Michael. -- To unsubscribe, e-mail: ulp-devel+unsubscribe(a)opensuse.org To contact the owner, e-mail: ulp-devel+owner(a)opensuse.org

1 0

[ulp-devel] Lame test
by Michael Matz 18 Nov '19

18 Nov '19

-- To unsubscribe, e-mail: ulp-devel+unsubscribe(a)opensuse.org To contact the owner, e-mail: ulp-devel+owner(a)opensuse.org

1 0