Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kernel symbols are updated without required capabilities #4325

Open
oshaked1 opened this issue Sep 24, 2024 · 8 comments · May be fixed by #4464
Open

Kernel symbols are updated without required capabilities #4325

oshaked1 opened this issue Sep 24, 2024 · 8 comments · May be fixed by #4464
Assignees
Labels
Milestone

Comments

@oshaked1
Copy link
Contributor

Description

The kernel symbol table utility relies on CAP_SYSLOG in order to be able to read from /proc/kallsyms and get actual addresses (otherwise addresses are zeroed-out).

When NewKernelSymbolTable is called when initializing tracee, the correct capability is held. But later calls to helpers like GetSymbolByName do not acquire CAP_SYSLOG, even though the helper may end up reading /proc/kallsyms again.

This causes lookups for symbols that were not initially added to t.requiredKsyms to produce unusable results (and no error is generated). An example for such a lookup occurs when fetching the address for attaching a kprobe belonging to an event that is needed as a dependency (and not selected directly).

The simplest way to reproduce this error is by deleting dist/signatures/builtin.so (because some signatures select events that list CAP_SYSLOG as a dependency), and running tracee without any arguments:

$ sudo dist/tracee > /dev/null
{"level":"warn","ts":1727174815.8476572,"msg":"libbpf: prog 'trace_load_elf_phdrs': failed to create kprobe '+0x0' perf event: Invalid argument"}

As can be seen, this program which is an (unrequired) dependency of sched_process_exec, is being loaded to address 0 and fails.

@geyslan
Copy link
Member

geyslan commented Sep 24, 2024

I was able to reproduce it:

  1. rm dist/signatures/builtin.so
  2. sudo dist/tracee -o none
{"level":"warn","ts":1727195441.2103937,"msg":"libbpf: prog 'trace_load_elf_phdrs': failed to create kprobe '+0x0' perf event: Invalid argument"}

@oshaked1 are you working on solving this? LMK.

@oshaked1
Copy link
Contributor Author

I could solve it by acquiring the capability in the refresh method instead of relying on the callers to acquire it. Do you think that would be a good solution?

@geyslan
Copy link
Member

geyslan commented Sep 25, 2024

I could solve it by acquiring the capability in the refresh method instead of relying on the callers to acquire it. Do you think that would be a good solution?

I think we opted to not acquire caps in inner functions since they can be called from other outers acquires, is that correct @NDStrahilevitz?

@NDStrahilevitz
Copy link
Collaborator

I could solve it by acquiring the capability in the refresh method instead of relying on the callers to acquire it. Do you think that would be a good solution?

I think we opted to not acquire caps in inner functions since they can be called for other outers acquires, is that correct @NDStrahilevitz?

I'm sorry @geyslan, I'm not sure I got what you mean. Could you rephrase?

Anyway, AFAIK events that require continuous use of ksyms declare the CAP_SYSLOG in their dependency such that it is included in the base ring. So before adding the capability inside the struct (which I would rather not do so that it is not coupled to the capabilities mechanism) we should:

  1. Confirm that all events which require SYSLOG declare that capability
  2. The dependency declaration does indeed add the capability to the base ring
  3. We have no inbetween calls (before adding the cap to the base ring and after the initial capability drop)

@NDStrahilevitz
Copy link
Collaborator

@oshaked1 as for the usecase you've described, searching the symbol of a kprobe attachment), I find it likely that we can add a capability ring increase around that code as it has a clearly defined scope.

@geyslan
Copy link
Member

geyslan commented Sep 25, 2024

I'm sorry @geyslan, I'm not sure I got what you mean. Could you rephrase?

Acquiring caps in between caps calls.

But I believe you already answered with:

which I would rather not do so that it is not coupled to the capabilities mechanism

We have no inbetween calls (before adding the cap to the base ring and after the initial capability drop)

@oshaked1
Copy link
Contributor Author

@oshaked1 as for the usecase you've described, searching the symbol of a kprobe attachment), I find it likely that we can add a capability ring increase around that code as it has a clearly defined scope.

Yes it should be straightforward to do so. We should also add a disclaimer in the docstrings of the lookup functions because the capability requirement is unintuitive considering the symbol table was created prior to performing lookups.

@yanivagman yanivagman added this to the v0.24.0 milestone Dec 11, 2024
@oshaked1 oshaked1 linked a pull request Dec 25, 2024 that will close this issue
@yanivagman yanivagman linked a pull request Dec 29, 2024 that will close this issue
@NDStrahilevitz
Copy link
Collaborator

@oshaked1 Assigned you as your PR closes this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants