Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

libpcap: added raw filters #563

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

chemag
Copy link

@chemag chemag commented Feb 28, 2017

The goal of supporting raw filters* is to provide libpcap/tcpdump support
for generic BPF insns, including those that are not-supported by libpcap
(e.g., the BPF_MOD/BPF_XOR ops in Linux, or any of the multiple ancillary
loads in linux). It also allows testing new kernel extensions to the
BPF ISA without having to modify libpcap/tcpdump.

We provide support by modifying pcap_compile() so that it first checks
for raw filters. This works for expressions appended in the command line,
and for expressions read from a file ("-F" option). Filters starting
with an integer and a valid separator (',' or '\n') are considered raw.
All other filters are considered (traditional) expressions.

We also make sure that filters compiled from raw filters are left for the
kernel to validate (added "skip_validate" to pcap_t).

*Raw filters are those generated by tcpdump -ddd. i.e.

$ ./tcpdump -ddd -i eth0 icmp
6
40 0 0 12
21 0 3 2048
48 0 0 23
21 0 1 1
6 0 0 65535
6 0 0 0

We also support replacing the new line characters with commas, as to
make possible to have inline filters.

$ ./tcpdump -ddd -i eth0 icmp |tr '\n' ','
6,40 0 0 12,21 0 3 2048,48 0 0 23,21 0 1 1,6 0 0 65535,6 0 0 0,

Some examples:
$ ./tcpdump -nn -i eth0 "6,40 0 0 12,21 0 3 2048,48 0 0 23,21 0 1 1,6 0 0 65535,6 0 0 0,"
$ ./tcpdump -nn -i eth0 -F ~/bpf/icmp.2.bpfraw

@guyharris
Copy link
Member

That's a bit of a big Git comment. Should we discard everything starting with "*Raw filters are those generated by tcpdump -ddd. i.e.", and change the first paragraph to read

The goal of supporting raw filters (those generated by tcpdump -ddd) ...

@chemag
Copy link
Author

chemag commented Feb 28, 2017

I like large comments (it's a doc, after all), but please feel free to remove what you decide

gencode.c Outdated
@@ -50,6 +50,10 @@

#endif /* _WIN32 */

#if defined(BSD) || (defined (__APPLE__) && defined(__MACH__))
#include <net/bpf.h>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So why does this need to be included? We're dealing with raw filters, i.e. raw BPF machine code, so there shouldn't be any need to look at the definitions for the kernel's BPF interpreter.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(Leftover from the previous patch).

Done

@guyharris
Copy link
Member

Is there any reason to run raw filters through the optimizer? Presumably if a user specifies a raw filter, they want exactly that chunk of BPF machine code to be used.

@guyharris
Copy link
Member

Documentation for the user would belong in a man page; documentation for libpcap developers would belong in a comment in the code.

@chemag
Copy link
Author

chemag commented Feb 28, 2017

Documentation for the user would belong in a man page; documentation for libpcap developers would belong in a comment in the code.

I removed all the test notes.

@chemag
Copy link
Author

chemag commented Feb 28, 2017

Is there any reason to run raw filters through the optimizer? Presumably if a user specifies a raw filter, they want exactly that chunk of BPF machine code to be used.

AFAICT, we don't run the optimizer when we call pcap_compile_raw(). In that case, pcap_compile() returns before calling the optimizer

@infrastation
Copy link
Member

Please note this pull request failed to build on MacOS.

gencode.c Outdated

if (sscanf(bpf_string, "%hu%c", &bpf_len, &sp) != 2 ||
(sp != separator1 && sp != separator2) ||
bpf_len > BPF_MAXINSNS || bpf_len == 0) {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's not a syntax error.

In addition, it requires that BPF_MAXINSNS be available in a header file, which it might or might not be, and the header file depends on the OS. Remember, this has to compile on, at minimum, various versions of:

  • various Linux distributions
  • various BSDs, including Darwin
  • Solaris
  • AIX
  • HP-UX

and the header file on the system on which it's compiled might, or might not, match the system on which it's running.

If we're going to impose a limit on the number of instructions (the main reason for which would be to keep from allocating a huge blob of memory and eating up most of the address space or backing store), if we're going to use an unsigned short, we might as well just impose a limit of 65535 instructions and let the kernel complain if it has a lower limit.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Your suggestion is ''s/BPF_MAXINSNS/65535/', right?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note that my kernel (3.13.0) defines BPF_MAXINSNS to be 4096. I doubt there is an issue with moving from 4096 to 64k (raw filters are only used from tcpdump clients).

Another question: do you prefer me harcoding the 64k constant, or something like:

#ifndef BPF_MAXINSNS
#define BPF_MAXINSNS 65536
#endif

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did the ifndef code. Also used 4096 instead of 64k (that's what the kernel defines).

@guyharris
Copy link
Member

Also used 4096 instead of 64k (that's what the kernel defines).

Actually, the kernel defines it as 512, as do the kernel and the kernel and the kernel and the kernel. The kernel also appears to do so, although Oracle's version may have diverged from the last OpenSolaris version.

Unfortunately, the kernel isn't open-source, so I can't post a URL. The kernel-mode driver also defines it as 512.

Oh, and the kernel doesn't have BPF, so it doesn't define it as anything.

Translation: there's no such thing as "the kernel" in the context of libpcap; there are a number of kernels it deals with - that's the whole point of libpcap; it hides, as best it can, the variety of packet capture mechanisms, so that code can be written to run on several different OSes with a minimum of platform-specific #ifdefs.

Note also that any one of those kernels might change the value in the future.

So my inclination is not to pay attention to what any particular kernel happens to choose, and not to make an effort to try to find the appropriate header on various different platforms. Just pick an arbitrary maximum, and give it a name other than BPF_MAXINSNS, to emphasize that it's a libpcap limit rather than any particular OS's limit.

@chemag
Copy link
Author

chemag commented Mar 1, 2017

Actually, the kernel defines it as 512, as do the kernel and the kernel and the kernel and the kernel ...

My bad. There's only one kernel for me :)

So my inclination is not to pay attention to what any particular kernel happens to choose, and not to make an effort to try to find the appropriate header on various different platforms. Just pick an arbitrary maximum, and give it a name other than BPF_MAXINSNS, to emphasize that it's a libpcap limit rather than any particular OS's limit.

Done

@chemag
Copy link
Author

chemag commented Mar 3, 2017

Ping

1 similar comment
@chemag
Copy link
Author

chemag commented Mar 10, 2017

Ping

@chemag
Copy link
Author

chemag commented Mar 28, 2017

One more ping...

@mcr
Copy link
Member

mcr commented Apr 26, 2019

if there is still interest, please rebase, thank you!

@chemag
Copy link
Author

chemag commented Apr 29, 2019

if there is still interest, please rebase, thank you!

Done.

Tested again, and added a "Tested:" section to the patch comment.

Thanks!

@chemag
Copy link
Author

chemag commented Apr 29, 2019

Also, per a previous comment from Guy, I moved a chunk of the comment to the pcap_compile man page

gencode.c Outdated Show resolved Hide resolved
gencode.c Outdated Show resolved Hide resolved
gencode.c Outdated Show resolved Hide resolved
gencode.c Outdated Show resolved Hide resolved
gencode.c Outdated Show resolved Hide resolved
The goal of supporting raw filters* is to provide libpcap/tcpdump support
for generic BPF insns, including those that are not-supported by libpcap
(e.g., the BPF_MOD/BPF_XOR ops in Linux, or any of the multiple ancillary
loads in linux). It also allows testing new kernel extensions to the
BPF ISA without having to modify libpcap/tcpdump.

We provide support by modifying pcap_compile() so that it first checks
for raw filters. This works for expressions appended in the command line,
and for expressions read from a file ("-F" option). Filters starting
with an integer and a valid separator (',' or '\n') are considered raw.
All other filters are considered (traditional) expressions.

We also make sure that filters compiled from raw filters are left for the
kernel to validate (added "skip_validate" to pcap_t).

Tested:

Ping 8.8.8.8 from another terminal. Then run:

```
$ ./tcpdump -n -i eno1 icmp
tcpdump: verbose output suppressed, use -v or -vv for full protocol decode
listening on eno1, link-type EN10MB (Ethernet), snapshot length 262144 bytes
11:44:29.556006 IP 1.2.3.4 > 8.8.8.8: ICMP echo request, id 16624, seq 1, length 64
11:44:29.589406 IP 8.8.8.8 > 1.2.3.4: ICMP echo reply, id 16624, seq 1, length 64
11:44:30.557593 IP 1.2.3.4 > 8.8.8.8: ICMP echo request, id 16624, seq 2, length 64
11:44:30.590918 IP 8.8.8.8 > 1.2.3.4: ICMP echo reply, id 16624, seq 2, length 64
...
```

OK, this seems to be working. Now let's try a raw filter:

```
$ ./tcpdump -ddd -i eth0 icmp |tr '\n' ','
6,40 0 0 12,21 0 3 2048,48 0 0 23,21 0 1 1,6 0 0 65535,6 0 0 0,

$ ./tcpdump -n -i eno1 "6,40 0 0 12,21 0 3 2048,48 0 0 23,21 0 1 1,6 0 0 262144,6 0 0 0,"
tcpdump: verbose output suppressed, use -v or -vv for full protocol decode
listening on eno1, link-type EN10MB (Ethernet), snapshot length 262144 bytes
11:45:04.820334 IP 1.2.3.4 > 8.8.8.8: ICMP echo request, id 16715, seq 1, length 64
11:45:04.853608 IP 8.8.8.8 > 1.2.3.4: ICMP echo reply, id 16715, seq 1, length 64
11:45:05.821841 IP 1.2.3.4 > 8.8.8.8: ICMP echo request, id 16715, seq 2, length 64
11:45:05.855157 IP 8.8.8.8 > 1.2.3.4: ICMP echo reply, id 16715, seq 2, length 64
...
```
@infrastation
Copy link
Member

On one hand, merging these changes into libpcap would improve consistency in tcpdump: since it can print the compiled bytecode with -ddd, it would be reasonable if it could parse and use this compiled bytecode (as iptables -I INPUT -m bpf --bytecode does), and for that libpcap (but not the programs that use libpcap, as Guy explained) would have to recognize it.

On the other hand, the "ddd" format has a pitfall, in that it does not specify for which DLT the expression was compiled. This, for example, creates the space for things quietly going wrong when the end user compiles a filter using the popular DLT_EN10MB type, whereas in the iptables context above the bytecode is applied to what effectively is DLT_RAW.

So perhaps this change and cBPF savefile address related, but distinct use cases, and the only coordination between the two should be that the complete API in the end makes as much sense as possible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Development

Successfully merging this pull request may close these issues.

4 participants