From patchwork Thu Sep 27 23:26:47 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Joe Stringer X-Patchwork-Id: 975957 Return-Path: X-Original-To: patchwork-incoming-netdev@ozlabs.org Delivered-To: patchwork-incoming-netdev@ozlabs.org Authentication-Results: ozlabs.org; spf=none (mailfrom) smtp.mailfrom=vger.kernel.org (client-ip=209.132.180.67; helo=vger.kernel.org; envelope-from=netdev-owner@vger.kernel.org; receiver=) Authentication-Results: ozlabs.org; dmarc=none (p=none dis=none) header.from=wand.net.nz Authentication-Results: ozlabs.org; dkim=fail reason="signature verification failed" (2048-bit key; unprotected) header.d=gmail.com header.i=@gmail.com header.b="pLz1xriE"; dkim-atps=neutral Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by ozlabs.org (Postfix) with ESMTP id 42LrZ11kvkz9s1c for ; Fri, 28 Sep 2018 09:27:20 +1000 (AEST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727407AbeI1Fru (ORCPT ); Fri, 28 Sep 2018 01:47:50 -0400 Received: from mail-yw1-f66.google.com ([209.85.161.66]:43127 "EHLO mail-yw1-f66.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725889AbeI1Frt (ORCPT ); Fri, 28 Sep 2018 01:47:49 -0400 Received: by mail-yw1-f66.google.com with SMTP id k66-v6so1838071ywa.10 for ; Thu, 27 Sep 2018 16:27:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:from:to:cc:subject:date:message-id; bh=4U9PSxfEFsa/YOXFNClDF8rOCu/XpQfO08/V7rVhLcI=; b=pLz1xriEGhbrHRyHfzR9ZL50SMhAk03xqX2fzpryv0GBoJBBp/h+ZpLzqsj3OvgOgt C7AW596XX8WjSVwvvq4C4ENPrTw9SWd5QGWFY9oV7/KjBnXPQiz+KgduDq8lndgKWext kRs+HbMShH1+yZfuMWLCagn0bcWgqNhRhMYzYu5YYn0KVhD+UmioDr7xR+2srzgXuwfr bxaxKCyr47eolPa9QnRTbsmLIK+ULTbXj74I2TH7FCJzyOlDiUutIYoMolfrg6PBF9lN 37WwJKEX7Qg6YVsjhS7ygjjDCpUjl1yCqmVSonA/GzWFSb8PRTH/CWUxSqaX3aeys5cK TSBQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:from:to:cc:subject:date:message-id; bh=4U9PSxfEFsa/YOXFNClDF8rOCu/XpQfO08/V7rVhLcI=; b=g5IEnEUGX8c6IAUUYq7XGacT5LnwoJuDUz9IWSS/0xYSKNQoOWJixBfvQOlQZv2R+t CdRF3UoK8cV9qNssOJ5HWhPhYr/hmw9NSDRCgTHtbhb1uKMJCYRN5gmaKhdz9iTn4n3R g0kE3X+NbNGq/10mDUO4B8YlxdPWFK7C3l6S3b01CfFaCYPCReNhBuhmX6lhWj8eFWTy 9WlkfJXC5GwSEjeZQibbMyW7VKiOHiUJQT8ffQoqLvK2r6vGP5uGoQWCH5WDMvuvpij9 23Qw4dwqBmjHFhidMR6jWlhGOBYAio2+SvlNvZkFdJVL/cogZmFHBmuEq9WHBGD/B8Hm 7Hww== X-Gm-Message-State: ABuFfoiM7nJCGkJRXv95SbpI1YUYauGLYDA67zN02QlyGLDx8efWTHJq PhBzKwrVTpGsIjZYIHk0hss= X-Google-Smtp-Source: ACcGV60/aFpDE9Qp7G78Hpa/nj8heEiVbMcP26o1mclJsGnZQLjOkzKvoE6iUtWrlYjMJYgqRUeCvA== X-Received: by 2002:a81:1153:: with SMTP id 80-v6mr7148482ywr.302.1538090824164; Thu, 27 Sep 2018 16:27:04 -0700 (PDT) Received: from localhost.localdomain ([99.0.85.34]) by smtp.gmail.com with ESMTPSA id w6-v6sm4810717ywg.3.2018.09.27.16.27.02 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 27 Sep 2018 16:27:03 -0700 (PDT) From: Joe Stringer To: daniel@iogearbox.net Cc: netdev@vger.kernel.org, ast@kernel.org, john.fastabend@gmail.com, tgraf@suug.ch, kafai@fb.com, nitin.hande@gmail.com, mauricio.vasquez@polito.it Subject: [PATCHv3 bpf-next 00/12] Add socket lookup support Date: Thu, 27 Sep 2018 16:26:47 -0700 Message-Id: <20180927232659.14348-1-joe@wand.net.nz> X-Mailer: git-send-email 2.17.1 Sender: netdev-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: netdev@vger.kernel.org This series proposes a new helper for the BPF API which allows BPF programs to perform lookups for sockets in a network namespace. This would allow programs to determine early on in processing whether the stack is expecting to receive the packet, and perform some action (eg drop, forward somewhere) based on this information. The series is structured roughly into: * Misc refactor * Add the socket pointer type * Add reference tracking to ensure that socket references are freed * Extend the BPF API to add sk_lookup_xxx() / sk_release() functions * Add tests/documentation The helper proposed in this series includes a parameter for a tuple which must be filled in by the caller to determine the socket to look up. The simplest case would be filling with the contents of the packet, ie mapping the packet's 5-tuple into the parameter. In common cases, it may alternatively be useful to reverse the direction of the tuple and perform a lookup, to find the socket that initiates this connection; and if the BPF program ever performs a form of IP address translation, it may further be useful to be able to look up arbitrary tuples that are not based upon the packet, but instead based on state held in BPF maps or hardcoded in the BPF program. Currently, access into the socket's fields are limited to those which are otherwise already accessible, and are restricted to read-only access. Changes since v2: * New patch: "selftests/bpf: Generalize dummy program types". This enables adding verifier tests for socket lookup with tail calls. * Define the semantics of the new helpers more clearly in uAPI header. * Fix release of caller_net when netns is not specified. * Use skb->sk to find caller net when skb->dev is unavailable. * Fix build with !CONFIG_NET. * Replace ptr_id defensive coding when releasing reference state with an internal error (-EFAULT). * Remove flags argument to sk_release(). * Add several new assembly tests suggested by Daniel. * Add a few new C tests. * Fix typo in verifier error message. Changes since v1: * Limit netns_id field to 32 bits * Reuse reg_type_mismatch() in more places * Reduce the number of passes at convert_ctx_access() * Replace ptr_id defensive coding when releasing reference state with an internal error (-EFAULT) * Rework 'struct bpf_sock_tuple' to allow passing a packet pointer * Allow direct packet access from helper * Fix compile error with CONFIG_IPV6 enabled * Improve commit messages Changes since RFC: * Split up sk_lookup() into sk_lookup_tcp(), sk_lookup_udp(). * Only take references on the socket when necessary. * Make sk_release() only free the socket reference in this case. * Fix some runtime reference leaks: * Disallow BPF_LD_[ABS|IND] instructions while holding a reference. * Disallow bpf_tail_call() while holding a reference. * Prevent the same instruction being used for reference and other pointer type. * Simplify locating copies of a reference during helper calls by caching the pointer id from the caller. * Fix kbuild compilation warnings with particular configs. * Improve code comments describing the new verifier pieces. * Testing courtesy of Nitin This tree is also available at: https://github.com/joestringer/linux/commits/submit/sk-lookup-v3 Joe Stringer (12): bpf: Add iterator for spilled registers bpf: Simplify ptr_min_max_vals adjustment bpf: Generalize ptr_or_null regs check bpf: Add PTR_TO_SOCKET verifier type bpf: Macrofy stack state copy bpf: Add reference tracking to verifier bpf: Add helper to retrieve socket in BPF selftests/bpf: Generalize dummy program types selftests/bpf: Add tests for reference tracking libbpf: Support loading individual progs selftests/bpf: Add C tests for reference tracking Documentation: Describe bpf reference tracking Documentation/networking/filter.txt | 64 ++ include/linux/bpf.h | 34 + include/linux/bpf_verifier.h | 37 +- include/uapi/linux/bpf.h | 93 ++- kernel/bpf/verifier.c | 594 +++++++++++++--- net/core/filter.c | 181 ++++- tools/include/uapi/linux/bpf.h | 93 ++- tools/lib/bpf/libbpf.c | 4 +- tools/lib/bpf/libbpf.h | 3 + tools/testing/selftests/bpf/Makefile | 2 +- tools/testing/selftests/bpf/bpf_helpers.h | 12 + tools/testing/selftests/bpf/test_progs.c | 38 + .../selftests/bpf/test_sk_lookup_kern.c | 180 +++++ tools/testing/selftests/bpf/test_verifier.c | 670 +++++++++++++++++- 14 files changed, 1858 insertions(+), 147 deletions(-) create mode 100644 tools/testing/selftests/bpf/test_sk_lookup_kern.c