From patchwork Tue Nov 5 15:58:41 2013 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Christophe Gouault X-Patchwork-Id: 288567 X-Patchwork-Delegate: davem@davemloft.net Return-Path: X-Original-To: patchwork-incoming@ozlabs.org Delivered-To: patchwork-incoming@ozlabs.org Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by ozlabs.org (Postfix) with ESMTP id 9AE802C0185 for ; Wed, 6 Nov 2013 02:59:01 +1100 (EST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754848Ab3KEP64 (ORCPT ); Tue, 5 Nov 2013 10:58:56 -0500 Received: from 33.106-14-84.ripe.coltfrance.com ([84.14.106.33]:55608 "EHLO proxy.6wind.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752744Ab3KEP6z (ORCPT ); Tue, 5 Nov 2013 10:58:55 -0500 Received: from bliss.dev.6wind.com (unknown [10.16.0.36]) by proxy.6wind.com (Postfix) with ESMTPS id 600DA282B2; Tue, 5 Nov 2013 16:58:53 +0100 (CET) Received: from gouault by bliss.dev.6wind.com with local (Exim 4.80) (envelope-from ) id 1Vdj1o-0006Cu-Ls; Tue, 05 Nov 2013 16:58:52 +0100 From: Christophe Gouault To: christophe.gouault@6wind.com, Steffen Klassert , "David S. Miller" Cc: Herbert Xu , netdev@vger.kernel.org, Saurabh Mohan , Sergei Shtylyov Subject: [PATCH net v2] vti: fix spd lookup: match plaintext pkt, not ipsec pkt Date: Tue, 5 Nov 2013 16:58:41 +0100 Message-Id: <1383667121-23798-1-git-send-email-christophe.gouault@6wind.com> X-Mailer: git-send-email 1.7.10.4 In-Reply-To: <1383646612-30103-1-git-send-email-christophe.gouault@6wind.com> References: <1383646612-30103-1-git-send-email-christophe.gouault@6wind.com> Sender: netdev-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: netdev@vger.kernel.org The vti interface inbound and outbound SPD lookups are based on the ipsec packet instead of the plaintext packet. Not only is it counterintuitive, it also restricts vti interfaces to a single policy (whose selector must match the tunnel local and remote addresses). The policy selector is supposed to match the plaintext packet, before encryption or after decryption. This patch performs the SPD lookup based on the plaintext packet. It enables to create several polices bound to the vti interface (via a mark equal to the vti interface okey). It remains possible to apply the same policy to all packets entering the vti interface, by setting an any-to-any selector (src 0.0.0.0/0 dst 0.0.0.0/0 proto any mark OKEY). Signed-off-by: Christophe Gouault --- v2: - Fixed comment style - Checked with checkpatch.pl and sparse --- net/ipv4/ip_vti.c | 29 ++++++++++++++++++++++++++++- 1 file changed, 28 insertions(+), 1 deletion(-) diff --git a/net/ipv4/ip_vti.c b/net/ipv4/ip_vti.c index 6e87f85..bcd85be 100644 --- a/net/ipv4/ip_vti.c +++ b/net/ipv4/ip_vti.c @@ -126,6 +126,7 @@ static int vti_rcv(struct sk_buff *skb) if (tunnel != NULL) { struct pcpu_tstats *tstats; u32 oldmark = skb->mark; + u16 netoff = skb_network_header(skb) - skb->data; int ret; @@ -133,7 +134,13 @@ static int vti_rcv(struct sk_buff *skb) * only match policies with this mark. */ skb->mark = be32_to_cpu(tunnel->parms.o_key); + /* The packet is decrypted, but not yet decapsulated. + * Temporarily make network_header point to the inner header + * for policy check. + */ + skb_reset_network_header(skb); ret = xfrm4_policy_check(NULL, XFRM_POLICY_IN, skb); + skb_set_network_header(skb, netoff); skb->mark = oldmark; if (!ret) return -1; @@ -166,6 +173,8 @@ static netdev_tx_t vti_tunnel_xmit(struct sk_buff *skb, struct net_device *dev) struct iphdr *old_iph = ip_hdr(skb); __be32 dst = tiph->daddr; struct flowi4 fl4; + struct flowi fl; + u32 oldmark = skb->mark; int err; if (skb->protocol != htons(ETH_P_IP)) @@ -173,17 +182,35 @@ static netdev_tx_t vti_tunnel_xmit(struct sk_buff *skb, struct net_device *dev) tos = old_iph->tos; + /* SPD lookup: we must provide a dst_entry to xfrm_lookup, normally the + * route to the final destination. However this route is a route via + * the vti interface. Now vti interfaces typically have the NOXFRM + * flag, hence xfrm_lookup would bypass IPsec. + * + * Therefore, we feed xfrm_lookup with a route to the vti tunnel remote + * endpoint instead. + */ memset(&fl4, 0, sizeof(fl4)); flowi4_init_output(&fl4, tunnel->parms.link, be32_to_cpu(tunnel->parms.o_key), RT_TOS(tos), RT_SCOPE_UNIVERSE, IPPROTO_IPIP, 0, dst, tiph->saddr, 0, 0); - rt = ip_route_output_key(dev_net(dev), &fl4); + rt = __ip_route_output_key(tunnel->net, &fl4); if (IS_ERR(rt)) { dev->stats.tx_carrier_errors++; goto tx_error_icmp; } + + memset(&fl, 0, sizeof(fl)); + /* Temporarily mark the skb with the tunnel o_key, to look up + * for a policy with this mark, matching the plaintext traffic. + */ + skb->mark = be32_to_cpu(tunnel->parms.o_key); + __xfrm_decode_session(skb, &fl, AF_INET, 0); + skb->mark = oldmark; + rt = (struct rtable *)xfrm_lookup(tunnel->net, &rt->dst, &fl, NULL, 0); + /* if there is no transform then this tunnel is not functional. * Or if the xfrm is not mode tunnel. */