[RFC,v1] hand off skb list to other cpu to submit to upper layer

From: Zhang Yanmin <yanmin.zhang@linux.intel.com>

Subject: hand off skb list to other cpu to submit to upper layer
From: Zhang Yanmin <yanmin.zhang@linux.intel.com>

Recently, I am investigating an ip_forward performance issue with 10G IXGBE NIC.
I start the testing on 2 machines. Every machine has 2 10G NICs. The 1st one seconds
packets by pktgen. The 2nd receives the packets from one NIC and forwards them out
from the 2nd NIC. As NICs supports multi-queue, I bind the queues to different logical
cpu of different physical cpu while considering cache sharing carefully.

Comparing with sending speed on the 1st machine, the forward speed is not good, only
about 60% of sending speed. As a matter of fact, IXGBE driver starts NAPI when interrupt
arrives. When ip_forward=1, receiver collects a packet and forwards it out immediately.
So although IXGBE collects packets with NAPI, the forwarding really has much impact on
collection. As IXGBE runs very fast, it drops packets quickly. The better way for
receiving cpu is doing nothing than just collecting packets.

Currently kernel has backlog to support a similar capability, but process_backlog still
runs on the receiving cpu. I enhance backlog by adding a new input_pkt_alien_queue to
softnet_data. Receving cpu collects packets and link them into skb list, then delivers
the list to the input_pkt_alien_queue of other cpu. process_backlog picks up the skb list
from input_pkt_alien_queue when input_pkt_queue is empty.

NIC driver could use this capability like below step in NAPI RX cleanup function.
1) Initiate a local var struct sk_buff_head skb_head;
2) In the packet collection loop, just calls netif_rx_queue or __skb_queue_tail(skb_head, skb)
to add skb to the list;
3) Before exiting, calls raise_netif_irq to submit the skb list to specific cpu.

Enlarge /proc/sys/net/core/netdev_max_backlog and netdev_budget before testing.

I tested my patch on top of 2.6.28.5. The improvement is about 43%.

Signed-off-by: Zhang Yanmin <yanmin.zhang@linux.intel.com>

---

--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Message ID	1235525270.2604.483.camel@ymzhang
State	RFC, archived
Delegated to:	David Miller
Headers	show Return-Path: <netdev-owner@vger.kernel.org> X-Original-To: patchwork-incoming@ozlabs.org Delivered-To: patchwork-incoming@ozlabs.org Received: from vger.kernel.org (vger.kernel.org [209.132.176.167]) by ozlabs.org (Postfix) with ESMTP id 62177DDDFA for <patchwork-incoming@ozlabs.org>; Wed, 25 Feb 2009 12:28:17 +1100 (EST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755317AbZBYB2N (ORCPT <rfc822;patchwork-incoming@ozlabs.org>); Tue, 24 Feb 2009 20:28:13 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1753074AbZBYB2M (ORCPT <rfc822; netdev-outgoing>); Tue, 24 Feb 2009 20:28:12 -0500 Received: from mga10.intel.com ([192.55.52.92]:44712 "EHLO fmsmga102.fm.intel.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1752782AbZBYB2L (ORCPT <rfc822;netdev@vger.kernel.org>); Tue, 24 Feb 2009 20:28:11 -0500 Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by fmsmga102.fm.intel.com with ESMTP; 24 Feb 2009 17:26:09 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.38,262,1233561600"; d="scan'208";a="433899286" Received: from ymzhang.sh.intel.com (HELO [10.239.36.211]) ([10.239.36.211]) by fmsmga002.fm.intel.com with ESMTP; 24 Feb 2009 17:24:01 -0800 Subject: [RFC v1] hand off skb list to other cpu to submit to upper layer From: "Zhang, Yanmin" <yanmin_zhang@linux.intel.com> To: "netdev@vger.kernel.org" <netdev@vger.kernel.org> Cc: LKML <linux-kernel@vger.kernel.org>, jesse.brandeburg@intel.com Content-Type: text/plain; charset=UTF-8 Date: Wed, 25 Feb 2009 09:27:49 +0800 Message-Id: <1235525270.2604.483.camel@ymzhang> Mime-Version: 1.0 X-Mailer: Evolution 2.22.1 (2.22.1-2.fc9) Content-Transfer-Encoding: 8bit Sender: netdev-owner@vger.kernel.org Precedence: bulk List-ID: <netdev.vger.kernel.org> X-Mailing-List: netdev@vger.kernel.org

[RFC,v1] hand off skb list to other cpu to submit to upper layer

Commit Message

Comments

Patch