Message ID | 20201030014910.2738809-2-vladimir.oltean@nxp.com |
---|---|
State | Changes Requested |
Delegated to: | David Miller |
Headers | show |
Series | Generic TX reallocation for DSA | expand |
Context | Check | Description |
---|---|---|
jkicinski/cover_letter | success | Link |
jkicinski/fixes_present | success | Link |
jkicinski/patch_count | success | Link |
jkicinski/tree_selection | success | Clearly marked for net-next |
jkicinski/subject_prefix | success | Link |
jkicinski/source_inline | success | Was 0 now: 0 |
jkicinski/verify_signedoff | success | Link |
jkicinski/module_param | success | Was 0 now: 0 |
jkicinski/build_32bit | success | Errors and warnings before: 0 this patch: 0 |
jkicinski/kdoc | success | Errors and warnings before: 0 this patch: 0 |
jkicinski/verify_fixes | success | Link |
jkicinski/checkpatch | success | total: 0 errors, 0 warnings, 0 checks, 63 lines checked |
jkicinski/build_allmodconfig_warn | success | Errors and warnings before: 0 this patch: 0 |
jkicinski/header_inline | success | Link |
jkicinski/stable | success | Stable not CCed |
On Fri Oct 30 2020, Vladimir Oltean wrote: > At the moment, taggers are left with the task of ensuring that the skb > headers are writable (which they aren't, if the frames were cloned for > TX timestamping, for flooding by the bridge, etc), and that there is > enough space in the skb data area for the DSA tag to be pushed. > > Moreover, the life of tail taggers is even harder, because they need to > ensure that short frames have enough padding, a problem that normal > taggers don't have. > > The principle of the DSA framework is that everything except for the > most intimate hardware specifics (like in this case, the actual packing > of the DSA tag bits) should be done inside the core, to avoid having > code paths that are very rarely tested. > > So provide a TX reallocation procedure that should cover the known needs > of DSA today. > > Note that this patch also gives the network stack a good hint about the > headroom/tailroom it's going to need. Up till now it wasn't doing that. > So the reallocation procedure should really be there only for the > exceptional cases, and for cloned packets which need to be unshared. > The tx_reallocs counter should prove that. > > Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> > Tested-by: Christian Eggers <ceggers@arri.de> # For tail taggers only Tested-by: Kurt Kanzenbach <kurt@linutronix.de> I'll wait with the hellcreek series until this is merged. Thanks, Kurt
On Fri, 30 Oct 2020 03:48:59 +0200 Vladimir Oltean wrote: > @@ -567,6 +591,17 @@ static netdev_tx_t dsa_slave_xmit(struct sk_buff *skb, struct net_device *dev) > */ > dsa_skb_tx_timestamp(p, skb); > > + if (dsa_realloc_skb(skb, dev)) { > + kfree_skb(skb); dev_kfree_skb_any()? > + return NETDEV_TX_OK; > + }
On Sat, Oct 31, 2020 at 06:00:43PM -0700, Jakub Kicinski wrote: > On Fri, 30 Oct 2020 03:48:59 +0200 Vladimir Oltean wrote: > > @@ -567,6 +591,17 @@ static netdev_tx_t dsa_slave_xmit(struct sk_buff *skb, struct net_device *dev) > > */ > > dsa_skb_tx_timestamp(p, skb); > > > > + if (dsa_realloc_skb(skb, dev)) { > > + kfree_skb(skb); > > dev_kfree_skb_any()? Just showing my ignorance, but where does the hardirq context come from?
On Sun, Nov 01, 2020 at 03:14:34AM +0200, Vladimir Oltean wrote: > On Sat, Oct 31, 2020 at 06:00:43PM -0700, Jakub Kicinski wrote: > > On Fri, 30 Oct 2020 03:48:59 +0200 Vladimir Oltean wrote: > > > @@ -567,6 +591,17 @@ static netdev_tx_t dsa_slave_xmit(struct sk_buff *skb, struct net_device *dev) > > > */ > > > dsa_skb_tx_timestamp(p, skb); > > > > > > + if (dsa_realloc_skb(skb, dev)) { > > > + kfree_skb(skb); > > > > dev_kfree_skb_any()? > > Just showing my ignorance, but where does the hardirq context come from? I mean I do see that netpoll_send_udp requires IRQs disabled, but is that the only reason why all drivers need to assume hardirq context in .xmit, or are there more?
On Sun, 1 Nov 2020 03:37:28 +0200 Vladimir Oltean wrote: > On Sun, Nov 01, 2020 at 03:14:34AM +0200, Vladimir Oltean wrote: > > On Sat, Oct 31, 2020 at 06:00:43PM -0700, Jakub Kicinski wrote: > > > On Fri, 30 Oct 2020 03:48:59 +0200 Vladimir Oltean wrote: > > > > @@ -567,6 +591,17 @@ static netdev_tx_t dsa_slave_xmit(struct sk_buff *skb, struct net_device *dev) > > > > */ > > > > dsa_skb_tx_timestamp(p, skb); > > > > > > > > + if (dsa_realloc_skb(skb, dev)) { > > > > + kfree_skb(skb); > > > > > > dev_kfree_skb_any()? > > > > Just showing my ignorance, but where does the hardirq context come from? > > I mean I do see that netpoll_send_udp requires IRQs disabled, but is > that the only reason why all drivers need to assume hardirq context in > .xmit, or are there more? netpoll is the only one that comes to my mind, maybe others know more..
diff --git a/net/dsa/slave.c b/net/dsa/slave.c index 3bc5ca40c9fb..10be715cf462 100644 --- a/net/dsa/slave.c +++ b/net/dsa/slave.c @@ -548,6 +548,30 @@ netdev_tx_t dsa_enqueue_skb(struct sk_buff *skb, struct net_device *dev) } EXPORT_SYMBOL_GPL(dsa_enqueue_skb); +static int dsa_realloc_skb(struct sk_buff *skb, struct net_device *dev) +{ + int needed_headroom = dev->needed_headroom; + int needed_tailroom = dev->needed_tailroom; + + /* For tail taggers, we need to pad short frames ourselves, to ensure + * that the tail tag does not fail at its role of being at the end of + * the packet, once the master interface pads the frame. Account for + * that pad length here, and pad later. + */ + if (unlikely(needed_tailroom && skb->len < ETH_ZLEN)) + needed_tailroom += ETH_ZLEN - skb->len; + /* skb_headroom() returns unsigned int... */ + needed_headroom = max_t(int, needed_headroom - skb_headroom(skb), 0); + needed_tailroom = max_t(int, needed_tailroom - skb_tailroom(skb), 0); + + if (likely(!needed_headroom && !needed_tailroom && !skb_cloned(skb))) + /* No reallocation needed, yay! */ + return 0; + + return pskb_expand_head(skb, needed_headroom, needed_tailroom, + GFP_ATOMIC); +} + static netdev_tx_t dsa_slave_xmit(struct sk_buff *skb, struct net_device *dev) { struct dsa_slave_priv *p = netdev_priv(dev); @@ -567,6 +591,17 @@ static netdev_tx_t dsa_slave_xmit(struct sk_buff *skb, struct net_device *dev) */ dsa_skb_tx_timestamp(p, skb); + if (dsa_realloc_skb(skb, dev)) { + kfree_skb(skb); + return NETDEV_TX_OK; + } + + /* needed_tailroom should still be 'warm' in the cache line from + * dsa_realloc_skb(), which has also ensured that padding is safe. + */ + if (dev->needed_tailroom) + eth_skb_pad(skb); + /* Transmit function may have to reallocate the original SKB, * in which case it must have freed it. Only free it here on error. */ @@ -1791,6 +1826,16 @@ int dsa_slave_create(struct dsa_port *port) slave_dev->netdev_ops = &dsa_slave_netdev_ops; if (ds->ops->port_max_mtu) slave_dev->max_mtu = ds->ops->port_max_mtu(ds, port->index); + if (cpu_dp->tag_ops->tail_tag) + slave_dev->needed_tailroom = cpu_dp->tag_ops->overhead; + else + slave_dev->needed_headroom = cpu_dp->tag_ops->overhead; + /* Try to save one extra realloc later in the TX path (in the master) + * by also inheriting the master's needed headroom and tailroom. + * The 8021q driver also does this. + */ + slave_dev->needed_headroom += master->needed_headroom; + slave_dev->needed_tailroom += master->needed_tailroom; SET_NETDEV_DEVTYPE(slave_dev, &dsa_type); netdev_for_each_tx_queue(slave_dev, dsa_slave_set_lockdep_class_one,