From patchwork Fri Aug 31 21:26:23 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Keith Busch X-Patchwork-Id: 964702 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@bilbo.ozlabs.org Authentication-Results: ozlabs.org; spf=none (mailfrom) smtp.mailfrom=vger.kernel.org (client-ip=209.132.180.67; helo=vger.kernel.org; envelope-from=linux-pci-owner@vger.kernel.org; receiver=) Authentication-Results: ozlabs.org; dmarc=fail (p=none dis=none) header.from=intel.com Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by ozlabs.org (Postfix) with ESMTP id 422C7n2QN1z9s4V for ; Sat, 1 Sep 2018 07:25:25 +1000 (AEST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727282AbeIABem (ORCPT ); Fri, 31 Aug 2018 21:34:42 -0400 Received: from mga12.intel.com ([192.55.52.136]:14083 "EHLO mga12.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726987AbeIABem (ORCPT ); Fri, 31 Aug 2018 21:34:42 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga001.jf.intel.com ([10.7.209.18]) by fmsmga106.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 31 Aug 2018 14:25:23 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.53,313,1531810800"; d="scan'208";a="86663792" Received: from unknown (HELO localhost.lm.intel.com) ([10.232.112.44]) by orsmga001.jf.intel.com with ESMTP; 31 Aug 2018 14:25:22 -0700 From: Keith Busch To: Linux PCI , Bjorn Helgaas Cc: Benjamin Herrenschmidt , Sinan Kaya , Thomas Tai , poza@codeaurora.org, Lukas Wunner , Keith Busch Subject: [PATCH 00/16] PCI, error handling and hot plug Date: Fri, 31 Aug 2018 15:26:23 -0600 Message-Id: <20180831212639.10196-1-keith.busch@intel.com> X-Mailer: git-send-email 2.13.6 Sender: linux-pci-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-pci@vger.kernel.org This series is all about error handling and hot plug. It should get the AER fatal handling much like how it was before where we don't always tear down and re-enumerate the topology, but even better in taht it won't trigger hotplug events to the extent possibe to suppress link events that occur from the secondary bus reset. Ports that do not implement an out-of-band presence detection mechanism may still have their device removed on the DPC and AER error handling, but at least error handling for non-hot pluggable ports won't experience removals anymore. Some of these are simple bug fixes, some are more complicated. Patch 11 fixes a long-standing issue where a RP detected fatal error never restored the downstream bridges, making recovery of end devices below those switches bound to fail after a secondary bus reset resets the bridge's config memory and bus windows. Comments welcomed! Keith Busch (15): PCI: Fix pci_reset_bus PCI/AER: Remove dead code PCI/ERR: Use slot reset if available PCI/ERR: Handle fatal error recovery PCI/ERR: Remove devices on recovery failure PCI/ERR: Always use the first downstream port PCI/ERR: Simplify broadcast callouts PCI/ERR: Report current recovery status for udev PCI/portdrv: Provide pci error callbacks PCI/portdrv: Restore pci state on slot reset PCI/pciehp: Fix powerfault detection order PCI/pciehp: Implement error handling callbacks PCI/pciehp: Ignore link events during DPC event PCI/DPC: Wait for reset complete PCI: Unify device inaccessible Lukas Wunner (1): PCI: Simplify disconnected marking drivers/pci/hotplug/pciehp.h | 1 + drivers/pci/hotplug/pciehp_core.c | 51 +++++++ drivers/pci/hotplug/pciehp_hpc.c | 34 +++-- drivers/pci/hotplug/pciehp_pci.c | 9 +- drivers/pci/pci.c | 32 ++++- drivers/pci/pci.h | 12 +- drivers/pci/pcie/aer.c | 15 +- drivers/pci/pcie/dpc.c | 40 ++++-- drivers/pci/pcie/err.c | 293 ++++++++++++++------------------------ drivers/pci/pcie/portdrv.h | 6 + drivers/pci/pcie/portdrv_pci.c | 37 ++++- include/linux/pci.h | 6 + 12 files changed, 299 insertions(+), 237 deletions(-)