From patchwork Mon Apr 29 20:01:14 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Asmaa Mnebhi X-Patchwork-Id: 1929157 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@legolas.ozlabs.org Authentication-Results: legolas.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=lists.ubuntu.com (client-ip=185.125.189.65; helo=lists.ubuntu.com; envelope-from=kernel-team-bounces@lists.ubuntu.com; receiver=patchwork.ozlabs.org) Received: from lists.ubuntu.com (lists.ubuntu.com [185.125.189.65]) (using TLSv1.2 with cipher ECDHE-ECDSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by legolas.ozlabs.org (Postfix) with ESMTPS id 4VSvPG3NYTz23t4 for ; Tue, 30 Apr 2024 06:02:09 +1000 (AEST) Received: from localhost ([127.0.0.1] helo=lists.ubuntu.com) by lists.ubuntu.com with esmtp (Exim 4.86_2) (envelope-from ) id 1s1XCD-0002pS-BW; Mon, 29 Apr 2024 20:01:53 +0000 Received: from mail-bn8nam12on2075.outbound.protection.outlook.com ([40.107.237.75] helo=NAM12-BN8-obe.outbound.protection.outlook.com) by lists.ubuntu.com with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.86_2) (envelope-from ) id 1s1XCB-0002p5-Kj for kernel-team@lists.ubuntu.com; Mon, 29 Apr 2024 20:01:51 +0000 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=NsYGn0Zh+pZlplAtGW8R/zCFGm8HQNHzdSNa9hFntoy+Vgf3BEAIU/lq45OHpBvxs8cfvB+LCdLSVA+71rVXmFo8WMHGrugnPA4raH+H+eGgOrvVPjb7BkBRcaJL2RWaObTpbbnS25Q8GQMf5sEm90OiGcyQ0Xy1OFl4sEWOZYZxeBPjF1m945Gu4JMzyT2OiY4KQRYrh95vYRyzNMeosqofRktIT41L0BxBgxzGPK/APbLhGH9wFjhDvFxDadF5b/doJJLFmpmmRF+cMEhq5Nr4PHIT7p1AGDQInXWjFkSJXRvnWx0/ffBNIM0/Uw12FHeLgxhXOGdPgIEdDIeKTg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=NfoWB7gybwD4MLTd1Zv2SLrXuirLv8AoRXlVhlVKPpw=; b=SkF89sDlZgw9o38Reua4qM2l7Kgh5rg9xarJZRcbFnku/E80BwvzYlW57HBqPmrWBjD/lc1JBrtUHQK3iblFrhCU3MHrsrS8GjO0avvAIKvG1APmZoxOHn6+dKNvpZsFFX4n2PncZgvQHbq+TXzRL9qFog91hDRS6hpHDqMiwwObpILIN3rYuqWQA/ylIwzcCdPmz0HPk3dKKyTh1m1BK/Z6WFuB0xQklaPEovXZ1IFbil5ajxcpv5W0+lsYRduM5RYnPzrJ+mfdvDC0ivZGDGM/o7r8NpOjQVoXzR0c3bm7NeqjJNiMFqnV6sxz+RKxWWwpLJk7bFUS3OsG8ZvgQg== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 216.228.117.161) smtp.rcpttodomain=lists.ubuntu.com smtp.mailfrom=nvidia.com; dmarc=pass (p=reject sp=reject pct=100) action=none header.from=nvidia.com; dkim=none (message not signed); arc=none (0) Received: from CH0PR03CA0413.namprd03.prod.outlook.com (2603:10b6:610:11b::11) by SA1PR12MB7342.namprd12.prod.outlook.com (2603:10b6:806:2b3::9) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7519.34; Mon, 29 Apr 2024 20:01:47 +0000 Received: from CH3PEPF00000015.namprd21.prod.outlook.com (2603:10b6:610:11b:cafe::12) by CH0PR03CA0413.outlook.office365.com (2603:10b6:610:11b::11) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7519.35 via Frontend Transport; Mon, 29 Apr 2024 20:01:47 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 216.228.117.161) smtp.mailfrom=nvidia.com; dkim=none (message not signed) header.d=none;dmarc=pass action=none header.from=nvidia.com; Received-SPF: Pass (protection.outlook.com: domain of nvidia.com designates 216.228.117.161 as permitted sender) receiver=protection.outlook.com; client-ip=216.228.117.161; helo=mail.nvidia.com; pr=C Received: from mail.nvidia.com (216.228.117.161) by CH3PEPF00000015.mail.protection.outlook.com (10.167.244.120) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7519.0 via Frontend Transport; Mon, 29 Apr 2024 20:01:46 +0000 Received: from rnnvmail202.nvidia.com (10.129.68.7) by mail.nvidia.com (10.129.200.67) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1544.4; Mon, 29 Apr 2024 13:01:18 -0700 Received: from rnnvmail205.nvidia.com (10.129.68.10) by rnnvmail202.nvidia.com (10.129.68.7) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1544.4; Mon, 29 Apr 2024 13:01:17 -0700 Received: from vdi.nvidia.com (10.127.8.14) by mail.nvidia.com (10.129.68.10) with Microsoft SMTP Server id 15.2.1544.4 via Frontend Transport; Mon, 29 Apr 2024 13:01:17 -0700 From: Asmaa Mnebhi To: Subject: [SRU][J:linux-bluefield][PATCH v1 0/1] UBUNTU: SAUCE: mlxbf-gige: Vitesse PHY stuck in a bad state during reboot test Date: Mon, 29 Apr 2024 16:01:14 -0400 Message-ID: <20240429200115.29252-1-asmaa@nvidia.com> X-Mailer: git-send-email 2.30.1 MIME-Version: 1.0 X-NV-OnPremToCloud: ExternallySecured X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: CH3PEPF00000015:EE_|SA1PR12MB7342:EE_ X-MS-Office365-Filtering-Correlation-Id: 6d597bf6-2d31-493b-81db-08dc68873357 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; ARA:13230031|1800799015|376005|82310400014|36860700004; X-Microsoft-Antispam-Message-Info: D+1bzPn6LA0TTSLSY9O4cF2NBrplmuppkNPQIcodxqu81d6Hgn3dkNW12EklosPB6uDt6234SylgfbzhFX0cVY9SsfkI3klFqUiqghoyMFz2O758Nig2HqTilHPjSAcoZVx5vXRzGBQ6vzy1BmevSPy3ftjrTC2bY7GKqFDzXTHwWOcZtfE6Jh2q7YMED8VV5TW95SZBcUQCXncf7vrB0/dyTUeDyHse9BtJ05L/Nzh3uhFuARqsjZ5d4QGmniWjvuJF5PC8pv6x4JHY1rYjAQa3G8a1WJCpMAs3XXQlrjXpKHl2JXfGhT49Ju5Ivn4iDOJXLScwhofqsJlhPacfCqCs2P1nyLhqZbyk21EPngj51KUDXSwb4V9yH/4tbJ6Y8Eu0PbLXQDw0oAtxR2Btq8bF4kJggFRx1WfgryZ7F3i6N8SU/tRtOIiGZIl8YUyHB9eSl4pToo1M/IMP4h3ljFbw3M1RkNHacUcU7xEiahWSSgUo4Uchrsb8f+TcR8mPtVpo8Oynl65Awc3/VNYTp8x7RbhXf6sMfCl544tQ95BZJbDze/XAZ7rTeP8162J1QEsTGu90ng/eaWJjpZP+aQ8+BN3FqMFYyVSGr4ooIsNPD9y8lm9kynqsPnRTzAAtsTCGunCfapTSb8ZfP2BMby9q1hKNH/DNyMS94MkmQsxBOwCbU2CGf/si4M5p3maYNKfNA+J/75BvCFi7C35eLW0GHlywiBkWetoVc1TzL2DnaD8EjbgxKaWqJBbhCsaoMxAmwtE/YDhRT0y57d9galChw/jVdhUlBXq5StmId0U4+TU2MIaN92OS1vZgbl9obSe18wOdVm+ZNFc6vBCgALVz9acC6v3NFmO2OODViRggzBX0QAgEZwu9YV4Z1Kp+474sorEgvzfCXKh6NSFi44BE17oseJ5HNCVY0NyXq+eOdfVCw3pcE3My/7mUBkkHwmpXquV9wsy+eSgtI6kU/vWoGy+VmcF9xWHfitOXR3gpgr8tQCzfYdMGPlo47DhwYICe8gIM1VMOI7cbJBHwgXnpyrAnQistuvYiQwjfr8xUeFdEZ/9y3mTervuKssAgBgyGT+zNm/43/l3ulVgRobT2i3LdObw6yzi066pb/vTYJOEp2DKwkqeQ/Lt/m+8dQQtzKQeYvMqgmZFNh3GjZDgCH8HMYWX1SQQj5gijTMM64nssY8QWofDQb6NTrebYh8sIfNyTsF5kCXVpJFOV/vw95Tw9xem42TFQ57Cskpt0olKFtOGRBI4L+z3D3mRIDskCl7Hhb3UvZWUMH6H/pGi3E3+gTcEWuCHmKofKvMjPjehZGsj+TDCe9SS4wtFHwQhEwRcid+yH80cbxRJjd9c7EBYzgz0y+nSmltReojJI3hwi0vesgf2jBo+IfK99 X-Forefront-Antispam-Report: CIP:216.228.117.161; CTRY:US; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:mail.nvidia.com; PTR:dc6edge2.nvidia.com; CAT:NONE; SFS:(13230031)(1800799015)(376005)(82310400014)(36860700004); DIR:OUT; SFP:1101; X-OriginatorOrg: Nvidia.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 29 Apr 2024 20:01:46.9608 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 6d597bf6-2d31-493b-81db-08dc68873357 X-MS-Exchange-CrossTenant-Id: 43083d15-7273-40c1-b7db-39efd9ccc17a X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=43083d15-7273-40c1-b7db-39efd9ccc17a; Ip=[216.228.117.161]; Helo=[mail.nvidia.com] X-MS-Exchange-CrossTenant-AuthSource: CH3PEPF00000015.namprd21.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: SA1PR12MB7342 Received-SPF: softfail client-ip=40.107.237.75; envelope-from=asmaa@nvidia.com; helo=NAM12-BN8-obe.outbound.protection.outlook.com X-BeenThere: kernel-team@lists.ubuntu.com X-Mailman-Version: 2.1.20 Precedence: list List-Id: Kernel team discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: Asmaa Mnebhi Errors-To: kernel-team-bounces@lists.ubuntu.com Sender: "kernel-team" BugLink: https://bugs.launchpad.net/bugs/2062384 SRU Justification: [Impact] During the QA reboot test, the BF3 Vitesse PHY gets stuck in a bad state, resulting in no ip provisioning. The only way to recover is to powercycle. We might have found a software workaround to avoid getting in this state in the first place: suspend the PHY during graceful shutdown. Suspend the PHY = Power down = set bit 11 to 1 in reg 0 of the PHY. This WA passed 1800 reboots on QA's setup. [Fix] * During reboot, the mlxbf_gige_shutdown() function makes a call to phy_stop(). phy_stop() calls phy_suspend(). * Certain Linux PHY drivers, like the Vitesse PHY, don't support suspend() to power down the PHY during shutdown. * Our Hardware also does not toggle the hard reset signal of the PHY during reboot. * Hence, when the PHY is in a bad state, it stays in its bad state until powercycle. * We have found a way to prevent the PHY from entering this bad state by suspending the PHY in the case of reboot. [Test Case] * do the reboot test (at least 2000 reboots): run 'reboot' from linux. * Check that the oob_net0 interface is up and the ip is assigned. * please note that if the the OOB doesn't get an ip, try reloading the driver (rmmod/modprobe). it that solves the issue, that would be a different bug. In the bug at stake, nothing recovers the OOB ip except power cycle. [Regression Potential] * Make sure the redfish DHCP is still working during the reboot test * Make sure the OOB gets an ip [Other] These changes were made both in the mlxbf-gige driver and UEFI