From patchwork Fri Aug 11 03:10:11 2017 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Alan Modra X-Patchwork-Id: 800393 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@bilbo.ozlabs.org Authentication-Results: ozlabs.org; spf=pass (mailfrom) smtp.mailfrom=gcc.gnu.org (client-ip=209.132.180.131; helo=sourceware.org; envelope-from=gcc-patches-return-460216-incoming=patchwork.ozlabs.org@gcc.gnu.org; receiver=) Authentication-Results: ozlabs.org; dkim=pass (1024-bit key; unprotected) header.d=gcc.gnu.org header.i=@gcc.gnu.org header.b="ToDdixWa"; dkim-atps=neutral Received: from sourceware.org (server1.sourceware.org [209.132.180.131]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ozlabs.org (Postfix) with ESMTPS id 3xT94C0217z9t3J for ; Fri, 11 Aug 2017 13:10:32 +1000 (AEST) DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender:date :from:to:cc:subject:message-id:references:mime-version :content-type:in-reply-to; q=dns; s=default; b=iCf5EYsOT6QdgJazR aPZsAdZ1PHDt/z89j5nqA6JRNQbPcUHFei19D2bMcQ3zngt42XB4L8ouxUPQF4BO sezUvZeXRugc9lY713YZTKbtjmpn4vxmbbiHMbLVFTNgAa2Sdw81spMUb44KiRIx JDve7YlZrfuNGQIYjjwsK+yJp0= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender:date :from:to:cc:subject:message-id:references:mime-version :content-type:in-reply-to; s=default; bh=VUceMPAmJ/jRzagMLw8x/xe GL0Y=; b=ToDdixWaijTUxZQWAdx4zJaYLl4ebMIPnVAwmtK9ohyhIT2CHGDo1Yh WT10flh/or8uUvOj5EI98tM/StRFqCsEFZlVLeNlwOwtfPDQ157i+1o/2hYDwAOy jSFvrhUwd/08oG2nziAwUBm6z0CoZERjMs3StA3JOIMSfacKSsAs= Received: (qmail 12772 invoked by alias); 11 Aug 2017 03:10:21 -0000 Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Archive: List-Post: List-Help: Sender: gcc-patches-owner@gcc.gnu.org Delivered-To: mailing list gcc-patches@gcc.gnu.org Received: (qmail 12479 invoked by uid 89); 11 Aug 2017 03:10:20 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-25.1 required=5.0 tests=AWL, BAYES_00, FREEMAIL_FROM, GIT_PATCH_0, GIT_PATCH_1, GIT_PATCH_2, GIT_PATCH_3, RCVD_IN_DNSWL_NONE, SPF_PASS autolearn=ham version=3.3.2 spammy=correcting X-HELO: mail-pg0-f46.google.com Received: from mail-pg0-f46.google.com (HELO mail-pg0-f46.google.com) (74.125.83.46) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Fri, 11 Aug 2017 03:10:17 +0000 Received: by mail-pg0-f46.google.com with SMTP id y129so10307409pgy.4 for ; Thu, 10 Aug 2017 20:10:17 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=6JWzuTPabcP9qV+l+Cg/3FLwaDLXpGH4sD/uies6nQ0=; b=t9yysV4FRSfCFcp1ysTRhPVv+crq+q/mAROUhg3GimYO2B7IFZa+LgPSQAk8YqNWpx PJ8IXXHK4RQfv+6yWIezuEgNS6uTLkxoStqrY3lvWYGDfm4s3QJIn8op8JE0vD+Iiz7q LAwnEgI/Zbst4syNLzLsN3C6dHPwlQ0t/Ai7CPhtHrhWfGRw8/UyyHZ0LoyZ0xRBLVOg olkrbPccVVeQiPW+AfLCvlnj5TvBMYWu00jK4+l+wMo+AiljSWjGIO+aXtPyyJGfSrm2 CXPicVdDE9uw+s895Cmdtt2AA0vlrUmHk7x1499rRFb3AnwnfZiVrxf9UmECk2fGFHcH +yNw== X-Gm-Message-State: AHYfb5hd6DAKZHs2pOPuUzWB7N/8+9IoCCHmuy6HIGmiQIinBpkCbEuB PgHdPbu5O2EqQ8Y7 X-Received: by 10.84.218.135 with SMTP id r7mr16399354pli.184.1502421015433; Thu, 10 Aug 2017 20:10:15 -0700 (PDT) Received: from bubble.grove.modra.org (CPE-58-160-71-80.tyqh2.lon.bigpond.net.au. [58.160.71.80]) by smtp.gmail.com with ESMTPSA id 64sm11961744pff.168.2017.08.10.20.10.14 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 10 Aug 2017 20:10:15 -0700 (PDT) Received: by bubble.grove.modra.org (Postfix, from userid 1000) id 2FB57C04D2; Fri, 11 Aug 2017 12:40:11 +0930 (ACST) Date: Fri, 11 Aug 2017 12:40:11 +0930 From: Alan Modra To: Segher Boessenkool Cc: gcc-patches@gcc.gnu.org, dje.gcc@gmail.com Subject: [RS6000] PR 80938, Don't emit eh_frame for regs that don't need saving Message-ID: <20170811031011.GO16312@bubble.grove.modra.org> References: <21f6fe5be45ca917a46e204c4382c67ebfbb742f.1502310090.git.segher@kernel.crashing.org> <20170810010305.GI16312@bubble.grove.modra.org> <20170810022822.GZ13471@gate.crashing.org> <20170810044740.GK16312@bubble.grove.modra.org> <20170810133924.GA13471@gate.crashing.org> MIME-Version: 1.0 Content-Disposition: inline In-Reply-To: <20170810133924.GA13471@gate.crashing.org> User-Agent: Mutt/1.5.24 (2015-08-30) X-IsSubscribed: yes It is possible when using out-of-line register saves or store multiple to save some registers unnecessarily, for example one reg in the block saved might be unused. We don't need to emit eh_frame info for those registers as that just bloats the eh_frame info, and also can result in an ICE when shrink-wrap gives multiple paths through the function saving different sets of registers. All the join points need to have identical eh_frame register save state. This patch reverts the previous fix for PR80939 "Use SAVE_MULTIPLE only if we restore what it saves (PR80938)" and instead fixes the PR by correcting the eh_frame info. The change to rs6000_savres_strategy is an optimization, but note that it hides the underlying problem in the PR testcase. Bootstrapped and regression tested powerpc64-linux (-m32 too) and powerpc64le-linux, with https://gcc.gnu.org/ml/gcc-patches/2017-08/msg00774.html and https://gcc.gnu.org/ml/gcc-patches/2017-08/msg00775.html applied. OK to apply? PR target/80938 * config/rs6000/rs6000.c (rs6000_savres_strategy): Revert 2017-08-09. Don't use store multiple if only one reg needs saving. (rs6000_frame_related): Don't emit eh_frame for regs that don't need saving. (rs6000_emit_epilogue): Likewise. diff --git a/gcc/config/rs6000/rs6000.c b/gcc/config/rs6000/rs6000.c index 2070648..abc55bd 100644 --- a/gcc/config/rs6000/rs6000.c +++ b/gcc/config/rs6000/rs6000.c @@ -24432,20 +24432,37 @@ rs6000_savres_strategy (rs6000_stack_t *info, && flag_shrink_wrap_separate && optimize_function_for_speed_p (cfun))) { - /* Prefer store multiple for saves over out-of-line routines, - since the store-multiple instruction will always be smaller. */ - strategy |= SAVE_INLINE_GPRS | SAVE_MULTIPLE; - - /* The situation is more complicated with load multiple. We'd - prefer to use the out-of-line routines for restores, since the - "exit" out-of-line routines can handle the restore of LR and the - frame teardown. However if doesn't make sense to use the - out-of-line routine if that is the only reason we'd need to save - LR, and we can't use the "exit" out-of-line gpr restore if we - have saved some fprs; In those cases it is advantageous to use - load multiple when available. */ - if (info->first_fp_reg_save != 64 || !lr_save_p) - strategy |= REST_INLINE_GPRS | REST_MULTIPLE; + int count; + + for (count = 0, i = info->first_gp_reg_save; i < 32; i++) + if (save_reg_p (i)) + count += 1; + + if (count <= 1) + /* Don't use store multiple if only one reg needs to be + saved. This can occur for example when the ABI_V4 pic reg + (r30) needs to be saved to make calls, but r31 is not + used. */ + strategy |= SAVE_INLINE_GPRS | REST_INLINE_GPRS; + else + { + /* Prefer store multiple for saves over out-of-line + routines, since the store-multiple instruction will + always be smaller. */ + strategy |= SAVE_INLINE_GPRS | SAVE_MULTIPLE; + + /* The situation is more complicated with load multiple. + We'd prefer to use the out-of-line routines for restores, + since the "exit" out-of-line routines can handle the + restore of LR and the frame teardown. However if doesn't + make sense to use the out-of-line routine if that is the + only reason we'd need to save LR, and we can't use the + "exit" out-of-line gpr restore if we have saved some + fprs; In those cases it is advantageous to use load + multiple when available. */ + if (info->first_fp_reg_save != 64 || !lr_save_p) + strategy |= REST_INLINE_GPRS | REST_MULTIPLE; + } } /* Using the "exit" out-of-line routine does not improve code size @@ -24454,21 +24471,6 @@ rs6000_savres_strategy (rs6000_stack_t *info, else if (!lr_save_p && info->first_gp_reg_save > 29) strategy |= SAVE_INLINE_GPRS | REST_INLINE_GPRS; - /* We can only use save multiple if we need to save all the registers from - first_gp_reg_save. Otherwise, the CFI gets messed up (we save some - register we do not restore). */ - if (strategy & SAVE_MULTIPLE) - { - int i; - - for (i = info->first_gp_reg_save; i < 32; i++) - if (fixed_reg_p (i) || !save_reg_p (i)) - { - strategy &= ~SAVE_MULTIPLE; - break; - } - } - /* Don't ever restore fixed regs. */ if ((strategy & (REST_INLINE_GPRS | REST_MULTIPLE)) != REST_INLINE_GPRS) for (i = info->first_gp_reg_save; i < 32; i++) @@ -25681,9 +25683,15 @@ rs6000_frame_related (rtx_insn *insn, rtx reg, HOST_WIDE_INT val, register save functions, or store multiple, then omit eh_frame info for any user-defined global regs. If eh_frame info is supplied, frame unwinding will - restore a user reg. */ + restore a user reg. Also omit eh_frame info for any + reg we don't need to save, as that bloats eh_frame + and can cause problems with shrink wrapping. Saves + of r0 are actually saving LR, so don't omit those. */ if (!REG_P (SET_SRC (set)) - || !fixed_reg_p (REGNO (SET_SRC (set)))) + || REGNO (SET_SRC (set)) == 0 + || REGNO (SET_SRC (set)) == CR2_REGNO + || (!fixed_reg_p (REGNO (SET_SRC (set))) + && save_reg_p (REGNO (SET_SRC (set))))) RTX_FRAME_RELATED_P (set) = 1; } RTX_FRAME_RELATED_P (insn) = 1; @@ -25720,9 +25728,13 @@ rs6000_frame_related (rtx_insn *insn, rtx reg, HOST_WIDE_INT val, set = simplify_replace_rtx (set, reg2, repl2); XVECEXP (pat, 0, i) = set; - /* Omit eh_frame info for any user-defined global regs. */ + /* Omit eh_frame info for any user-defined global regs or + regs that don't need to be saved. */ if (!REG_P (SET_SRC (set)) - || !fixed_reg_p (REGNO (SET_SRC (set)))) + || REGNO (SET_SRC (set)) == 0 + || REGNO (SET_SRC (set)) == CR2_REGNO + || (!fixed_reg_p (REGNO (SET_SRC (set))) + && save_reg_p (REGNO (SET_SRC (set))))) RTX_FRAME_RELATED_P (set) = 1; } } @@ -27945,7 +27957,8 @@ rs6000_emit_epilogue (int sibcall) RTVEC_ELT (p, j++) = gen_frame_load (reg, frame_reg_rtx, info->gp_save_offset + reg_size * i); - if (flag_shrink_wrap) + if (flag_shrink_wrap + && save_reg_p (info->first_gp_reg_save + i)) cfa_restores = alloc_reg_note (REG_CFA_RESTORE, reg, cfa_restores); } for (i = 0; info->first_altivec_reg_save + i <= LAST_ALTIVEC_REGNO; i++) @@ -27954,7 +27967,8 @@ rs6000_emit_epilogue (int sibcall) RTVEC_ELT (p, j++) = gen_frame_load (reg, frame_reg_rtx, info->altivec_save_offset + 16 * i); - if (flag_shrink_wrap) + if (flag_shrink_wrap + && save_reg_p (info->first_altivec_reg_save + i)) cfa_restores = alloc_reg_note (REG_CFA_RESTORE, reg, cfa_restores); } for (i = 0; info->first_fp_reg_save + i <= 63; i++) @@ -27964,7 +27978,8 @@ rs6000_emit_epilogue (int sibcall) info->first_fp_reg_save + i); RTVEC_ELT (p, j++) = gen_frame_load (reg, frame_reg_rtx, info->fp_save_offset + 8 * i); - if (flag_shrink_wrap) + if (flag_shrink_wrap + && save_reg_p (info->first_fp_reg_save + i)) cfa_restores = alloc_reg_note (REG_CFA_RESTORE, reg, cfa_restores); } RTVEC_ELT (p, j++) @@ -28085,7 +28100,8 @@ rs6000_emit_epilogue (int sibcall) && (flag_shrink_wrap || (offset_below_red_zone_p (info->altivec_save_offset - + 16 * (i - info->first_altivec_reg_save))))) + + 16 * (i - info->first_altivec_reg_save)))) + && save_reg_p (i)) { rtx reg = gen_rtx_REG (V4SImode, i); cfa_restores = alloc_reg_note (REG_CFA_RESTORE, reg, cfa_restores); @@ -28297,7 +28313,8 @@ rs6000_emit_epilogue (int sibcall) for (i = info->first_altivec_reg_save; i <= LAST_ALTIVEC_REGNO; ++i) if (((strategy & REST_INLINE_VRS) == 0 || (info->vrsave_mask & ALTIVEC_REG_BIT (i)) != 0) - && (DEFAULT_ABI == ABI_V4 || flag_shrink_wrap)) + && (DEFAULT_ABI == ABI_V4 || flag_shrink_wrap) + && save_reg_p (i)) { rtx reg = gen_rtx_REG (V4SImode, i); cfa_restores = alloc_reg_note (REG_CFA_RESTORE, reg, cfa_restores); @@ -28643,7 +28660,8 @@ rs6000_emit_epilogue (int sibcall) RTVEC_ELT (p, elt++) = gen_frame_load (reg, sp_reg_rtx, info->fp_save_offset + 8 * i); - if (flag_shrink_wrap) + if (flag_shrink_wrap + && save_reg_p (info->first_fp_reg_save + i)) cfa_restores = alloc_reg_note (REG_CFA_RESTORE, reg, cfa_restores); }