From patchwork Thu Mar 21 15:57:30 2013 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Peter Lieven X-Patchwork-Id: 229745 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@bilbo.ozlabs.org Received: from lists.gnu.org (lists.gnu.org [208.118.235.17]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (Client did not present a certificate) by ozlabs.org (Postfix) with ESMTPS id A35002C012A for ; Fri, 22 Mar 2013 03:02:32 +1100 (EST) Received: from localhost ([::1]:52751 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1UIhwk-0004lT-Sl for incoming@patchwork.ozlabs.org; Thu, 21 Mar 2013 12:02:30 -0400 Received: from eggs.gnu.org ([208.118.235.92]:38539) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1UIhsQ-0006fv-In for qemu-devel@nongnu.org; Thu, 21 Mar 2013 11:58:12 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1UIhsN-0002it-Bm for qemu-devel@nongnu.org; Thu, 21 Mar 2013 11:58:02 -0400 Received: from [2a02:248:0:30:223:aeff:fefe:7f1c] (port=33560 helo=dns.kamp-intra.net) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1UIhsN-0002in-4u for qemu-devel@nongnu.org; Thu, 21 Mar 2013 11:57:59 -0400 Received: from lieven-pc.kamp-intra.net (lieven-pc.kamp-intra.net [172.21.12.60]) by dns.kamp-intra.net (Postfix) with ESMTP id 8CC2B206AC; Thu, 21 Mar 2013 16:57:58 +0100 (CET) Received: by lieven-pc.kamp-intra.net (Postfix, from userid 1000) id 40C9C5FCAE; Thu, 21 Mar 2013 16:58:09 +0100 (CET) From: Peter Lieven To: qemu-devel@nongnu.org Date: Thu, 21 Mar 2013 16:57:30 +0100 Message-Id: <1363881457-14814-3-git-send-email-pl@kamp.de> X-Mailer: git-send-email 1.7.9.5 In-Reply-To: <1363881457-14814-1-git-send-email-pl@kamp.de> References: <1363881457-14814-1-git-send-email-pl@kamp.de> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 2a02:248:0:30:223:aeff:fefe:7f1c Cc: kwolf@redhat.com, pbonzini@redhat.com, Peter Lieven Subject: [Qemu-devel] [PATCHv3 2/9] cutils: add a function to find non-zero content in a buffer X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org Sender: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org this adds buffer_find_nonzero_offset() which is a SSE2/Altives optimized function that searches for non-zero content in a buffer. due to the optimizations used in the function there are restrictions on buffer address and search length. the function can_use_buffer_find_nonzero_content() can be used to check if the function can be used safely. Signed-off-by: Peter Lieven --- include/qemu-common.h | 3 +++ util/cutils.c | 50 +++++++++++++++++++++++++++++++++++++++++++++++++ 2 files changed, 53 insertions(+) diff --git a/include/qemu-common.h b/include/qemu-common.h index e76ade3..ebbaf71 100644 --- a/include/qemu-common.h +++ b/include/qemu-common.h @@ -362,6 +362,9 @@ size_t qemu_iovec_from_buf(QEMUIOVector *qiov, size_t offset, size_t qemu_iovec_memset(QEMUIOVector *qiov, size_t offset, int fillc, size_t bytes); +#define BUFFER_FIND_NONZERO_OFFSET_UNROLL_FACTOR 8 +inline bool can_use_buffer_find_nonzero_offset(const void *buf, size_t len); +inline size_t buffer_find_nonzero_offset(const void *buf, size_t len); bool buffer_is_zero(const void *buf, size_t len); void qemu_progress_init(int enabled, float min_skip); diff --git a/util/cutils.c b/util/cutils.c index 1439da4..6d079ac 100644 --- a/util/cutils.c +++ b/util/cutils.c @@ -143,6 +143,56 @@ int qemu_fdatasync(int fd) } /* + * Searches for an area with non-zero content in a buffer + * + * Attention! The len must be a multiple of 8 * sizeof(VECTYPE) + * and addr must be a multiple of sizeof(VECTYPE) due to + * restriction of optimizations in this function. + * + * can_use_buffer_find_nonzero_offset() can be used to check + * these requirements. + * + * The return value is the offset of the non-zero area rounded + * down to 8 * sizeof(VECTYPE). If the buffer is all zero + * the return value is equal to len. + */ + +inline size_t buffer_find_nonzero_offset(const void *buf, size_t len) +{ + VECTYPE *p = (VECTYPE *)buf; + VECTYPE zero = ZERO_SPLAT; + size_t i; + + if (*((const long *) buf)) { + return 0; + } + + for (i = 0; i < len / sizeof(VECTYPE); + i += BUFFER_FIND_NONZERO_OFFSET_UNROLL_FACTOR) { + VECTYPE tmp0 = p[i + 0] | p[i + 1]; + VECTYPE tmp1 = p[i + 2] | p[i + 3]; + VECTYPE tmp2 = p[i + 4] | p[i + 5]; + VECTYPE tmp3 = p[i + 6] | p[i + 7]; + VECTYPE tmp01 = tmp0 | tmp1; + VECTYPE tmp23 = tmp2 | tmp3; + if (!ALL_EQ(tmp01 | tmp23, zero)) { + break; + } + } + return i * sizeof(VECTYPE); +} + +inline bool can_use_buffer_find_nonzero_offset(const void *buf, size_t len) +{ + if (len % (BUFFER_FIND_NONZERO_OFFSET_UNROLL_FACTOR + * sizeof(VECTYPE)) == 0 + && ((uintptr_t) buf) % sizeof(VECTYPE) == 0) { + return true; + } + return false; +} + +/* * Checks if a buffer is all zeroes * * Attention! The len must be a multiple of 4 * sizeof(long) due to