diff options
author | Andy Polyakov <appro@openssl.org> | 2015-01-05 23:40:10 +0100 |
---|---|---|
committer | Andy Polyakov <appro@openssl.org> | 2015-01-13 21:40:14 +0100 |
commit | b3d7294976c58e0e05d0ee44a0e7c9c3b8515e05 (patch) | |
tree | 308a62f20067be1911f844c93a8326e27a9dee86 /crypto/modes/asm | |
parent | a5a412350daa8f49b90323ec2a99fee499fc5b6d (diff) | |
download | openssl-b3d7294976c58e0e05d0ee44a0e7c9c3b8515e05.tar.gz |
Add Broadwell performance results.
Reviewed-by: Emilia Käsper <emilia@openssl.org>
Diffstat (limited to 'crypto/modes/asm')
-rw-r--r-- | crypto/modes/asm/aesni-gcm-x86_64.pl | 5 | ||||
-rw-r--r-- | crypto/modes/asm/ghash-x86_64.pl | 4 |
2 files changed, 7 insertions, 2 deletions
diff --git a/crypto/modes/asm/aesni-gcm-x86_64.pl b/crypto/modes/asm/aesni-gcm-x86_64.pl index cfc856cf35..7e4e04ea25 100644 --- a/crypto/modes/asm/aesni-gcm-x86_64.pl +++ b/crypto/modes/asm/aesni-gcm-x86_64.pl @@ -22,7 +22,10 @@ # [1] and [2], with MOVBE twist suggested by Ilya Albrekht and Max # Locktyukhin of Intel Corp. who verified that it reduces shuffles # pressure with notable relative improvement, achieving 1.0 cycle per -# byte processed with 128-bit key on Haswell processor. +# byte processed with 128-bit key on Haswell processor, and 0.74 - +# on Broadwell. [Mentioned results are raw profiled measurements for +# favourable packet size, one divisible by 96. Applications using the +# EVP interface will observe a few percent worse performance.] # # [1] http://rt.openssl.org/Ticket/Display.html?id=2900&user=guest&pass=guest # [2] http://www.intel.com/content/dam/www/public/us/en/documents/software-support/enabling-high-performance-gcm.pdf diff --git a/crypto/modes/asm/ghash-x86_64.pl b/crypto/modes/asm/ghash-x86_64.pl index ce7d1cb8ba..6e656ca13b 100644 --- a/crypto/modes/asm/ghash-x86_64.pl +++ b/crypto/modes/asm/ghash-x86_64.pl @@ -63,6 +63,7 @@ # Sandy Bridge 1.80(+8%) # Ivy Bridge 1.80(+7%) # Haswell 0.55(+93%) (if system doesn't support AVX) +# Broadwell 0.45(+110%)(if system doesn't support AVX) # Bulldozer 1.49(+27%) # Silvermont 2.88(+13%) @@ -73,7 +74,8 @@ # CPUs such as Sandy and Ivy Bridge can execute it, the code performs # sub-optimally in comparison to above mentioned version. But thanks # to Ilya Albrekht and Max Locktyukhin of Intel Corp. we knew that -# it performs in 0.41 cycles per byte on Haswell processor. +# it performs in 0.41 cycles per byte on Haswell processor, and in +# 0.29 on Broadwell. # # [1] http://rt.openssl.org/Ticket/Display.html?id=2900&user=guest&pass=guest |