Glitch-Free Implementation of Masking in Modern FPGAs

Size: px

Start display at page:

Download "Glitch-Free Implementation of Masking in Modern FPGAs"

Linda Fleming
5 years ago
Views:

1 Glitch-Free Imementation of Masking in Modern FPGAs Amir Moradi and Oliver Mischke Horst Görtz Institute for IT Security, Ruhr University Bochum, Germany {moradi, Abstract Due to the propagation of the glitches in combinational circuits side-channel leakage of the masked S-boxes realized in hardware is a known issue. Our contribution in this paper is to adopt a masked AES S-box circuit according to the FPGA resources in order to avoid the glitches. Our design is suitable for the 5, 6, and 7 FPGA series of Xilinx although our practical investigations are performed using a Virtex-5 chip. In short, compared to the original design synthesized by automatic tools while requiring the same area (slice count) our design reduces power consumption, critical path delay, and more importantly the side-channel leakage. In our practical investigations we could not recover any first-order leakage of our design using up to 50 million traces. However, since the targeted S-box realizes a first-order boolean masking, the second-order leakage could be revealed using around 25 million measurements. I. INTRODUCTION With the increasing pervasion of cryptogray in more and more embedded systems to protect either the intellectual property of a vendor or to preserve privacy by allowing secure communications, the need of secure imementations of cryptograic primitives like AES is at an all-time high. These imementations should not only be resistant to classical attacks but also be protected against side-channel attacks like power analysis [11], [12]. Countermeasures against power analysis attacks in hardware can be realized on multie levels. However, if the target atform is an FPGA, the algorithmic-level countermeasures are mainly the possible choices. Masking of sensitive values is one of the most considered solutions, and several schemes have already been published. These options include multiicative [2], [10], additive [3], [7], [20], or relatively recent affine [9] masking schemes. The problem of masking in hardware could not yet be solved by these schemes. Several attacks have been published, e.g., [13], [15], which exoit a remaining first-order leakage in the designs. The reason for the remaining leakage namely glitches in the combinational circuits is well known to the community. A coue of new schemes have been proposed to solve this issue by creating glitch-resistant imementations. The notable ones are the threshold imementation (TI) [17], [18], [19] and a new proposal based on a mixture of multi-party computation (MPC) and Shamir secret sharing [22], [23]. However, making a correct TI of most algorithms is very challenging. So far only the Noekeon [8] and the PRESENT [5] S-boxes could be successfully imemented [19], [21]. The MPC scheme has not been practically evaluated yet, but because of the proposed design of the inversion, the area and speed overheads of a single S-box computation are quite large. In this work we try not to create a glitch-resistant imementation but instead try to avoid causing any glitches. The target of our imementation is the Virtex-5 LX-50 FPGA of the readily available side-channel evaluation atform SASEBO- GII [1]. For this we take the very compact masked S-box by Canright-Batina [7] and manually map the combinational functions to the resources of our target atform. By efficiently using special enable signals in each FPGA Look-Up-Table (), we can suppress any glitches at the outputs by enabling them only sequentially. We have evaluated different versions of our design including a fully pipelined one achieving a very high clock frequency. Note that although our design has been initially optimized to the 6-Input architecture of the Xilinx Virtex-5 FPGA, the same architecture is used in their newer Series 6 and 7 FPGAs which allows using the same design on these recent atforms. When evaluating the side-channel leakage of our final design, contrary to the original S-box imementation our design did not show any first-order leakage by analyzing 50 million measurements. Since the scheme only imements a first-order masking, a second-order attack is expected to be successful, which is practically confirmed using a very high amount of 25 million measurements. In the next section we briefly describe the reasons why we have selected the Canright-Batina masked S-box as the basis of our imementation. Moreover, we introduce the Xilinx architecture and how we have used it to eliminate glitches. Section III gives an overview of our S-box design and names the imementation profiles used in the evaluation whose results are depicted in Section IV. Finally, Section V concludes this article. II. TARGETS In the following we will first give a short summary of the recent masked S-box designs and state why we have chosen the one of Canright and Batina as basis for our modifications to create a glitch-free version. Afterwards we will describe the architecture of the Xilinx 6-Input and how we use it to minimize the possible leakage. A. Masked AES S-box As stated previously the currently known glitch-resistant schemes come with some drawbacks. Threshold imementation has been shown to be quite effective when using small

2 Optimized xor/sq/scl/ mul input mask output mask 4 S Fig. 1. Masked GF(2 8 ) Inverter by Canright-Batina (taken from [15]) S-boxes [21], but because of the large S-box size of AES up to now no expressions could be found to rewrite the AES S-box using this scheme. Note that the imementation reported in [16] has been made by masking the multiiers of a tower-field imementation of the AES S-box which could not follow the requirements of the threshold imementation. At CHES 2011 a mixture of Shamir secret sharing scheme and multi-party computation was introduced [22]. While it has not been practically evaluated yet, it is clear that the hardware resource requirements are quite high. Furthermore, because of the sequential way of computing the inversion of the S-box a large number of clock cycles are necessary to compute only one S-box output. All these predicted area and time overheads may hinder its practical feasibility. Instead of focusing on glitch resistance in this article we try to avoid any glitches at the FPGA s at all. From the more traditional currently known masking schemes the one of Canright-Batina [7] uses an additive masking and imements the S-box in a tower-field approach using carefully chosen normal bases to minimize the circuit size. It is based on the area-optimized S-box by Canright [6], and it is still supposed to be the most compact design available. While it was claimed to be perfectly secure by the definition of [3], it was shown in [15] that because of glitches in the circuit there still exists an exoitable first-order leakage. Figure 1 shows an overview of the GF(2 8 ) inverter design omitting the towerfield conversions. The GF(2 4 ) inverter is imemented using the same design the only difference being that the inversion in GF(2 2 ) is also merged to this module. The authors of the original design were kind enough to supy the HDL source code online 1 which we used as basis for our modifications detailed in the following. B. Xilinx FPGA Resources When not using dedicated hardware blocks like Multiiers/DSPs, a combinational logic circuit in an FPGA is usually imemented by means of many-to-one Look-Up Tables. Their general design is as a number of single-bit storage elements whose values are initialized during the configuration of the FPGA by the bitstream. The inputs of the control the setting of internal multiexers thereby choosing which stored 1 Fig. 2. Two possible s in Virtex-5: 6-input, 32-bit Shift- Register [25] bit value is available at the output of the. As exame, considering the 6-to-1 of the Xilinx Series 5, 6, and 7 FPGAs, the imementation of this is realized as two 5-to-1 s and a multiexer as can be seen in Fig. 2. Each of these 5-to-1 s themselves can again be seen as two 4-to-1 s and a multiexer and so on. In our device under test, the Xilinx Virtex-5 LX50 FPGA mounted on a SASEBO-GII Board, each slice consists of four 6 and four single-bit flip-flops. The 6, as depicted in Fig. 2, can be hardinstanced in two different configurations. As 6_1 any combinational function having up to 6 input signals and one output signal can be imemented. Using the in a 5_2 configuration allows providing two output signals from the 5 inputs but only if these 5 inputs are the same for both internal 5-to-1 s, i.e., the inputs must be shared. Glitches at the output of a happen since the input signals arrive at different instances of time because of the routing specification in the device. In order to avoid this the output of the must be hold stable until all input signals have arrived. We achieve this by using one of the input signals as an active low enable signal, i.e., in our case as long as this input signal is set to logic 1, the output will always be logic 0 no matter the values of the other input signals. Here it is important to choose the correct input signal as enable carefully. Let us consider choosing the input I5 in Fig. 2 as the enable signal. While the output of the _6 will actually not change during the transition period of the other input signals, there will still be glitches at the output of one of the internal _5 instances. We therefore have to choose the input signal which controls the very first multiexer stage so that toggles at the select signals of the following multiexers do not cause any glitches. Although the details of the internal architecture of the FPGA resources are not publicly available, this input signal can be identified by looking at the architecture of the SRLC32E depicted in Fig. 2. It is a special mode of operation for s in some slices of Xilinx FPGAs that realizes a shift register. In this mode the content of the storage cells 2

3 a al al bl bl MUL.SCL 2x2 p an Q1 en1 en2 en3 Q0 b m n al al MUL.SCL 2x2 bl bl p mb Q1 Q0 en1 en2 en3 al al MUL.SCL 2x2 bl bl p mn Q1 Q0 en1 en2 en3 en3 c1 cst c3 af8 c2 c4 cst c5 a c6 c7 b en2 c8 m4 en4 mn csm csm cl b a ch n m m2 GF_INV_8 (masked) d GF_INV_4 an mb Q1 mn cst1 e q cst0 cm1 em cm0 d p Q0 dn en6 en7 en8 en9 en10 o1 e al al MUL.SCL 2x2 bl bl p Q1 p Q0 al al MUL.SCL 2x2 bl bl p Q1 dn Q0 al al MUL.SCL 2x2 bl bl p Q1 q Q0 al al MUL.SCL 2x2 bl bl p Q1 em QH QL n m o0 Q0 m4 m5 en1 en2 en3 en4 en5 en6 en7 en8 en9 en10 en11 en15 Fig. 3. Design of our full-custom optimized S-box (inversion part only) can be changed in a serial fashion during the operation of the FPGA. By using the inputs as select lines, the length of the shift register can be set dynamically. Since the all zero input sets the length to 1 bit, and switching the I0 input signal to logic 1 increases the length to 2 bits, i.e., choosing the neighboring cell, the I0 signal must control the very first multiexer stage. Therefore, I0 is the correct choice for the enable signal. Note that since the synthesizer permutes the input signals (and accordingly changes the configuration) to optimize the routing, by special constraints [24] one has to keep the PIN positions of the hardinstanced s locked. III. OUR DESIGN The detailed structure of our design is given by Fig. 3. Omitting the tower-field conversion, 15 stages are required to perform the full inversion in GF(2 8 ). We give performance figures for 6 different imementation profiles, from the original unmodified design to our optimized one with or without pipelining stages and when the special enable signals to minimize glitches in the circuit are used or not. The imementation profiles of the S-box are as follows: 1) The original HDL code optimized by the ISE synthesizer 2) The original HDL but avoiding any optimizations or trimming by the synthesizer, i.e., one per gate to keep all hierarchy levels 3) Our modified design using hardinstanced s, all enable signals always 0, no pipeline registers 4) Our modified design without pipelining but activating each stage sequentially by the enable signals 5) Our modified design using pipelining to hinder glitch propagation, but all enable signals always 0 6) Our modified design using both pipelining to hinder the glitch propagation and using the enable signals to avoid glitches in the circuit 3

4 In Profiles 1, 2, and 3 the imementations are pure combinational functions where at each clock the full S-box is computed at once. Glitches in the first stage therefore are passed through the whole S-box generating a highly glitching circuit until all signals get stable. Therefore, we do not consider Profiles 1 and 2 in our side-channel evaluations (Section IV). Profile 4 avoids this issue. Here only one stage is activated in each clock cycle, thereby not only hindering the propagation of glitches but also not causing any glitches at all. That is because the input signals of the next stage are stable when they are activated in the following clock cycle. The downside of this profile is the apparent non-practicality. One needs 15 clock cycles to compute a single S-box output while the inputs must be hold stable. In order to make matters worse one would need to spend another 15 clock cycles to deactivate each stage in the reverse order before the next S- box computation can begin. In Profile 5 the pipelining stages hinder the glitch propagation. On the other hand, keeping all enable signals at 0 glitches will still occur at the outputs of each stage. Finally, in the last Profile 6 we combine both the pipelining to avoid any glitch propagation and the use of the active-low enable signals to cometely shut down glitches at the outputs. In order to reach our goal in a straightforward way one would need to i) first disable all s, ii) clock every second pipelining registers after enabling their corresponding s, iii) disable all s again, iv) clock the other half of pipelining registers having their corresponding s enabled and so on. This means that only every four clock cycles a new S-box input can be feed into the circuit, and it leads to a latency of 30 clock cycles from input to output. This is necessary because if one would simy merge clocking every second register and disabling the connected stage at the same time, the routing of the signals would determine whether the disable signal arrives at the first or if other inputs arrive earlier, which the later causes glitches at the output. To avoid this issue we can use the special way the clock signal is routed in the FPGA. The clock is routed on special dedicated paths to each switch box separately to avoid race problems in synchronous circuits. However, the output signals need to first go back to the corresponding slice s switch box and from there travel to the destination inputs where more switch boxes might be passed. Therefore, a transition, e.g., low-to-high, on the clock signal arrives at the registers and s of each slice earlier than the other signals. Therefore, by tying our active-low enable signals to the clock signal the gets deactivated at each rising clock edge before the new inputs arrive. At the falling edge of the clock the gets active and provides the output signal to the next flip-flop stage where it will be stored at the next rising edge. This way the pipelining registers can be active at every clock cycle and no glitches will occur. Please note that the maximum clock frequency in this case cannot be faster than twice the longest critical path delay of the S-box circuit. In order to provide clk/ en output data in 0 output (i) 0 output (ii) 0 Fig. 4. inputs (i) inputs (ii) Signal timings on inputs and outputs inputs (iii) a better understanding Fig. 4 showcases the different signal timings. Also, the performance results of each imementation profile for only the inversion module of the S-box is given in Table I. TABLE I SYNTHESIS RESULTS FOR ALL PROFILES (INVERSION ONLY) Profile Max. Freq. #s #FFs Latency Throughput (#clocks) (16 Inv. /s) MHz MHz MHz MHz MHz (pipe d) MHz (pipe d) IV. EVALUATION We used a SASEBO-GII [1] board as the target atform to examine the side-channel leakage of our designs. Different profiles of our design were imemented on the Virtex-5 (XC5VLX50) FPGA embedded on the target board, and the power consumption traces were collected using a LeCroy WP715Zi 1.5GHz oscilloscope at the saming rate of 1GS/s. Since our design emoys a very few number of s in the target FPGA, and the number of toggles in each clock cycle is restricted, the peak-to-peak amitude of the signal in the power traces was quite low. Therefore, we measured the power traces by means of a 1Ω resistor in the VDD path, a DC blocker, a passive probe and an amifier. Furthermore, we restricted the bandwidth of the measurements (on the oscilloscope) to 20MHz to eliminate the electrical noise while our designs run by a stable 3MHz oscillator. We made an exemary architecture where one AddRound- Key module (128-bit) and one instance of the targeted S- box exist. The 128-bit (masked) input is ed by a 128- bit secret key, and the result is sequentially given to the S- box module one byte per clock cycle. The method we used to examine the side-channel leakage of our targeted designs is a correlation collision attack [15]. It examines the firstorder leakage of one circuit instance that is used in different time instances. Therefore, it perfectly suits to our exemary architecture since the targeted S-box instance is shared for all SubBytes transformations. The target masked S-box [7] uses two different mask bytes per input byte, i.e., a random byte to mask an input byte and another random byte as the mask of S-box output. Therefore, we provided two random values for each input byte, and gave the above mentioned architecture the masked inputs and the corresponding masks. In other words, in each run of the circuit two independent 128-bit random values 4

5 Voltage [mv] Voltage [mv] 6 2 Time [µs] Time [µs] (c) (c) Fig. 5. Profile 3: evaluation results a same trace, attack result using traces, (c) over the number of traces. Fig. 6. Profile 5: evaluation results a same trace, attack result using traces, (c) over the number of traces. SubBytes transformations. These 31 clock cycles are clearly distinguishable in Fig. 6 which shows a same power trace of this design. As an interesting point, compared to that of Profile 3 (Fig. 5) the power consumption of Profile 5 is reduced though it needs 15 more clock cycles to finish all SubBytes transformations. In order to perform a successful attack on this design and recover the desired secret, we required to collect much more traces compared to Profile 3, i.e., This is due to preventing the glitch propagations, which control the datadependent leakage and consequently is harder to detect. The same attack scheme with the same target as in Profile 3 was performed. As shown in Fig. 6, there is still a first-order leakage. This shows that controlling the propagation of the glitches is effective to significantly reduce the side-channel leakage, but it does not cometely prevent it, as we need about traces to see the desired leakage (Fig. 6(c)). The last design we considered for evaluation is Profile 6, where by a soisticated control over the enable signals the glitches are prevented. The level of power consumption of this design, as shown by Fig. 7, is roughly the same2 as that of Profile 5. In order to perform the attacks, we measured traces of this design. Performing the same attack as before led to the unsuccessful result which is depicted in Fig. 7. In fact, it shows that preventing the glitches significantly helps resisting against the first-order attacks. However, this design should have second-order leakage because of its as input and output masks are provided for the aforementioned circuit. For comparison purposes we start our evaluations by Profile 3 to have a reference as a design where glitches are not controlled and can be propagated. Please note that we omitted the evaluation results of Profiles 1 and 2 since there is no control over the glitches, and they have the same side-channel leakage as that of Profile 3. A same power trace of this design is shown in Fig. 5. Sixteen clock cycles related to the sixteen S-box computations are clearly distinguishable. We measured traces, and performed a correlation collision attack considering two aintext bytes which are processed consecutively by the targeted S-box instance. Note that this attack, similar to the most of the side-channel collision attacks, recovers a relation between the targeted secret key bytes. In case of our targets (like the linear collision attack on AES [4]) the attack searches for the difference of the key bytes corresponding to the two targeted aintext bytes. The result of this attack is depicted in Fig. 5 and Fig. 5(c) showing the simicity of recovering the secret, i.e., traces, when the glitches in the masked S-box are not controlled. Profile 5 is the next S-box design we evaluated. As mentioned in Section III, this design does not avoid the glitches, but it prevents their propagation to the next circuit stages. Since this design provides a pipeline with 15 stages, sequentially giving the 16 key-whitened aintext bytes to this S-box instance leads to requiring 31 clock cycles to compute all the 2 Indeed, 5 it is slightly lower because of the glitch prevention.

Voltage [mv] 6 5 0 5 Time [µs] 13 18 i.e., a multivariate attack. This is out of the evaluation criteria we have considered in this paper.

(Virtex-5 onward) in a size-optimized manner. Compared to a design created by an automatic synthesizer this led to the same number of s and slight decrease of the operation frequency.

6 Voltage [mv] Time [µs] i.e., a multivariate attack. This is out of the evaluation criteria we have considered in this paper. However, we believe that combining leakages of different time instances leads to increasing the noise factor and most likely provides not a better result than the univariate second-order attack whose result is shown here. V. C ONCLUSIONS In this work we have taken the highly optimized for ASICs very compact masked S-box by Canright and Batina, and ported it to use the available resources of the current Xilinx FPGA Series (Virtex-5 onward) in a size-optimized manner. Compared to a design created by an automatic synthesizer this led to the same number of s and slight decrease of the operation frequency. We could also, as already pointed out in [15], confirm the still available first-order leakage of this S-box design when imemented in a straightforward manner. Since this leakage was caused by glitches in the circuit, we have first eliminated the glitches by acing enable signals in each used, so that no output is propagated while the inputs are not stable. By combining this solution together with pipelining stages and utilizing the special way how the clock signals are routed for the enable signals, we could create an imementation which operates at an extremely high clock frequency while showing absolutely no first-order leakage by means of 50 million power consumption measurements. While not specifically focusing on this, we also achieved a quite high resistance against univariate second-order attacks. In this case 25 million traces is the threshold after which the secrets become slowly distinguishable using the very soisticated attacks of [14]. We should emasize a comparison between our results and those of a threshold imementation of AES reported in [16] and [14]. Although their imementation atform is different to ours, their scheme required roughly the same number of traces the secondorder leakage to be exoited while the area overhead of their design excluding all the internal PRNGs is much higher than our optimized one. In order to allow further study of our design and to use it in real apications the HDL source code of our masked S-box design is available online at (c) (d) Fig. 7. Profile 6: evaluation results a same trace, attack result using traces, (c) attack result on squared mean-free traces, (d) over the number of traces. underlying first-order masking scheme. In order to check this issue we performed the same attack, i.e., correlation collision attack, but using the second-order moments. That is, as illustrated in [14], in a correlation collision attack one can emoy the variance traces of the measurements instead of the averages to examine the second-order moments. It is, in fact, the same as squaring the mean-free traces and then performing a correlation collision attack [14]. We performed this preprocessing step prior to the same correlation collision attack as before, and the result is presented by Fig. 7(c). As expected, the second-order leakage is available, and can be used to reveal the desired secret using around measurements (see Fig. 7(d)). We should mention that we considered only the univariate attacks, i.e., first-order and zero-offset second-order. Because of the pipeline architecture of our design the leakages relevant to the one S-box computation are distributed over 15 clock cycles. Therefore, one may perform a second-order attack by combining the leakages appearing at different clock cycles, ACKNOWLEDGMENT In this project O. Mischke has been part-financed by the European Union, Investing in your future, European Regional Development Fund. R EFERENCES [1] Side-channel attack standard evaluation board (sasebo). Further information are available via html. [2] M.-L. Akkar and C. Giraud. An Imementation of DES and AES, Secure against Some Attacks. In CHES 2001, volume 2162 of LNCS, pages Springer, [3] J. Blömer, J. Guajardo, and V. Krummel. Provably Secure Masking of AES. In SAC 2004, volume 3357 of LNCS, pages Springer,

7 [4] A. Bogdanov. Multie-Differential Side-Channel Collision Attacks on AES. In CHES 2008, volume 5154 of LNCS, pages Springer, [5] A. Bogdanov, G. Leander, L. Knudsen, C. Paar, A. Poschmann, M. Robshaw, Y. Seurin, and C. Vikkelsoe. PRESENT - An Ultra-Lightweight Block Cier. In CHES 2007, number 4727 in LNCS, pages Springer, [6] D. Canright. A Very Compact S-Box for AES. In CHES 2005, volume 3659 of LNCS, pages Springer, The HDL specification is available at the author s official webpage pub/index.html. [7] D. Canright and L. Batina. A Very Compact "Perfectly Masked" S- Box for AES. In ACNS 2008, volume 5037 of LNCS, pages Springer, the corrected version at Cryptology eprint Archive, Report 2009/011 [8] J. Daemen, M. Peeters, G. Assche, and V. Rijmen. Nessie proposal: NOEKEON. Submitted as an NESSIE Candidate Algorithm, http: // [9] L. Genelle, E. Prouff, and M. Quisquater. Thwarting Higher-Order Side Channel Analysis with Additive and Multiicative Maskings. In CHES 2011, volume 6917 of LNCS, pages Springer, [10] J. D. Golić and C. Tymen. Multiicative Masking and Power Analysis of AES. In CHES 2002, volume 2523 of LNCS, pages Springer, [11] P. C. Kocher, J. Jaffe, and B. Jun. Differential Power Analysis. In CRYPTO 1999, volume 1666 of LNCS, pages Springer, [12] S. Mangard, E. Oswald, and T. Popp. Power Analysis Attacks: Revealing the Secrets of Smart Cards. Springer, [13] S. Mangard, N. Pramstaller, and E. Oswald. Successfully Attacking Masked AES Hardware Imementations. In CHES 2005, volume 3659 of LNCS, pages Springer, [14] A. Moradi. Statistical Tools Flavor Side-Channel Collision Attacks. In EUROCRYPT 2012, volume 7237 of LNCS, pages Springer, [15] A. Moradi, O. Mischke, and T. Eisenbarth. Correlation-Enhanced Power Analysis Collision Attack. In CHES 2010, volume 6225 of LNCS, pages Springer, the extended version at Cryptology eprint Archive, Report 2010/297 [16] A. Moradi, A. Poschmann, S. Ling, C. Paar, and H. Wang. Pushing the Limits: A Very Compact and a Threshold Imementation of AES. In EUROCRYPT 2011, volume 6632 of LNCS, pages Springer, [17] S. Nikova, C. Rechberger, and V. Rijmen. Threshold Imementations Against Side-Channel Attacks and Glitches. In ICICS 2006, volume 4307 of LNCS, pages Springer, [18] S. Nikova, V. Rijmen, and M. Schläffer. Secure Hardware Imementations of Non-Linear Functions in the Presence of Glitches. In ICISC 2008, volume 5461 of LNCS, pages Springer, [19] S. Nikova, V. Rijmen, and M. Schläffer. Secure Hardware Imementation of Nonlinear Functions in the Presence of Glitches. J. Cryptology, 24(2): , [20] E. Oswald, S. Mangard, N. Pramstaller, and V. Rijmen. A Side-Channel Analysis Resistant Description of the AES S-Box. In FSE 2005, volume 3557 of LNCS, pages Springer, [21] A. Poschmann, A. Moradi, K. Khoo, C.-W. Lim, H. Wang, and S. Ling. Side-Channel Resistant Crypto for Less than 2, 300 GE. J. Cryptology, 24(2): , [22] E. Prouff and T. Roche. Higher-Order Glitches Free Imementation of the AES Using Secure Multi-party Computation Protocols. In CHES 2011, volume 6917 of LNCS, pages Springer, [23] A. Shamir. How to Share a Secret. Commun. ACM, 22(11): , [24] Xilinx. Constraints Guide. Available via xilinx10/books/docs/cgd/cgd.pdf, [25] Xilinx. Virtex-5 Libraries Guide for HDL Designs. Available via virtex5_hdl.pdf, September

Time-Memory Trade-Offs for Side-Channel Resistant Implementations of Block Ciphers. Praveen Vadnala

Time-Memory Trade-Offs for Side-Channel Resistant Implementations of Block Ciphers Praveen Vadnala Differential Power Analysis Implementations of cryptographic systems leak Leaks from bit 1 and bit 0 are