Academia.eduAcademia.edu

Modular multiplication

description473 papers
group27 followers
lightbulbAbout this topic
Modular multiplication is a mathematical operation that computes the product of two integers and then reduces the result modulo a third integer. This operation is fundamental in number theory and is widely used in cryptography, computer science, and algorithm design to manage large numbers efficiently.
lightbulbAbout this topic
Modular multiplication is a mathematical operation that computes the product of two integers and then reduces the result modulo a third integer. This operation is fundamental in number theory and is widely used in cryptography, computer science, and algorithm design to manage large numbers efficiently.

Key research themes

1. What hardware architectures optimize Montgomery modular multiplication for cryptographic applications?

This research area focuses on designing and implementing efficient hardware architectures for Montgomery modular multiplication, which is a critical operation in RSA and other cryptosystems. The investigations aim at optimizing parameters such as area, latency, clock cycles, and throughput on FPGA and ASIC platforms, balancing trade-offs between resource utilization and performance. These efforts are crucial because Montgomery multiplication directly impacts encryption/decryption speed and hardware feasibility for large bit-length operands.

Key finding: Presented a Verilog HDL prototype of iterative Montgomery modular multiplication with a 2048-bit operand size on an Altera FPGA, achieving best performance measured by area-time squared (AT^2) metric with 2048 clock cycles... Read more
Key finding: Proposed two architectures based on modified serial Montgomery algorithm optimized for 2048-bit RSA operands, achieving latency of 2048 clock cycles and area of 19010 logic cells in one design variant, and 36864 clock cycles... Read more
Key finding: Developed generic, reconfigurable architectures using carry save adders (CSAs) including a novel four-to-two CSA with extra registers to lower critical path delays in Montgomery multipliers scaled to 512-2048 bits.... Read more
Key finding: Introduced a one-dimensional bit-parallel, semi-systolic Montgomery multiplier optimized for GF(2^m) fields targeting resource-constrained IoT edge nodes, achieving significant delay reduction while marginally lowering area... Read more
Key finding: Presented an FPGA systolic array implementation of Montgomery multiplication over GF(2^m) for field sizes m=160-193, achieving a clock frequency independent of operand size and supporting irreducible polynomials beyond... Read more

2. How do specialized arithmetic methods and number systems improve modular multiplication performance?

This theme addresses algorithmic techniques such as Residue Number System (RNS), specific modular reduction methods, new recoding methods, and approximate computations to accelerate modular multiplication beyond classical approaches. These methods enhance parallelism, reduce carry propagation and hardware complexity, and optimize performance in both software and hardware settings. Such developments also contribute to post-quantum cryptography and digital signature schemes by enabling efficient modular arithmetic over large integers.

Key finding: Proposed an approximate modular multiplication method based on RNS that substitutes expensive modulo operations by rapid bit shifts and low-bit extraction, achieving average time savings of 75% and area reductions of 80%... Read more
Key finding: Introduced a 4-moduli RNS set offering maximum dynamic range for even n, along with an efficient, memory-less reverse converter design based on new CRT-I and mixed-radix conversion (MRC). Evaluated modular multiplication... Read more
Key finding: Presented reorganized RNS Montgomery exponentiation algorithms with operations and precomputations structured to reduce costly multiplications, complemented by tailored arithmetic cell architectures exploiting pipelining and... Read more
Key finding: Developed a direct modular multiplication algorithm utilizing redundant digit division to match Montgomery multiplication speed without requiring residue domain transformations, thus eliminating transformation overhead.... Read more
Key finding: Proposed a new radix-2 Montgomery multiplication algorithm modified with a novel recoding technique based on bitwise subtraction operator to simplify and speed up critical loop computations. Implemented with carry save... Read more

3. Which modular reduction methods and operand scanning algorithms enable fast modular multiplication in cryptosystems?

This research area explores modular reduction techniques and operand scanning methods such as interleaved modular multiplication, sign detection-based optimizations, Barrett and Montgomery reductions, and special prime modulus selections to accelerate modular multiplication. These methods lower computational complexity by reducing division operations, enabling effective hardware and software implementations for high-security parameter sizes. The underlying goal is to minimize latency in modular arithmetic computations underpinning public key cryptography.

Key finding: Identified four new special modulus classes based on Mersenne and Quasi-Mersenne primes enabling folding of modular reduction into efficient addition and shift operations within the Modified Barrett algorithm. Achieved... Read more
Key finding: Introduced a 2-bits-per-iteration interleaved modular multiplication architecture that replaces costly magnitude comparisons with efficient sign detection to determine modular reduction steps. For 1024-bit operands, the... Read more
Key finding: Advocated separation of multiplication and modular reduction to optimize reduction independently, and presented methods focusing on speeding up the costly modular reduction stage for 500-bit large integers. Emphasized the... Read more
Key finding: Proposed a novel modular multiplication algorithm (HFFM) and an interleaved multi-pipelined hardware architecture tailored for the special prime moduli used in Supersingular Isogeny Diffie-Hellman (SIDH) cryptosystems. The... Read more
Key finding: Developed a modular multiplication algorithm using precomputations with a constant modulus to reduce processor computations in software implementations. This approach speeds up modular multiplications by a factor of 1.5 to 2... Read more

All papers in Modular multiplication

Logical cryptanalysis has been introduced by Massacci and Marraro as a general framework for encoding properties of crypto-algorithms into SAT problems, with the aim of generating SAT benchmarks that are controllable and that share the... more
We study the general nature of the group generated by the symmetries of regular n-sided polygon D n , described as a simple non-abelian finite group. The two symmetry elements r (rotation) and f (reflection) generates the group D n , and... more
This paper discusses several Montgomery multiplication algorithms, two of which h a ve been proposed before. We describe three additional algorithms, and analyze in detail the space and time requirements of all ve methods. These... more
As the use of electronic voting systems and e-commerce systems increases, the efficient batch verification of digital signatures becomes more and more important. In this paper, we first propose a new method to identify bad signatures in... more
In this paper we highlight the benefits of using genus 2 curves in public-key cryptography. Compared to the standardized genus 1 curves, or elliptic curves, arithmetic on genus 2 curves is typically more involved but allows us to work... more
Two’s complement overflow detection is traditionally performed by evaluating the carry into and out of the sign bit after full binary addition, or equivalently by comparing operand and result signs. While exact and efficient in hardware,... more
It is widely acknowledged that efficient modular multiplication is a key to high-performance implementation of public-key cryptography, be it classical RSA, Diffie-Hellman, or (hyper-) elliptic curve algorithms. In the recent decade,... more
Our aim is to study and implement an AM RECEIVER based on Super heterodyne principle virtually used in all modern radio and television receivers.This mainly involves the use ofheterodyning. The signal from the antenna is filtered... more
Most implementations of the modular exponentiation, M E mod N , computation in cryptographic algorithms employ Montgomery multiplication, ABR -1 mod N , instead of modular multiplication, AB mod N , even the former requires some... more
We &scribe a cryptographic library for the Motorola DSP56000 that provides harahre speed yet softwcue&xibility. The library includes modular arithmetic, DES, message digest and other methods. Montgomery modular reduction to give a very... more
The study sought and found algorithms for count of all odd composites under or up to a natural number x . The significance of the algorithm of count of odd composites is that it affords use of the prime number equation (or prime counting... more
Rivest-Shamir-Adleman (RSA) cryptosystem uses modular multiplication for encryption and decryption. So, performance of RSA can be drastically improved by optimizing modular multiplication. This paper proposes a new parallel, high-radix... more
Resumen: Este artículo sugiere diversas alternativas para la implementación en hardware del operador Multiplicación Modular. Se propone una función de costo para evaluar las alternativas de diseño y se les compara con el objeto de... more
In order to prevent the SPA (Simple Power Analysis) attack against modular exponentiation algorithms, a multiply-always implementation is generally used. Witteman et al. introduced in [14] a new cross-correlation power analysis attack... more
Improving software algorithms are not easy task, especially for increasing operating speed, and reducing complexity. Different algorithms implemented in cryptosystems used the exponentiation modular arithmetic, however, they suffer very... more
Logical cryptanalysis has been introduced by Massacci and Marraro as a general framework for encoding properties of crypto-algorithms into SAT problems, with the aim of generating SAT benchmarks that are controllable and that share the... more
Montgomery multiplication is one of the fundamental operations used in cryptographic algorithms, such as RSA and Elliptic Curve Cryptosystems. At CHES 1999, Tenca and Koç introduced a nowclassical architecture for implementing Montgomery... more
This paper investigates performance and energy characteristics of software algorithms for long integer arithmetic. We analyze and compare the number of RISC-like processor instructions (e.g. singleprecision multiplication, addition, load,... more
The paper examines the question of how learning multiple tasks interacts with neural architectures and the flow of information through those architectures. It approaches the question by using the idealization of an artificial neural... more
In this paper, new structures that implement the RSA cryptographic algorithm are presented. The core of these architectures is the modular exponential operation based on a modified Montgomery modular multiplier, where the operations of... more
The new modular multiplier structures proposed in this paper are based on a short precision magnitude comparison instead of the full magnitude comparison operation. Another feature of these structures is that the comparison operations are... more
‫ـ‬ ‫ـــ‬ Finding multiplicative inverse (Modular Inversion) operation is the most time-consuming operation in Elliptic Curve Crypto-system (ECC) operations which affects the performance of ECC. Moreover, several factors that affect the... more
This paper describes a hardware architecture for modular multiplication operation which is efficient for bit-lengths suitable for both commonly used types of Public Key Cryptography (PKC) i.e. ECC and RSA Cryptosystems. The challenge of... more
This paper describes an efficient FPGA implementation for modular multiplication in the finite field GF(2 m) that is suitable for implementing Elliptic Curve Cryptosystems. We have developed a systolic array implementation of a Montgomery... more
This paper describes a hardware implementation of an arithmetic processor which is efficient for bit-lengths suitable for both commonly used types of Public Key Cryptography (PKC), i.e., Elliptic Curve (EC) and RSA Cryptosystems.... more
This paper describes a hardware implementation of an arithmetic processor which is efficient for elliptic curve (EC) cryptosystems, which are becoming increasingly popular as an alternative for public key cryptosystems based on factoring.... more
We introduce a set of four twisted Edwards curves that satisfy common security requirements and allow for fast implementations of scalar multiplication on 8, 16, and 32-bit processors. Our curves are defined by an equation of the form −x... more
In last years, many modulus sets in the Residue Number System (RNS) for increasing Dynamic Range (DR) and parallelism are presented. Hence, for reach these purposes a new 5-Moduli Set for even n, and its efficient reverse converter design... more
In last years, many modulus sets in the Residue Number System (RNS) for increasing Dynamic Range (DR) and parallelism are presented. Hence, for reach these purposes a new 5-Moduli Set for even n, and its efficient reverse converter design... more
In order to prevent the SPA (Simple Power Analysis) attack against modular exponentiation algorithms, a multiply-always implementation is generally used. Witteman et al. introduced in [14] a new cross-correlation power analysis attack... more
In order to prevent the SPA (Simple Power Analysis) attack against modular exponentiation algorithms, a multiply-always implementation is generally used. Witteman et al. introduced in [14] a new cross-correlation power analysis attack... more
We consider the problem of exactly computing the number of integers in a Chinese Remainder Representation (crr) whose pseudorank does not equal the rank. We call this number the census. The rank is key in developing crr-intrinsic methods... more
We consider a speciÿc class of satisÿability (SAT) problems, the conjunctions of (nested) equivalencies (CoE). It is well known that CNF (conjunctive normal form) translations of CoE formulas are hard for branching and resolution... more
Contents vii 5.4.2 A branch and cut framework. 5.4.3 Numerical experiments. 5.4.4 Cutting planes 5.4.5 A note on solving SAT problems via MAX2SAT 5.5 A semidefinite relaxation of the SAT problem 5.5.1 A sufficient condition for... more
This paper describes a differential electromagnetic analysis attack performed on a hardware implementation of an elliptic curve cryptosystem. We describe the use of the distance of mean test. The number of measurements needed to get a... more
This paper presents a scalable hardware implementation of both commonly used public key cryptosystems, RSA and Elliptic Curve Cryptosystem (ECC) on the same platform. The introduced hardware accelerator features a design which can be... more
This article discusses some of the practical issues of studying the basic principles of the four-stroke diesel engine operation. Four-stroke cycle is logical and intuitive. All four obvious steps necessary to implement the full working... more
Arithmetic operations are generally slowest operations in digital design which is the bottleneck in most of the systems. Optimizing adder circuits provides faster performance in arithmetic circuits. Field Programmable Gate Arrays (FPGA)... more
Download research papers for free!