High Speed Serial Serial Multiplier
…
9 pages
Sign up for access to the world's latest research
Abstract
The International Review on Computers and Software (IRECOS) is a publication of the Praise Worthy Prize S.r.l.. The Review is published monthly, appearing on the last day of every month.
Key takeaways
AI
AI
- The proposed HS-SS multiplier enhances speed by operating on both clock edges.
- Power and area reductions of 13.6% to 21.1% versus existing designs are achieved.
- High-speed multiplication is critical for DSP applications including audio and navigation.
- Partial product generation and reduction significantly influence overall multiplier performance.
- The HS-SS multiplier successfully integrates into an Orthogonal Frequency Division Multiplexing (OFDM) system.







Related papers
Multiplication is major concern in computations like signal processing, audio video applications. Performing multiplication on Fixed and floating point data is a more time consuming action and necessitates huge quantity of giving out time. By civilizing the speed of multiplication job overall speed of the system can be boosted. The Holdup in the floating point multiplication procedure is the multiplication of mantissas which wants 53*53 bit integer multiplier aimed at double precision floating point numbers, Vedic and Canonic Signed digit set of rules exists to associated parameters like speed, complication of routing, pipelining, source required on FPGA. The comparison showed that Signed Digit Algorithm and Booth Multiplication Algorithm is better than Vedic algorithm in terms of speed and capitals required on spartan3 FPGA.
IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2011
A novel approach of designing serial-serial hybrid multiplier is proposed for applications with high data sampling rate (4 GHz). The conventional way of partial product formation is revamped. Our proposed technique effectively forms the entire partial product matrix in just sampling cycles for an multiplication instead of at least 2 cycles in the conventional serial-serial multipliers. It achieves a high bit sampling rate by replacing conventional full adders and 5:3 counters with asynchronous 1's counters so that the critical path is limited to only an AND gate and a D flip-flop (DFF). The use of 1's counter to column compress the partial products preliminarily reduces the height of the partial product matrix from to log 2 + 1, resulting in a significant complexity reduction of the resultant adder tree. The proposed hybrid column compressed multiplier consists of a serial-serial data accumulation unit and a parallel carry save adder (CSA) array that occupies approximately 35% and 58% less silicon area than the full CSA array multiplier with operands of wordlength 32 32 and 64 64, respectively. The post-layout simulation results based on 90-nm seven metal single poly CMOS process technology shows that our 64 64 multiplier dissipates 39% less average power at a sampling rate of 4 GHz, and has only 11% additional delay penalty to complete a multiplication compared to the conventional fully parallel CSA array multiplier. Index Terms-Binary multiplication, on-chip serial-link bus architecture, on-the-fly accumulation, parallel multipliers, serial-serial and serial-parallel multiplier.
Terms of both latency and power Digit-serial implementation styles are best suited for implementation of digital signal processing systems which require moderate sampling rates. Digit-serial architectures obtain using traditional unfolding techniques cannot be pipelined beyond a certain level because of the presence of feedback loops. In this paper, an alternative approach for the design of digit-serial architectures is presented based on a novel design methodology. This methodology permits bit-level pipelining of the digit-serial architectures by moving all feedback loops to the last stage of the design. This enables bit-level pipelining of digit-serial architectures, thereby achieving sample speeds close to corresponding bit-parallel multipliers with lower area. This increased sample speed can be traded with reduction in power supply voltage resulting in significant reduction in power consumption. The proposed approach is applied to the design of various multipliers which form the backbone of digital signal processing computations. The results show that for transformed multipliers with smaller digit sizes (_4), the singly-redundant multiplier consumes the least power, and for larger digit sizes, the type-I multiplier consumes the least power. It is also found that the optimum digit size for least power consumption in type-I and type-III multipliers is _p2W, where W represents the word length. Among the bit-level pipelined digit-serial multipliers, it is found that the redundant multiplier offers the best choice in consumption The proposed digit-serial multipliers consume on average 20% lower power than the traditional digit-serial architectures for the non pipelined case and about 5–15 times lower power for the bit-level pipelined case.
In the field of embedded electronics the information of circuits (processors and other integrated circuits) is vital. In order for such connection to be established a common protocol must be assigned to them one of such is the serial communication protocol. Serial communication is common method of transmitting data between a computer and a peripheral device such as a programmable instrument or even another computer. Serial communication transmits data one bit at a time, sequentially, over a single communication line to a receiver. Serial is also a most popular communication protocol that is used by many devices for instrumentation; Numerous devices also come with an RS232 based port that is based on this protocol. However, because of the rapid evolution of this protocol its counter part the parallel communication protocol is not commonly used because the standards that are available with serial communication can easily achieve the original purpose of a parallel communication protocol which is primarily a high relative throughput. Arduino is an open-source platform used for building electronics projects. Arduino consists of both a physical programmable circuit board (often referred to as a microcontroller) and a piece of software, or IDE (Integrated Development Environment) that runs on your computer, used to write and upload computer code to the physical board. This paper has the aim to look at serial communication and how Arduino utilizes such protocol in creating complex and vital embedded electronics. This paper will achieve this by giving and overview on Arduino, serial communication and then briefly explain how Arduino utilises such feature.
Electronics Letters, 1998
IEEE Communications Magazine, 2001
The architecture and critical circuit design issues for high-speed serial data links operating in excess of 1 Gb/s are described. Trade-offs in power vs. performance are presented for SONET/SDH transceivers and backplane transceivers for Infiniband or similar standards.
Proceedings of the Tenth International Conference on Microelectronics (Cat. No.98EX186)
In this paper, a new design of serial-paiallel multipliers based on the Modified Booth algorithm is proposed. The proposed multiplier implementation on a PCI initiztorrnarget Interface Card (where a single chip design, XC4010E, is used to implement fi PCI Local Bus interface, revision 2.1, protocol and timing compliance) is demonsterated. The flexible design of the PCI-interface can be easily adapted for specific interface requirements. This interface card can be used for different DSP applications. The multiplier is compared with one proposed recently in terms of speed and hardware, and is shown to be faster (for large multipliers, it tends to provide double speed), and having a lower area x time complexity. An implementation using a FPGA-based PCI-interface Card have been made for an 8x8 multiplier. The design was tested using simulation software.
2014 International Conference on Signal Processing and Integrated Networks (SPIN), 2014
The paper proposes a systematic design methodology for bit-serial multiplication. The proposed approach is a modified method for performing traditional multiplication. This paper presents a general technique for NxN bit-serial multiplication used in signal processing. HDL implementation and simulation of 4x4 bit-serial multiplier is discussed. Synthesis is performed using Xilinx ISE with Virtex-4 ML402 FPGA board.
Multiplication or repeated addition is the basic operation used in both Mathematics and Science. The speed of multiplier determines the speed of all Digital Signal Processors. This paper describes four multipliers that is Modified Booth Multiplier, Vedic Multiplier (Urdhva Tiryakbhyam Sutra), Wallace Multiplier and Dadda Multiplier. In Modified Booth Multiplier Algorithm ‘0’ is appended to the right of LSB and then three bits starting from the LSB are grouped as a set to decide the partial product. So as to increase its speed, desk calculators are used that perform the operation of shifting faster. Urdhva Tiryakbhyam Sutra is performed by two multiplication techniques that is straight above multiplication and diagonal multiplication. Then finally, their sum is taken. Here reduction of multi bit multiplication to single bit multiplication takes place followed by the process of addition. Since there is only one step generation of partial product, carry propagation from LSB to MSB is reduced. There are three steps in which both Wallace and Dadda Multiplier work. They are partial product formation, reduction of partial products formed to two rows and addition of these two rows using carry propagation adder. Wallace Multiplier reduces all possible partial products and so has a smaller carry propagating adder whereas Dadda Multiplier performs only minimum necessary reduction and thus has a larger carry propagating adder. But as far as speed is concerned Dadda Multiplier is faster than Wallace Multiplier. Keywords- Modified Booth Multiplier, Vedic Multiplier, Wallace Multiplier, Dadda Multiplier, partial products.
International Journal of Emerging Trends in Engineering Research, 2023
The I/O (Input Output) module conveys the information between I/O device and processor. I/O devices are majorly of two types: Parallel I/O and Serial I/O. Parallel I/O performs multiple I/O operations simultaneously. Due to this speed and higher bandwidths are achieved, but the usage of parallel I/O devices is decreasing as time progresses because it involves complex design due to the usage of multiple wires for the transmission hence only limited to usage in shorter distances. It also uses a greater number of pins compared to serial I/O for the same number of data bits which makes its usage problematic in higher level devices. Serial I/O transmits individual data bits sequentially. It uses lesser number of lines for data transmission thereby reducing the design complexity. Since, the data transmission is sequential the signal delay increases. Thus, this project aims to develop a protocol which achieves High Speed Serial I/O which helps to increase the data rate from Mbps to Gbps, decrease the design complexity, to design hardware using fewer number of pins on PCB and reduce signal delay to maximum extent possible.
References (27)
- R.F. Lyon, Two's compliment pipeline multipliers, IEEE Trans Commun., Vol .COM-24 no 4 pp 418-425, April 1976.
- R.Gnanasekaran, On a bit-serial input and bit serial output multiplier IEEE Trans. Comput Vol C-32 no.9 pp 878 -880 Sept 1983.
- L.Dadda, On Serial-input multipliers for two's complement functions IEEE Trans. Comput Vol 38 no 9, pp 1341-1345 Sept 1989.
- N.Kanapoulos, A bit serial architecture for digital signal processing, IEEE Trans. Circuits Sysf. Vol. CAS -32 no 3, pp 289- 291 ,1985
- A.D.Booth,A.D, A signed binary multiplication technique , Quarterly J.Mechan, Appl. Math. Vol 4, no 2, pp 236 -240, Aug 1951
- R.Menon and D.Radhakrishanan, High Performance 5:2 compressor CMOS 4-2 and 5-2 compressor for fast arithmetic circuits IEEE Trans. Circuits Systs. I Reg Papers Vol 51, no 10, pp 1985 -1997, Oct 2004.
- A.Aggoun, A.Ashur and M.K.Ibrahim, Area-time efficient serialserial multipliers, in Proc. IEEE Conf. Circuits Syst. (ISCAS), Geneva, Switzerland, 2000, pp. 585-588.
- K.Z.Pekmestzi, P.Kalivas and N.Moshopoulos, Long unsigned number systolic serial multipliers and squarers, IEEE Trans. Circuits Syst. II, Brief Papers, Vol. 48, no. 3, pp. 316-321, Mar. 2001.
- O.Nibouche, A.Bouridarie and M.Nibouche, New architectures for serial-serial multiplication, in Proc. IEEE Conf. Circuits Syst. (ISCAS), Sydney, Australia, 2001, Vol. 2, pp. 705-708.
- M.R.Meher, C.C.Jong and C.H.Chang, A High Bit Rate Serial- Serial Multiplier With On-the-Fly Accumulation by Asynchronous Counters, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, Vol. 19, no. 10, pp no,1733-1745 October 2011.
- P.Denyer and D.Renshaw, VLSI Signal Processing: A Bit-Serial Approach, Addison-Wesley, 1985.
- S.Sunder, F.El-Guibaly and A.Antoniou, Two's-complement fast serial-parallel multiplier, in Proc. Inst. Elect. Eng.-Circuits Devices Syst., Vol. 142, 1995.
- I.Hussian and C.Prasanna Raj, Design and implementation of low power and high speed autocorrelator and CORDIC Architecture for OFDM, IACSIT International Journal of Engineering and Technology, Vol. 3, No. 3, 2011.
- H.Saleh, A.H.Khalil, M.A.Ashour and A.Salama, Novel serial- parallel multipliers, IEEE Proc-Circuits Devices Syst., Vol. 148, no. 4, pp. 183-189, Aug. 2001.
- Sriadibhatla Sridevi, Appsani V. V. S. Chowdary, Low Power Pilpelined FIR Filter with Enhanced Row Bypassing Multiplier, (2011) International Journal on Communications Antenna and Propagation (IRECAP), 1 (1), pp. 132-135.
- Ghandi F. Manasra, Mais S. Ibrahim, Yaser H. Abdallah, Bit Error Rate Reduction Using Spatial-Temporal Diversity by Employing OFDM Techniques, (2011) International Journal on Communications Antenna and Propagation (IRECAP), 1 (4), pp. 353-365.
- Mattera, D., Tanda, M., Preamble-based synchronization for OFDM/OQAM systems, (2011) European Signal Processing Conference, pp. 1598-1602.
- Mattera, D., Tanda, M., Data-aided synchronization for OFDM/OQAM systems, (2012) Signal Processing, 92 (9), pp. 2284-2292.
- Mattera, D., Tanda, M., A new method for blind synchronization for OFDM/OQAM systems, (2011) ISPA 2011 -7th International Symposium on Image and Signal Processing and Analysis, art. no. 6046578, pp. 46-51.
- Mattera, D., Tanda, M., Blind symbol timing and CFO estimation for OFDM/OQAM systems, (2013) IEEE Transactions on Wireless Communications, 12 (1), art. no. 6397549, pp. 268-277.
- Mattera, D., Tanda, M., Bellanger, M., Frequency-spreading implementation of OFDM/OQAM systems, (2012) Proceedings of the International Symposium on Wireless Communication Systems, art. no. 6328353, pp. 176-180.
- Improving Search Results Through Reducing Replica in User Profile by P. Srinivasan, K. Batri 2444 An Evaluation of the Movie Song Browser System Among IT and Non-IT Users by Munauwarah, Nazlena Mohamad Ali, Hyowon Lee 2453 An Access Control Model of Web Services Based on Multifactor Trust Management by R. Joseph Manoj, A. Chandrasekhar 2460 Performance Evaluation of the Hearing Impaired Speech Recognition in Noisy Environment by C. Jeyalakshmi, V. Krishnamurthi, A. Revathy 2467 SEVALERPS a New EX-ANTE Multi-Criteria Method for ERP Selection by Abdelilah Khaled, Mohammed Abdou Janati-Idrissi 2477
- A Novel Expert System in Hospital Location Analysis with the Aid of Adaptive Artificial Bee Colony (AABC) by K. Janaki, N. Radhakrishnan 2486 Design of High Speed Serial-Serial Multiplier for OFDM Applications by N. Saravanakumar, A. Nirmal Kumar, K. N. Vijeyakumar, M. K. Ananda Moorthy 2495 Feature Based Image Retrieval Using Fused Sift and Surf Features by V. Vijayarajan, M. Dinakaran 2500 A New Multibiometric Identification Method Based on a Decision Tree and a Parallel Processing Strategy by Kamel Aizi, Mohamed Ouslim 2507
- Computed Tomography Images Restoration Using Anisotropic Diffusion Regularization by Faouzi Benzarti, Hamid Amiri 2515 Secure Medical Image Retrieval Using Dynamic Binary Encoded Watermark by A. Umaamaheshvari, K. Thanushkodi 2521
- Microarray Gene Expression and Multiclass Cancer Classification Using Improved PSO Based Evolutionary Fuzzy ELM Classifier with ICGA Gene Selection by T. Karthikeyan, R. Balakrishnan 2532 Comparative Analysis of Intrusion Detection System with Mining by S. Vinila Jinny, J. Jayakumari 2540 Enhanced Distributed Text Document Clustering Based on Semantics by J. E. Judith, J. Jayakumari 2545 Abstracting and Indexing Information: Cambridge Scientific Abstracts (CSA/CIG) Academic Search Complete (EBSCO Information Services) Elsevier Bibliographic Database -SCOPUS Index Copernicus (Journal Master List): Impact Factor 6.14
- Autorizzazione del Tribunale di Napoli n. 59 del 30/06/2006
- R E P R I N T
FAQs
AI
What performance improvements does the HS-SS multiplier offer over traditional designs?add
The HS-SS multiplier shows a delay reduction compared to Lyon's, conventional, Kanapaulous, and Meher designs, achieving up to 21.1% area reduction and 13.6% power savings.
How does the HS-SS multiplier handle partial product generation efficiently?add
It uses a Cycle Tracker for term generation, producing 2n+1 terms per cycle, minimizing the number of gates required for partial products.
What techniques contribute to the reduced latency in the HS-SS multiplier?add
The design employs Dadda's methodology for partial product reduction, and operates on both clock edges to enhance speed.
Which specific applications benefit from the HS-SS multiplier implementation?add
The proposed multiplier is verified within an auto-correlator unit for Orthogonal Frequency Division Multiplexing, showcasing its applicability in DSP tasks.
In what way does the HS-SS multiplier increase throughput?add
By employing pipelining and parallel processing of inputs, the multiplier outputs one bit of the result per clock cycle, thus enhancing overall throughput.
saravana kumar