Power comparison of flow-graph and distributed arithmetic based DCT architectures

Martin Kuhlmann, Keshab K. Parhi

Research output: Contribution to journalConference articlepeer-review

13 Scopus citations


The Discrete Cosine Transform (DCT) is widely used in image and video compression systems. Two popular approaches to implementation of DCT algorithms include use of distributed arithmetic and flow-graphs based on fast algorithms. The Distributed arithmetic architectures (DAA) have been widely used in many system implementations, due to their low latency and area requirements. However, no systematic study of power, area and latency tradeoffs of the DAA and the FGA have been studied. This paper presents a systematic study of area, latency and power consumption of these two alternate architectures. It is concluded that the flow-graph architecture consumes about 39% less power compared to the distributed arithmetic architecture, at the expenses of 28% more area and a 3.75 times increase in latency. Alternatively, by reducing the level of pipelining in the flowgraph architecture the implementation consumes 13% less power, at the expenses of 20% more area and a tow times increase in latency. These results have been obtained by estimating the power consumption on actual layouts with effects of parasitic capacitance included as opposed to estimation of power consumption on schematics.

Original languageEnglish (US)
Pages (from-to)1214-1219
Number of pages6
JournalConference Record of the Asilomar Conference on Signals, Systems and Computers
StatePublished - Dec 1 1998
EventProceedings of the 1998 32nd Asilomar Conference on Signals, Systems & Computers. Part 1 (of 2) - Pacific Grove, CA, USA
Duration: Nov 1 1998Nov 4 1998


Dive into the research topics of 'Power comparison of flow-graph and distributed arithmetic based DCT architectures'. Together they form a unique fingerprint.

Cite this