VLSI UNIVERSE: Clock gating

The dynamic power associated with any circuit is related to the amount of switching activity and the total capacitive load. In digital VLSI designs, the most frequently switching element are clock elements (buffers and other gates used to transport clock signal to all the synchronous elements in the design). In some of the designs, clock switching power may be contributing as high as 50% of the total power. Power being a very critical aspect, we need to make efforts to reduce this. Any effort that can be made to save the clock elements toggling can help in reducing the total power by a significant amount. Clock gating is one of the techniques used to save the dynamic power of clock elements in the design.

Principle behind clock gating: The principle behind clock gating is to stop the clock of those sequential elements whose data is not toggling. RTL level code talks only about data transfer. It may have some condition wherein a flip-flop will not toggle its output if that condition is met. Figure 1 below shows such a condition. In it, FF1's output will remain stable as long as EN = 0. On the right hand side, its equivalent circuit is provided, wherein EN has been translated into an AND gate in the clock path.This is a very simplistic version of what modern-day synthesis tools do to implement clock gating.

Figure 1: Clock gating implementation

Implications of clock gating: The implementation of clock gating, as expected, is not so simple. There are multiple things to be taken into account, some of which are:

Timing of enable (EN) signal: The gating of clock can cause a glitch in clock, if not taken care of by architectural implementation. Clock gating checks discusses what all needs to be taken care of as regards timing in clock gating implementation.

Area/power/latency trade-off: As is shown in figure 1, clock gating transfers a data-path logic into clock path. This can increase overall clock latency. Also, area penalty can be there, if the area of clock gating structure is more. Power can also increase, instead of decreasing, if only 1-2 flops' structure is replaced by clock gating (depending upon the switching power of clock gating structure vs those inside flip-flop). Normally, a bunch of flops with similar EN condition are chosen, and a common clock gating is inserted for those, thereby minimizing area and power penalties.

Also read:

Clock gating interview questions

8 comments:

JN Raju12 May 2020 at 21:00
i think clock gates would save on area too, as many muxes before the flops are avoided,
generally there is a min bandwidth in synthesis, usually 3, below which syntheis tool will not insert clock gating,
Unknown16 November 2020 at 05:21
Hi,

Is that possible to reduce the duty cycle of clock from 50% to 30% or even less than that? This in-turn reduce the duration of time for which the transistor is being ON and consecutively leads to dynamic power saving. Correct me if am wrong.
Thanks in advance
Anonymous3 February 2021 at 08:01
Nicely explained. Short and crisp.
Anonymous13 June 2023 at 04:53
why we are using and gate while doing clock gating
Semiconductor Company in India30 March 2025 at 01:29
Nice Blog. Thanks for sharing

Exploring the Role of Semiconductors in IoT and Smart Devices

What are the common approaches to ASIC Verification?

The Impact of ASICs on the Evolution of Satellite Technology

Understanding the differences between clock gating and power gating

Understanding the Chip Design Process

Thanks for your valuable inputs/feedbacks. :-)

Clock gating - basics

8 comments: