

# CSE460: VLSI Design

## Practice problems

More problems will be added to this doc as we progress through the semester.

### Introduction to VLSI Design, History and Timeline

1. What is Moore's law? Briefly explain using appropriate diagrams.
2. Suppose, you are building a digital system that has 4 inputs ( $x_1, x_2, x_3, x_4$ ) and you want to produce the output 1 only if the combination of the inputs are 0, 1, 0, 1, respectively. Design a circuit that implements the system using logic gates.
3. Design the simplest circuit that has three inputs,  $x_1, x_2$ , and  $x_3$ , which produces an output value of 1 whenever two or more of the input variables have the value 1; otherwise, the output has to be 0. (Using karnaugh maps)
4. Design a circuit with output  $f$  and inputs  $x_1, x_0, y_1$ , and  $y_0$ . Let  $X = x_1x_0$  and  $Y = y_1y_0$  represent two 2-digit binary numbers. The output  $f$  should be 1 if the numbers represented by  $X$  and  $Y$  are equal. Otherwise,  $f$  should be 0.
5. Practice designing different logic circuits using logic gates for arbitrary logic functions.

### Introduction to CMOS technology and CMOS circuits

1. Why do we use impure (doped) silicon as our substrate for building MOS transistors?
2. Explain the following general structure of a CMOS logic gate, mentioning the different behaviours for different combinations of the networks:



3. Design CMOS compound gates that implements the following functions:
  - a.  $Y = (A \cdot B + C \cdot D) \cdot E$
  - b.  $Y = (A + B) \cdot (C + D)$
  - c.  $Y = (A \cdot B \cdot C) + D$
  - d.  $Y = \overline{A \cdot (B + C) \cdot D}$

4. What is signal strength? Explain the behavior of MOS transistors as pass transistors while mentioning which transistors produce which type of strong and weak signals.
5. Explain restoring and non-restoring properties of a device with the help of different kinds of CMOS tristate devices.
6. Explain how the complexity, cost and area requirement for a 2:1 MUX can be reduced by orders of magnitude by going from gate-level implementation to CMOS transmission gate/CMOS tristate inverter implementation.
7. Design a 2:1 MUX using CMOS tristate inverters and explain its working principle.
8. Design a positive level triggered D latch and explain its working principle.
9. Design a positive edge triggered D flip flop and explain its working principle using a suitable timing diagram.
10. Practice designing CMOS logic gates for arbitrary logic functions.

## Pass Transistor

1. Give an expression for the output voltage for the pass transistor networks:



2. Suppose  $V_{DD} = 1.2 \text{ V}$  and  $V_t = 0.4 \text{ V}$ . Determine  $V_{out}$  for the following cases:
  - a)  $V_{in} = 0 \text{ V} \rightarrow 0\text{V}$
  - b)  $V_{in} = 0.6 \text{ V} \rightarrow 0.6\text{V}$
  - c)  $V_{in} = 0.9 \text{ V} \rightarrow$
  - d)  $V_{in} = 1.2 \text{ V}$ .



3. From the following figure, find the value of the voltages at points X and Y if  $V_{SS} = 0$  V and  $V_{tp} = -0.3$  V. Why do X and Y differ?



4. From the following figure, find the voltages at each of the nodes, A, B, C, D, E and F. Assume,  $V_{dd} = 5$  V,  $Gnd = 0$  V,  $V_{tn} = 0.5$  V and  $|V_{tp}| = 1.5$  V.



5. From the following figures, find the voltages at each of the nodes, A, B, C, D, E. Assume,  $V_{dd} = 1$  V,  $Gnd = 0$  V,  $V_{tn} = 0.2$  V and  $|V_{tp}| = 0.2$  V.



- 6.

7. What is a sequential circuit? Give a few practical examples of finite state machines.
8. Using a block diagram, explain the working mechanism of a general sequential circuit.
9. What is the difference between a Moore type FSM and a Mealy type FSM?

## Finite State Machines

1. Derive the Moore type state diagram, state table, state assigned table and the final circuit for the following systems:
  - a. A modulo-6 up counter (i.e. a counter that counts 0,1,2,3,4,5,0,1,2,...).
    - i. It should have a 1-bit input  $w$ , a clock signal and a multi-bit output  $z$ .
    - ii. The machine should increase its count whenever  $w = 1$ , and hold its previous count otherwise.
    - iii. At each counting step the machine should output the value of the count in binary.
  - b. A machine that detects the following pattern in its input ( $w$ ) : 1, 0, 1.
    - i. Repeating and overlapping sequences should be detected.
    - ii. The machine should generate the output  $z = 1$  if in the past 3 clock cycles 101 pattern was detected, and  $z = 0$  otherwise.
  - c. A machine that has the following state table:

| Present state | Next state |         | Output<br>$z$ |
|---------------|------------|---------|---------------|
|               | $w = 0$    | $w = 1$ |               |
| A             | A          | B       | 0             |
| B             | A          | C       | 0             |
| C             | A          | D       | 0             |
| D             | A          | D       | 1             |

- d. An FSM that has an input  $w$  and an output  $z$ . The machine has to generate  $z = 1$  when the previous four values of  $w$  were 1001 or 1111; otherwise,  $z = 0$ . Overlapping input patterns are allowed. An example of the desired behavior is:

$w : 010111100110011111$

$z : 000000100100010011$

1. Explain different types of encoding schemes for FSMs mentioning their advantages and disadvantages.
2. Practice designing Moore type state diagrams for different scenarios.
3. How one-hot encoded sequential circuits can be faster than moore/mealy type sequential circuits?

## VLSI Physical Design-I

1. What is the difference between floor-planning and placement?
2. The graph below (nodes *a-f*) can be optimally partitioned using the Kernighan-Lin algorithm. Perform the first pass of the algorithm. The dotted line represents the initial partitioning. Assume all nodes have the same weight and all edges have the same priority.



Note: Clearly describe each step of the algorithm. Also, show the resulting partitioning (after one pass) in graphical form.

3. The circuit below (nodes 1-6) can be optimally partitioned using the Kernighan-Lin algorithm. Transform the following circuit to its equivalent graph representation (**gates to nodes**) and perform the first iteration (swap) of the algorithm and compare the cut cost with the initial one. The dotted line represents the initial partitioning with block A (1,2,6) and B (3,4,5).



4. The graph below (nodes *a-f*) can be optimally partitioned using the **Kernighan-Lin** algorithm. Perform the first pass of the algorithm. The dotted line represents the initial

partitioning. Assume the numbers in the figure are the weights of the corresponding connections.



[Hint: Initial cut cost =  $3+1+2+4+6 = 16$ ,  
 $E_c(a) = 3$ ,  $E_{nc}(a) = 6$ ,  
 $D(a) = 3-6 = -3$ ,  
 $D(b) = (3+1)-2 = 2$ ,  
 $c(a,b) = 3$ ,  
 $g(a,b) = D(a) + D(b) - 2c(a,b) = -3 + 2 - 2 \cdot (3) = -7$ ]

5. What are the different types of routing?
6. What are the goals of Clock Tree Synthesis (CTS)?

## VLSI Physical Design-II( Lee's Maze Algorithm)

1. Find the shortest routing path from Source (S) to Target (T) using Lee's Maze algorithm. Find the memory requirements for the calculation. Dark regions are obstacles or components.



2. i. Find the shortest path for each target using Lee's maze routing.  
ii. Calculate the total memory usage.



## Delay

1. What are the origins of delay in CMOS circuits?
2. What are the major two delays that characterize combinational logic circuits?
3. Define tpdf, tpdr, tcdf, and tcdr using a suitable timing diagram.
4. Describe how one can compute the delay using transient response.
5. Come up with the equivalent nMOS & pMOS RC circuits for the RC delay model.
6. Sketch a 3-input NAND gate with transistor widths chosen to achieve effective rise and fall resistance equal to that of a unit inverter ( $R$ ). Annotate the gate with its gate and diffusion capacitances. Assume all diffusion nodes are contacted. Then sketch equivalent circuits for the falling output transition and for the worst-case rising output transition.
7. Estimate tpdf, tpdr, tcdf, and tcdr for the 3-input NAND gate from the previous problem if the output is loaded with  $h$  number of identical NAND gates.
8. Compute the Elmore delay for  $V_{out}$  in the 2nd order RC system:



9. Estimate tpd for a unit inverter driving  $m$  identical unit inverters.
10. Repeat the previous problem if the driver is  $w$  times unit size.  
If a unit transistor has  $R = 10 \text{ k}\Omega$  and  $C = 0.1 \text{ fF}$  in a 65 nm process, compute the delay, in picoseconds, of the inverter shown with a fanout of  $h = 4$ . Repeat the problem for  $h = 10$ .



11. Exercises 4.1 4.2, 4.3, 4.4, 4.5, 4.6, 4.9, 4.18, 4.19.
12. Calculate all the propagation and contamination delays for a 4-input NOR gate if it drives two inverter and a 3-input NAND gate. Assume all diffusion nodes are contacted and worst case rise and fall resistances are equal to that of a unit inverter ( $R$ ). [Draw the circuit with gate symbols at first and then, CMOS architecture to derive RC equivalent circuit]
13. Draw the simplified RC network and determine the expression for the Elmore delay,  $t_{pd}$  for node Y.



## Power

1. A digital system-on-chip in a 1 V 65 nm process (with 50 nm drawn channel lengths and  $\lambda = 25$  nm) has 1 billion transistors, of which 50 million are in logic gates and the remainder in memory arrays. The average logic transistor width is  $12 \lambda$  and the average memory transistor width is  $4 \lambda$ . The memory arrays are divided into banks and only the necessary bank is activated so the memory activity factor is 0.02. The static CMOS logic gates have an average activity factor of 0.1. Assume each transistor contributes 1 fF/ $\mu\text{m}$  of gate capacitance and 0.8 fF/ $\mu\text{m}$  of diffusion capacitance. Neglect wire capacitance for now (though it could account for a large fraction of total power). Estimate the switching power when operating at 1 GHz.
2. Consider the system-on-chip from the previous problem. Subthreshold leakage for OFF devices is 100 nA/ $\mu\text{m}$  for low-threshold devices and 10 nA/ $\mu\text{m}$  for high-threshold devices. Gate leakage is 5 nA/ $\mu\text{m}$ . Junction leakage is negligible. Memories use low-leakage devices everywhere. Logic uses low-leakage devices in all but 5% of the paths that are most critical for performance. Estimate the static power consumption.
3. You are synthesizing a chip composed of random logic with an average activity factor of 0.1. You use a standard cell process with an average switching capacitance of 450

pF/mm<sup>2</sup>. Estimate the dynamic power consumption of your chip if it has an area of 70 mm<sup>2</sup> and runs at 450 MHz at VDD = 0.9 V.

4. You are manufacturing a chip composed of random logic and memory units with average activity factors of 0.1 & 0.05 respectively. You use a standard cell process with an average switching capacitance of 450 pF/mm<sup>2</sup>. Estimate the dynamic power consumption of your chip if it has an area of 200 mm<sup>2</sup> of which 60% is occupied by the logic circuit and runs at 450 MHz at VDD = 0.9 V.
5. You are considering lowering VDD to try to save power in a static CMOS gate. You will also scale V<sub>t</sub> proportionally to maintain performance. Will dynamic power consumption go up or down?

Determine the activity factor for the signal shown. The clock rate is 1 GHz.

