



# CS & IT ENGINEERING

COMPUTER ORGANIZATION  
AND ARCHITECTURE



## IO Organization

Lecture No.- 03

By- Vishvadeep Gothi sir



# Recap of Previous Lecture



**Topic**

Peripheral Device

**Topic**

IO vs Memory Buses

**Topic**

Memory Mapped IO vs IO Mapped IO

**Topic**

Asynchronous Data Transfer

**Topic**

Modes of Transfer

# Topics to be Covered



**Topic**

DMA

**Topic**

Cycle Stealing



## Topic : DMA



- Enables data transfer between I/O and memory without CPU intervention
- Need a hardware: DMAC<sup>ontroller</sup>



## Topic : DMA





## Topic : DMA

1. Starting Address mem address from where data transfer should start

2. Data Count No. of bytes or words to be transferred  
 for byte addressable mem.  
 for word addressable mem.



|            | Initially | After 1 Byte | After 2 Byte | After 3 Byte | After 4 Byte |         |
|------------|-----------|--------------|--------------|--------------|--------------|---------|
| Start add. | 1000      | 1001         | 1002         | 1003         | 1004         | ...     |
| Data count | 200       | 199          | 198          | 197          | 196          | - - - 0 |

when data count becomes 0 then transfer stops

Data count =   4 bits

max value of data count =  $(1111)_2 = (15)_{10}$

max no. of bytes transferred with one "initializat" of DMA = 15

---

⇒ DMAC is a special purpose processor, because it generates address and control signals for data transfer between I/O & mem.

---

→ During DMA transfer CPU's status (Reg. values) will be as it is.

when DMA uses buses for data transfer then CPU can perform  
only those operations which do not require system buses.  
which means CPU will be blocked mostly.



## Topic : Modes of DMA Transfer

when CPU gives control of the buses to DMA and it takes back control.



## Topic : Modes of DMA Transfer

- 1. Burst Mode
- 2. Cycle Stealing
- 3. Interleaving DMA





## Topic : Modes of DMA Transfer



### Burst Mode :

when CPU gives control of the buses to DMAC, then a block of data is transferred.

1 block  $\Rightarrow$  512B to 4KB



## Topic : Modes of DMA Transfer

### Cycle Stealing :

slow I/o devices takes time to prepare data internally . Till then <sup>CPU keeps</sup> control of buses -  
when data is ready , then DMAC steals CPU cycles and takes control of buses to transfer prepared data to mem -



## Topic : Modes of DMA Transfer

### Interleaving DMA:

CPU gives control of the buses to DMA when CPU does not need it.  
ex:- during ALU operation, inst<sup>n</sup> decode.

Time ^ CPU is blocked due to DMA is almost zero.  
for which

Time needed to prepare data in I/O =  $t_x$  ← depends on I/O speed  
— || — transfer data to mem =  $t_y$  ← depends on mem speed

$$\% \text{ of time CPU is blocked due to DMA} = \frac{t_y}{t_x + t_y} * 100\% \quad (\text{burst mode})$$

---

$$|| - \frac{t_y}{t_x} = \frac{t_y}{t_x} * 100\% \quad (\text{cycle stealing})$$

#Q. Consider a device operating on 1MBPS speed and transferring the data to memory using cycle stealing mode of DMA. If it takes 2 microseconds to transfer 16 bytes data to memory when it is ready/prepared. Then percentage of time CPU is blocked due to DMA is?

$$\begin{aligned} t_y &= 2 \mu\text{sec} \\ t_{yc} &= 16 \mu\text{sec} \end{aligned}$$

$$\begin{aligned} \text{for } 1\text{MB, preparat' time} &= 1\text{sec} \\ \text{for } 16\text{B} &\quad \frac{1}{16} = \frac{1\text{sec}}{1\text{MB}} * 16\text{B} \\ &= 16 \mu\text{sec} \end{aligned}$$

---


$$\begin{aligned} \% \text{ of time CPU blocked due to DMA} &= \frac{2 \mu\text{s}}{16 \mu\text{s}} * 100\% \\ &= 12.5\% \end{aligned}$$

Ques) % of time CPU blocked = 20%

2 MBPS

cycle stealing

Data to prepare at once = 8B

$$t_y = \text{_____ usec ?}$$

for 2MB, time = 1sec

$$\text{for } 8B, \text{ time} = \frac{1\text{sec}}{2\text{MB}} * 8B \\ = 4\text{usec}$$

$$20\% = \frac{t_y}{4\text{usec}} * 100\%$$

$$t_y = \frac{20}{25} \\ = 0.8\text{ usec}$$

Ques)  $t_y = 10 \text{ msec}$

% of time CPU blocked = 5%

$t_x = \underline{\hspace{2cm}} \text{ us}$  ?.

Cycle stealing

Sol'n

$$5\% = \frac{10 \text{ usec}}{t_x} * 100\%$$

$$t_x = \frac{1000 \text{ usec}}{5}$$

$$= 200 \text{ usec}$$

Ques)

In this quest' if data prepared at once is 4 bytes then speed of I/O is ?.

Sol'n

$$\begin{aligned} \text{In } 200 \text{ usec, data prepare} &= 4B \\ \text{in } 1 \text{ sec, } \frac{1}{\text{---}} &= \frac{4B}{200 \times 10^{-6} \text{ sec}} \\ &= 2 * 10^4 B/\text{sec} \\ &\Rightarrow 20 \text{ KB/sec} \end{aligned}$$

Ans = 80000

#Q. Consider a computer system with DMA support. The DMA module is transferring one 8-bit character in one CPU cycle from a device to memory through cycle stealing at regular intervals. Consider a 2 MHz processor. If 0.5% processor cycles are used for DMA, the data transfer rate of the device is bits per second?

$$t_y = 1 \text{ cycle} = \frac{1}{2 \text{ MHz}} = 0.5 \mu\text{sec}$$

$$0.5\% = \frac{0.5 \mu\text{sec}}{t_x} * 10^6 \%$$

$$t_x = 100 \mu\text{sec}$$

in 100  $\mu\text{sec}$ , data = 8 bits

$$\text{in } 1 \text{ sec, } \frac{8 \text{ bits}}{100 * 10^6 \text{ sec}} = 80000 \text{ bits/sec}$$

#Q. On a non-pipelined sequential processor, a program segment, which is the part of the interrupt service routine, is given to transfer 500 bytes from an I/O device to memory.

Initialize the address register — 1 ] 2  
Initialize the count to 500 — 1  
LOOP: Load a byte from device — 2  
Store in memory at address given by address register — 2 }  $7 * 500$   
Increment the address register — 1  
Decrement the count — 1  
If count !=0 go to LOOP — 1

*500 times* ← [

Assume that each statement in this program is equivalent to a machine instruction which takes one clock cycle to execute if it is a non-load/store instruction. The load-store instructions take two clock cycles to execute.

The designer of the system also has an alternate approach of using the DMA controller to implement the same transfer. The DMA controller requires 20 clock cycles for initialization and other overheads. Each DMA transfer cycle takes two clock cycles to transfer one byte of data from the device to the memory.

What is the approximate speed up when the DMA controller-based design is used in a place of the interrupt driven program-based input-output?

$$\text{Interrupt time} = 2 + (500 * 7) = 3502$$

$$\text{DMA time} = 20 + (2 * 500) = 1020$$

$$\text{Speed up} = \frac{\text{Interrupt I/O time}}{\text{DMA time}} = \frac{3502}{1020} \approx 3.4$$

#Q. The DMA controller has data count register of size 8-bits. The memory is byte addressable. The maximum number of bytes the DMA can transfer to memory at a time without giving the control of the buses back to CPU?

$$\text{max value of 8 bits} = (111111)_8 = (255)_{10}$$

$$\text{Ans} = 255$$

#Q. The DMA controller has data count register of size 8-bits. The memory is byte addressable.

1. Minimum how many times DMA needs to take control from CPU to transfer a file of 500 bytes?
2. Minimum how many times DMA needs to take control from CPU to transfer a file of 15K bytes?

$$1. \left\lceil \frac{500 \text{ B}}{(2^8 - 1) \text{ B}} \right\rceil = 2$$

$$2. \left\lceil \frac{15 * 1024 \text{ B}}{255} \right\rceil = 61$$

GATE - PYQ

- #Q. The size of the data count register of a DMA controller is 16bits.  
The processor needs to transfer a file of 29, 154 kilobytes from disk to main memory.  
The memory is byte addressable. The minimum number of times the DMA controller needs to get the control of the system bus from the processor to transfer the file from the disk to main memory is 456.

$$= \left\lceil \frac{29154 * 1024B}{(2^{16}-1)B} \right\rceil = 456$$

$$t_x = t_y$$

#Q. If a word preparation time in IO device and word transfer time to memory from IO device are same. Then?

$$\text{cycle stealing, \% of time CPU blocked} = \frac{t_y}{t_{sc}} * 100\% \\ = 100\%$$

- A ✓ 100% time CPU is blocked due to DMA in cycle stealing mode
- B 50% time CPU is blocked due to DMA in cycle stealing mode
- C 100% time CPU is blocked due to DMA in burst mode
- D ✓ 50% time CPU is blocked due to DMA in burst mode

for burst mode,

$$= \frac{t_y}{t_x + t_y} * 100\% = 50\%$$

#Q. Which one of the following statements is FALSE?

- A** In the cycle stealing mode of DMA, one word of data is transferred between an I/O device and main memory in a stolen cycle
- B** The CPU can start executing an interrupt service routine faster with vectored interrupts than with non-vectored interrupts
- C** For bulk data transfer, the burst mode of DMA has a higher throughput than the cycle stealing mode
- D**  Programmed I/O mechanism has a better CPU utilization than the interrupt driven I/O mechanism

#Q. Consider the following statements:

- I. Daisy chaining is used to assign priorities in attending interrupts.
- II. When a device raises a vectored interrupt, the CPU does polling to identify the source of interrupt.
- III. In polling, the CPU periodically checks the status bits to know if any device needs its attention.
- IV. During DMA, both the CPU and DMA controller can be bus masters at the same time.

Which of the above statements is/are TRUE?

**A**

I, II only

**C**

I and III only

**B**

I and IV only

**D**

III only

#Q. A keyboard connected to a computer is used at a rate of 1 keystroke per second. The computer system polls the keyboard every 10 ms (milli seconds) to check for a keystroke and consumes 100  $\mu$ s (micro seconds) for each poll. If it is determined after polling that a key has been pressed, the system consumes an additional 200  $\mu$ s to process the keystroke. Let  $T_1$  denote the fraction of a second spent in polling and processing a keystroke.

In an alternative implementation, the system uses interrupts instead of polling. An interrupt is raised for every keystroke. It takes a total of 1 ms for servicing an interrupt and processing a keystroke. Let  $T_2$  denote the fraction of a second spent in servicing the interrupt and processing a keystroke.

The ratio  $\frac{T_1}{T_2}$  is 10.2. (Rounded off to one decimal place)

$$\frac{T_1}{T_2} = \frac{10.2 \text{ ms}}{1 \text{ ms}} = 10.2$$

Soln

$$\text{no. of polls in 1 sec} = \frac{1 \text{ sec}}{10 \text{ msec}} = 100$$

$$\begin{aligned}\text{Total time in polling per sec} &= 100 * 100 \text{ usec} \\ &= 10 \text{ msec}\end{aligned}$$

$$\text{keystroke processing time} = 200 \text{ usec} = 0.2 \text{ ms}$$

$$\begin{aligned}T_1 &= 10 + 0.2 \\ &= 10.2 \text{ ms}\end{aligned}$$

$$T_2 = 1 \text{ msec}$$



## 2 mins Summary



**Topic**

DMA

**Topic**

Cycle Stealing



# Happy Learning

## THANK - YOU