

CS-537: Midterm (Spring 2016)  
*The Timeline*

**Please Read All Questions Carefully!**

There are eighteen (18) total numbered pages, with fourteen (14) questions.

Please put your FULL NAME (mandatory) on THIS page only.

Name: \_\_\_\_\_ [MASTER]

## **Grading Page**

|       | Points | Total Possible |
|-------|--------|----------------|
| Q1    |        | 10             |
| Q2    |        | 10             |
| Q3    |        | 10             |
| Q4    |        | 10             |
| Q5    |        | 10             |
| Q6    |        | 10             |
| Q7    |        | 10             |
| Q8    |        | 10             |
| Q9    |        | 10             |
| Q10   |        | 10             |
| Q11   |        | 10             |
| Q12   |        | 10             |
| Q13   |        | 10             |
| Q14   |        | 10             |
| Total |        | 140            |

## Directions

Welcome to the 537 Midterm! It shouldn't be too hard, unless, of course, you haven't prepared! Then it will be hard, unless you are a genius.

The theme of the exam is **the timeline**. Timelines show the behavior of things over time (naturally), and so in this exam we will mostly look at timelines to see what the system is doing, or create timelines to describe various behaviors.

Please **read each question carefully**.

Exam-taking strategy should be: **easiest-problem-first**. This scheduling discipline will ensure you finish as much of the exam as possible, and also builds your confidence. Don't get stuck working on one hard problem.

Good luck!

# EACH WORTH $\frac{1}{2}$ PT

## 1. Base And Bounds Blues

A system uses simple base/bounds to virtualize address spaces. In each of the traces below, your job is to fill in the missing values of either virtual addresses, physical addresses, base register, or bounds register. The bounds holds the size of the valid region of the address space. In some cases, you won't be able to precisely know the values, so just put as precise of an answer as you can (i.e., perhaps a range, not a single number).

All numbers are in decimal format.

Virtual Address  $\rightarrow$  Physical Address

|      |         |
|------|---------|
| 0    | 1000    |
| 100  | 1100    |
| 1999 | 2999    |
| 2000 | [fault] |

Base? 1000  
Bounds? 2000

Virtual Address  $\rightarrow$  Physical Address

|      |      |
|------|------|
| 0    | 1000 |
| 100  | 1100 |
| 1999 | 2999 |
| 2000 | 3000 |

Base? 1000  
Bounds? > 2000

Virtual Address  $\rightarrow$  Physical Address

|      |               |
|------|---------------|
| 100  | 3400          |
| 2000 | 5300          |
| 2001 | <u>5301</u> ? |
| 3000 | 6300          |

Base? 3300  
Bounds? > 3000

Virtual Address  $\rightarrow$  Physical Address

|      |               |
|------|---------------|
| 0    | <u>6050</u> ? |
| 100  | <u>6150</u> ? |
| 2000 | <u>8050</u> ? |
| 2001 | [fault]       |

Base? 6050  
Bounds? 2001

OR [fault]  
" "  
bounds 0

Virtual Address  $\rightarrow$  Physical Address

|                    |         |
|--------------------|---------|
| <u>400</u> ?       | 0900    |
| <u>600</u> ?       | 1100    |
| <u>2500</u> ?      | 3000    |
| <u>&gt; 3000</u> ? | [fault] |

Base? 500  
Bounds? 3000

Virtual Address  $\rightarrow$  Physical Address

|      |       |
|------|-------|
| 9000 | 10001 |
| 100  | 1101  |
| 2000 | 3001  |
| 2001 | 3002  |

Base? 1001  
Bounds? > 9000

EACH WORTH 1 1/2 PTS

$$\text{MAX } [-10]$$

## 2. Deadlock Or Not Deadlock

Some of the following arrangement of threads, and the locks that they will grab, can lead to deadlock. Which ones?

Thread 1: will try to grab locks 1 and 2 in an arbitrary order.  
Thread 2: will try to grab locks 1 and 2 in fixed order, 1 then

Could this deadlock?

YES



Thread 1: will try to grab locks 1 and 2 in fixed order, 1 then 2.  
Thread 2: will try to grab locks 1 and 2 in fixed order, 2 then 1.

Could this deadlock?

YES



Thread 1: will try to grab locks 1 and 2 in fixed order, 1 then 2.

Thread 2: will try to grab locks 1, 2 and 3 in fixed order, 1 then 2 then 3.

Could this deadlock?

NO

(no cycle)

Thread 1: will try to grab locks 1 and 2 in arbitrary order.  
Thread 2: will try to grab locks 2 and 3 in arbitrary order.  
Thread 3: will try to grab locks 1 and 3 in arbitrary order.

Could this deadlock?

YES



```
Thread 1: will try to grab locks 1 and 2 in arbitrary order.  
Thread 2: will try to grab locks 2 and 3 in arbitrary order.  
Thread 3: will try to grab lock 1.
```

Could this deadlock?

NO



Thread 1: will try to grab locks 1 and 2 in fixed order, 1 then 2.  
Thread 2: will try to grab locks 2 and 3 in fixed order, 2 then 3.  
Thread 3: will try to grab locks 1 and 3 in fixed order, 1 then 3.

Could this deadlock?

NO

(ordered acquire)

Thread 1: will try to grab locks 1 and 2 in fixed order, 1 then 2.  
Thread 2: will try to grab locks 2 and 3 in fixed order, 2 then 3.  
Thread 3: will try to grab locks 1 and 2 in arbitrary order.

Could this deadlock?

YES



EACH LINE | PT  
MAX -10

### 3. Fork Exec Wait Oh My

Remember fork(), exec(), and wait()? In this question, we trace when each of these system calls is CALLED, and when they RETURN. Thus, write down something like "fork called" if fork() has been called, and "fork returned" if it returns. A particular answer may have more than one action.

Successful exec does not return

Successful fork returns twice (parent, child)



EACH  $\frac{1}{2}$  PT

#### 4. Hardware Or Software Or User Program Or Give Up

This question is about the Limited Direct Execution protocol. Some of these steps are performed by the OS; some are handled by hardware (HW); some are in the user program itself (USER). Mark (by circling the correct answer), in the timeline below, which steps are taken by OS, HW, or USER program in this example of a process being created, running, issuing a system call, and exiting.

|                                          |    |    |      |
|------------------------------------------|----|----|------|
| Create entry for process list            | OS | HW | USER |
| Allocate memory for program              | OS | HW | USER |
| Load program into memory                 | OS | HW | USER |
| Setup user stack with argv               | OS | HW | USER |
| Fill kernel stack with reg/PC            | OS | HW | USER |
| execute return-from-trap instruction     | OS | HW | USER |
| restore regs from kernel stack           | OS | HW | USER |
| switch to user mode                      | OS | HW | USER |
| set PC to main()                         | OS | HW | USER |
| Start running in main()                  | OS | HW | USER |
| Call a system call                       | OS | HW | USER |
| execute trap instruction                 | OS | HW | USER |
| save regs to kernel stack                | OS | HW | USER |
| switch to kernel mode                    | OS | HW | USER |
| set PC to OS trap handler                | OS | HW | USER |
| Handle trap                              | OS | HW | USER |
| Do work of syscall                       | OS | HW | USER |
| execute return-from-trap instruction     | OS | HW | USER |
| restore regs from kernel stack           | OS | HW | USER |
| switch to user mode                      | OS | HW | USER |
| set PC to instruction after earlier trap | OS | HW | USER |
| Call exit() system call                  | OS | HW | USER |

# EACH M/H $\frac{1}{4}$ PT

## 5. Physical Memory Is Just A Cache

Here is a timeline of virtual memory references, given by the virtual page number:

0 1 4 0 1 3 0 1 4 1

Your job is simple: for each scenario below, determine whether the virtual page access will lead to a HIT ("H") as the page is FOUND in the memory of the system or a MISS ("m") as it is not (the page must be retrieved from the disk's swap space).

All pages begin on disk; no pages are in memory at the start (and thus must be referenced to be brought into memory).

Policy FIFO, Cache Size 3

0 1 4 0 1 3 0 1 4 1  
 M M M (H) (H) M M M M (H)

$\emptyset \times \emptyset \emptyset 014$

Policy FIFO, Cache Size 5

0 1 4 0 1 3 0 1 4 1  
 M M M (H) (H) M (H) (H) (H)

$\emptyset \times \emptyset \emptyset 014$

Policy LRU, Cache Size 3

0 1 4 0 1 3 0 1 4 1  
 M M M (H) (H) M (H) (H) M (H)

$\emptyset \times \emptyset \emptyset 014$

notice:  
same  
result  
because  
cache  
is  
big

Policy LRU, Cache Size 1000

0 1 4 0 1 3 0 1 4 1  
 M M M (H) (H) M (H) (H) (H)

EACH QUESTION

1 1/2 PTS

## 6. Producers And Consumers: We Need Both

Assume the following producer/consumer implementation for the famous bounded buffer problem.

```
int buffer[max];

void *producer(void *arg) {
    for (i = 0; i < loops; i++) {
        Pthread_mutex_lock(&mutex); // p1
        while (count == max) // p2
            Pthread_cond_wait(&empty, &mutex); // p3
        put(i); // p4
        Pthread_cond_signal(&fill); // p5
        Pthread_mutex_unlock(&mutex); // p6
    }
}

void *consumer(void *arg) {
    while (1) {
        Pthread_mutex_lock(&mutex); // c1
        while (count == 0) // c2
            Pthread_cond_wait(&fill, &mutex); // c3
        int tmp = get(); // c4
        Pthread_cond_signal(&empty); // c5
        Pthread_mutex_unlock(&mutex); // c6
        printf("%d\n", tmp);
    }
}
```

Assume further that the only way a thread stops running is when it explicitly blocks in either a condition variable or lock (in other words, no untimely interrupts switch from one thread to the other).

Also assume there are NO SPURIOUS WAKEUPS from wait().

In the following, show which lines of code run given the particular scenario.  
For example:

Thread Pa: p1p2p4p5p6p1p2p3  
Thread Ca: c1c2c4c5c6c1c2

You can also show this more concisely if you like, e.g.:

Pa: 1,2,4,5,6,1,2,3  
Ca: 1,2,4,5,6,1,2

Trace 1: 1 producer (Pa), 1 consumer (Ca), max=1. Producer Pa runs first. Stop when consumer has consumed one entry.

Pa: 12456123

Ca: 124

Trace 2: 1 producer (Pa), 1 consumer (Ca), max=3. Producer Pa runs first. Stop when consumer has consumed one entry.

Pa: 12456 12456 12456 123

Ca: 124

3 times

```

void *producer(void *arg) {
    for (i = 0; i < loops; i++) {
        Pthread_mutex_lock(&mutex);           // p1
        while (count == max)                 // p2
            Pthread_cond_wait(&empty, &mutex); // p3
        put(i);                            // p4
        Pthread_cond_signal(&fill);          // p5
        Pthread_mutex_unlock(&mutex);        // p6
    }
}

void *consumer(void *arg) {
    while (1) {
        Pthread_mutex_lock(&mutex);           // c1
        while (count == 0)                   // c2
            Pthread_cond_wait(&fill, &mutex); // c3
        int tmp = get();                  // c4
        Pthread_cond_signal(&empty);         // c5
        Pthread_mutex_unlock(&mutex);        // c6
    }
}

```

Trace 3: 1 producer (Pa), 1 consumer (Ca), max=1. Consumer Ca runs first. Stop when consumer has consumed one entry.

Pa: 12456123

Ca: 123

② 4

critical: recheck

Trace 4: 1 producer (Pa), 2 consumers (Ca, Cb), max=1. Consumer Ca runs first, then Cb, then Pa. Stop when producer Pa has produced an entry.

Pa: 124

Ca: 123

Cb: 123

Trace 5: Now we change the "while" loops to "if" statements. Show a trace of behavior, using one producer and two consumers, where this leads to problems (assume max is set to whatever you like).

Pa: 12456123

Ca: 123 | wait wake

Cb:

12456123

[consumes entry]

5 no 2!

max=1

④ tries to consume, nothing is there

Trace 6: Now we use "while" loops but only one condition variable, not two as above. Show a trace of behavior, using one producer and two consumers, where this leads to problems (assume max is set to whatever you like).

Pa: 12456123 | wait

Ca: 123 | wait wake

Cb: 123 | wait

245 | signal

→ could wake

consumer but must wake producer

# EACH 1 PT

## (MAX -10)

### 7. Remember Scheduling

The following timelines show a set of jobs arriving, and then being executed on a processor. What is the TURNAROUND TIME for job A?

Assume A arrives at the beginning of the time unit where the "\*" is on the timeline, A ends at the end of the time unit where "x" and that each tick moves time forward 1 time unit.

Example

ABABA  
\* x

This means that A starts at time=0, runs right away, and then finishes at time=5. B runs from time=1-2 and time=3-4.



Turnaround Time (A) ?

|           |          |
|-----------|----------|
| <u>9</u>  | $(9-0)$  |
| <u>9</u>  | $(11-2)$ |
| <u>5</u>  | $(5-0)$  |
| <u>10</u> | $(10-0)$ |
| <u>11</u> | $(11-0)$ |
| <u>5</u>  | $(10-5)$ |

Now do the same for RESPONSE TIME for A (note that the traces are not identical to those above):

Response Time (A) ?



|          |  |
|----------|--|
| <u>0</u> |  |
| <u>2</u> |  |
| <u>0</u> |  |
| <u>5</u> |  |
| <u>0</u> |  |
| <u>5</u> |  |

DON'T GRADE  
(SAME AS)

VALID ENTRIES

1 1/2 PTS EACH

INVALID

1 PT EACH

## 8. Reverse Engineering The Page Table

In this problem, we consider address translation in a system with a simple linear page table (an array of page table entries, or PTEs).

Parameters:

- Virtual address space size is 32KB
- Page size is 4KB
- Physical memory size is 64KB

Here is a trace of virtual addresses and the physical addresses they translate to (or perhaps an invalid access):

VA 0x1063 --> PA 0x2063  
VA 0x67b4 --> PA 0x67b4  
VA 0x584a --> PA 0xe84a  
VA 0x4dfe --> Invalid  
VA 0x388a --> Invalid  
VA 0x1c6b --> PA 0x2c6b  
VA 0x50a9 --> PA 0xe0a9  
VA 0x0bc6 --> Invalid  
VA 0x2a9f --> PA 0x9a9f  
VA 0x742b --> Invalid  
VA 0x4b5e --> Invalid  
VA 0x5597 --> PA 0xe597

1 → 2  
6 → 6  
5 → e  
4 x  
3 x  
0 x  
2 → 9

offset  
4KB → 12 bits  
→ 3 hex digits

Thus, VPN or PFN is top hex digit

e.g. 0x 

⇒ 0x 

Can you reconstruct the page table entries from this? For each entry that you can construct, please do so; otherwise, mark down the entry as "UNKNOWN".

Format: Valid bit followed by Physical Frame Number (PFN)  
[ 1 or 0 then PFN ]

|                     | Valid | PFN |
|---------------------|-------|-----|
| Page Table Entry 0: | 0     | —   |
| Page Table Entry 1: | 1     | 2   |
| Page Table Entry 2: | 1     | 9   |
| Page Table Entry 3: | 0     | —   |
| Page Table Entry 4: | 0     | —   |
| Page Table Entry 5: | 1     | 0xE |
| Page Table Entry 6: | 1     | 6   |
| Page Table Entry 7: | 0     | —   |

# EACH 1 PT [MAX -10]

## 9. Segmentation Is Fun Until The Stack Segment

Segmentation is another approach to supporting virtual memory. In this question, you will examine some timelines of virtual memory addresses and try to set the base and bounds registers, per segment, correctly so as to NEVER GENERATE a SEGMENTATION FAULT, and to make sure that the virtual addresses in the trace get translated to the proper physical address.

All other virtual addresses (not seen in the trace) should generate a SEGMENTATION FAULT.

Here we assume a simple segmentation approach that splits the virtual address space into two segments. The top bit of the virtual address determines which segment it is in.

Segment 0 acts like a code and heap segment; the heap grows towards higher addresses.

Segment 1 acts like a stack segment; it grows backwards towards lower addresses. For this segment, we follow convention that the book follows: the base register points to the physical address one past the last byte of the stack.

In both segments, the bounds (or limit) register just contains the "size" of the segment, i.e., the number of bytes valid.

Trace 1: Assume a 16-byte (4-bit) virtual address space (tiny!).

- Virtual address trace: 0,1,2,3,15,14,13 (all of these accesses should be valid)
- Virtual address 1 translates to physical address 101
- Virtual address 13 translates to physical address 998

Segment 0 Base? 100

Segment 1 Base? 1001

$13 \rightarrow 998$

Segment 0 Limit? 4

Segment 1 Limit? 3

$14 \rightarrow 999$

$15 \rightarrow 1000$

Trace 2: Assume a 64-byte (6-bit) virtual address space.

- Virtual address trace: 0,1,63 (all of these accesses should be valid)
- Virtual address 1 translates to physical address 1001
- Virtual address 63 translates to physical address 899

Segment 0 Base? 1000

Segment 1 Base? 900

Segment 0 Limit? 2

Segment 1 Limit? 1

Trace 3: Assume a 8-byte (3-bit) virtual address space.

- Virtual address trace: 0,1,2,3 (all of these accesses should be valid)
- Virtual address 3 translates to physical address 100

Segment 0 Base? 97

Segment 1 Base? —

Segment 0 Limit? 4

Segment 1 Limit? 0

# EACH 1.5 PTS

## 10. Spin Locks For The Win

Assume we are using a spinlock to protect a critical section. In the timelines below, "c" means a thread is doing some computation; "S" means a thread is spinning on a lock, waiting for it to become available; "A" means a thread acquired a lock; "R" means a thread has released the lock.

Describe, in words, what happened in each of the following timelines. In some cases, the timelines shouldn't be possible. If so, say so, and describe why. In all cases, there is ONLY ONE LOCK that is being used.

Thread 1: ccccAccccRcccc

Thread 2: ccccccAccccccRcccc

What happened?

*T<sub>1</sub> acquired, released lock ; T<sub>2</sub> then did same*

Thread 1: ccccAccccccc

Thread 2: cccccSSSSSSSSSSSSSSSS

What happened?

*T<sub>1</sub> got lock, was interrupted; T<sub>2</sub> ran and spun, couldn't get lock; interrupted, T<sub>1</sub> ran and released lock*

Thread 1: ccccAcc ccccccR

Thread 2: cccccAccccR

What happened?

**BAD:** Looks like T<sub>1</sub>, T<sub>2</sub> both got lock

Thread 1: ccccAcc

Thread 2: cccSSSSSSSSSSSS ... (forever)

What happened?

*T<sub>1</sub> got lock, interrupted; T<sub>2</sub> ran @ higher priority, tries to acquire, spins forever*

Thread 1: AR AR AR AR ...

Thread 2: AR AR AR AR

What happened?

*lots of interrupts switching between T<sub>1</sub>, T<sub>2</sub> each grabs/releases lock*

Thread 1: ccccSSSSSS SSSSSSSSSS ... (forever)

Thread 2: cccccSSSSSS SSSSSSSSSS ... (forever)

What happened?

**BAD:** no one was able to get lock (spin forever)

PART 1: correct ones missed, - $\frac{1}{2}$  PT (MAX - 5)  
bad ones circled

PART 2: EACH 1 PT

(MAX - 5)

### 11. The Dreaded MLFQ Question

The MLFQ (multi-level feedback queue) is a scheduling discipline. It consists of a number of rules. First, circle the rules that are actually part of the final MLFQ policy:

pri

1. If Priority(A) > Priority(B), A runs (B doesn't).
2. If Priority(A) < Priority(B), A runs (B doesn't).
3. If Priority(A) = Priority(B), A & B run in round-robin fashion.
4. If Priority(A) = Priority(B), A runs to completion, then B.
5. When a job enters the system, it is placed at the highest priority (the topmost queue).
6. When a job enters the system, it is placed in any queue.
7. When a job enters the system, it is placed in the lowest queue.
8. Once a job uses up its time slice at a given level, its priority is reduced.
9. Once a job uses up its time slice at a given level, it moves to the end of the round-robin queue.
10. Once a job uses up its time slice at a given level, it exits.
11. After some time period, move each job up to a higher priority queue.
12. After some time period, move each job down to a lower priority queue.
13. After some time period, move all the jobs in the system to the top-most (highest-priority) queue.

1, 3, 5, 8, 13

RR

enter

reduce

boost

Now, write down the rule (or rules) that come into play in each of the following example traces of MLFQ behavior (use the numbers from above). Note that \* marks when A arrives, if the information is relevant.

Rule(s)?

Q3: A  
Q2: AA  
Q1: AAAAAAAA ...

5, 8 (twice)

Q3: \* A  
Q2: AA  
Q1: BBBBBBBB

5, 1, 8

Q3: \* A 5, 1  
Q2: AA 8 8  
Q1: BBBBBBBB AAAABBBB ...  
3

5, 1, 8, 3 (2x)

Q3:  
Q2:  
Q1: AAAABBBAAAABBBAAAABBB

3

Q3: AB AB  
Q2: AABB AABB  
Q1: AAABBAAABB AAABBB...

5 (or 13), 3, 8, 1, 13

(1 when A is done  
at given level)

## 12. The Even More Dreaded Multi-Level Page Table

Assume you have a 15-bit virtual address, with page size = 32 bytes. Assume further a two-level page table, with a page directory which points to pieces of the page table. Each page directory entry is 1 byte, and consists of a valid bit and PFN of the page of the page table. Each page table entry is similar: a valid bit followed by the PFN of the physical page where the desired data resides.

The page directory resides in physical page 18.

The following physical page contents are made available to you:

page 10: 7f f0 7f a4 7f 7f 7f 7f 7f  
7f 7f 7f 7f 7f 7f 7f 7f 7f 7f 7f 7f 7f 7f 7f 7f 7f 7f



In translating virtual address 0x3a3a, which physical pages are accessed?

18, 90, 98

] 3 PTS

What is the final data value returned?

O<sub>x</sub> 00

] 2 PTS

In translating virtual address 0x74f6, which physical pages are accessed?

18, 57, 30

] 3 PTS

What is the final data value returned?

Ox 1c

7 2 PTS

EACH WRONG STATE -½ PT  
 MAX -10

### 13. Too Many Forking Questions

We have a system with three processes (A, B, C) and a single CPU. Processes can be in one of three states: Running, Ready, Blocked. If the process does not exist (yet), or has exited, just put a "----" down or leave the entry blank.

Below is a timeline of process behavior. Fill in the states of each process in the diagram:



#### 14. You Put The T In TLB

The following question traces TLB behavior over time. Each question will give you a few assumptions; you should then produce a series of hits ("H") and misses ("m"). The string "mHmH" would mean "TLB miss" followed by a "TLB hit" followed by a "miss" followed by a "hit".

In all cases, ignore instruction references (i.e., do not worry about their effects on the TLB).

Also, always assume the array (discussed below) is PAGE ALIGNED.

Assume you have a 1-entry TLB. Assume you access contiguous 4-byte integers in a large array, starting at index 0 and going to the max size. Assume the page size is 32-bytes. What is the hit/miss pattern for that access pattern?



Pattern:

2 PTS

[m H H H H H H H] repeated  
(until done)

Assume you have a 2-entry TLB with LRU replacement of TLB entries. Assume you access contiguous 4-byte integers in a large array, again starting at 0. Assume the page size is 32-bytes. What is the hit/miss pattern for that access pattern?

Pattern:

2

[m H H H H H H H] repeated  
(until done) (same  
as above)

Assume you have a 1-entry TLB. Assume you access \*every other\* 4-byte integer in a large contiguous array, starting at index=0, then index=2, etc. Assume the page size is 32-bytes. What is the hit/miss pattern for that access pattern?



Pattern:

3

[m H H H] repeated (until done)

Assume you have a 16-entry TLB with FIFO replacement. Assume you repeatedly access all 4-byte integers in a small contiguous array of 24 integers, in a loop. Assume the page size is 32-bytes. What is the hit/miss pattern for that access pattern, for the \*fourth\* run through the loop?



8 ints/page

Pattern:

3

[H] repeated 24 times  
(all hits)