

# Bab 12

## Code Generation dan Target Machine

### Sub-CPMK yang Dicakup dalam Bab Ini:

- **Sub-CPMK 5.3:** Mengimplementasikan code generator untuk arsitektur target

### 12.1 Pengenalan Target Machine (RISC vs CISC)

#### 12.1.1 Arsitektur RISC (Reduced Instruction Set Computer)

Contoh: RISC-V, ARM.

- Instruksi berukuran tetap (biasanya 32-bit).
- Hanya instruksi Load dan Store yang bisa mengakses memori.
- Memiliki banyak register umum (*general purpose*).

#### 12.1.2 Arsitektur CISC (Complex Instruction Set Computer)

Contoh: x86 (Intel/AMD).

- Instruksi berukuran variabel (1-15 byte).
- Instruksi aritmatika bisa langsung mengakses operan di memori.
- Jumlah register fisik lebih terbatas dibanding RISC.

#### 12.1.3 Dampak pada Code Generation

Code generator harus menyadari arsitektur target. Pada RISC, operasi aritmatika yang kompleks harus diurai menjadi beberapa instruksi dasar load-load-op-store.

## 12.2 Pemilihan Instruksi (Instruction Selection)

*Instruction Selection* adalah proses memetakan instruksi tingkat menengah (*TAC*) ke instruksi spesifik mesin target yang memberikan performa terbaik.

### 12.2.1 Pattern Matching

Kompilator modern sering menggunakan pencocokan pola pohon (*Tree Pattern Matching*) untuk memilih instruksi kompleks. Misal:  $a = b * c + d$  dapat diganti dengan satu instruksi *Multiply-Add* jika CPU mendukungnya.

### 12.2.2 Biaya Instruksi (Cost Estimation)

Setiap opsi instruksi memiliki bobot (siklus CPU). *Instruction selector* mencoba meminimalkan total bobot instruksi yang dihasilkan untuk sebuah blok kode.

## 12.3 Register Allocation

### 12.3.1 Register Allocation Problem

Masalah alokasi register:

- Variabel terbatas vs unlimited temporaries
- Register interference
- Spilling ke memory
- Calling convention constraints

### 12.3.2 Linear Scan Allocation

```
1 typedef struct {
2     char *var_name;
3     int start_point;      // First use
4     int end_point;        // Last use
5     int register_num;    // Assigned register (-1 if spilled)
6 } LiveRange;
7
8 void linear_scan_allocation(LiveRange *ranges, int count,
9                             int num_registers) {
10    // Sort ranges by start point
11    sort_ranges_by_start(ranges, count);
12
13    bool *registers_used = calloc(num_registers, sizeof(bool));
14}
```

```

15    for (int i = 0; i < count; i++) {
16        // Free registers whose ranges have ended
17        free_expired_registers(ranges[i].start_point,
18                                registers_used, num_registers);
19
20        // Find free register
21        int reg = find_free_register(registers_used, num_registers);
22        if (reg != -1) {
23            ranges[i].register_num = reg;
24            registers_used[reg] = true;
25        } else {
26            // Spill to memory
27            ranges[i].register_num = -1;
28        }
29    }
30}

```

## 12.4 Target Architecture

### 12.4.1 x86 Architecture

Karakteristik x86:

- CISC (Complex Instruction Set Computer)
- Variable-length instructions
- Rich addressing modes
- Backward compatibility

```

1 // x86 instruction examples
2 MOV EAX, EBX          ; Register to register
3 MOV EAX, [EBX+4]       ; Base + offset addressing
4 MOV EAX, [EBX+ECX*4]  ; Base + index*scale
5 LEA EAX, [EBX+ECX*2] ; Load effective address
6 PUSH EAX             ; Stack operation
7 POP EBX              ; Stack operation

```

### 12.4.2 RISC Architecture

Karakteristik RISC (MIPS/ARM):

- Fixed-length instructions
- Load/store architecture
- Simple addressing modes

- Large register file

```
1 // MIPS instruction examples
2 ADD $t0, $t1, $t2      ; $t0 = $t1 + $t2
3 LW $t0, 4($t1)         ; Load word from memory
4 SW $t0, 8($t1)         ; Store word to memory
5 ADDI $t0, $t1, 10       ; $t0 = $t1 + 10 (immediate)
```

## 12.5 Code Generation Algorithm

### 12.5.1 Basic Block Code Generation

```
1 void generate_block_code(BasicBlock *block,
                           RegisterAllocator *alloc) {
2     for (int i = 0; i < block->instruction_count; i++) {
3         TACInstruction *inst = &block->instructions[i];
4
5         // Allocate registers for operands
6         int reg1 = allocate_register(inst->arg1, alloc);
7         int reg2 = allocate_register(inst->arg2, alloc);
8         int reg3 = allocate_register(inst->result, alloc);
9
10        // Generate target instruction
11        switch (inst->op) {
12            case OP_ADD:
13                emit_add(reg3, reg1, reg2);
14                break;
15            case OP_MUL:
16                emit_mul(reg3, reg1, reg2);
17                break;
18            case OP_ASSIGN:
19                emit_mov(reg3, reg1);
20                break;
21            // ... other operations
22        }
23
24        // Release temporary registers
25        release_register(reg1, alloc);
26        release_register(reg2, alloc);
27    }
28 }
29 }
```

### 12.5.2 Function Call Generation

```
1 void generate_function_call(FunctionCall *call) {
2     // Save caller-saved registers
3     save_caller_saved_registers();
4
5     // Push parameters (right-to-left for cdecl)
6     for (int i = call->param_count - 1; i >= 0; i--) {
7         push_parameter(call->parameters[i]);
```

```

8     }
9
10    // Call function
11    emit_call(call->function_name);
12
13    // Clean up stack (callee or caller depending on convention)
14    cleanup_stack(call->param_count * sizeof(int));
15
16    // Restore caller-saved registers
17    restore_caller_saved_registers();
18
19    // Move return value to target register
20    if (call->has_return_value) {
21        emit_mov(call->return_reg, EAX);
22    }
23}

```

## 12.6 Optimization in Code Generation

### 12.6.1 Peephole Optimization

```

1 void peephole_optimization(Instruction *instructions, int count) {
2     for (int i = 0; i < count - 1; i++) {
3         // MOV reg, reg -> NOP
4         if (is_mov_reg_to_reg(&instructions[i])) {
5             instructions[i].opcode = NOP;
6         }
7
8         // PUSH reg; POP reg -> NOP
9         if (is_push_pop_same_reg(&instructions[i], &instructions[i+1])) {
10            instructions[i].opcode = NOP;
11            instructions[i+1].opcode = NOP;
12        }
13
14         // MOV reg, imm; ADD reg, imm -> LEA reg, [imm]
15         if (can_convert_to_lea(&instructions[i], &instructions[i+1])) {
16             convert_to_lea(&instructions[i], &instructions[i+1]);
17         }
18     }
19 }

```

### 12.6.2 Instruction Scheduling

```

1 void schedule_instructions(Instruction *instructions, int count) {
2     // Simple list scheduling for basic blocks
3     Instruction *scheduled[count];
4     int scheduled_count = 0;
5
6     while (scheduled_count < count) {
7         // Find ready instructions (no dependencies)
8         for (int i = 0; i < count; i++) {

```

```
9     if (!is_scheduled(&instructions[i]) &&
10        is_ready(&instructions[i], scheduled, scheduled_count)) {
11            scheduled[scheduled_count] = instructions[i];
12            break;
13        }
14    }
15 }
16
17 // Copy scheduled instructions back
18 memcpy(instructions, scheduled, count * sizeof(Instruction));
19 }
```

## Aktivitas Pembelajaran

1. **Instruction Selection:** Implementasikan instruction selector untuk subset x86.
2. **Register Allocation:** Bangun linear scan register allocator.
3. **Code Generation:** Implementasikan code generator untuk simple expressions.
4. **Peephole Optimization:** Buat peephole optimizer untuk assembly code.
5. **Function Calls:** Generate code untuk function calls dengan calling conventions.

## Latihan dan Refleksi

1. Generate assembly code untuk expression tree kompleks!
2. Implementasikan register allocator dengan spilling strategy!
3. Analisis instruction selection for different target architectures!
4. Optimasi generated code dengan peephole optimizations!
5. Generate code untuk recursive functions dengan proper stack management!
6. **Refleksi:** Bagaimana target architecture mempengaruhi code generation strategy?

## Asesmen (Evaluasi Kinerja)

### Instrumen Penilaian untuk Sub-CPMK 5.3

#### A. Pilihan Ganda

1. Register allocation problem terjadi karena:
  - (a) Terlalu banyak variabel
  - (b) Terbatasnya jumlah register
  - (c) Memory terlalu kecil
  - (d) Instruksi terlalu kompleks
2. CISC architecture memiliki:
  - (a) Fixed-length instructions
  - (b) Variable-length instructions
  - (c) Load/store only
  - (d) Large register file
3. Peephole optimization bekerja pada:
  - (a) Single instruction
  - (b) Small window of instructions
  - (c) Entire program
  - (d) Basic blocks

## B. Essay

1. Jelaskan complete code generation pipeline dari three-address code ke assembly!
2. Implementasikan code generator untuk bahasa sederhana dengan arithmetic expressions dan function calls!

**Rubrik Penilaian:** Lihat Lampiran A

### Checklist Pencapaian Kompetensi

*Centang item berikut setelah Anda yakin telah menguasainya:*

- Saya dapat mengimplementasikan code generator untuk arsitektur target
- Saya dapat melakukan instruction selection yang efisien
- Saya dapat mengimplementasikan register allocation algorithms
- Saya dapat menggenerate code untuk function calls
- Saya dapat melakukan peephole optimizations

- Saya memahami perbedaan CISC dan RISC architectures

## Rangkuman

Bab ini membahas code generation dan target machine, termasuk instruction selection, register allocation, target architectures, dan optimization techniques. Mahasiswa belajar membangun code generator yang efisien.

### Poin Kunci:

- Code generation mengkonversi intermediate code ke target code
- Instruction selection memilih optimal target instructions
- Register allocation mengelola limited register resources
- Target architecture mempengaruhi generation strategy
- Peephole optimization mengoptimasi local instruction patterns
- Function calls memerlukan proper calling convention handling

**Kata Kunci:** *Code Generation, Instruction Selection, Register Allocation, Target Architecture, x86, RISC, Peephole Optimization, Calling Convention*