

# **Senior Instruction Set Manual**

November 13, 2024



# Contents

|                                              |           |
|----------------------------------------------|-----------|
| <b>1 Senior hardware description</b>         | <b>5</b>  |
| 1.1 General . . . . .                        | 5         |
| 1.2 Accumulators . . . . .                   | 5         |
| 1.3 Registers . . . . .                      | 8         |
| 1.4 Memory . . . . .                         | 8         |
| 1.5 Coding convention . . . . .              | 9         |
| 1.6 Constants . . . . .                      | 10        |
| 1.7 Operations . . . . .                     | 11        |
| 1.7.1 Scaling . . . . .                      | 11        |
| 1.7.2 Rounding . . . . .                     | 11        |
| 1.7.3 Saturation . . . . .                   | 12        |
| 1.7.4 8 bit mode . . . . .                   | 12        |
| 1.8 Software stack . . . . .                 | 12        |
| 1.9 FFT addressing . . . . .                 | 12        |
| <b>2 Move-load-store instructions</b>        | <b>15</b> |
| <b>3 Short arithmetic instructions, 16'b</b> | <b>31</b> |
| <b>4 Short logic instructions, 16'b</b>      | <b>45</b> |
| <b>5 Short shift instructions, 16'b</b>      | <b>53</b> |
| <b>6 Long arithmetic instructions, 32'b</b>  | <b>63</b> |
| <b>7 Iterative instructions</b>              | <b>83</b> |
| <b>8 Flow control instructions</b>           | <b>85</b> |
| <b>9 Alias instructions</b>                  | <b>91</b> |



# 1 Senior hardware description

## 1.1 General

The **Senior** processor is a single issue DSP processor for applications including voice codec, audio decoder, bit manipulations and program flow controller for video codec. The instruction set supports running basic kernels of BDTI benchmarking.

The core includes data path, control path and address path. The data path consists of a general register file, ALU and a MAC. The addressing path consists of four circuit modules, AG0–AG3. AG0–AG1 supporting modulo, variable size post increment addressing and bit reversal addressing. The control path consists of a PC FSM, a loop controller supporting REPEAT of subroutine up to 127 instructions.

Interrupt handler and timer are directly connected to the core. The instruction decoder checks interrupt interface first before decoding an instruction. All access requests from other peripheral components to the core are controlled by the interrupt handler. Pipeline of different instructions is listed in Table 1.1. The pipeline architecture for all instructions except **conv** (convolution) is given in the simplified Fig. 1.1. The pipeline architecture for **conv** is given in the simplified Fig. 1.2.

The typical pipeline for RISC instructions is IF(P1)→ID(P2)→OF(P3)→EX(P4). Write back operations are executed in P4 right after execution during the execution cycle.

## 1.2 Accumulators

There are 4 accumulators (32 bits wide), **acr0** to **acr3**, used for double precision computing. There are 8 guard bits in the MAC unit giving a total of 40 bits internal resolution for the accumulators during computing.

| Pipe | RISC-E1/E2                | RISC memory load/store   | CISC-convolution    |
|------|---------------------------|--------------------------|---------------------|
| P1   | IF: instr fetch           | IF: instr fetch          | IF: instr fetch     |
| P2   | ID: instr decode          | ID: instr decode         | ID: instr decode    |
| P3   | OF: operand fetch         | OF+AG: compute addr      | OF+AG: compute addr |
| P4   | EX1:execution (set flags) | MEM:read/write           | MEM: read           |
| P5   | EX2:only for MAC, RWB     | WB: write back (if load) | send data mem→mul   |
| P6   |                           |                          | MUL                 |
| P7   |                           |                          | accumulation        |

Table 1.1: Pipeline specification



Figure 1.1: Pipeline processor architecture for all instructions but `conv`



Figure 1.2: Pipeline processor architecture for conv

## 1.3 Registers

There are 32 general registers (16 bits wide), **r0** to **r31**, used as computing buffers. General registers can be addressed using 5 bits binary code. There are also 32 special registers (16 bits wide, except **bitrev** that is 3 bits wide), **sr0** to **sr31**, that have special functions. They are addressed using 5 bits binary code and allocated either in the control path or in the address generator AGU or in the MAC (guard bits). Defined special registers can be seen in Table 1.2.

| Mnemonic | Location | Address code | Specification                    |
|----------|----------|--------------|----------------------------------|
| ar0      | AG       | 00000        | Address register 0               |
| ar1      | AG       | 00001        | Address register 1               |
| ar2      | AG       | 00010        | Address register 2               |
| ar3      | AG       | 00011        | Address register 3               |
| sp       | AG       | 00100        | Stack pointer                    |
| bot0     | AG       | 00101        | Bottom for AR0                   |
| top0     | AG       | 00110        | Top for AR0                      |
| step0    | AG       | 00111        | Step size for AR0                |
| bot1     | AG       | 01000        | Bottom for AR1                   |
| top1     | AG       | 01001        | Top for AR1                      |
| step1    | AG       | 01010        | Step size for AR1                |
| bitrev   | AG       | 01011        | Number of bits to reverse-6      |
| fftbase  | AG       | 01100        | Base address for FFT addressing  |
| fftstage | AG       | 01101        | Current stage of FFT addressing  |
| intaddr  | AG       | 01110        | Start address for interrupts     |
| f10      | CP       | 01111        | Flags, processor status register |
| f11      | CP       | 10000        | Flags, core control register     |
| loopn    | CP       | 10001        | Number of iterations in loop     |
| loopb    | CP       | 10010        | Loop start address               |
| loope    | CP       | 10011        | Loop end address                 |
| intmask  | CP       | 10100        | <i>Reserved(interrupt mask)</i>  |
| guards01 | MAC      | 10101        | Guard for ACR0 and ACR1          |
| guards23 | MAC      | 10110        | Guard for ACR2 and ACR3          |

Table 1.2: Special purpose registers

Special registers (**sr0** to **sr31**) can only be accessed with move instructions. Only general registers involves in arithmetic and load-store instructions.

The **bitrev** special purpose register determines the number of bits to reverse when using bit reversed addressing mode. **bitrev+6** bits will be reversed starting at the second least significant bit. That is the least significant bit will not be affected by the reversal.

Processor status register (**f10**) and core control vector register (**f11**) are in the following Table 1.3.

## 1.4 Memory

Three memories can be directly connected to the core, the PM (program memory) which is 32 bits wide and at most  $2^{16}$  words and DM0 (data memory 0) and DM1 (data memory 1) which are 16 bits wide and at most  $2^{15}$  words. The data memories are addressed with 16 bits and the lsb of

| Name | Bit assignment | Specification                            |
|------|----------------|------------------------------------------|
| AZ   | f10[0]         | ALU zero flag                            |
| AN   | f10[1]         | ALU sign flag                            |
| AC   | f10[2]         | ALU carry/saturation flag                |
| AV   | f10[3]         | ALU ALU overflow flag                    |
| MZ   | f10[4]         | MAC zero flag                            |
| MN   | f10[5]         | MAC sign flag                            |
| MS   | f10[6]         | MAC saturation flag                      |
| MV   | f10[7]         | MAC overflow flag (sticky)               |
| FFT  | f11[0]         | Actives FFT addressing for DMO           |
| TR   | f11[]          | <i>Reserved(Trace mode)</i>              |
| IE   | f11[]          | <i>Reserved(Global interrupt enable)</i> |

Table 1.3: Control and status registers

| Group code | Type [31:30]          | Specification                                                |
|------------|-----------------------|--------------------------------------------------------------|
| 00         | Move--load--store     | Memory access and register move instructions                 |
| 01         | Arithmetic operations | 16'b arithmetic/logic/shift and 32'b arithmetic instructions |
| 10         | Program flow control  | Jumps, calls, REPEAT, NOP, SLEEP, SW-trap instructions       |
| 11         | Accelerations         | Reserved for all accelerations                               |

Table 1.4: Instruction groups

the address is used to point out which half of the word is being addressed. 8 bits can be read or written at any address but 16 bits can only be read or written at even addresses. 8 bit reading or writing is specified with the flag b8.

## 1.5 Coding convention

Instruction are classified into four groups, as seen in Table 1.4. Flags of the DSP core expose behaviour of the most recent results from ALU and MAC. Flags are specified in Table 1.5. Most move and arithmetic instructions can be conditionally executed. Because of the limitation of the instruction word width, conditional executions are not available when carrying long constants as

| Flag | Signed computing                      | Unsigned computing                           |
|------|---------------------------------------|----------------------------------------------|
| AZ   | ALU result is zero                    | ALU result is zero                           |
| AN   | ALU result is negative                | ALU result is zero                           |
| AC   | ALU saturated (or carry out)          | ALU carry out                                |
| AV   | ALU result overflowed                 | No meaning                                   |
| MZ   | MAC result is zero                    | N/A, MAC does not support unsigned computing |
| MN   | MAC result is negative                | N/A, MAC does not support unsigned computing |
| MS   | MAC saturated, sticky                 | N/A, MAC does not support unsigned computing |
| MV   | MAC result overflow (>40bits), sticky | N/A, MAC does not support unsigned computing |

Table 1.5: Flag specification (when flag is “1”)

| Mnemonic | Code  | Description/specification          | Flag test      |
|----------|-------|------------------------------------|----------------|
|          | 00000 | Unconditionally true               | none           |
|          | 00001 | <i>Unused</i>                      |                |
| eq       | 00010 | ALU equal/zero                     | AZ=1           |
| ne       | 00011 | ALU not equal/not zero             | AZ=0           |
| ugt      | 00100 | ALU unsigned greater than          | AC=1 and AZ=0  |
| uge/cs   | 00101 | ALU unsigned greater than or equal | AC=1           |
| ule      | 00110 | ALU unsigned less than or equal    | AC=0 or AZ=1   |
| ult/cc   | 00111 | ALU unsigned less than             | AC=0           |
| sgt      | 01000 | ALU signed greater than            | AN=AV and AZ=0 |
| sge      | 01001 | ALU signed greater than or equal   | AN=AV          |
| sle      | 01010 | ALU signed less than or equal      | AZ=1 or AN≠AV  |
| slt      | 01011 | ALU signed less than               | AN≠AV          |
| mi       | 01100 | ALU negative                       | AN=1           |
| pl       | 01101 | ALU positive                       | AN=0           |
| vs       | 01110 | ALU has overflowed                 | AV=1           |
| vc       | 01111 | ALU has not overflowed             | AV=0           |
| meq      | 10000 | MAC or MUL equal                   | MZ=1           |
| mne      | 10001 | MAC or MUL not equal               | MZ=0           |
| mgt      | 10010 | MAC or MUL greater than            | MN=0 and MZ=0  |
| mge/mp1  | 10011 | MAC or MUL positive or zero        | MN=0           |
| mle      | 10100 | MAC or MUL less than or equal      | MN=1 or MZ=1   |
| mlt/mmi  | 10101 | MAC or MUL negative or less than   | MN=1           |
| mvs      | 10110 | MAC was saturated                  | MS=1           |
| mvc      | 10111 | MAC was not saturated              | MS=0           |

Table 1.6: Testing flags for condition

immediate data or immediate address. Some coding fields are coded for all applications through the manual. CDT is a 5-bit coding field for conditional execution and conditional jumps. All Address computing algorithms are “right-aligned” using either data type without saturation and rounding. There is no overflow check in hardware. Overflow is actually an implicit mode of the  $2^{16}$  operation.

## 1.6 Constants

A binary constant is introduced by %, a decimal by (nothing) and a hexadecimal by \$ or 0x. Therefore % and \$ are only used to denote data types. For example:

- %0001 1111 1111 1111 = \$7FFF = 0x7FFF = 32767 in case of 16 bits
- %0111 11111 1111 = \$7FF = 0x7FF = 2047 in case of 12 bits (right aligned, integer mode)
- %0111 1 = \$F = 0xF = 15 in case of 5 bits

| AM   | Code | Addressing model   | Coding                | Algorithm specification                         |
|------|------|--------------------|-----------------------|-------------------------------------------------|
| INDR | 000  | Reg indirect       | Any R0-R31            | $A \leftarrow R_n$                              |
| INDX | 001  | Indexed            | Any AR0-AR3 or R0-R31 | $A \leftarrow A_{Rn} + R_m$                     |
| INC  | 010  | Post-add           | Any AR0-AR3           | $A \leftarrow A_{Rn}; A_{Rn} += S_{Tn}$         |
| DEC  | 011  | Pre-subtract       | Any AR0-AR3           | $A_{Rn} -= S_{Tn}; A \leftarrow A_{Rn}$         |
| OFS  | 100  | Offset addressing  | Any AR0-AR3, offset   | $A \leftarrow + \text{signed offset}$           |
| MINC | 101  | Post add, mod addr | Any AR0-AR3           | $A \leftarrow A_{Rn}; A_{Rn} += S_{Tn}$         |
| ABS  | 110  | Absolute           | Immediate 16'b        | $A \leftarrow \text{abs}(16\text{b immediate})$ |
| BRV  | 111  | Bit reversal       | Any AR0-AR3           | $A \leftarrow BR(A_{Rn}); A_{Rn} += S_{Tn}$     |

Table 1.7: Addressing modes

## 1.7 Operations

### 1.7.1 Scaling

Scaling operation can be performed for the instructions `move`, `postop`, `mul`, `mac`, `mdm` and `conv`. It operates on the source or destination accumulator (depending on the instruction) together with the instruction operation itself. Table 1.8 lists the optional scale factors.

| Mnemonics | Code | Scale factor | Description          |
|-----------|------|--------------|----------------------|
|           | 000  | 1            | No scaling           |
| mul2      | 001  | 2            | Multiply by 2        |
| mul4      | 010  | 4            | Multiply by 4        |
| div2      | 011  | 0.5          | Divide by 2          |
| div4      | 100  | 0.25         | Divide by 4          |
| div8      | 101  | 0.125        | Divide by 8          |
| div16     | 110  | 0.0625       | Divide by 16         |
| mul65536  | 111  | $2^{16}$     | Multiply by $2^{16}$ |

Table 1.8: Scale factors

### 1.7.2 Rounding

Rounding operation can be performed for the instructions `move`, `addl`, `sublst` and `postop`. It operates on the source or destination accumulator (depending on the instruction) together with the instruction operation itself. Table 1.9 lists the rounding operator.

| Mnemonics | Code | Description                       |
|-----------|------|-----------------------------------|
|           | 0    | No rounding                       |
| rnd       | 1    | Rounding to nearest integer value |

Table 1.9: Rounding

### 1.7.3 Saturation

Saturation operation can be performed for the instructions `move`, `abs`, `absl`, `negl`, `addl`, `subl` and `postop`. It operates on the destination operand together with the instruction operation itself. Table 1.10 lists the saturation operator.

| Mnemonics        | Code | Description                                                 |
|------------------|------|-------------------------------------------------------------|
|                  | 0    | No saturation                                               |
| <code>sat</code> | 1    | Saturate to within signed destination operand bitsize range |

Table 1.10: Saturation

### 1.7.4 8 bit mode

8 bit mode can be performed for the instructions `ld0`, `ld1`, `st0` and `st1`. If the `b8` flag is left out, it will be a normal 16 bit operation. When using the `b8` flag, a store at an even address will result in putting the 8 lsb of the register into the 8 lsb of the memory. A store at an odd address will result in putting the 8 lsb of the register into the 8 msb of the memory. The other 8 bits of the line in memory is not affected. When loading with 8 bit mode, an even address will load from the 8 lsb and an odd address will load from the 8 msb. The loaded data will be put into the 8 lsb of the register and the 8 msb of the register will be cleared. 8 bit mode causes the stepsize of `ar2` and `ar3` to become 1, instead of 2 in normal mode. The stepsize of `ar0` and `ar1` is not affected by 8 bit mode.

## 1.8 Software stack

The software stack is located in DM1 and grows towards lower addresses. `sp++` in this manual will therefore in reality be handled as `sp = sp - 2`. The stack is implicitly used for subroutine calls and interrupts. The special purpose register `sp` is used as the address pointer for the stack and may only be used for post-incremental, pre-decremental and offset addressing using `ld1` and `st1`. The `sp` register must be set before any subroutine calls are made or any interrupts occur.

## 1.9 FFT addressing

The Senior processor contains special hardware for generating addresses during FFT computations. When `f11[0]` is set, all memory accessing instructions on DM0 uses the address from this hardware, no matter the addressing mode specified in the instruction.

If the input samples for the FFT is located in a continuous order in DM0 starting with the real part and then the imaginary part for each sample, then the first generated address will point at the real part of the second input sample to the first butterfly. The next address points to the imaginary part. Then the real and the imaginary part of the first input is pointed out. Next the real part of first input is addressed again followed by the imaginary part and then the second input. This to store the results of the butterfly operation in an in-place FFT. The next generated address points

at the real part of the second input of the next butterfly.

The special register **fftbase** is used to tell the FFT addressing hardware where the input samples are located in DMO. It should be set to the address of the real value of the first input to the first butterfly. This must be set in the beginning of each stage (should be the same value for each stage) as **fftbase** is updated by the hardware on each memory accessing instruction using DMO.

To get the addressing inside a stage correct, the special register **fftstage** is used to tell the FFT addressing hardware which stage of the FFT that is currently being computed. **fftstage** = 1 corresponds to the first stage that is using bit reverse addressing in DIT FFTs. Note that bit reverse addressing is not supported by the FFT addressing hardware.



## 2 Move-load-store instructions

Move-load-store instructions concern operations of 16 bit data transfer between general registers, special registers, accumulators, IO ports and data memories. The move instructions (**move** and **set**) can only work with registers and accumulators (**move** only) for data access, as the load-store instructions (**ld0**, **ld1**, **st0**, **st1**, **dblld**, **dblst**, **in**, **out**) can use various addressing modes for data memory access. Table 2.1 lists the move-load-store group of instructions.

| Mnemonic | Description                                  | Page |
|----------|----------------------------------------------|------|
| move     | copy data to/from registers and accumulators | 16   |
| set      | set constant to register                     | 17   |
| ld0      | load register from data memory 0             | 18   |
| ld1      | load register from data memory 1             | 19   |
| st0      | store register to data memory 0              | 20   |
| st1      | store register to data memory 1              | 21   |
| dblld    | double load                                  | 22   |
| dblst    | double store                                 | 23   |
| in       | read IO port to register                     | 24   |
| out      | write register to IO port                    | 25   |
| dblinc   | write dm0 and dm1 from IO port               | 26   |
| dblout   | write IO port from dm0 and dm1               | 27   |

Table 2.1: move-load-store instructions

# MOVE — move

## Syntax

```

1  move      rd,sra
2  move      srd,ra
3  move[.cdt] rd,[sat] [rnd] [sf] acrA
4  move      acrD[.h|.1],ra
5  move[.cdt] rd,ra
aε[0..31], dε[0..31], Aε[0..3], Dε[0..3]
sfε[mul2|mul4|div2|div4|div8|div16|mul65536]
```

## Operation

```

1  rd←sra
2  srd←ra
3  if cdt is true: rd←saturation(round(scale(acrA)))
4  if h is used: acrD[31..16]←ra, if l is used: acrD[15..0]←ra
5  if cdt is true: rd←ra
```

## Description

Copy contents from source to destination.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
|    | U  |    |    |

## Comment

Syntax no.5 above is implemented as orn[.cdt]ra,rb,rb. Use the instruction exch to exchange contents between special register and register.

## Example

```

move    r1,sr2
move    sr1,r2
move.eq r22,rnd mul2 acr3
move    acr2.h,r12
move    r7,r14
```

Instruction 2.1: MOVE — move

# SET — set register to constant

## Syntax

1    **set**               $rd, K$   
2    **set**               $srd, K$   
 $d \in [0..31], K \in [\$0000..\$FFFF]$

## Operation

1     $rd \leftarrow K$   
2     $srd \leftarrow K$

## Description

Set destination register to an unsigned 16'b constant

## Flags

No flags affected

## Comment

## Example

```
set      r21,711
set      sr12,$2C7
set      r3,%1111000111
```

Instruction 2.2: SET — set register to constant

# LD0 — load register from DM0

## Syntax

|   |                  |                                |
|---|------------------|--------------------------------|
| 1 | <code>ld0</code> | <code>[b8] rd, (rb)</code>     |
| 2 | <code>ld0</code> | <code>[b8] rd, (ara,rb)</code> |
| 3 | <code>ld0</code> | <code>[b8] rd, (arc++)</code>  |
| 4 | <code>ld0</code> | <code>[b8] rd, (--arc)</code>  |
| 5 | <code>ld0</code> | <code>[b8] rd, (ara,K)</code>  |
| 6 | <code>ld0</code> | <code>[b8] rd, (are++%)</code> |
| 7 | <code>ld0</code> | <code>[b8] rd, (L)</code>      |
| 8 | <code>ld0</code> | <code>rd,br(arc)</code>        |

$a \in [0..3]$ ,  $b \in [0..31]$ ,  $c \in [0..3]$ ,  $d \in [0..31]$ ,  $e \in [0..1]$ ,  $K \in [-2048..2047]$ ,  $L \in \$0000..\$7FFF$

## Operation

- 1  $rd \leftarrow DM0(ra)$
- 2  $rd \leftarrow DM0(ar_a + rb)$
- 3  $rd \leftarrow DM0(ar_c)$ , if  $c \in [0..1]$  then  $ar_c \leftarrow ar_c + step_c$  else  $ar_c \leftarrow ar_c + 1/2$
- 4 if  $c \in [0..1]$  then  $ar_c \leftarrow ar_c - step_c$  else  $ar_c \leftarrow ar_c - 1/2$ ,  $rd \leftarrow DM0(ar_c)$
- 5  $rd \leftarrow DM0(ar_a + K)$
- 6  $rd \leftarrow DM0(ar_e)$ , if  $ar_e = tope$  then  $ar_e \leftarrow bote$  else  $ar_e \leftarrow ar_e + step_e$
- 7  $rd \leftarrow DM0(L)$
- 8  $rd \leftarrow DM0(bitrev(ar_c))$ , if  $c \in [0..1]$  then  $ar_c \leftarrow ar_c + step_c$  else  $ar_c \leftarrow ar_c + 2$

## Description

Load a general register with data from data memory 0 using various addressing modes. If 8 bit mode (b8) is used, 8 bits are stored in the register and the 8 msb are cleared.

## Flags

No flags affected

## Comment

Syntax no.5 can be used without the constant  $K$ , in which case it will be assumed to be 0.

Operation no.6: observe that  $ar_e$  will only be set to  $bote$  when the condition  $ar_e = tope$  is met. There is nothing preventing  $ar_e$  from becoming larger than  $tope$  at the  $ar_e \leftarrow ar_e + step_e$  operation.  $1/2$  in stepsize means 1 in 8-bit mode and 2 otherwise.

## Example

`ld0 r1, (ar1,r9)`

Instruction 2.3: LD0 — load register from DM0

# LD1 — load register from DM1

## Syntax

|    |     |                    |
|----|-----|--------------------|
| 1  | ld1 | [b8] rd , (rb)     |
| 2  | ld1 | [b8] rd , (ara,rb) |
| 3  | ld1 | [b8] rd , (arc++)  |
| 4  | ld1 | [b8] rd , (--arc)  |
| 5  | ld1 | [b8] rd , (ara,K)  |
| 6  | ld1 | [b8] rd , (are++%) |
| 7  | ld1 | [b8] rd , (L)      |
| 8  | ld1 | rd , br(arc)       |
| 9  | ld1 | rd , (--sp)        |
| 10 | ld1 | rd , (sp,K)        |

a $\in$ [0..3], b $\in$ [0..31], c $\in$ [0..3], d $\in$ [0..31], e $\in$ [0..1], K $\in$ [-2048..2047], L $\in$ [\$0000..\$7FFF]

## Operation

|    |                                                                                                                                                                                            |
|----|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 1  | rd $\leftarrow$ DM1(ra)                                                                                                                                                                    |
| 2  | rd $\leftarrow$ DM1(ar <sub>a</sub> +rb)                                                                                                                                                   |
| 3  | rd $\leftarrow$ DM1(ar <sub>c</sub> ), if c $\in$ [0..1] then ar <sub>c</sub> $\leftarrow$ ar <sub>c</sub> +step <sub>c</sub> else ar <sub>c</sub> $\leftarrow$ ar <sub>c</sub> +1/2       |
| 4  | if c $\in$ [0..1] then ar <sub>c</sub> $\leftarrow$ ar <sub>c</sub> -step <sub>c</sub> else ar <sub>c</sub> $\leftarrow$ ar <sub>c</sub> -1/2, rd $\leftarrow$ DM1(ar <sub>c</sub> )       |
| 5  | rd $\leftarrow$ DM1(ar <sub>a</sub> +K)                                                                                                                                                    |
| 6  | rd $\leftarrow$ DM1(ar <sub>e</sub> ), if are=tope then are $\leftarrow$ bote else are $\leftarrow$ are+step <sub>e</sub>                                                                  |
| 7  | rd $\leftarrow$ DM1(L)                                                                                                                                                                     |
| 8  | rd $\leftarrow$ DM1(bitrev(ar <sub>c</sub> )), if c $\in$ [0..1] then ar <sub>c</sub> $\leftarrow$ ar <sub>c</sub> +step <sub>c</sub> else ar <sub>c</sub> $\leftarrow$ ar <sub>c</sub> +2 |
| 9  | sp $\leftarrow$ sp-1, rd $\leftarrow$ DM1(sp)                                                                                                                                              |
| 10 | rd $\leftarrow$ DM1(sp+K)                                                                                                                                                                  |

## Description

Load a general register with data from data memory 1 using various addressing modes. In 8 bit mode the 8 msb in the register are cleared.

## Flags

No flags affected

## Comment

Syntax no.5 can be used without the constant K, in which case it will be assumed to be 0.

Operation no.6: observe that are will only be set to bote when the condition are=tope is met.

There is nothing preventing are from becoming larger than top<sub>e</sub> at the are  $\leftarrow$  are+step<sub>e</sub> operation.

1/2 in stepsize means 1 in 8-bit mode and 2 otherwise.

## Example

```
ld1      r7, (channel) ;channel is a previously defined constant
```

# ST0 — store register to DM0

## Syntax

|   |            |                  |
|---|------------|------------------|
| 1 | <b>st0</b> | [b8] (rd),rb     |
| 2 | <b>st0</b> | [b8] (arf,rd),rb |
| 3 | <b>st0</b> | [b8] (arg++),rb  |
| 4 | <b>st0</b> | [b8] (--arg),rb  |
| 5 | <b>st0</b> | [b8] (arf,K),rb  |
| 6 | <b>st0</b> | [b8] (arh++%),rb |
| 7 | <b>st0</b> | [b8] (L),rb      |
| 8 | <b>st0</b> | br(arg),rb       |

b $\in$ [0..31], d $\in$ [0..31], f $\in$ [0..3], g $\in$ [0..3], h $\in$ [0..1], K $\in$ [-2048..2047], L $\in$ [\$0000..\$7FFF]

## Operation

- 1 DM0(rd) $\leftarrow$ rb
- 2 DM0(arf+rd) $\leftarrow$ rb
- 3 DM0(arg) $\leftarrow$ rb, if g $\in$ [0..1] then arg $\leftarrow$ arg+stepg else arg $\leftarrow$ arg+1/2
- 4 if g $\in$ [0..1] then arg $\leftarrow$ arg-stepg else arg $\leftarrow$ arg-1/2, DM0(arg) $\leftarrow$ rb
- 5 DM0(arf+K) $\leftarrow$ rb
- 6 DM0(arh) $\leftarrow$ rb, if arh=toph then arh $\leftarrow$ both else arh $\leftarrow$ arh+steph
- 7 DM0(L) $\leftarrow$ rb
- 8 DM0(bitrev(arg)) $\leftarrow$ rb, if c $\in$ [0..1] then arg $\leftarrow$ arg+stepg else arg $\leftarrow$ arg+2

## Description

Store contents of a general register to DM0 (data memory 0) using various addressing modes. When 8 bit mode (b8) is used, the 8 lsb of the register are placed in the memory at given address. The other 8 bits in the same word in the memory are not affected.

## Flags

No flags affected

## Comment

Syntax no.5 can be used without the constant K, in which case it will be assumed to be 0.  
 Operation no.6: observe that arh will only be set to both when the condition arh=toph is met. There is nothing preventing arh from becoming larger than topd at the arh $\leftarrow$ arh+steph operation. 1/2 in stepsize means 1 in 8-bit mode and 2 otherwise.

## Example

st0 (ar2++,r31

Instruction 2.5: ST0 — store register to DM0

# ST1 — store register to DM1

## Syntax

```
1  st1      [b8]  (rd),rb
2  st1      [b8]  (arf,rd),rb
3  st1      [b8]  (arg++),rb
4  st1      [b8]  (--arg),rb
5  st1      [b8]  (arf,K),rb
6  st1      [b8]  (arh++%),rb
7  st1      [b8]  (L),rb
8  st1      br(arg),rb
9  st1      (sp++),rb
10 st1     (sp,K),rb
b[0..31], d[0..31], f[0..3], g[0..3], h[0..1], K[-2048..2047], L[$0000..$7FFF]
```

## Operation

```
1  DM1(rd)←rb
2  DM1(arf+rd)←rb
3  DM1(arg)←rb, if g[0..1] then arg←arg+stepg else arg←arg+1/2
4  if g[0..1] then arg←arg-stepg else arg←arg-1/2, DM1(arg)←rb
5  DM1(arf+K)←rb
6  DM1(arh)←rb, if arh=toph then arh←both else arh←arh+steph
7  DM1(L)←rb
8  DM1(bitrev(arg))←rb, if c[0..1] then arg←arg+stepg else arg←arg+1
9  DM1(sp)←rb, sp←sp+2
10 DM1(sp+K)←rb
```

## Description

Store contents of a general register to DM1 (data memory 1) using various addressing modes. When 8 bit mode (b8) is used, the 8 lsb of the register are placed in the memory at given address. The other 8 bits in the same word in the memory are not affected.

## Flags

No flags affected

## Comment

Syntax no.5 can be used without the constant  $K$ , in which case it will be assumed to be 0.

Operation no.6:  $\text{arh}$  will only be set to  $\text{both}$  when  $\text{arh}=\text{toph}$ . There is nothing preventing  $\text{arh}$  from becoming larger than  $\text{toph}$  at the  $\text{arh}\leftarrow\text{arh}+\text{steph}$  operation.

## Example

```
st1      (ar0++%),r13
```

# DBLLD — double load

## Syntax

```

1   dblld      rd,(ara),rf,(arb)
2   dblld      rd,(ara++),rf,(arb++)
3   dblld      rd,(--ara),rf,(--arb)
4   dblld      rd,(arc++%),rf,(are++%)
5   dblld      rd,br(ar a),rf,br(ar b)
ae[0..3], be[0..3], ce[0..1], de[0..31], ee[0..1], fe[0..31]

```

## Operation

- 1  $rd \leftarrow DM0(ar_a)$  and  $rf \leftarrow DM1(ar_b)$
- 2  $rd \leftarrow DM0(ar_a)$ , if  $ae[0..1]$  then  $ar_a \leftarrow ar_a + step_a$  else  $ar_a \leftarrow ar_a + 2$ , and  
 $rf \leftarrow DM1(ar_b)$ , if  $be[0..1]$  then  $ar_b \leftarrow ar_b + step_b$  else  $ar_b \leftarrow ar_b + 2$
- 3 if  $ae[0..1]$  then  $ar_a \leftarrow ar_a - step_a$  else  $ar_a \leftarrow ar_a - 2$ ,  $rd \leftarrow DM0(ar_a)$ , and  
if  $be[0..1]$  then  $ar_b \leftarrow ar_b - step_b$  else  $ar_b \leftarrow ar_b - 2$ ,  $rf \leftarrow DM1(ar_b)$
- 4  $rd \leftarrow DM0(ar_c)$ , if  $arc=top_c$  then  $arc \leftarrow bot_c$  else  $arc \leftarrow arc + step_c$ , and  
 $rf \leftarrow DM1(ar_e)$ , if  $are=top_e$  then  $are \leftarrow bot_e$  else  $are \leftarrow are + step_e$
- 5  $rd \leftarrow DM0(bitrev(ar_a))$ , if  $ae[0..1]$  then  $ar_a \leftarrow ar_a + step_a$  else  $ar_a \leftarrow ar_a + 2$ , and  
 $rf \leftarrow DM1(bitrev(ar_b))$ , if  $be[0..1]$  then  $ar_b \leftarrow ar_b + step_b$  else  $ar_b \leftarrow ar_b + 2$

## Description

Load one general register with data from DM0 (data memory 0) and another general register with data from DM1 (data memory 1).

## Flags

No flags affected

## Comment

In the syntax section the supported addressing modes are listed. DM0 and DM1 do not have to use the same addressing mode.

## Example

```
dblld    r1,(ar1),r9,(ar0++)
```

Instruction 2.7: DBLLD — double load

# DBLST — double store

## Syntax

```
1  dblst      (ard),ra,(arf),rb
2  dblst      (ard++),ra,(arf++),rb
3  dblst      (--ard),ra,(--arf),rb
4  dblst      (arg++%),ra,(arh++%),rb
5  dblst      br(ard),ra,br(arf),rb
ae[0..31], be[0..31], de[0..3], fe[0..3], ge[0..1], he[0..1]
```

## Operation

- 1  $DM0(\text{ard}) \leftarrow ra$  and  $DM1(\text{arf}) \leftarrow rb$
- 2  $DM0(\text{ard}) \leftarrow ra$ , if  $de[0..1]$  then  $\text{ard} \leftarrow \text{ard} + \text{step}_d$  else  $\text{ard} \leftarrow \text{ard} + 2$ , and  
 $DM1(\text{arf}) \leftarrow rb$ , if  $fe[0..1]$  then  $\text{arf} \leftarrow \text{arf} + \text{step}_f$  else  $\text{arf} \leftarrow \text{arf} + 2$
- 3 if  $de[0..1]$  then  $\text{ard} \leftarrow \text{ard} - \text{step}_d$  else  $\text{ard} \leftarrow \text{ard} - 2$ ,  $DM0(\text{ard}) \leftarrow ra$ , and  
if  $fe[0..1]$  then  $\text{arf} \leftarrow \text{arf} - \text{step}_f$  else  $\text{arf} \leftarrow \text{arf} - 2$ ,  $DM1(\text{arf}) \leftarrow rb$
- 4  $DM0(\text{arg}) \leftarrow ra$ , if  $\text{arg} = \text{top}_g$  then  $\text{arg} \leftarrow \text{bot}_g$  else  $\text{arg} \leftarrow \text{arg} + \text{step}_g$ , and  
 $DM1(\text{arh}) \leftarrow rb$ , if  $\text{arh} = \text{top}_h$  then  $\text{arh} \leftarrow \text{both}$  else  $\text{arh} \leftarrow \text{arh} + \text{step}_h$
- 5  $DM0(\text{bitrev}(\text{ard})) \leftarrow ra$ , if  $de[0..1]$  then  $\text{ard} \leftarrow \text{ard} + \text{step}_d$  else  $\text{ard} \leftarrow \text{ard} + 2$ , and  
 $DM1(\text{bitrev}(\text{arf})) \leftarrow rb$ , if  $fe[0..1]$  then  $\text{arf} \leftarrow \text{arf} + \text{step}_f$  else  $\text{arf} \leftarrow \text{arf} + 2$

## Description

Store contents of one general register to DM0 (data memory 0) and contents of another general register to DM1 (data memory 1).

## Flags

No flags affected

## Comment

In the syntax section the supported addressing modes are listed. DM0 and DM1 do not have to use the same addressing mode.

## Example

```
dblst  (ar2++),r31,br(ar0),r29
```

Instruction 2.8: DBLST — double store

# IN — read IO port to register

## Syntax

```
1   in          rd,K  
dε[0..31], Kε[$0000..$FFFF]
```

## Operation

```
1   rd←PORT(K)
```

## Description

Read data from an IO port and store in a general register.

## Flags

No flags affected

## Comment

Note: The address is more than the peripheral number. Read the *Senior Peripheral Firmware User Manual* for more information.

## Example

```
in      r12,$0010
```

Instruction 2.9: IN — read IO port to register

# OUT — write register to IO port

## Syntax

```
1    out      K,ra  
aε[0..31], Kε[$0000..$FFFF]
```

## Operation

```
1    PORT(K)←ra
```

## Description

Write data from a general register to an IO port

## Flags

No flags affected

## Comment

Note: The address is more than the peripheral number. Read the *Senior Peripheral Firmware User Manual* for more information.

## Example

```
out      $0012, r10
```

Instruction 2.10: OUT — write register to IO port

# DBLIN — IO to DM0 and DM1

## Syntax

```

1  dblin      (ard),(arf),K
2  dblin      (ard++),(arf++),K
3  dblin      (--ard),(--arf),K
4  dblin      (arg++%),(arh++%),K
5  dblin      br(ard),br(arf),K
dε[0..3], fε[0..3], gε[0..1], hε[0..1], Kε[0..63]

```

## Operation

- 1  $DM0(\text{ard}) \leftarrow PORT0(K)$  and  $DM1(\text{arf}) \leftarrow PORT1(K)$
- 2  $DM0(\text{ard}) \leftarrow PORT0(K)$ , if  $d\epsilon[0..1]$  then  $\text{ard} \leftarrow \text{ard} + \text{step}_d$  else  $\text{ard} \leftarrow \text{ard} + 2$ , and  
 $DM1(\text{arf}) \leftarrow PORT1(K)$ , if  $f\epsilon[0..1]$  then  $\text{arf} \leftarrow \text{arf} + \text{step}_f$  else  $\text{arf} \leftarrow \text{arf} + 2$
- 3 if  $d\epsilon[0..1]$  then  $\text{ard} \leftarrow \text{ard} - \text{step}_d$  else  $\text{ard} \leftarrow \text{ard} - 2$ ,  $DM0(\text{ard}) \leftarrow PORT0(K)$ , and  
if  $f\epsilon[0..1]$  then  $\text{arf} \leftarrow \text{arf} - \text{step}_f$  else  $\text{arf} \leftarrow \text{arf} - 2$ ,  $DM1(\text{arf}) \leftarrow PORT1(K)$
- 4  $DM0(\text{arg}) \leftarrow PORT0(K)$ , if  $\text{arg} = \text{top}_g$  then  $\text{arg} \leftarrow \text{bot}_g$  else  $\text{arg} \leftarrow \text{arg} + \text{step}_g$ , and  
 $DM1(\text{arh}) \leftarrow PORT1(K)$ , if  $\text{arh} = \text{top}_h$  then  $\text{arh} \leftarrow \text{bot}_h$  else  $\text{arh} \leftarrow \text{arh} + \text{step}_h$
- 5  $DM0(\text{bitrev}(\text{ard})) \leftarrow PORT0(K)$ , if  $d\epsilon[0..1]$  then  $\text{ard} \leftarrow \text{ard} + \text{step}_d$  else  $\text{ard} \leftarrow \text{ard} + 2$ , and  
 $DM1(\text{bitrev}(\text{arf})) \leftarrow PORT1(K)$ , if  $f\epsilon[0..1]$  then  $\text{arf} \leftarrow \text{arf} + \text{step}_f$  else  $\text{arf} \leftarrow \text{arf} + 2$

## Description

Store I/O input data to DM0 and DM1. The 16 lsb of the I/O input data is stored in DM0 and the 16 msb is stored in DM1.

## Flags

No flags affected

## Comment

DM0 and DM1 do not have to use the same addressing mode. The address constant should be the peripheral address, and unlike the address used for IN and OUT it shouldn't contain special targeted registers in the lsb.

## Example

```
dblin      (ar2++),br(ar0),0x14
```

Instruction 2.11: DBLIN — IO to DM0 and DM1

# DBLOUT — IO from DM0 and DM1

## Syntax

```
1  dblout      K,(ara),(arb)
2  dblout      K,(ara++),(arb++)
3  dblout      K,(--ara),(--arb)
4  dblout      K,(arc++%),(are++%)
5  dblout      K,br(ar a),br(ar b)
aε[0..3], bε[0..3], cε[0..1], eε[0..1], Kε[0..63]
```

## Operation

- 1 PORT0(K)←DM0(**ara**) and PORT1(K)←DM1(**arb**)
- 2 PORT0(K)←DM0(**ara**), if  $a\in[0..1]$  then  $\text{ara} \leftarrow \text{ara} + \text{step}_a$  else  $\text{ara} \leftarrow \text{ara} + 2$ , and  
PORT1(K)←DM1(**arb**), if  $b\in[0..1]$  then  $\text{arb} \leftarrow \text{arb} + \text{step}_b$  else  $\text{arb} \leftarrow \text{arb} + 2$
- 3 if  $a\in[0..1]$  then  $\text{ara} \leftarrow \text{ara} - \text{step}_a$  else  $\text{ara} \leftarrow \text{ara} - 2$ , PORT0(K)←DM0(**ara**), and  
if  $b\in[0..1]$  then  $\text{arb} \leftarrow \text{arb} - \text{step}_b$  else  $\text{arb} \leftarrow \text{arb} - 2$ , PORT1(K)←DM1(**arb**)
- 4 PORT0(K)←DM0(**arc**), if  $\text{arc} = \text{top}_c$  then  $\text{arc} \leftarrow \text{bot}_c$  else  $\text{arc} \leftarrow \text{arc} + \text{step}_c$ , and  
PORT1(K)←DM1(**are**), if  $\text{are} = \text{top}_e$  then  $\text{are} \leftarrow \text{bot}_e$  else  $\text{are} \leftarrow \text{are} + \text{step}_e$
- 5 PORT0(K)←DM0(bitrev(**ara**)), if  $a\in[0..1]$  then  $\text{ara} \leftarrow \text{ara} + \text{step}_a$  else  $\text{ara} \leftarrow \text{ara} + 2$ , and  
PORT1(K)←DM1(bitrev(**arb**)), if  $b\in[0..1]$  then  $\text{arb} \leftarrow \text{arb} + \text{step}_b$  else  $\text{arb} \leftarrow \text{arb} + 2$

## Description

Output data from DM0 to the 16 lsb of I/O data out and output data from DM1 to the 16 msb of I/O data out.

## Flags

No flags affected

## Comment

DM0 and DM1 do not have to use the same addressing mode. The address constant should only contain the peripheral number.

## Example

```
dblout 0x5,(ar1),(ar0++)
```

Instruction 2.12: DBLOUT — IO from DM0 and DM1

# OUT0 — out DM0 data to IO port

## Syntax

```

1  out0      K,ara
2  out0      K,(ara++)
3  out0      K,(--ara)
aε[0..3], Kε[$0000..$FFFF]

```

## Operation

```

1  PORT(K)←DM0(ara)
2  PORT(K)←DM0(ara),if aε[0..1] then ar a←ara+step a else ar a←ara+2
3  if aε[0..1] then ar a←ara-step a else ar a←ara-2, PORT(K)←DM0(ara)

```

## Description

Write data from DM0 to an IO port, DM0 address is from register **ara**, output data is 16 bit

## Flags

No flags affected

## Comment

Note: The address is more than the peripheral number. Read the *Senior Peripheral Firmware User Manual* for more information.

## Example

```
out0    $0012, (ar0++)
```

Instruction 2.13: OUT0 — out DM0 data to IO port

# OUT1 — out DM1 data to IO port

## Syntax

```
1  out1      K,ara
2  out1      K,(ara++)
3  out1      K,(--ara)
aε[0..3], Kε[$0000..$FFFF]
```

## Operation

```
1  PORT(K)←DM1(ara)
2  PORT(K)←DM1(ara),if aε[0..1] then ara←ara+stepa else ara←ara+2
3  if aε[0..1] then ara←ara-stepa else ara←ara-2, PORT(K)←DM1(ara)
```

## Description

Write data from DM1 to an IO port, DM1 address is from register **ara**, output data is 16 bit

## Flags

No flags affected

## Comment

Note: The address is more than the peripheral number. Read the *Senior Peripheral Firmware User Manual* for more information.

## Example

```
out1    $0012, (ar3)
```

Instruction 2.14: OUT1 — out DM1 data to IO port



### 3 Short arithmetic instructions, 16'b

The short arithmetic instructions concern 16 bit arithmetic operations. All instructions can make use of conditional execution (depending on ALU/MAC flags status) when no constant operands are used. Because of code size limitations constant operands are restricted to a size of 12 bits, but always sign extended to 16 bits before use. Only the **cmp** instruction can carry a 16 bit constant. Table 3.1 lists the short arithmetic instructions.

| Mnemonic | Description                                                | Page |
|----------|------------------------------------------------------------|------|
| add      | addition of registers and constants                        | 32   |
| addn     | addition of registers and constants with no flag change    | 33   |
| addc     | addition of registers and constants with carry             | 34   |
| adds     | addition of registers and constants with saturation        | 35   |
| sub      | subtraction of registers and constants                     | 36   |
| subn     | subtraction of registers and constants with no flag change | 37   |
| subc     | subtraction of registers and constants with carry          | 38   |
| subs     | subtraction of registers and constants with saturation     | 39   |
| cmp      | compare registers and constants                            | 40   |
| max      | return maximum of registers and constants                  | 41   |
| min      | return minimum of registers and constants                  | 42   |
| abs      | return absolute of a register                              | 43   |
| ext      | 8 to 16 bit sign extension on registers                    | 44   |

Table 3.1: short arithmetic instructions

# ADD — addition

## Syntax

```

1  add[.cdt]      rd,ra,rb
2  add           rd,K,rb
3  add           rd,rb,K
    ae[0..31], be[0..31], de[0..31], Ke[-2048..2047]

```

## Operation

```

1  if cdt is true: rd ← ra + rb
2,3 rd ← K + rb

```

## Description

1. Add general registers using a condition
- 2,3. Add a 12 bit signed constant (with 16 bit signed extension) and a general register

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

It is possible to omit one source register (**ra** or **rb**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```

add.ne  r7,r12          ;the same as add.ne r7,r12,r7
add     r23,r0,-31
add     r17,%11001001    ;the same as add r17,%11001001,r17
add     r2,c_tick,r6      ;c_tick is previously defined as a constant

```

Instruction 3.1: ADD — addition

# ADDN — addition with no flag change

## Syntax

```
1  addn[.cdt]    rd,ra,rb
2  addn          rd,K,rb
3  addn          rd,rb,K
aε[0..31], bε[0..31], dε[0..31], Kε[-2048..2047]
```

## Operation

```
1  if cdt is true: rd ← ra + rb
2,3 rd ← K + rb
```

## Description

1. Add general registers using a condition
- 2,3. Add a 12 bit signed constant (with 16 bit signed extension) and a general register

## Flags

No flags affected

## Comment

It is possible to omit one source register (**ra** or **rb**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```
addn.ne r7,r12           ;the same as addn.ne r7,r12,r7
addn   r23,r0,-31
addn   r17,%11001001      ;the same as addn r17,%11001001,r17
addn   r2,c_tick,r6       ;c_tick is previously defined as a constant
```

Instruction 3.2: ADDN — addition with no flag change

# ADDC — addition with carry

## Syntax

```

1  addc[.cdt]    rd,ra,rb
2  addc          rd,K,rb
3  addc          rd,rb,K
    ae[0..31], be[0..31], de[0..31], Ke[-2048..2047]

```

## Operation

```

1  if cdt is true: rd←ra+rb+AC
2,3 rd←K+rb+AC

```

## Description

1. Add general registers and carry bit using a condition
- 2,3. Add a 12 bit signed constant (with 16 bit signed extension), a general register and carry bit

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

It is possible to omit one source register (**ra** or **rb**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```

addc.ne r7,r12           ;the same as addc.ne r7,r12,r7
addc   r23,r0,-31
addc   r17,%11001001      ;the same as addc r17,%11001001,r17
addc   r2,c_tick,r6       ;c_tick is previously defined as a constant

```

Instruction 3.3: ADDC — addition with carry

# ADDS — addition with saturation

## Syntax

```
1  adds[.cdt]    rd,ra,rb
2  adds          rd,K,rb
3  adds          rd,rb,K
aε[0..31], bε[0..31], dε[0..31], Kε[-2048..2047]
```

## Operation

```
1  if cdt is true: rd←saturation(ra+rb)0
2,3 rd←saturation(K+rb)
```

## Description

1. Add with saturation general registers using a condition
- 2,3. Add with saturation a 12 bit signed constant (with 16 bit signed extension) and a general register

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

It is possible to omit one source register ( $ra$  or  $rb$ ) in the syntax, in which case the destination register ( $rd$ ) will be used in its place.

## Example

```
adds.ne r7,r12           ;the same as adds.ne r7,r12,r7
adds     r23,r0,-31
adds     r17,%11001001    ;the same as adds r17,%11001001,r17
adds     r2,c_tick,r6      ;c_tick is previously defined as a constant
```

Instruction 3.4: ADDS — addition with saturation

# SUB — subtraction

## Syntax

```

1  sub[.cdt]      rd,ra,rb
2  sub           rd,K,rb
  ae[0..31], be[0..31], de[0..31], Ke[-2048..2047]

```

## Operation

- 1 if *cdt* is true:  $rd \leftarrow ra - rb$
- 2  $rd \leftarrow K - rb$

## Description

- 1 Subtract general registers using a condition
- 2 Subtract a general register from a 12 bit signed constant (with 16 bit signed extension)

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

It is possible to omit one source register (*ra* or *rb*) in the syntax, in which case the destination register (*rd*) will be used in its place.

## Example

```

sub.mi  r3,r4          ;the same as sub.mi r3,r4,r3
sub     r14,0x7A,r0

```

Instruction 3.5: SUB — subtraction

# SUBN — subtraction with no flag change

## Syntax

```
1  subn[.cdt]    rd,ra,rb  
2  subn          rd,K,rb  
  ae[0..31], be[0..31], de[0..31], Ke[-2048..2047]
```

## Operation

- 1 if *cdt* is true:  $rd \leftarrow ra - rb$
- 2  $rd \leftarrow K - rb$

## Description

- 1 Subtract general registers using a condition
- 2 Subtract a general register from a 12 bit signed constant (with 16 bit signed extension)

## Flags

No flags affected

## Comment

It is possible to omit one source register (*ra* or *rb*) in the syntax, in which case the destination register (*rd*) will be used in its place.

## Example

```
subn.mi  r3,r4          ;the same as subn.mi r3,r4,r3  
subn     r14,0x7A,r0
```

Instruction 3.6: SUBN — subtraction with no flag change

# SUBC — subtraction with carry

## Syntax

```

1  subc[.cdt]    rd,ra,rb
2  subc          rd,K,rb
  ae[0..31], be[0..31], de[0..31], Ke[-2048..2047]

```

## Operation

- 1 if *cdt* is true:  $rd \leftarrow ra - rb - 1 + AC = ra + rb \oplus \$FFFF + AC$
- 2  $rd \leftarrow K - rb - 1 + AC = K + rb \oplus \$FFFF + AC$

## Description

1. Subtract general registers and add carry if condition is true
2. Subtract a general register from a 12 bit signed constant (with 16 bit signed extension) and add carry

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

It is possible to omit one source register (*ra* or *rb*) in the syntax, in which case the destination register (*rd*) will be used in its place.

## Example

```

subc.mi  r3,r4           ;the same as subc.mi r3,r4,r3
subc     r14,0x7A,r0

```

Instruction 3.7: SUBC — subtraction with carry

# SUBS — subtraction with saturation

## Syntax

```
1  subs[.cdt]    rd,ra,rb  
2  subs          rd,K,rb  
  ae[0..31], be[0..31], de[0..31], Ke[-2048..2047]
```

## Operation

- 1 if *cdt* is true:  $rd \leftarrow \text{saturation}(ra - rb)$
- 2  $rd \leftarrow \text{saturation}(K - rb)$

## Description

1. Subtract with saturation general registers using a condition
2. Subtract with saturation a general register from a 12 bit signed constant (with 16 bit signed extension)

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

It is possible to omit one source register (*ra* or *rb*) in the syntax, in which case the destination register (*rd*) will be used in its place.

## Example

```
subs.mi  r3,r4           ;the same as subs.mi r3,r4,r3  
subs     r14,0x7A,r0
```

Instruction 3.8: SUBS — subtraction with saturation

# CMP — compare

## Syntax

```

1   cmp[.cdt]      ra,rb
2   cmp            K,rb
ae[0..31], be[0..31], Ke[-32768..32767]

```

## Operation

- 1 if *cdt* is true: ALUflags  $\curvearrowright$  **ra - rb**
- 2 ALUflags  $\curvearrowright$  **K - rb**

## Description

1. Compare contents of general registers using a condition
2. Compare a constant and a general register

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

## Example

```

cmp.pl  r7,r12
cmp    -23456,r3

```

Instruction 3.9: CMP — compare

# MAX — return maximum value

## Syntax

```
1  max[.cdt]      rd,ra,rb
2  max            rd,K,ra
3  max            rd,ra,K
aε[0..31], bε[0..31], dε[0..31], Kε[-2048..2047]
```

## Operation

```
1  if cdt is true: rd←signed maximum(ra,rb)
2,3 rd←signed maximum(ra,K)
```

## Description

1. Return maximum of the general source registers, using a condition
- 2,3. Return maximum of a 12 bit signed constant (with 16 bit signed extension) and a general register

## Flags

No flags affected

## Comment

It is possible to omit one source register (**ra** or **rb**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```
max.eq    r11,r7          ;the same as max.eq r11,r7,r11
max      r3,$F44          ;the same as max r3,$F44,r3
max      r1,r2,-3
```

Instruction 3.10: MAX — return maximum value

# MIN — return minimum value

## Syntax

```

1  min[.cdt]      rd,ra,rb
2  min           rd,K,ra
3  min           rd,ra,K
    ae[0..31], be[0..31], de[0..31], Ke[-2048..2047]

```

## Operation

```

1  if cdt is true: rd ← signed minimum(ra,rb)
2,3 rd ← signed minimum(ra,K)

```

## Description

1. Return minimum of the general source registers, using a condition
- 2,3. Return minimum of a 12 bit signed constant (with 16 bit signed extension) and a general register

## Flags

No flags affected

## Comment

It is possible to omit one source register (**ra** or **rb**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```

min.eq   r11,r7          ;the same as min.eq r11,r7,r11
min      r3,$F44          ;the same as min r3,$F44,r3
min      r1,r2,-3

```

Instruction 3.11: MIN — return minimum value

# ABS — return absolute value

## Syntax

```
1 abs[.cdt]      [sat] rd,ra  
aε[0..31], dε[0..31]
```

## Operation

1 if *cdt* is true:  $rd \leftarrow \text{absolute}(ra)$  or  $rd \leftarrow \text{absolute}(\text{saturation}(ra))$

## Description

Return absolute value of an optionally saturated general register, using a condition

## Flags

No flags affected

## Comment

Observe: for the special case when  $ra = \$8000$  saturation is needed

## Example

```
abs      r12,r13
```

Instruction 3.12: ABS — return absolute value

# EXT — 8 to 16 bit sign extension

## Syntax

```
1   ext[.cdt]      rd,ra
    ae[0..31], de[0..31]
```

## Operation

1 if *cdt* is true:  $rd \leftarrow \text{sign\_extension}(ra[7..0])$

## Description

1. Extending a 8 bit value into a 16 bit value, using a condition.

## Flags

No flags affected

## Comment

It is possible to omit the source register (**ra**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```
ext.eq  r3,r2
ext      r1           ;Write back to r1
```

Instruction 3.13: EXT — 8 to 16 bit sign extension

## 4 Short logic instructions, 16'b

The short logic instructions concern 16 bit logic operations. All instructions can make use of conditional execution (depending on ALU/MAC flags status) when no constant operands are used. Because of code size limitations constant operands are restricted to a size of 12 bits, but always sign extended to 16 bits before use. Table 4.1 lists the short logic instructions.

| Mnemonic | Description                                                   | Page |
|----------|---------------------------------------------------------------|------|
| and      | logic and between registers and constants                     | 46   |
| andn     | logic and between registers and constants with no flag change | 47   |
| or       | logic or between registers and constants                      | 48   |
| orn      | logic or between registers and constants with no flag change  | 49   |
| xor      | logic xor between registers and constants                     | 50   |
| xorn     | logic xor between registers and constants with no flag change | 51   |

Table 4.1: short logic instructions

# AND — logic and

## Syntax

```

1  and[.cdt]      rd,ra,rb
2  and            rd,K,ra
3  and            rd,ra,K
aε[0..31], bε[0..31], dε[0..31], Kε[-2048..2047]

```

## Operation

```

1  if cdt is true: rd←ra•rb
2,3 rd←ra•K

```

## Description

1. Logic and between general registers, using a condition
- 2,3. Logic and between a 12 bit signed constant (with 16 bit signed extension) and a general register

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

It is possible to omit one source register (**ra** or **rb**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```

and.ugt  r3,r2          ;the same as and.ugt r3,r2,r3
and      r14,$83A        ;the constant $83A will sign extend to $F83A

```

Instruction 4.1: AND — logic and

# ANDN — logic and with no flag change

## Syntax

```
1  andn[.cdt]    rd,ra,rb
2  andn          rd,K,ra
3  andn          rd,ra,K
aε[0..31], bε[0..31], dε[0..31], Kε[-2048..2047]
```

## Operation

```
1  if cdt is true: rd←ra•rb
2,3 rd←ra•K
```

## Description

1. Logic and between general registers, using a condition
- 2,3. Logic and between a 12 bit signed constant (with 16 bit signed extension) and a general register

## Flags

No flags affected

## Comment

It is possible to omit one source register (**ra** or **rb**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```
andn.ugt r3,r2          ;the same as andn.ugt r3,r2,r3
andn      r14,$83A       ;the constant $83A will sign extend to $F83A
```

Instruction 4.2: ANDN — logic and with no flag change

# OR — logic or

## Syntax

```

1  or[.cdt]      rd,ra,rb
2  or            rd,K,ra
3  or            rd,ra,K
aε[0..31], bε[0..31], dε[0..31], Kε[-2048..2047]

```

## Operation

```

1  if cdt is true: rd←ra∨rb
2,3 rd←ra∨K

```

## Description

1. Logic or between general registers, using a condition
- 2,3. Logic or between a 12 bit signed constant (with 16 bit signed extension) and a general register

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

It is possible to omit one source register (**ra** or **rb**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```

or.ugt    r3,r2          ;the same as or.ugt r3,r2,r3
or        r14,$83A        ;the constant $83A will sign extend to $F83A

```

Instruction 4.3: OR — logic or

# ORN — logic or with no flag change

## Syntax

```
1  orn[.cdt]      rd,ra,rb
2  orn           rd,K,ra
3  orn           rd,ra,K
aε[0..31], bε[0..31], dε[0..31], Kε[-2048..2047]
```

## Operation

```
1  if cdt is true: rd←ra∨rb
2,3 rd←ra∨K
```

## Description

1. Logic or between general registers, using a condition
- 2,3. Logic or between a 12 bit signed constant (with 16 bit signed extension) and a general register

## Flags

No flags affected

## Comment

It is possible to omit one source register (**ra** or **rb**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```
orn.ugt r3,r2          ;the same as orn.ugt r3,r2,r3
orn      r14,$83A       ;the constant $83A will sign extend to $F83A
```

Instruction 4.4: ORN — logic or with no flag change

# XOR — logic Xor

## Syntax

```

1  xor[.cdt]      rd,ra,rb
2  xor            rd,K,ra
3  xor            rd,ra,K
    ae[0..31], be[0..31], de[0..31], Ke[-2048..2047]

```

## Operation

```

1  if cdt is true: rd←ra⊕rb
2,3 rd←ra⊕K

```

## Description

1. Logic xor between general registers, using a condition
- 2,3. Logic xor between a 12 bit signed constant (with 16 bit signed extension) and a general register

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

It is possible to omit one source register (**ra** or **rb**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```

xor.ugt  r3,r2          ;the same as xor.ugt r3,r2,r3
xor      r14,$83A        ;the constant $83A will sign extend to $F83A

```

Instruction 4.5: XOR — logic Xor

# XORN — logic xor with no flag change

## Syntax

```
1  xorn[.cdt]    rd,ra,rb
2  xorn          rd,K,ra
3  xorn          rd,ra,K
aε[0..31], bε[0..31], dε[0..31], Kε[-2048..2047]
```

## Operation

```
1  if cdt is true: rd←ra⊕rb
2,3 rd←ra⊕K
```

## Description

1. Logic xor between general registers, using a condition
- 2,3. Logic xor between a 12 bit signed constant (with 16 bit signed extension) and a general register

## Flags

No flags affected

## Comment

It is possible to omit one source register (**ra** or **rb**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```
xorn.ugt r3,r2          ;the same as xorn.ugt r3,r2,r3
xorn      r14,$83A        ;the constant $83A will sign extend to $F83A
```

Instruction 4.6: XORN — logic xor with no flag change



## 5 Short shift instructions, 16'b

The short shift instructions concern 16 bit shift and rotate operations. All instructions can make use of conditional execution (depending on ALU/MAC flags status). All instructions use either a register or a constant to note the number of shift/rotate operations.

**OBSERVE**, when more than 16 shift/rotate operations are used results may be unexpected.  
Table 5.1 lists the short shift instructions.

| Mnemonic | Description                | Page |
|----------|----------------------------|------|
| asr      | arithmetic shift right     | 54   |
| asl      | arithmetic shift left      | 55   |
| lsr      | logic shift right          | 56   |
| lsl      | logic shift left           | 57   |
| ror      | rotate right               | 58   |
| rol      | rotate left                | 59   |
| rcr      | rotate right through carry | 60   |
| rcl      | rotate left through carry  | 61   |

Table 5.1: short shift instructions

# ASR — arithmetic shift right

## Syntax

```
1  asr[.cdt]      rd,ra,rb
2  asr[.cdt]      rd,ra,K
    ae[0..31], be[0..31], de[0..31], Ke[0..31]
```

## Operation



## Description

**ra** is the register to be shifted, **rb** or **K** holds the number of steps to shift, the result is stored in register **rd**

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
|    | U  | U  | U  |

## Comment

It is possible to omit the source register (**ra**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```
asr.ne   r12,7           ;the same as asr.ne r12,r12,7
```

Instruction 5.1: ASR — arithmetic shift right

## ASL — arithmetic shift left

# Syntax

```

1  asl[.cdt]      rd,ra,rb
2  asl[.cdt]      rd,ra,K
a[0..31], b[0..31], d[0..31], K[0..31]

```

# Operation

← 15 14 ... 1 0 →

1,2      ← 0

## Description

**rs** is the register to be shifted, **rb** or **K** holds the number of steps to shift, the result is stored in register **rd**

# Flags

| AV | AC | AN | AZ |
|----|----|----|----|
|    | U  | U  | U  |

## Comment

It is possible to omit the source register (`ra`) in the syntax, in which case the destination register (`rd`) will be used in its place.

## Example

as1.eq r12,r3 ;the same as as1.eq r12,r12,r3

Instruction 5.2: ASL — arithmetic shift left

# LSR — logic shift right

## Syntax

```
1 lsr[.cdt]      rd,ra,rb
2 lsr[.cdt]      rd,ra,K
    ae[0..31], be[0..31], de[0..31], Ke[0..31]
```

## Operation



## Description

**ra** is the register to be shifted, **rb** or **K** holds the number of steps to shift, the result is stored in register **rd**

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
|    | U  | U  | U  |

## Comment

It is possible to omit the source register (**ra**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```
lsr.ne   r4,22           ;the same as lsr.ne r4,r4,22
```

Instruction 5.3: LSR — logic shift right

# LSL — logic shift left

# Syntax

```

1  lsl[.cdt]      rd,ra,rb
2  lsl[.cdt]      rd,ra,K
a[0..31], b[0..31], d[0..31], K[0..31]

```

## Operation



## Description

**rs** is the register to be shifted, **rb** or **K** holds the number of steps to shift, the result is stored in register **rd**

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
|    | U  | U  | U  |

## Comment

It is possible to omit the source register (`ra`) in the syntax, in which case the destination register (`rd`) will be used in its place.

## Example

lsl.eq r3,7 ;the same as lsl.eq r3,r3,7

Instruction 5.4: LSL — logic shift left

# ROR — rotate right

## Syntax

```
1  ror[.cdt]      rd,ra,rb
2  ror[.cdt]      rd,ra,K
    ae[0..31], be[0..31], de[0..31], Ke[0..15]
```

## Operation



## Description

**ra** is the register to be rotated, **rb** or **K** holds the number of steps to rotate, the result is stored in register **rd**

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
|    | U  | U  | U  |

## Comment

It is possible to omit the source register (**ra**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```
ror.ne   r5,2           ;the same as ror.ne r5,r5,2
```

Instruction 5.5: ROR — rotate right

# ROL — rotate left

## Syntax

1    `rol[.cdt]      rd,ra,rb`  
2    `rol[.cdt]      rd,ra,K`  
 $a \in [0..31], b \in [0..31], d \in [0..31], K \in [0..15]$

## Operation



## Description

`ra` is the register to be rotated, `rb` or `K` holds the number of steps to rotate, the result is stored in register `rd`

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
|    | U  | U  | U  |

## Comment

It is possible to omit the source register (`ra`) in the syntax, in which case the destination register (`rd`) will be used in its place.

## Example

`rol.eq r15,r1                                  ;the same as rol.eq r15,r15,r1`

Instruction 5.6: ROL — rotate left

# RCR — rotate right through carry

## Syntax

```
1  rcr[.cdt]      rd,ra,rb
2  rcr[.cdt]      rd,ra,K
    ae[0..31], be[0..31], de[0..31], Ke[0..15]
```

## Operation



## Description

**ra** is the register to be rotated, **rb** or **K** holds the number of steps to rotate, the result is stored in register **rd**

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
|    | U  | U  | U  |

## Comment

It is possible to omit the source register (**ra**) in the syntax, in which case the destination register (**rd**) will be used in its place.

## Example

```
rcr.ne    r7,7           ;the same as rcr.ne r7,r7,7
```

Instruction 5.7: RCR — rotate right through carry

# RCL — rotate left through carry

# Syntax

```

1   rcl[.cdt]      rd,ra,rb
2   rcl[.cdt]      rd,ra,K
aε[0..31], bε[0..31], dε[0..31], Kε[0..15]

```

# Operation



## Description

**ra** is the register to be rotated, **rb** or **K** holds the number of steps to rotate, the result is stored in register **rd**

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
|    | U  | U  | U  |

## Comment

It is possible to omit the source register (`ra`) in the syntax, in which case the destination register (`rd`) will be used in its place.

## Example

rcl.eq r17,r2 ;the same as rcl.eq r17,r17,r2

Instruction 5.8: RCL — rotate left through carry



## 6 Long arithmetic instructions, 32'b

The long arithmetic instructions concern 32 bit arithmetic operations. All instructions (except **sublst0**, **sublst1**, **mul**, **mulld0**, **mulld1**, **muldblld**, **mac**, **macld0**, **macld1**, **mdm** and **conv**) can make use of conditional execution (depending on ALU/MAC flags status). Operations are carried out with 8 guard bits on all operands giving 40 bits internal resolution in the MAC unit. Table 6.1 lists the long arithmetic instructions.

| Mnemonic | Description                                               | Page |
|----------|-----------------------------------------------------------|------|
| addl     | long addition between accumulators and registers          | 64   |
| subl     | long subtraction between accumulators and registers       | 65   |
| sublst0  | long subtraction and store register to data memory 0      | 66   |
| sublst1  | long subtraction and store register to data memory 1      | 67   |
| cmpl     | long compare between accumulators and registers           | 68   |
| absl     | long absolute of accumulator or register                  | 69   |
| negl     | long negation of accumulator or register                  | 70   |
| moveL    | long move of register to accumulator                      | 71   |
| clr      | clear of accumulator                                      | 72   |
| postop   | post accumulator operation                                | 73   |
| mul      | multiply                                                  | 74   |
| mulld0   | multiply and load register from data memory 0             | 75   |
| mulld1   | multiply and load register from data memory 1             | 76   |
| muldblld | multiply and double load                                  | 77   |
| mac      | multiply and accumulate                                   | 78   |
| macld0   | multiply, accumulate and load register from data memory 0 | 79   |
| macld1   | multiply, accumulate and load register from data memory 1 | 80   |
| mdm      | multiply and diminish                                     | 81   |
| conv     | convolution                                               | 82   |

Table 6.1: long arithmetic instructions

# ADDL — long addition

## Syntax

```

1  addl[.cdt]    acrD,acrA,acrB
2  addl[.cdt]    acrD,acrA,rb:0
3  addl[.cdt]    acrD,acrA,1:rb
4  addl[.cdt]    acrD,acrA,ra:rb
5  addl[.cdt]    [sat] [rnd] rd,acrA,rb:0
6  addl[.cdt]    [sat] [rnd] rd,acrA,1:rb
ae[0..31], be[0..31], de[0..31], Aε[0..3], Bε[0..3], Dε[0..3]

```

## Operation

```

1  if cdt is true: acrD ← acrA + acrB
2  if cdt is true: acrD ← acrA + zero_extension(rb)
3  if cdt is true: acrD ← acrA + sign_extension(rb)
4  if cdt is true: acrD ← acrA + register_extension(rb)
5  if cdt is true: rd ← saturation(round(acrA + zero_extension(rb)))
6  if cdt is true: rd ← saturation(round(acrA + sign_extension(rb)))

```

## Description

Operations are carried out with 8 guard bits on all operands

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

## Example

```

addl.meq acr2,acr1,acr0
addl      acr1,acr3,r2:0
addl.mvs sat rnd r15,acr0,1:r4

```

Instruction 6.1: ADDL — long addition

# SUBL — long subtraction

## Syntax

```
1  subl[.cdt]    acrD,acrA,acrB
2  subl[.cdt]    acrD,acrA,rb:0
3  subl[.cdt]    acrD,acrA,1:rb
4  subl[.cdt]    acrD,acrA,ra:rb
aε[0..31], bε[0..31], Aε[0..3], Bε[0..3], Dε[0..3]
```

## Operation

- 1 if *cdt* is true:  $\text{acrD} \leftarrow \text{acrA} - \text{acrB}$
- 2 if *cdt* is true:  $\text{acrD} \leftarrow \text{acrA} - \text{zero\_extension}(\text{rb})$
- 3 if *cdt* is true:  $\text{acrD} \leftarrow \text{acrA} - \text{sign\_extension}(\text{rb})$
- 4 if *cdt* is true:  $\text{acrD} \leftarrow \text{acrA} - \text{register\_extension}(\text{rb})$

## Description

Operations are carried out with 8 guard bits on all operands

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  |    | U  | U  |

## Comment

## Example

```
subl.mle acr2,acr1,1:r14
subl      acr1,acr3,r7:r8
```

Instruction 6.2: SUBL — long subtraction

# SUBLST0 — long subtraction and store register to DM0

## Syntax

```

1  sublst0      [sat] [rnd] rd,ra:0,acrB,(arf++),rb
2  sublst0      [sat] [rnd] rd,1:ra,acrB,(arf++),rb
    ae[0..31], be[0..31], de[0..31], fe[0..3], Be[0..3]

```

## Operation

- 1  $rd \leftarrow \text{saturation}(\text{round}(\text{zero\_extension}(ra) - acrB))$  and  
 $\text{DM0(arf)} \leftarrow rb$ , if  $fe[0..1]$  then  $arf \leftarrow arf + step_f$  else  $arf \leftarrow arf + 1$
- 2  $rd \leftarrow \text{saturation}(\text{round}(\text{sign\_extension}(ra) - acrB))$  and  
 $\text{DM0(arf)} \leftarrow rb$ , if  $fe[0..1]$  then  $arf \leftarrow arf + step_f$  else  $arf \leftarrow arf + 1$

## Description

Subtract an accumulator from an extended general purpose register and write back the result to a general purpose register. And store contents of a general purpose register to DM0 (data memory 0) using post-incremental addressing mode.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

## Example

```
sublst0 sat rnd r5,1:r3,acr0,(ar3++),r7
```

Instruction 6.3: SUBLST0 — long subtraction and store register to DM0

# SUBLST1 — long subtraction and store register to DM1

## Syntax

```
1  sublst1      [sat] [rnd] rd,ra:0,acrb,(arf++),rb
2  sublst1      [sat] [rnd] rd,1:ra,acrb,(arf++),rb
    ae[0..31], be[0..31], de[0..31], fe[0..3], Be[0..3]
```

## Operation

- 1  $rd \leftarrow \text{saturation}(\text{round}(\text{zero\_extension}(ra) - acrb))$  and  
 $\text{DM1(arf)} \leftarrow rb$ , if  $fe[0..1]$  then  $\text{arf} \leftarrow \text{arf} + \text{stepf}$  else  $\text{arf} \leftarrow \text{arf} + 1$
- 2  $rd \leftarrow \text{saturation}(\text{round}(\text{sign\_extension}(ra) - acrb))$  and  
 $\text{DM1(arf)} \leftarrow rb$ , if  $fe[0..1]$  then  $\text{arf} \leftarrow \text{arf} + \text{stepf}$  else  $\text{arf} \leftarrow \text{arf} + 1$

## Description

Subtract an accumulator from an extended general purpose register and write back the result to a general purpose register. And store contents of a general purpose register to DM1 (data memory 1) using post-incremental addressing mode.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

## Example

```
sublst1 r2,r4:0,acrl,(ar0++),r14
```

Instruction 6.4: SUBLST1 — long subtraction and store register to DM1

# CMPL — long compare

## Syntax

```

1  cmpl[.cdt]    acrA,acrB
2  cmpl[.cdt]    acrA,rb:0
3  cmpl[.cdt]    acrA,1:rb
4  cmpl[.cdt]    acrA,ra:rb
aε[0..31], bε[0..31], Aε[0..3], Bε[0..3]

```

## Operation

- 1 if *cdt* is true: MACflags $\cap$ saturation(**acrA** $-$ **acrB**)
- 2 if *cdt* is true: MACflags $\cap$ saturation(**acrA** $-$ zero\_extension(**rb**))
- 3 if *cdt* is true: MACflags $\cap$ saturation(**acrA** $-$ sign\_extension(**rb**))
- 4 if *cdt* is true: MACflags $\cap$ saturation(**acrA** $-$ register\_extension(**rb**))

## Description

Operations are carried out with 8 guard bits on all operands

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

## Example

```

cmpl.mne acr3,acr2
cmpl      acr1,r7:0

```

Instruction 6.5: CMPL — long compare

# ABSL — long absolute

## Syntax

```
1  abs1[.cdt]    [sat] acrD,acrA  
2  abs1[.cdt]    [sat] acrD[.h|.l],ra  
   ae[0..31], Ae[0..3], De[0..3]
```

## Operation

- 1 if *cdt* is true:  $\text{acrD} \leftarrow \text{saturation}(\text{absolute}(\text{acrA}))$
- 2 if *cdt* is true; if **h** is used  $\text{acrD} \leftarrow \text{saturation}(\text{absolute}(\text{ra}:[0..0]))$  else  $\text{acrD} \leftarrow \text{saturation}(\text{absolute}([0..0]:\text{ra}))$

## Description

Return the absolute value of an accumulator or an extended general register.

The [.h|.l] option will select either the high order bits (bits 31 through 16) when using **h** or the low order bits (bits 15 through 0) when using **l** as destination for **ra** in **acrD**. When using **l**, the value will be sign extended up to 32 bits. Using **h** will clear the low order bits.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

## Example

```
abs1.mle sat acr2,acr3  
abs1      acr0.h,r4
```

Instruction 6.6: ABSL — long absolute

# NEGL — long negation

## Syntax

```

1  negl[.cdt]    [sat] acrD,acrA
2  negl[.cdt]    [sat] acrD[.h|.1],ra
  ae[0..31], Ae[0..3], De[0..3]

```

## Operation

- 1 if *cdt* is true:  $\text{acrD} \leftarrow \text{sat}(\text{neg}(\text{acrA}))$
- 2 if *cdt* is true; if **h** is used  $\text{acrD} \leftarrow \text{sat}(\text{neg}(\text{ra}:[0..0]))$  else  $\text{acrD} \leftarrow \text{sat}(\text{neg}([0..0]:\text{ra}))$

## Description

Return the negated value of an accumulator or an extended general register.

The [.h|.1] option will select either the high order bits (bits 31 through 16) when using **h** or the low order bits (bits 15 through 0) when using 1 (default) as destination for **ra** in **acrD**. When using 1, the value will be sign extended up to 32 bits. Using **h** will clear the low order bits.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

## Example

```

negl.mne acr1,acr2
negl      sat acr3.1,r17

```

Instruction 6.7: NEGL — long negation

# MOVEL — long move

## Syntax

```
1  movel[.cdt]  acrD,ra:rb
2  movel[.cdt]  acrD[.h|.l],ra
   aε[0..31], bε[0..31], Dε[0..3]
```

## Operation

- 1 if *cdt* is true:  $\text{acrD} \leftarrow \text{ra:rb}$
- 2 if *cdt* is true: if **h** is used  $\text{acrD} \leftarrow \text{sat}(\text{ra}:[0..0])$  else  $\text{acrD} \leftarrow \text{sat}([0..0]:\text{ra})$

## Description

The [.h|.l] option will select either the high order bits (bits 31 through 16) when using **h** or the low order bits (bits 15 through 0) when using **l** (default) as destination for **ra** in **acrD**. When using **l**, the value will be sign extended up to 32 bits. Using **h** will clear the low order bits.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
|    |    | U  | U  |

## Comment

## Example

```
moveL    acr2,r14:r2
moveL    acr0,r5          ;low order bits of acr0 will be target
```

Instruction 6.8: MOVEL — long move

# CLR — clear accumulator

## Syntax

1    **clr**[.*cdt*]      **acrD**  
*D*<sub>ε</sub>[0..3]

## Operation

1    if *cdt* is true: **acrD** ← 0

## Description

Clear an accumulator, using a condition.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
|    |    | U  | U  |

## Comment

## Example

**clr.eq**    **acr2**

Instruction 6.9: CLR — clear accumulator

# POSTOP — post accumulator operation

## Syntax

```
1  postop[.cdt] [sf] [rnd] [sat] acrD,acrA  
Aε[0..3], Dε[0..3]  
sfε[mul2|mul4|div2|div4|div8|div16|mul65536]
```

## Operation

1 if *cdt* is true: **acrD**←saturation(round(scale(**acrA**)))

## Description

The postop instruction offers the possibility to perform a scale, round and/or saturation only operation for an accumulator.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

## Example

```
postop    mul2 rnd sat acr0,acr1  
postop    rnd acr0,acr1
```

Instruction 6.10: POSTOP — post accumulator operation

# MUL<sub>xx</sub> — multiply

## Syntax

```
1 mulxx      [sf] acrD,ra,rb
  aε[0..31], bε[0..31], Dε[0..3]
  sfε[mul2|mul4|div2|div4|div8|div16|mul65536]
```

## Operation

1  $\text{acr}D \leftarrow \text{scale}(\text{signed/unsigned}(ra) * \text{signed/unsigned}(rb))$

## Description

Multiply two general registers with chosen signed/unsigned interpretation and scale the result.  
The *xx* notation in the instruction shall be replaced with a two character combination of the letters **s** or **u**, and chooses signed or unsigned interpretation respectively for the source registers **ra** and **rb**, in that order.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  |    | U  | U  |

## Comment

## Example

```
muluu    div2 acr0,r2,r4 ;interpret r2 unsigned, r4 unsigned
mulsu    acr1,r7,r23   ;interpret r7 signed, r23 unsigned
```

Instruction 6.11: MUL<sub>xx</sub> — multiply

# MULLD0 — multiply and load register from DM0

## Syntax

```
1  mulld0      acrD,ra,rb,rd,(arc)
2  mulld0      acrD,ra,rb,rd,(arc++)
3  mulld0      acrD,ra,rb,rd,(--arc)
4  mulld0      acrD,ra,rb,rd,(are++%)
5  mulld0      acrD,ra,rb,rd,br(arc)
ae[0..31], be[0..31], ce[0..3], de[0..31], ee[0..1], D[0..3]
```

## Operation

```
1  acrD $\leftarrow$ signed(ra) * signed(rb) and rd $\leftarrow$ DM0(arc)
2  acrD $\leftarrow$ signed(ra) * signed(rb) and
   rd $\leftarrow$ DM0(arc), if ce[0..1] then arc $\leftarrow$ arc+stepc else arc $\leftarrow$ arc+1
3  acrD $\leftarrow$ signed(ra) * signed(rb) and
   if ce[0..1] then arc $\leftarrow$ arc-stepc else arc $\leftarrow$ arc-1, rd $\leftarrow$ DM0(arc)
4  acrD $\leftarrow$ signed(ra) * signed(rb) and
   rd $\leftarrow$ DM0(are), if are=tope then are $\leftarrow$ bote else are $\leftarrow$ are+stepc
5  acrD $\leftarrow$ signed(ra) * signed(rb) and
   rd $\leftarrow$ DM0(bitrev(arc)), if ce[0..1] then arc $\leftarrow$ arc+stepc else arc $\leftarrow$ arc+1
```

## Description

Multiply two general registers with signed interpretation and mul4 scaling and load a general register with data from DM0 (data memory 0) using various addressing modes.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  |    | U  | U  |

## Comment

Operation no.4: observe that are will only be set to bote when the condition are=tope is met. There is nothing preventing are from becoming larger than tope at the are $\leftarrow$ are+stepc operation.

## Example

```
mulld0  acr0,r0,r1,r20,(ar0++)
```

Instruction 6.12: MULLD0 — multiply and load register from DM0

# MULLD1 — multiply and load register from DM1

## Syntax

```

1  mulld1      acrD,ra,rb,rd,(arc)
2  mulld1      acrD,ra,rb,rd,(arc++)
3  mulld1      acrD,ra,rb,rd,(--arc)
4  mulld1      acrD,ra,rb,rd,(are++%)
5  mulld1      acrD,ra,rb,rd,br(arc)
ae[0..31], be[0..31], cε[0..3], dε[0..31], eε[0..1], Dε[0..3]

```

## Operation

```

1  acrD ← signed(ra) * signed(rb) and rd ← DM1(arc)
2  acrD ← signed(ra) * signed(rb) and
   rd ← DM1(arc), if cε[0..1] then arc ← arc + stepc else arc ← arc + 1
3  acrD ← signed(ra) * signed(rb) and
   if cε[0..1] then arc ← arc - stepc else arc ← arc - 1, rd ← DM1(arc)
4  acrD ← signed(ra) * signed(rb) and
   rd ← DM1(are), if are = tope then are ← bote else are ← are + stepc
5  acrD ← signed(ra) * signed(rb) and
   rd ← DM1(bitrev(arc)), if cε[0..1] then arc ← arc + stepc else arc ← arc + 1

```

## Description

Multiply two general registers with signed interpretation and mul4 scaling and load a general register with data from DM1 (data memory 1) using various addressing modes.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  |    | U  | U  |

## Comment

Operation no.4: observe that **are** will only be set to **bote** when the condition **are = tope** is met. There is nothing preventing **are** from becoming larger than **tope** at the **are ← are + stepc** operation.

## Example

```
mulld1  acr3,r14,r9,r0,(ar0)
```

Instruction 6.13: MULLD1 — multiply and load register from DM1

# MULDBLLD — multiply and double load

## Syntax

```
1 muldblld [br0] [br1] acrD,ra,rb,rd,rf  
aε[0..31], bε[0..31], dε[0..31], fε[0..31], Dε[0..3]
```

## Operation

```
1 acrD ← signed(ra) * signed(rb), and  
if br0 is used then rd ← DM0(bitrev(ar0)) and ar0 ← ar0 + step0, else rd ← DM0(ar0), and  
if br1 is used then rf ← DM1(bitrev(ar3)) else rf ← DM1(ar3), ar3 ← ar3 + 1
```

## Description

Multiply two general registers with signed interpretation and mul4 scaling and load one general register with data from DM0 (data memory 0) and another general register with data from DM1 (data memory 1).

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  |    | U  | U  |

## Comment

## Example

```
muldblld br0 acr0,r15,r5,r22,r23 ;Bit reverse addressing for DM0  
;Incremental addressing for DM1  
muldblld acr1,r7,r23,r4,r1 ;Indirect addressing for DM0  
;Incremental addressing for DM1
```

Instruction 6.14: MULDBLLD — multiply and double load

# MAC<sub>xx</sub> — multiply and accumulate

## Syntax

```
1  macxx      [sf] acrD,ra,rb
  aε[0..31], bε[0..31], Dε[0..3]
  sfε[mul2|mul4|div2|div4|div8|div16|mul65536]
```

## Operation

1  $\text{acr}D \leftarrow \text{acr}D + \text{scale}(\text{signed}/\text{unsigned}(\text{ra}) * \text{signed}/\text{unsigned}(\text{rb}))$

## Description

Multiply two general registers with chosen signed/unsigned interpretation, scale the result and add to the destination accumulator.

The *xx* notation in the instruction shall be replaced with a two character combination of the letters **s** or **u**, and chooses signed or unsigned interpretation respectively for the source registers **ra** and **rb**, in that order.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  |    | U  | U  |

## Comment

## Example

```
macss    div8 acr3,r17,r14 ;interpret r17 signed, r14 signed
macus    acr1,r17,r3       ;interpret r17 unsigned, r3 signed
```

Instruction 6.15: MAC<sub>xx</sub> — multiply and accumulate

# MACLD0 — multiply, accumulate and load register from DM0

## Syntax

```
1  macld0      acrD,ra,rb,rd,(arc)
2  macld0      acrD,ra,rb,rd,(arc++)
3  macld0      acrD,ra,rb,rd,(--arc)
4  macld0      acrD,ra,rb,rd,(are++%)
5  macld0      acrD,ra,rb,rd,br(arc)
ae[0..31], be[0..31], ce[0..3], de[0..31], ee[0..1], D[0..3]
```

## Operation

```
1  acrD ← acrD + (signed(ra) * signed(rb)) and rd ← DM0(arc)
2  acrD ← acrD + (signed(ra) * signed(rb)) and
   rd ← DM0(arc), if ce[0..1] then arc ← arc + stepc else arc ← arc + 1
3  acrD ← acrD + (signed(ra) * signed(rb)) and
   if ce[0..1] then arc ← arc - stepc else arc ← arc - 1, rd ← DM0(arc)
4  acrD ← acrD + (signed(ra) * signed(rb)) and
   rd ← DM0(are), if are = tope then are ← bote else are ← are + stepc
5  acrD ← acrD + (signed(ra) * signed(rb)) and
   rd ← DM0(bitrev(arc)), if ce[0..1] then arc ← arc + stepc else arc ← arc + 1
```

## Description

Multiply two general registers with signed interpretation and mul4 scaling, add the result to the destination accumulator and load a general register with data from DM0 (data memory 0) using various addressing modes.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  |    | U  | U  |

## Comment

Operation no.4: observe that `are` will only be set to `bote` when the condition `are = tope` is met. There is nothing preventing `are` from becoming larger than `tope` at the `are ← are + stepc` operation.

## Example

```
macld0  acr3,r14,r9,r0,(ar0++%)
```

Instruction 6.16: MACLD0 — multiply, accumulate and load register from DM0

# MACLD1 — multiply, accumulate and load register from DM1

## Syntax

```

1  macld1      acrD,ra,rb,rd,(arc)
2  macld1      acrD,ra,rb,rd,(arc++)
3  macld1      acrD,ra,rb,rd,(--arc)
4  macld1      acrD,ra,rb,rd,(are++%)
5  macld1      acrD,ra,rb,rd,br(arc)
ae[0..31], be[0..31], cε[0..3], dε[0..31], eε[0..1], Dε[0..3]

```

## Operation

```

1  acrD ← acrD + (signed(ra) * signed(rb)) and rd ← DM1(arc)
2  acrD ← acrD + (signed(ra) * signed(rb)) and
   rd ← DM1(arc), if cε[0..1] then arc ← arc + stepc else arc ← arc + 1
3  acrD ← acrD + (signed(ra) * signed(rb)) and
   if cε[0..1] then arc ← arc - stepc else arc ← arc - 1, rd ← DM1(arc)
4  acrD ← acrD + (signed(ra) * signed(rb)) and
   rd ← DM1(are), if are = tope then are ← bote else are ← are + stepc
5  acrD ← acrD + (signed(ra) * signed(rb)) and
   rd ← DM1(bitrev(arc)), if cε[0..1] then arc ← arc + stepc else arc ← arc + 1

```

## Description

Multiply two general registers with signed interpretation and mul4 scaling, add the result to the destination accumulator and load a general register with data from DM1 (data memory 1) using various addressing modes.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  |    | U  | U  |

## Comment

Operation no.4: observe that `are` will only be set to `bote` when the condition `are = tope` is met. There is nothing preventing `are` from becoming larger than `tope` at the `are ← are + stepc` operation.

## Example

```
macld1  acr0,r2,r19,r3,(ar0++)
```

Instruction 6.17: MACLD1 — multiply, accumulate and load register from DM1

# MDM<sub>xx</sub> — multiply and diminish

## Syntax

```
1  mdmxx      [sf] acrD,ra,rb  
aε[0..31], bε[0..31], Dε[0..3]  
sfε[mul2|mul4|div2|div4|div8|div16|mul65536]
```

## Operation

```
1  acrD ← acrD – scale(signed/unsigned(ra) * signed/unsigned(rb))
```

## Description

Multiply two general registers with chosen signed/unsigned interpretation, scale the result and subtract from the destination accumulator.

The *xx* notation in the instruction shall be replaced with a two character combination of the letters **s** or **u**, and chooses signed or unsigned interpretation respectively for the source registers **ra** and **rb**, in that order.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  |    | U  | U  |

## Comment

## Example

```
mdmss    mul2 acr0,r11,r5 ;interpret r11 signed, r5 signed  
mdmuu    acr2,r7,r13      ;interpret r7 unsigned, r13 unsigned
```

Instruction 6.18: MDM<sub>xx</sub> — multiply and diminish

# CONV $xx$ — convolution

## Syntax

```
1   conv $xx$       [ap|am] [sf] acrD , DMO , DM1
DMO can be one of (ara++), (--ara) or (arc++%)
DM1 can be one of (arb++), (--arb) or (are++%)
a $\in$ [0..3], b $\in$ [0..3], c $\in$ [0..1], e $\in$ [0..1], D $\in$ [0..3]
sf $\in$ [mul2|mul4|div2|div4|div8|div16|mul65536]
```

## Operation

Pre operation:

The ( $--\text{ar}x$ ) addressing mode will: if  $x \in [0..1]$  then  $\text{ar}x \leftarrow \text{ar}x - \text{step}_x$  else  $\text{ar}x \leftarrow \text{ar}x - 1$

1  $\text{acr}D \leftarrow \text{acr}D + / - \text{scale}(\text{signed}/\text{unsigned}(\text{DM}0(\text{ar})) * \text{signed}/\text{unsigned}(\text{DM}1(\text{ar})))$

Post operation:

The ( $\text{ar}x++$ ) addressing mode will: if  $x \in [0..1]$  then  $\text{ar}x \leftarrow \text{ar}x + \text{step}_x$  else  $\text{ar}x \leftarrow \text{ar}x + 1$

The ( $\text{ar}x++%$ ) addressing mode will: if  $\text{ar}x = \text{top}_x$  then  $\text{ar}x \leftarrow \text{bot}_x$  else  $\text{ar}x \leftarrow \text{ar}x + \text{step}_x$

## Description

Multiply two data memory operands with chosen signed/unsigned interpretation, scale the result and add to or subtract from the destination accumulator.

The  $xx$  notation in the instruction shall be replaced with a two character combination of the letters **s** or **u**, and chooses signed or unsigned interpretation respectively for the source operands  $DM_0$  and  $DM_1$ , in that order.

The **ap** operator (default) denotes addition in the convolution operation.

The **am** operator denotes subtraction in the convolution operation.

## Flags

| MV | MS | MN | MZ |
|----|----|----|----|
| U  |    |    |    |

## Comment

## Example

```
convus    am acr1,(ar2++),(--ar1)
convuu    acr0,(ar0++%),(ar1++%)
```

Instruction 6.19: CONV $xx$  — convolution

## 7 Iterative instructions

The iterative instruction (**repeat**) makes it possible to repeat instructions in sequence a certain number of times. All **repeat** does is setup the internal loop registers (**loopn**, **loopb** and **loope**) and then the internal structure of the processor conducts the iteration. Nested iteration loops are not possible without the risk of breaking the flow of outer loops, since inner **repeat** instructions may rewrite the loop registers. Table 7.1 lists the iterative instructions.

| Mnemonic | Description                                               | Page |
|----------|-----------------------------------------------------------|------|
| repeat   | repeat instructions in sequence a certain number of times | 84   |

Table 7.1: iterative instructions

# REPEAT — repeat instructions

## Syntax

```
1   repeat      K,L
K-pc-1ε[0..127], Lε[$0001..$0FFF]
```

## Operation

1    *loopn*←*L*, *loopb*←*pc*+1, *loope*←*K*-1  
*pc* is the current program counter value

## Description

Repeat instructions up to (not including) address *K*, *L* times.

*K* is typically a label that must reside within 127 addresses after the **repeat** instruction.

## Flags

No flags affected

## Comment

A nested **repeat** instruction, that is a **repeat** within range of a previous **repeat**, will if executed rewrite the loop registers (**loopn**, **loopb** and **loope**) and break the outer loop.

## Example

```
repeat    label1,700
  set      r4,$FA72          ;this and the next instruction ...
  move    r1,sr3            ;...will repeat 700 times in sequence
label1
  move    r17,r31           ;same address as label1, but outside loop
```

Instruction 7.1: REPEAT — repeat instructions

## 8 Flow control instructions

The flow control instructions determine the program flow, in one way or another. Instructions **jump**, **call** and **ret** make use of 0 to 3 delay slots. With 3 delay slots the first 3 instructions following a jump will be executed before the jump takes place. With 2 delay slots the first 2 instructions and 1 **nop** (2 **nop** for ret) will be executed before the jump takes place, and so on. Table 8.1 lists the flow control instructions.

| Mnemonic | Description                               | Page |
|----------|-------------------------------------------|------|
| jump     | jump to address                           | 86   |
| call     | call to subroutine                        | 87   |
| ret      | return from subroutine                    | 88   |
| reti     | return from interrupt                     | 88   |
| nop      | no operation, only used for time shimming | 90   |

Table 8.1: flow control instructions

# JUMP — jump to address

## Syntax

```

1   jump[.cdt]    [dsx] ra
2   jump[.cdt]    [dsx] K
  aε[0..31], xε[0..3], Kε[$0000..$FFFF]

```

## Operation

- 1 if *cdt* is true: after *x* cycles:  $pc \leftarrow (ra)$
- 2 if *cdt* is true: after *x* cycles:  $pc \leftarrow K$

## Description

The **dsx** option denotes a number of cycles (default is 0) to wait before executing the jump. After the jump there are  $3-x$  empty cycles where no instructions are executed.  
*K* is typically a label

## Flags

No flags affected

## Comment

## Example

```

jump.ne  ds2 label4
move     r1,sr3          ;this will execute
set      r2,7            ;this will execute
move     r12,r3          ;this will NOT execute
label4
set     r7,3

```

Instruction 8.1: JUMP — jump to address

# CALL — call to subroutine

## Syntax

```
1  call      [dsx] ra
2  call      [dsx] K
aε[0..31], xε[0..3], Kε[$0000..$FFFF]
```

## Operation

Pre operation:

$DM1[sp] \leftarrow pc + 1 + x, sp \leftarrow sp + 1$

1 after  $x$  cycles:  $pc \leftarrow (ra)$

2 after  $x$  cycles:  $pc \leftarrow K$

## Description

The **dsx** option denotes a number of cycles (default is 0) to wait before executing the call. After the call there are  $3 - x$  empty cycles where no instructions are executed.

$K$  is typically a label

## Flags

No flags affected

## Comment

## Example

```
call    ds2 label7
move   r1,sr3          ;this will execute
set    r2,7            ;this will execute
jump   label4          ;this will execute AFTER return (ret)
label7
set    r7,3
ret    ds1
move   r1,r2          ;this will execute BEFORE return takes place
set    r11,7           ;this will NOT execute
```

Instruction 8.2: CALL — call to subroutine

# RET — return from subroutine

## Syntax

1    **ret**                **dsx**  
 $x \in [0..3]$

## Operation

Pre operation:  $sp \leftarrow sp - 1$   
 1    after  $x$  cycles:  $pc \leftarrow DM1[sp]$

## Description

The **dsx** option denotes a number of cycles (default is 0) to wait before executing the return. After the return there are  $4-x$  empty cycles where no instructions are executed.

## Flags

No flags affected

## Comment

## Example

see the **call** instruction example

Instruction 8.3: RET — return from subroutine

# RETI — return from interrupt

## Syntax

```
1 reti
```

## Operation

Pre operation:  $sp \leftarrow sp - 1$

1  $pc \leftarrow DM1(sp)$

one cycle later:  $f10 \leftarrow DM1(--sp)$

## Description

Pops PC and then flags from the stack.

## Flags

All flags in  $f10$  are updated.

## Comment

## Example

```
reti ;return from interrupt routine
```

Instruction 8.4: RETI — return from interrupt

# NOP — no operation

## Syntax

1    `nop`

## Operation

1     $pc \leftarrow pc + 1$

## Description

No operation

## Flags

No flags affected

## Comment

## Example

`nop` ; simple enough

Instruction 8.5: NOP — no operation

## 9 Alias instructions

The alias instructions are instructions with no physical implementation in the processor. They exist only for the benefit of the programmer and make use of special cases of other physical instructions for their own implementation. Table 9.1 lists the alias instructions.

| Mnemonic | Description                                    | Page |
|----------|------------------------------------------------|------|
| inc      | increase a register by one                     | 92   |
| incn     | increase a register by one with no flag change | 93   |
| dec      | decrease a register by one                     | 94   |
| decn     | decrease a register by one with no flag change | 95   |
| push     | push register to stack                         | 96   |
| pop      | pop stack to register                          | 97   |

Table 9.1: alias instructions

# INC — increase register

## Syntax

1 inc rd

## Operation

1  $rd \leftarrow rd + 1$

## Description

Add one to a general register

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

This instruction is implemented as `add rd,rd,1`

## Example

inc r2

Instruction 9.1: INC — increase register

# INCN — increase register with no flag change

## Syntax

1 incn            *rd*

## Operation

1  $rd \leftarrow rd + 1$

## Description

Add one to a general register

## Flags

No flags affected

## Comment

This instruction is implemented as `addn rd,rd,1`

## Example

incn            r7

Instruction 9.2: INCN — increase register with no flag change

# DEC — decrease register

## Syntax

1    **dec**                **rd**

## Operation

1    **rd**  $\leftarrow$  **rd** - 1

## Description

Subtract one from a general register

## Flags

| AV | AC | AN | AZ |
|----|----|----|----|
| U  | U  | U  | U  |

## Comment

This instruction is implemented as **add rd,rd,-1**

## Example

**dec**                **r12**

Instruction 9.3: DEC — decrease register

# DECN — decrease register with no flag change

## Syntax

1    **decn**              **rd**

## Operation

1     $rd \leftarrow rd - 1$

## Description

Subtract one from a general register

## Flags

No flags affected

## Comment

This instruction is implemented as **addn rd,rd,-1**

## Example

**decn**      **r17**

Instruction 9.4: DECN — decrease register with no flag change

# PUSH — push register to stack

## Syntax

1    **push**              **ra**  
   $a \in [0..31]$

## Operation

1     $DM1[sp] \leftarrow ra$ , increment sp

## Description

Copy contents of source register to top of stack

## Flags

No flags affected

## Comment

This instruction is implemented as **st1 (sp++) ,ra**

## Example

**push**        **r30**

Instruction 9.5: PUSH — push register to stack

# POP — pop stack to register

## Syntax

1    **pop**              **rd**  
 $d \in [0..31]$

## Operation

1    decrement sp,  $rd \leftarrow DM1[sp]$

## Description

Copy contents from top of stack to destination register

## Flags

No flags affected

## Comment

This instruction is implemented as `ld1 rd, (--sp)`

## Example

**pop**              **r7**

Instruction 9.6: POP — pop stack to register



# List of instructions

|      |                                                  |    |
|------|--------------------------------------------------|----|
| 2.1  | MOVE — move . . . . .                            | 16 |
| 2.2  | SET — set register to constant . . . . .         | 17 |
| 2.3  | LD0 — load register from DM0 . . . . .           | 18 |
| 2.4  | LD1 — load register from DM1 . . . . .           | 19 |
| 2.5  | ST0 — store register to DM0 . . . . .            | 20 |
| 2.6  | ST1 — store register to DM1 . . . . .            | 21 |
| 2.7  | DBLLD — double load . . . . .                    | 22 |
| 2.8  | DBLST — double store . . . . .                   | 23 |
| 2.9  | IN — read IO port to register . . . . .          | 24 |
| 2.10 | OUT — write register to IO port . . . . .        | 25 |
| 2.11 | DBLIN — IO to DM0 and DM1 . . . . .              | 26 |
| 2.12 | DBLOUT — IO from DM0 and DM1 . . . . .           | 27 |
| 2.13 | OUT0 — out DM0 data to IO port . . . . .         | 28 |
| 2.14 | OUT1 — out DM1 data to IO port . . . . .         | 29 |
| 3.1  | ADD — addition . . . . .                         | 32 |
| 3.2  | ADDN — addition with no flag change . . . . .    | 33 |
| 3.3  | ADDC — addition with carry . . . . .             | 34 |
| 3.4  | ADDS — addition with saturation . . . . .        | 35 |
| 3.5  | SUB — subtraction . . . . .                      | 36 |
| 3.6  | SUBN — subtraction with no flag change . . . . . | 37 |
| 3.7  | SUBC — subtraction with carry . . . . .          | 38 |
| 3.8  | SUBS — subtraction with saturation . . . . .     | 39 |
| 3.9  | CMP — compare . . . . .                          | 40 |
| 3.10 | MAX — return maximum value . . . . .             | 41 |
| 3.11 | MIN — return minimum value . . . . .             | 42 |
| 3.12 | ABS — return absolute value . . . . .            | 43 |
| 3.13 | EXT — 8 to 16 bit sign extension . . . . .       | 44 |
| 4.1  | AND — logic and . . . . .                        | 46 |
| 4.2  | ANDN — logic and with no flag change . . . . .   | 47 |
| 4.3  | OR — logic or . . . . .                          | 48 |
| 4.4  | ORN — logic or with no flag change . . . . .     | 49 |
| 4.5  | XOR — logic Xor . . . . .                        | 50 |
| 4.6  | XORN — logic xor with no flag change . . . . .   | 51 |
| 5.1  | ASR — arithmetic shift right . . . . .           | 54 |
| 5.2  | ASL — arithmetic shift left . . . . .            | 55 |

|                                                                         |    |
|-------------------------------------------------------------------------|----|
| 5.3 LSR — logic shift right . . . . .                                   | 56 |
| 5.4 LSL — logic shift left . . . . .                                    | 57 |
| 5.5 ROR — rotate right . . . . .                                        | 58 |
| 5.6 ROL — rotate left . . . . .                                         | 59 |
| 5.7 RCR — rotate right through carry . . . . .                          | 60 |
| 5.8 RCL — rotate left through carry . . . . .                           | 61 |
|                                                                         |    |
| 6.1 ADDL — long addition . . . . .                                      | 64 |
| 6.2 SUBL — long subtraction . . . . .                                   | 65 |
| 6.3 SUBLST0 — long subtraction and store register to DM0 . . . . .      | 66 |
| 6.4 SUBLST1 — long subtraction and store register to DM1 . . . . .      | 67 |
| 6.5 CMPL — long compare . . . . .                                       | 68 |
| 6.6 ABSL — long absolute . . . . .                                      | 69 |
| 6.7 NEGL — long negation . . . . .                                      | 70 |
| 6.8 MOVEL — long move . . . . .                                         | 71 |
| 6.9 CLR — clear accumulator . . . . .                                   | 72 |
| 6.10 POSTOP — post accumulator operation . . . . .                      | 73 |
| 6.11 MUL $xx$ — multiply . . . . .                                      | 74 |
| 6.12 MULLD0 — multiply and load register from DM0 . . . . .             | 75 |
| 6.13 MULLD1 — multiply and load register from DM1 . . . . .             | 76 |
| 6.14 MULDBLLD — multiply and double load . . . . .                      | 77 |
| 6.15 MAC $xx$ — multiply and accumulate . . . . .                       | 78 |
| 6.16 MACLD0 — multiply, accumulate and load register from DM0 . . . . . | 79 |
| 6.17 MACLD1 — multiply, accumulate and load register from DM1 . . . . . | 80 |
| 6.18 MDM $xx$ — multiply and diminish . . . . .                         | 81 |
| 6.19 CONV $xx$ — convolution . . . . .                                  | 82 |
|                                                                         |    |
| 7.1 REPEAT — repeat instructions . . . . .                              | 84 |
|                                                                         |    |
| 8.1 JUMP — jump to address . . . . .                                    | 86 |
| 8.2 CALL — call to subroutine . . . . .                                 | 87 |
| 8.3 RET — return from subroutine . . . . .                              | 88 |
| 8.4 RETI — return from interrupt . . . . .                              | 89 |
| 8.5 NOP — no operation . . . . .                                        | 90 |
|                                                                         |    |
| 9.1 INC — increase register . . . . .                                   | 92 |
| 9.2 INCN — increase register with no flag change . . . . .              | 93 |
| 9.3 DEC — decrease register . . . . .                                   | 94 |
| 9.4 DECN — decrease register with no flag change . . . . .              | 95 |
| 9.5 PUSH — push register to stack . . . . .                             | 96 |
| 9.6 POP — pop stack to register . . . . .                               | 97 |