



# Block-Level Protocols

2021.1

© Copyright 2021 Xilinx



## Objectives

**After completing this module, you will be able to:**

- ▶ List the types of block-level protocols abstracted in the Vitis™ HLS tool

# Block-Level Protocol

Block-level protocol generates the **handshaking/control signals**, which control the HLS block

When the block can start to perform its standard operation (ap\_start)

When the block operation ends (ap\_done)

What is the status of the block operation (ap\_ready and ap\_idle)

Block-level protocols are specified on the function or the function return

Let's first understand what a block-level protocol is.

The block-level protocol generates the handshaking or control signals, which control the HLS block.

These control signals indicate:

- When the block can start to perform its standard operation (ap\_start)
- When the block operation ends (ap\_done)
- What is the status of the block operation (ap\_ready and ap\_idle)

These block-level protocols are specified on the function or the function return.

# Block-Level Protocol

Three types of block-level protocols:

- `ap_ctrl_hs`
- `ap_ctrl_chain`
- `ap_ctrl_none`

Function return → specified as an AXI4-Lite protocol using `s_axilite`

- All the ports in the block-level protocol are grouped into the AXI4-Lite protocol
- Common practice when another device, such as a CPU or a processor, is used to configure and control when the block starts and stops operation



4

© Copyright 2021 Xilinx

XILINX.

There are three types of block-level protocols:

- `ap_ctrl_hs`
- `ap_ctrl_chain`
- `ap_ctrl_none`

Apart from this, if the function return is specified as an AXI4-Lite protocol using `s_axilite`, all the ports in the block-level protocol are grouped into the AXI4-Lite protocol. This is a common practice when another device, such as a CPU or a processor, is used to configure and control when the block starts and stops operation.

# Block-Level Protocol

Block-Level Protocol

| Argument Type  | Scalar |        |
|----------------|--------|--------|
| Interface Mode | Input  | Return |
| ap_ctrl_hs     |        | D      |
| ap_ctrl_none   |        |        |
| ap_ctrl_chain  |        |        |

## ap\_ctrl\_hs

Behavior of the control signals created by ap\_ctrl\_hs

After the reset signal is applied:

1. Design starts when “ap\_start” control signal goes High
2. ap\_start signal should remain High until ap\_ready signal goes high



Let's discuss what happens when the ap\_ctrl\_hs block-level protocol is used. The behavior of the control signals created by ap\_ctrl\_hs block-level protocol is shown here.

After the reset signal is applied:

1. Block waits for “ap\_start” control signal to go High before it begins operation.
2. The ap\_start signal should remain High until ap\_ready signal indicates that the design is ready for new inputs.

## ap\_ctrl\_hs

3. Output “ap\_idle” goes Low when “ap\_start” is sampled High
4. Data can now be read on the input ports and write on the output ports
  - a. First input data can be sampled on the first clock edge after “ap\_idle” goes Low



7

© Copyright 2021 Xilinx

 XILINX.

3. Output “ap\_idle” goes Low when “ap\_start” is sampled High.
4. Data can now be read on the input ports and write on the output ports.
  - a. The first input data can be sampled on the first clock edge after “ap\_idle” goes Low.

## ap\_ctrl\_hs

5. When the block completes all operations, any return value is written to port ap\_return
  - a. If there was no function return, there is no ap\_return port on the RTL
  - b. Other outputs may be written to at any time until the block completes and are independent of this I/O protocol



8

© Copyright 2021 Xilinx

 XILINX.

5. When the block completes all operations, any return value is written to port ap\_return.
  - a. If there was no function return, there is no ap\_return port on the RTL block.
  - b. Other outputs may be written to at any time until the block completes and are independent of this I/O protocol.

## ap\_ctrl\_hs

6. Output “ap\_done” goes High when the block completes operation
7. Idle signal goes High one cycle after “ap\_done” and remains High until the next time “ap\_start” is sampled High



9

© Copyright 2021 Xilinx

XILINX.

6. Output “ap\_done” goes High when the block completes operation.
  - a. When the design is ready to accept new inputs, the ap\_ready signal goes High.
  - b. If there is an ap\_return port, the data on this port is valid when “ap\_done” is High.
7. The idle signal goes High one cycle after “ap\_done” and remains High until the next time “ap\_start” is sampled High (indicating the block should begin operation).

## ap\_ctrl\_hs

8. If the “ap\_start” signal is High when “ap\_done” goes High:
  - a. “ap\_idle” signal remains Low
  - b. Block immediately starts its next execution (or next transaction)
  - c. Next input can be read on the next clock edge



10

© Copyright 2021 Xilinx

XILINX.

8. If the “ap\_start” signal is High when “ap\_done” goes High:
  - a. The “ap\_idle” signal remains Low.
  - b. Block immediately starts its next execution (or next transaction).
  - c. The next input can be read on the next clock edge.

## ap\_ctrl\_chain

ap\_ctrl\_chain protocol is used to support the pipeline chains

Similar to ap\_ctrl\_hs protocol but provides an additional input port ap\_continue

Asserting the ap\_continue signal Low:

- Informs the design that the downstream block that consumes the data is not ready to accept new data
- Design stalls when it reaches the final state of the current transaction
- Output data is presented on the protocol
- ap\_done signal can be asserted High and the design remains in this state until ap\_continue is asserted High



11

© Copyright 2021 Xilinx

XILINX.

Let's discuss what happens when the ap\_ctrl\_chain block-level protocol is used. The ap\_ctrl\_chain protocol is used to support the pipeline chains. It is similar to the ap\_ctrl\_hs protocol but provides an additional input port ap\_continue.

Asserting the ap\_continue signal Low:

- Informs the design that the downstream block that consumes the data is not ready to accept new data.
- Design stalls when it reaches the final state of the current transaction.
- Output data is presented on the protocol.
- ap\_done signal can be asserted High and the design remains in this state until ap\_continue is asserted High.

## ap\_ctrl\_chain

Provides "back pressure" signal which allows the downstream blocks to prevent any more data being generated

When integrating several HLS modules together in RTL domain



The ap\_ctrl\_chain protocol provides "back pressure" signal which allows the downstream blocks to prevent any more data being generated.

Xilinx recommends using the ap\_ctrl\_chain block-level I/O protocol when chaining the HLS tool blocks together.

## ap\_ctrl\_none

Simplest protocol with no other control signals associated with it

The only ports in the RTL design are those specified in the source code

Useful for the module where there is no need to control stop/go

Does not require

Additional hardware overhead

Requires

Producer blocks provide data to the input port at the correct time or hold it for the length of a transaction

Consumer blocks to read the output ports at the correct time

*A design with ap\_ctrl\_none protocol specified on an output cannot be verified using the cosim\_design feature*

The ap\_ctrl\_none block-level protocol is the simplest protocol and has no other control signals associated with it.

Neither the input nor the output signals have any associated control ports indicating when data is read or written. The only ports in the RTL design are those specified in the source code.

It is useful for the module where there is no need to control stop/go. For example: combinational logic example with Initiation Interval equals to 1.

An ap\_ctrl\_none protocol requires no additional hardware overhead but does require that the producer blocks provide data to the input port at the correct time or hold it for the length of a transaction, until the design completes, and consumer blocks are required to read the output ports at the correct time.

A design with ap\_ctrl\_none protocol specified on an output cannot be verified using the cosim\_design feature.

## **s\_axilite**

AXI Slave Lite I/O protocol used to control the HLS block: s\_axilite

Common practice used when the processing system (PS) or processor is used to configure the HLS block

| Argument Type    | Scalar |        | HLS Stream |
|------------------|--------|--------|------------|
| Interface Mode   | Input  | Return | Output     |
| ap_ctrl_none     |        |        | O          |
| ap_ctrl_hs       |        | D      |            |
| ap_ctrl_chain    |        |        |            |
| axis             |        |        |            |
| <b>s_axilite</b> |        |        |            |
| m_axi            |        |        |            |

Wire handshake Protocol

Memory Interface Protocol: RAM : FIFO

Legend: Supported D = Default Interface Not Supported

14

© Copyright 2021 Xilinx

 XILINX.

An AXI Slave Lite I/O protocol can be used to control the HLS block by using the memory-mapped block-level: s\_axilite.

This is a common practice used when the processing system or processor is used to configure the HLS block.

## **s\_axilite**

Features of the AXI4 Slave Lite protocol

Multiple ports can  
be grouped into the  
same AXI4 Slave  
Lite protocol

When the design is  
exported to the IP  
catalog, the output  
includes C function  
and header files for the  
use with the code  
running on a processor

The features of the AXI4 Slave Lite protocol provided by Vitis HLS are:

- Multiple ports can be grouped into the same AXI4 Slave Lite protocol
- When the design is exported to the IP Catalog, the output includes C function and header files for the use with the code running on a processor

## s\_axilite

Hardware header file provides a complete list of the memory mapped locations for the ports grouped into the AXI4 Slave Lite protocol

Understand how the hardware ports operate

Device will read any inputs grouped into the AXI4 Slave Lite protocol from the register in the protocol

```
// 0x00 : Control signals
//      bit 0 - ap_start (Read/Write/SC)
//      bit 1 - ap_done (Read/COR)
//      bit 2 - ap_idle (Read)
//      bit 3 - ap_ready (Read)
//      bit 7 - auto_restart (Read/Write)
//      others - reserved
// 0x04 : Global Interrupt Enable Register
//      bit 0 - Global Interrupt Enable (Read/Write)
//      others - reserved
// 0x08 : IP Interrupt Enable Register (Read/Write)
//      bit 0 - Channel 0 (ap_done)
//      bit 1 - Channel 1 (ap_ready)
// 0x0c : IP Interrupt Status Register (Read/TOW)
//      bit 0 - Channel 0 (ap_done)
//      others - reserved
```

The hardware header file provides a complete list of the memory mapped locations for the ports grouped into the AXI4 Slave Lite protocol. To correctly program the registers in the AXI4 Slave Lite protocol, there is some requirement to understand how the hardware ports operate.

For example, to start the block operation the ap\_start register must be set to 1.

The device will then proceed and read any inputs grouped into the AXI4 Slave Lite protocol from the register in the protocol.

## s\_axilite

When the block completes operation:

- ap\_done, ap\_idle, and ap\_ready registers will be set by the hardware output ports
- Results for any output ports grouped into the AXI4 Slave Lite protocol read from appropriate register

```
// 0x00 : Control signals
//      bit 0 - ap_start (Read/Write/SC)
//      bit 1 - ap_done (Read/COR)
//      bit 2 - ap_idle (Read)
//      bit 3 - ap_ready (Read)
//      bit 7 - auto_restart (Read/Write)
//      others - reserved
// 0x04 : Global Interrupt Enable Register
//      bit 0 - Global Interrupt Enable (Read/Write)
//      others - reserved
// 0x08 : IP Interrupt Enable Register (Read/Write)
//      bit 0 - Channel 0 (ap_done)
//      bit 1 - Channel 1 (ap_ready)
// 0x0c : IP Interrupt Status Register (Read/TOW)
//      bit 0 - Channel 0 (ap_done)
//      others - reserved
```

When the block completes operation, the ap\_done, ap\_idle, and ap\_ready registers will be set by the hardware output ports and the results for any output ports grouped into the AXI4 Slave Lite protocol read from the appropriate register.

## Block-Level for C/RTL Co-simulation

C/RTL co-simulation feature is used to verify the RTL design with block-level I/O  
**one or more of which must be true**

Conditions:

- Top-level function must be synthesized using an ap\_ctrl\_hs or ap\_ctrl\_chain function-level protocol
- Design must be purely combinational
- Top-level function must have an initiation interval of 1
- Interface must be all arrays that are streaming and implemented with ap\_fifo, ap\_hs, or axis interface modes

As we know that the C/RTL co-simulation feature is used to verify the RTL design with block-level, here are few conditions one or more of which must be true:

- Top-level function must be synthesized using an ap\_ctrl\_hs or ap\_ctrl\_chain function-level protocol
- Design must be purely combinational
- Top-level function must have an initiation interval of 1
- Interface must be all arrays that are streaming and implemented with ap\_fifo, ap\_hs, or axis interface modes

## Block-Level for C/RTL Co-simulation

If one of these conditions is not met, C/RTL co-simulation halts with the message shown below

```
@E [SIM-345] Cosim only supports the following 'ap_ctrl_none' designs: (1)  
combinational designs; (2) pipelined design with task interval of 1; (3) designs with  
array streaming or hls_stream ports.  
@E [SIM-4] *** C/RTL co-simulation finished: FAIL ***
```

If one of these conditions is not met, C/RTL co-simulation halts with the message shown below.

## Clock, Reset, and RTL Output

### For C and C++ Designs

- Only a single clock is supported
- Same clock is applied to all functions in the design



20

© Copyright 2021 Xilinx

 XILINX.

Let's look at how the clock, reset, and RTL output is managed using block-level protocols.

For C and C++ designs, only a single clock is supported. The same clock is applied to all functions in the design.

To calculate the effective clock period used for synthesis by the HLS tool, the Vitis HLS tool subtracts the clock uncertainty from the clock period as shown.

Using the clock frequency and device target information...

## Clock, Reset, and RTL Output

Vitis HLS tool estimates the timing of operations in the design

Provides a user-specified timing margin to ensure downstream processes

By default, the clock uncertainty is 12.5% of the cycle time

...the Vitis HLS tool estimates the timing of operations in the design and provides a user-specified margin to ensure downstream processes, such as logic synthesis and place & route, have enough timing margin to complete their operations.

By default, the clock uncertainty is 12.5% of the cycle time. The value can be explicitly specified beside the clock period.

# Clock, Reset, and RTL Output

Important aspect of RTL configuration is selecting the reset behavior

## Differences between initialization and reset

### Initialization

In C, the variables defined with the static qualifier and in the global scope are by default initialized to zero

Assigned a specific initial value at compile time

Same initial value is implemented in the RTL

### Reset

Reset port is used in an FPGA to return the registers and block RAMs in the design to an initial state/value any time the reset signal is applied

Typically, the most important aspect of RTL configuration is selecting the reset behavior. When discussing reset behavior, it is important to understand the differences between initialization and reset.

### Initialization

In C, the variables defined with the static qualifier and those defined in the global scope are by default initialized to zero.

Optionally, these variables may be assigned a specific initial value at compile time (at time zero) and never again. In both cases, the same initial value is implemented in the RTL.

### Reset

The reset port is used in an FPGA to return the registers and block-RAMs in the design to an initial state/value any time when the reset signal is applied.

## Clock, Reset, and RTL Output

Presence and behavior of the RTL reset port is controlled using the **config\_rtl** configuration

Accessed using menus Solution > Solution Settings > General > config\_rtl



The presence and behavior of the RTL reset port is controlled using the **config\_rtl** configuration.

This configuration is accessed using menus Solution > Solution Settings > General > config\_rtl as shown.

## Clock, Reset, and RTL Output

The reset settings include:

- Ability to set the polarity of the reset
- Whether the reset is synchronous or asynchronous
- Controls which registers are reset when the reset signal is applied

| Name                   | Value     | Default | Reset |
|------------------------|-----------|---------|-------|
| config_unroll          |           |         |       |
| config_storage         |           |         |       |
| config_schedule        |           |         |       |
| config_rtl             |           |         |       |
| header                 |           |         |       |
| kernel_profile         | 0         | 0       |       |
| module_auto_prefix     | 1         | 1       |       |
| module_prefix          |           |         |       |
| mult_keep_attribute    | 0         | 0       |       |
| register_all_io        | 0         | 0       |       |
| register_reset_num     | 0(vivado) |         |       |
| reset                  | none      | control | x     |
| reset_async            | none      | 0       |       |
| reset_level            | control   | high    |       |
| config_op              |           |         |       |
| config_interface       |           |         |       |
| config_array_partition |           |         |       |
| config_export          |           |         |       |

The reset settings include the ability to set the polarity of the reset and whether the reset is synchronous or asynchronous but more importantly it controls, through the reset option, which registers are reset when the reset signal is applied.

## Clock, Reset, and RTL Output

The reset option has four settings:

- **none**: No reset is added to the design
- **control**: Default and ensures all control registers are reset
- **state**: Adds a reset to control registers plus any registers or memories derived from static and global variables in the C code
- **all**: This adds a reset to all registers and memories in the design

| Name                   | Value     | Default | Reset |
|------------------------|-----------|---------|-------|
| config_unroll          |           |         |       |
| config_storage         |           |         |       |
| config_schedule        |           |         |       |
| config_rtl             |           |         |       |
| header                 |           |         |       |
| kernel_profile         | 0         | 0       |       |
| module_auto_prefix     | 1         | 1       |       |
| module_prefix          |           |         |       |
| mult_keep_attribute    | 0         | 0       |       |
| register_all_io        | 0         | 0       |       |
| register_reset_num     | 0(vivado) |         |       |
| reset                  | none      | control | x     |
| reset_async            | none      | 0       |       |
| reset_level            | control   | high    |       |
| config_op              | state     |         |       |
| config_interface       | all       |         |       |
| config_array_partition |           |         |       |
| config_export          |           |         |       |

The reset option has four settings:

- none: No reset is added to the design
- control: This is the default and ensures all control registers are reset
- state: This option adds a reset to control registers plus any registers or memories derived from static and global variables in the C code
- all: This adds a reset to all registers and memories in the design

## Apply Your Knowledge

### Multiple Choice Question

What control signals are added by the ap\_ctrl\_hs protocol? (Select all that apply)

- ap\_start
- ap\_idle
- ap\_continue
- ap\_done

#### Correct answers:

“ap\_start”, “ap\_idle” & “ap\_done”

#### Correct feedback:

Your response is correct.

#### Incorrect feedback:

The correct answers are “ap\_start”, “ap\_idle” & “ap\_done”.

#### Try again feedback:

**Hint:** Recall the main difference between the ap\_ctrl\_hs and ap\_ctrl\_chain protocols

## Apply Your Knowledge

### True or False Question

Block-level protocols are specified on the function or the function return.

- True
- False

**Correct answer:**

True

**Correct feedback:**

You selected the correct answer

**Incorrect feedback:**

The correct answer is “True”.

## Summary

- ▶ Block-level interface protocols generate the handshaking or control signals that control the HLS block
- ▶ Three types of block-level protocols:
  - ap\_ctrl\_none: No block-level protocol
  - ap\_ctrl\_hs: Default block-level protocol
  - ap\_ctrl\_chain: Protocol to support chaining