

# UNSW COMP3601 - Design Project A - 24T3

## FPGA Microphone Audio System



### Introduction:

Audio-based applications are becoming increasingly advanced as technology progresses. From music production to virtual reality, film soundscapes and live teleconferences, there is a growing need for specialized audio solutions that can handle complex processing tasks in real-time. Such demands can exceed the capabilities of general purpose hardware: this is where *custom* hardware can come into play.

Custom hardware is specifically designed to meet the unique needs of an application, providing improved performance and efficiency. For advanced audio applications, custom hardware can be used to implement specialized algorithms, such as digital signal processing (DSP) routines and/or interfaces that enable real-time processing of large amounts of audio data.

Field Programmable Gate Arrays (FPGAs) are the most popular avenue for the development and deployment of custom hardware. They are flexible and highly configurable devices that can be programmed to perform a wide range of digital processing tasks. Compared to traditional single-purpose hardware, FPGAs can reconfigure themselves to achieve application specific needs: such as those posed by modern audio engineering.

### Overview:

In this course you will work both individually and as part of a team to implement a custom microphone system using an FPGA-based Multiprocessor System-on-Chip (MPSoC) - the Xilinx Kria KV260. This is a ‘best of both worlds’ device that has both FPGA fabric (to provide the advantages of custom hardware) as well as a ‘hard’ ARM quad-core processor (to provide the benefits of a traditional processor).

As an individual, you will first familiarize yourself with the FPGA by writing some custom hardware to blink an LED, then connect this and a button to the internal ARM core via AXI. We term this the ‘Individual Development’. Once this is completed, you can join a team.

In your team you will connect an I2S MEMS microphone to the KV260 by developing an I2S bus master and connecting to the ARM AXI. You will use this to collect audio to a file (the ‘Basic Requirements’). Once this is achieved, you will then take ownership of the project and develop it in a unique way specific to your interests (the ‘Application’). The teaching staff have provided you with some application project ideas, or you can work with them to design your own!

## Setup for the Basic Requirements:



Figure 1: Basic Requirements System Diagram

In this project, you and your team will first implement the basic system diagram specified in Figure 1: That is, you will connect the I2S microphone to the Kria board and capture the audio data. This data will need to be output either via the Terminal or by saving it as a .wav audio file.

## The Development Board:

The FPGA kit used in this project is the Xilinx Kria KV260 Vision AI Starter Kit, a development board including a Zynq UltraScale+ MPSoC. This platform combines a quad-core ARM Cortex A53 hard processor (the Processing System / PS) and an FPGA (the Programmable Logic / PL). The processor will run a custom Linux image called PetaLinux. In PetaLinux, device drivers and application software can be executed - you will develop these to save audio samples from the microphone and perform any other needed operations.

## I2S MEMS Microphone:

The provided microphone is a MEMS device which converts sound to voltage, giving out the sampled audio as a purely digital signal. It is capable of capturing sound waves with frequencies from 50Hz to 15KHz, which is good for all general audio recording/processing applications. The microphone supports I2S (Inter-IC Sound) serial bus interface protocol, which is used for connecting digital audio devices together. This will be the protocol used in this project to transfer the sampled audio data to the FPGA-based audio processing pipeline.

## Provided Components (per individual):

|                                       |                   |                          |
|---------------------------------------|-------------------|--------------------------|
| 1 x Kria KV260 Vision AI Starter Kit; | 1 x Power Supply; | 1 x I2S MEMS Microphone; |
| 1 x KV260 PMOD board                  | 1 x USB cable     |                          |

## Requestable Components

Breadboards, Wires, LEDs, switches, additional microphone, I2S amplifiers, speakers

## Basic Project Layout (Source):



Figure 2: Basic Requirements Project Source Layout

You will be provided with a template project (Figure 2). You will be able to use some existing code, and will need to write some of your own. Once completed, you will have achieved the Basic Requirements - and you can then choose what to extend to achieve your Application! Several Application ideas are included in this specification document (Appendix 1).

### Reference material:

- <https://www.xilinx.com/support/download.html>
- <https://www.xilinx.com/products/som/kria/kv260-vision-starter-kit.html>
- <https://www.xilinx.com/products/silicon-devices/soc/zyng-ultrascale-mpsoc.html>
- <https://www.adafruit.com/product/3421>
- <https://digilent.com/reference/pmod/pmodbb/reference-manual?redirect=1>
- <https://en.wikipedia.org/wiki/I%C2%BA2S>

### Course Project Rubric / Marking Guidance:

The project is divided up into a number of deliverables, divided up over a number of *milestones*. ID in Milestones 1 and 2 indicates an Individual Development and is completed before you join a team. BR in Milestones 3 and 4 indicates a Basic Requirement (must be completed).

Applications in Milestone 5 will be graded on an effort/difficulty scale, i.e. **they do not need to be at 100% to receive full credit**, but work must demonstrate good faith progress to the final goal. Markers should be convinced that given enough time, you *could* have finished the application. Please see Appendix 2 for more details.

Note that although this is a group project, only 30% of the marks are group marks and you are primarily assessed **individually**. See Appendix 2 for more details.

## Appendix 1 - Application Ideas:

|                                                                                     |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
|-------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|    | <p><b>Hardware Audio Output:</b> In the basic requirements, we examine only what is required to capture audio to .wav files. Ask your TA for additional hardware for I2S output and a speaker (or pair of speakers - try for multiple channels!)</p> <p>In this application, you will build a similar pipeline to that already completed, but this time for the production of audio. You should make it such that it can either reproduce the audio from the mic. or play .wav files directly.</p>                                              |
|    | <p><b>Sound Mixing:</b> Advanced audio applications need to be able to combine and adjust different audio tracks.</p> <p>In this application, consider adding the second microphone to your FPGA, then add support for rudimentary (or advanced!) adjustments: choosing between microphone channels, volume and frequency adjustments. You could have software or hardware control of this process (e.g. add expansion hardware with buttons and knobs).</p>                                                                                    |
|   | <p><b>Sound FX:</b> Some of the most distinctive villains in film history owe much of their identity to vocal / audio augmentation.</p> <p>Similarly to Sound Mixing above, in this application you could thus experiment with the kinds of filters and processing which are used to add effects to sound (e.g. echos, distortions, delays, tonal shifts). To implement these you will need to investigate various filters, such as FIR and IIR. Work towards goals such as real-time processing and implementations using custom hardware!</p> |
|  | <p><b>Musical Instrument:</b> As another variant of the above sound mixers/FX, music synthesizers can be created by the mixing of tones in hardware or software.</p> <p>In this application you would add support to your platform for the production of music by using off-chip hardware for the playable space. Consider your input options! You could use analog inputs (e.g. LDR) to make a quirky theramin, or use switches and keys for making an instrument closer to a keyboard or digital piano.</p>                                   |
|  | <p><b>Compression:</b> Modern technology tends not to use .wav files for the transfer of audio - we instead compress the sound waves to better formats. This is most often done with custom hardware, e.g. mp3 encoder/decoder co-processors!</p> <p>In this application, you will explore how these are achieved, and likewise implement some kind of compression pipeline which works with your processor. For bonus marks, ensure that the files are playable using another device (e.g. VLC on a PC!).</p>                                  |
|  | <p><b>Networking:</b> Modern audio setups often feature multiple devices connected via networks or the internet (single room theaters to international teleconferencing).</p> <p>Here, you will explore how your audio device may be made part of a greater whole. This could be done through the DEP platform provided by Audinate or via a custom stack that you define yourself. Your audio should be receivable / replayable by another machine!</p>                                                                                        |

|                                                                                                                                                                        |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <br> | <p><b>Security:</b> In World War II, the SIGSALY project was used to encrypt the highest level of Allied communications. It performed this by cryptographically scrambling the audio waves in hardware.</p> <p>In this project you could similarly protect your microphone data. To avoid the risks of software compromise in PetaLinux, you could add custom hardware to encrypt your audio signal before it is exposed to the processor.</p> <p>Of course, encrypting the data is no good if we can't decrypt it, so you should provide additional components (which could be in hardware or software) which are capable of later decoding your audio.</p> <p>For this purpose, you may consider investigating algorithms like AES, DES, RC5, or ASCON and design a way for them to be combined with the audio processing pipeline.</p> |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|

### Hard application ideas ↓

|                                                                                     |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|-------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|   | <p><b>Voice Recognition / "Wake word":</b> While full speech-to-text transcription is unlikely to be able to run locally, you could combine a networking goal with an online transcription service such as OpenAI's 'Whisper'. However, what should be sent? Voice 'AI assistants' like Amazon Alexa have a 'wake word' which is processed locally before sending the audio stream to a server.</p> <p>In this application, you could try making one or other (or both) of these components, possibly utilizing statistical or AI models.</p> |
|  | <p><b>Sound Localization:</b> Many real-world products in the consumer and industrial space feature sound localization, where you determine the direction of an audio source (e.g. Amazon Alexa directional lights).</p> <p>Developing a system that can identify the direction from which a sound is coming can be challenging. You will need to combine multiple microphone inputs with signal processing and direction-of-arrival (DoA) algorithms.</p>                                                                                    |
|  | <p><b>Noise Cancellation:</b> Over-ear headphones are frequently equipped with noise cancellation technology, which is a system that uses inverse sound wave techniques to reduce or eliminate surrounding noise.</p> <p>By combining both 'hardware audio output' and 'sound effects' you can try out techniques for this purpose - by combining 'audio mixing' for a playback stream you're then well on the way to a real product!</p>                                                                                                     |
|  | <p><b>Something else:</b> You might find that you have another idea from that provided above, and wish to try forge your own path forwards!</p> <p>In this course, that is completely fine: you will just need to coordinate with the teaching staff in advance so that we can verify that your application idea has a suitable scope and level of difficulty for the course.</p>                                                                                                                                                             |

## Appendix 2 - Deliverables:

This course is divided into five milestones with the following deliverables:

| Points (/100) | Course Component                            | Task/Feature                                                                                                                                                                                                                                                                                                                                                                     | Milestone  |
|---------------|---------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|
| 10            | Individual FPGA Demonstration (Part 1)      | <u>ID</u> : Custom hardware can be deployed to the MPSoC FPGA fabric<br><br><i>(10 pts) Demonstrated via in-lab demo to a tutor. Q&amp;A to test understanding.</i>                                                                                                                                                                                                              | M1 (wk1/2) |
| 10            | Individual FPGA Demonstration (Part 2)      | <u>ID</u> : Custom hardware integrating LED, button, and Linux MPSoC<br><br><i>(10 pts) Demonstrated via in-lab demo to a tutor. Q&amp;A to test understanding.</i>                                                                                                                                                                                                              | M2 (wk2/3) |
| 5             | Group Project Demonstration                 | <u>BR</u> : I2S Bus Master implemented correctly. Some data can be received in the Zynq PS.<br><br><i>(5 pts) In-lab demo (preliminaries) - showing group ability to capture audio to the Zynq PS.</i>                                                                                                                                                                           | M3 (wk5)   |
| 10            | Individual Report and Peer Review (scaled!) | Preliminary Project Plan:<br>Page 1: Report on the group project + plan,<br>Page 2: Report on your personal contribution,<br>Page 3: Peer review of the other group members.<br><br><i>(10 pts) PDF report describing progress and your individual contribution. Marks may be scaled based on peer reviews.</i>                                                                  | M3 (wk5)   |
| 5             | Group Project Demonstration                 | <u>BR</u> : A software program written with clean code running on the Zynq PS which saves the audio .wav files readable by other computers.<br><br><i>(5 pts) In-lab demo, showing all basic requirements.</i>                                                                                                                                                                   | M4 (wk7)   |
| 5             | Group Project Files and Documentation       | <u>BR</u> : I2S Bus Master implementation with clean code and good documentation.<br><br><i>(5 pts) BR project files (zip) and Interim documentation.</i>                                                                                                                                                                                                                        | M4 (wk7)   |
| 5             | Individual FPGA Demonstration (Part 3A)     | <u>BR</u> : Individual interview with tutor. A tutor will conduct Q&A on each individual's contribution to the project and test understanding.<br><br><i>(5 pts) Alongside group in-lab demo. Q&amp;A.</i>                                                                                                                                                                       | M4 (wk7)   |
| 10            | Individual Report and Peer Review (scaled!) | Interim Report:<br>Page 1: Report on the group BR outcomes<br>Page 2: Application plan + System Diagram<br>Page 3: Tasks and timeline<br>Page 4: Individual contribution + peer review<br><br>Provide justifications and “sell the solution”!<br><br><i>(10 pts) PDF report describing progress and your individual contribution. Marks may be scaled based on peer reviews.</i> | M4 (wk7)   |

|    |                                             |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |           |
|----|---------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------|
| 10 | Group Project Demonstration                 | <u>Application:</u> Implementation<br><i>(10 pts) In-lab demo of application.</i>                                                                                                                                                                                                                                                                                                                                                                                                 | M5 (wk10) |
| 5  | Individual FPGA Demonstration (Part 3B)     | <u>Application:</u> Individual interview with tutor. A tutor will conduct Q&A on each individual's contribution to the project and test understanding.<br><br><i>(5 pts) Alongside group in-lab demo. Q&amp;A.</i>                                                                                                                                                                                                                                                                | M5 (wk10) |
| 5  | Group Project Files and Documentation       | <u>Application:</u> Application implementation with clean code and good documentation.<br><br><i>(5 pts) Well-written documentation and software. In the documentation, describe what was provided and how it did/would have fit into the bigger picture of what the application intended e.g. via architecture diagram.</i>                                                                                                                                                      | M5 (wk10) |
| 20 | Individual Report and Peer Review (scaled!) | Individual Report and Peer Review<br>Maximum 6 pages including a peer review.<br><br><i>(20 pts) Reflection on the project, on your individual contribution, on challenges encountered and solutions found, what went well and what could have gone better. Explain what the project intended to accomplish and your feelings on whether or not this was or could have been achieved. Describe how your efforts fitted into the broader team. Discuss how the team performed.</i> | M5 (wk10) |

### Appendix 3 - Learning Principles

The course relates to the following learning principles:

| ESY Embedded Systems                |                                                            |
|-------------------------------------|------------------------------------------------------------|
| ESY-1                               | History and overview                                       |
| ESY-2                               | Relevant tools & standards                                 |
| ESY-3                               | Characteristics of embedded Systems                        |
| ESY-6                               | Asynchronous and synchronous serial communications         |
| ESY-7                               | Periodic interrupts, waveform generation, time measurement |
| ESY-8                               | Data acquisition, control, sensors, actuators              |
| ESY-13                              | Computing platforms for embedded systems                   |
| SPE Systems and Project Engineering |                                                            |
| SPE-3                               | Project management principles                              |
| SPE-6                               | Hardware and software processes                            |
| SPE-7                               | Requirements analysis and elicitation                      |
| SPE-8                               | System specifications                                      |
| SPE-9                               | System architectural design and evaluation                 |
| SPE-10                              | Concurrent hardware and software design                    |
| SPE-11                              | System integration, testing, and validation                |

### Late penalty (standard UNSW policy)

- 5% per day off maximum/awarded mark, after 5 days no submissions accepted,
- Special considerations aside.