



Karachi Institute of Engineering & Technology (KIET)  
CoCIS Department

# IoT Bitstream Log Validator (TM Based)

Theory of Automata Project

**Course Name:** Theory of Automata  
**CID:** 118692  
**Instructor:** Miss Misbah

## Group Members

| Name                | Student ID |
|---------------------|------------|
| Syed Muhammad Nihal | 66044      |
| Zaid Maju           | 66057      |

# Abstract

When making and testing IoT and embedded systems, we often record serial communication logs. These logs help us find problems like messed-up data or framing issues. Our project made a simple Turing Machine (TM) simulator in Python. It checks UART-like frames in these logs.

The TM reads a raw data stream and checks its structure: the start bit, data bits, parity, and stop bit. Then, it writes notes on the tape to show if each frame is good or bad. We focused on making sure you can follow every step. Every decision the TM makes—reading, writing, moving—is clear, which is perfect for showing how Theory of Automata works.

# Contents

|          |                                         |          |
|----------|-----------------------------------------|----------|
| <b>1</b> | <b>Introduction</b>                     | <b>1</b> |
| 1.1      | Why We Did This . . . . .               | 1        |
| 1.2      | Why This Matters . . . . .              | 1        |
| 1.3      | What We Wanted to Do . . . . .          | 1        |
| <b>2</b> | <b>Other Projects Like This</b>         | <b>2</b> |
| 2.1      | Automata and TM Simulators . . . . .    | 2        |
| 2.2      | Protocol Validation . . . . .           | 2        |
| 2.3      | UART Framing and Parity . . . . .       | 2        |
| <b>3</b> | <b>How We Did It</b>                    | <b>3</b> |
| 3.1      | Overall Approach . . . . .              | 3        |
| 3.2      | Input Format . . . . .                  | 3        |
| 3.2.1    | Frame Format (8E1 Model) . . . . .      | 3        |
| 3.3      | TM Tape Design . . . . .                | 3        |
| 3.4      | How the TM Head Moves . . . . .         | 4        |
| 3.5      | Python Implementation Details . . . . . | 4        |
| 3.5.1    | Core Components . . . . .               | 4        |
| 3.5.2    | Output . . . . .                        | 4        |
| <b>4</b> | <b>Results &amp; Experiments</b>        | <b>5</b> |
| 4.1      | Experimental Setup . . . . .            | 5        |
| 4.2      | Test Data . . . . .                     | 5        |
| 4.3      | Observed Output . . . . .               | 5        |
| 4.4      | Screenshots . . . . .                   | 6        |
| 4.5      | Discussion . . . . .                    | 6        |
| <b>5</b> | <b>Conclusion</b>                       | <b>7</b> |

# 1 Introduction

## 1.1 Why We Did This

IoT devices commonly communicate using lightweight serial protocols, and during debugging it is normal to capture raw bitstreams for offline analysis. A practical challenge is verifying whether frames in a captured log follow the expected format (e.g. correct start/stop bits and correct parity). This project uses a Turing Machine to model that verification process in a way that is explainable step-by-step, instead of taking validation as a black-box function.

## 1.2 Why This Matters

A finite state machine can model control flow, but a TM can additionally rewrite its tape and revisit earlier positions, that makes it a strong educational model for “mechanical” computation. In this project, the TM does not only just accept/reject; it also rewrites the tape to produce a marked output stream that looks like a tester’s validation report. This makes the project meaningful both as (i) a Theory of Automata implementation and (ii) a simplified example of protocol trace validation.

## 1.3 What We Wanted to Do

- Make a Python simulator for a single-tape, deterministic TM
- Check UART-like 8E1 frames in a recorded data stream.
- Show the TM head moving both ways (right to read, left to mark results).
- Make a report file that shows the original log and the checked output.

## 2 Other Projects Like This

This project got its ideas from two main areas: (1) simulators for learning about automata and TMs, and (2) ways to check communication protocols.

### 2.1 Automata and TM Simulators

Hamada (2013) discusses simulators for computational models (including Turing machines) as a learning environment where students can experiment with machine behavior step-by-step [4]. Such work supports the idea that a TM simulator is not only a theory exercise but also a practical educational tool.

### 2.2 Protocol Validation

Holzmann (1987) presents an approach to automated protocol validation (Argos) based on formal models of communicating finite state machines, emphasizing systematic validation and analysis of protocol behaviors [5]. While our implementation is intentionally simplified, the overall motivation is similar: validate communication behavior using a formal model and produce interpretable traces.

### 2.3 UART Framing and Parity

UART framing is commonly described using a start bit, data bits, an optional parity bit, and a stop bit [2, 3]. Parity is used as a basic error detection mechanism, where even parity sets the parity bit so the total number of 1s becomes even [1].

# 3 How We Did It

## 3.1 Overall Approach

The solution is split into two layers:

- **TM Engine Layer:** Implements tape, head, states, transition function, and step execution.
- **Protocol Validator TM:** A concrete TM “program” (transition table) that validates UART-like frames in a stream.

## 3.2 Input Format

The input is a text log file (e.g., `iot_capture.log`), where each line contains metadata and a `raw=` field holding a bitstream. Frames are separated by the delimiter `|`. Noise/idle bits may appear between frames.

### 3.2.1 Frame Format (8E1 Model)

We validate a simplified UART-like frame:

$$0 \ d_0d_1d_2d_3d_4d_5d_6d_7 \ p \ 1$$

where 0 is start bit,  $d_0..d_7$  are 8 data bits, p is the parity bit, and 1 is the stop bit [2, 3]. Even parity is enforced so that the parity bit makes the total number of 1s in data+parity even [1].

## 3.3 TM Tape Design

- Tape alphabet includes: 0, 1, |, \_ plus annotation symbols s, K, E.
- s is a temporary marker written on the start bit when a frame is detected.
- K indicates a validated (OK) frame; E indicates an invalid frame.

## 3.4 How the TM Head Moves

The validation cycle for each frame is:

1. Move **right** to locate a start bit (0) and mark it as **s**.
2. Move **right** through data bits and parity bit, tracking parity in the control states.
3. Read stop bit and decide OK/ERROR.
4. Move **left** back to the **s** marker and overwrite it with K or E.
5. Move **right** to the next delimiter | and repeat.

This explicitly demonstrates that the TM uses both directions and uses tape rewriting as part of computation.

## 3.5 Python Implementation Details

### 3.5.1 Core Components

- **Tape**: a sparse mapping from indices to symbols, supporting infinite extension.
- **TuringMachine**: executes  $\delta(q, a) = (q', b, m)$  via a transition dictionary.
- **Trace Mode**: prints step-by-step state, head index, read symbol, write symbol, and movement.

### 3.5.2 Output

After validation, the program writes `validated_output.log` containing:

- Original selected log line
- Raw bitstream
- TM-marked bitstream
- Summary counts: number of OK frames and error frames

# 4 Results & Experiments

## 4.1 Experimental Setup

Experiments were run on a standard development environment using PyCharm. The input log file contains multiple captured lines; each test selects one entry and validates its `raw=` field.

## 4.2 Test Data

The dummy log simulates real capture-style entries, including timestamps, device IDs, and a raw stream with multiple frames separated by `|`. Some frames are intentionally constructed to contain parity or stop-bit errors to demonstrate detection.

## 4.3 Observed Output

The main result is a transformed bitstream where each frame's start bit is overwritten with:

- K for valid frames
- E for invalid frames

This output is saved into `validated_output.log` for traceability.

## 4.4 Screenshots

(a) Console run (record selection and raw stream).

(b) Trace mode showing read/write/move steps.

(c) Generated validated\_output.log (report file).

Figure 4.1: Execution evidence of the TM-based validator.

## 4.5 Discussion

In trace mode, the step- by -step output confirms that validation is achieved through TM transitions rather than direct high- level parsing . The leftward “return” phase is especially important in showing tape rewriting: the TM revisits the start of the frame to write the final decision symbol.

## 5 Conclusion

This project implemented a deterministic single-tape Turing Machine simulator and applied it to a realistic testing scenario: validating UART-like frames inside an IoT serial capture log. The main contribution is not only accept/reject behavior but also a rewritten tape that marks frame-level outcomes, producing an output artifact similar to a tester’s report. Future work could extend the validator to more advanced framing (e.g., bit-stuffing protocols) or to compute stronger integrity checks.

# Bibliography

- [1] Uart: A hardware communication protocol understanding ...  
<https://www.analog.com/en/resources/analog-dialogue/articles/uart-a-hardware-communication-protocol.html>.
- [2] Stm32 uart frame formats: Start, data, parity, and stop bits. <https://fastbitlab.com/stm32-uart-lecture-3-uart-frame-formats/>.
- [3] Universal asynchronous receiver-transmitter. [https://en.wikipedia.org/wiki/Universal\\_asynchronous\\_receiver-transmitter](https://en.wikipedia.org/wiki/Universal_asynchronous_receiver-transmitter).
- [4] Mohamed Hamada. Turing machine and automata simulators. *Procedia Computer Science*, 2013. URL <https://www.sciencedirect.com/science/article/pii/S1877050913004572>. Available online via ScienceDirect.
- [5] Gerard J. Holzmann. Automated protocol validation in argos: Assertion proving and scatter searching. *IEEE Transactions on Software Engineering*, 1987. URL <https://spinroot.com/gerard/pdf/inprint/ieee87.pdf>. PDF available online.