

# Rudrajyoti Roy

Ann Arbor, MI, 48105 | [rudraroy@umich.edu](mailto:rudraroy@umich.edu) | [rudrajyotiroy@gmail.com](mailto:rudrajyotiroy@gmail.com) | (+1)734-596-4864

[linkedin.com/in/rudrajyoti-roy-9a9417187](https://www.linkedin.com/in/rudrajyoti-roy-9a9417187) | [github.com/rudrajyotiroy](https://github.com/rudrajyotiroy) | [rudrajyotiroy.github.io](https://rudrajyotiroy.github.io) | G-Scholar

## Education

|                                                                                                                                                                                                                       |                     |
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------|
| <b>University of Michigan, Ann-Arbor, MI, USA</b>                                                                                                                                                                     | Aug 2024 – May 2026 |
| • <b>MS in Computer Science and Engineering</b> (Current GPA: 4.0/4.0), Advisor : <b>Satish Narayanasamy</b>                                                                                                          |                     |
| • <b>Research:</b> Exploring ways to ensure EULA compliance of on-premise AI and HPC chips remotely through hardware-enforced workload intent authorization and performance regulation ( <i>why is this useful?</i> ) |                     |
| • <b>Coursework:</b> Adv Computer Architecture, Adv Compilers, Scalable Systems for GenAI, Privacy-Enhancing Tech                                                                                                     |                     |
| <b>Indian Institute of Technology (IIT), Kharagpur, WB, India</b>                                                                                                                                                     | Aug 2018 – May 2022 |
| • <b>B.Tech Major in Electronics &amp; Elec. Comm. Eng.</b> GPA : 9.26/10 (3.9/4.0 US equivalent)                                                                                                                     |                     |
| • <b>Minor in Computer Science and Engineering, Micro-specialisation</b> in Embedded Software Modelling & Design                                                                                                      |                     |
| • <b>Major Coursework:</b> Adv VLSI, Digital Design, Embedded System, Digital Signal Processing, Control Theory                                                                                                       |                     |
| • <b>Minor Coursework:</b> Algorithms, AI, Computer Architecture, CUDA/OpenMP Programming, Hardware Security                                                                                                          |                     |

## Work Experience

|                                                                                                                                                                                                                                       |                     |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------|
| <b>Graduate Student Instructor (GSI)</b> , EECS183, University of Michigan – Ann Arbor, MI                                                                                                                                            | Aug 2025 – Present  |
| • Managed a class of 805 students (35 students per GSI) learning introduction to programming in C++, Python.                                                                                                                          |                     |
| • Conducted weekly labs, office hours, prepared and graded exams and actively contributed to holistic curriculum development, student experience and staff productivity improvement. (Average Student Rating: <b>4.5 out of 1-5</b> ) |                     |
| <b>HW System Modeling Engineer</b> , Qualcomm – Bengaluru, India                                                                                                                                                                      | Dec 2023 – Aug 2024 |
| • Single-handedly developed and verified a functional <b>co-simulation model</b> of High-Performance Audio Engine in C++ and integrated with QEMU Virtual Platform to enable <b>RTL-agnostic audio driver testing</b> .               |                     |
| • Facilitated full register set programmability with functionality and timing accurate modelling, enabling fast prototyping and <b>expediting SW driver development by 6 months</b> ahead of RTL maturity.                            |                     |
| • <b>Individually recognized by VP</b> (Audio Systems, Toronto), for pioneering first-of-its-kind execution.                                                                                                                          |                     |
| <b>Associate HW Verification Engineer</b> , Qualcomm – Bengaluru, India                                                                                                                                                               | Jul 2022 – Nov 2023 |
| • Proposed and <b>published</b> a segregable, scalable, fault-tolerant hierarchical <b>ML framework</b> for <b>early detection, termination and intelligent triaging of deadlocks</b> encountered during long-running HW simulations. |                     |
| • Integrated the proposed framework with CI/CD pipeline for automated deployment across projects. Achieved <b>40%</b> reduction in early-stage debugging time and significantly minimised cluster resource wastage.                   |                     |
| • Developed an automated assertion generation and binding flow for in-situ monitoring of post-reset memory reads and flagging incorrect POR values, improved power-aware GLS sign-off TAT by <b>25%</b> .                             |                     |
| • Took ownership of fixing testbench and writing testcases for <b>two IP blocks</b> within audio core across projects.                                                                                                                |                     |
| <b>HW Engineering Intern</b> , Qualcomm – Bengaluru, India                                                                                                                                                                            | May 2021 – Jul 2021 |
| • <b>Developed a secondary UVM monitor</b> to detect bus stalls by passively tracking bus read/write transactions.                                                                                                                    |                     |
| • Automated its integration with AXI/AHB, achieving up to <b>80% bus stall detection</b> with <5% false positives.                                                                                                                    |                     |

## Publications

|                                                                                                       |              |
|-------------------------------------------------------------------------------------------------------|--------------|
| <sup>[1]</sup> <b>Scheduling &amp; Routing Strategies for Executing Task Graphs on AdHoc Networks</b> | Nov 3, 2025  |
| Chhavi Chaudhury; <b>Rudrajyoti Roy</b> ; Rajesh Devaraj; Arnab Sarkar                                |              |
| Ad Hoc Networks Journal (Elsevier) DOI: 10.1016/j.adhoc.2025.104084                                   |              |
| <sup>[2]</sup> <b>Harnessing Machine Learning in DTM in CPU-GPU Embedded Platforms</b>                | Jan 10, 2025 |

|                                                                                                 |                  |
|-------------------------------------------------------------------------------------------------|------------------|
| [3] <b>ML Based Scalable Plug-and-Play Framework for Early Hang Detection</b>                   | May 8 - 9, 2024  |
| <u>Rudrajyoti Roy</u> ; Anshul Sengar, Ronak Shah                                               | Bengaluru, India |
| <b>Qualcomm Global SoC Conference, IP-Cores track (BEST PAPER AWARD)</b>                        |                  |
| [4] <b>Future aware Dynamic Thermal Management in CPU-GPU Embedded Platforms</b>                | Dec 5 - 8, 2022  |
| Srijeeta Maity*, <u>Rudrajyoti Roy</u> *; Anirban Majumder; Soumyajit Dey; Ashish R Hota        | Houston, TX      |
| IEEE Real-Time Systems Symposium (RTSS) DOI: 10.1109/RTSS55097.2022.00041                       |                  |
| [5] <b>Selective detection of multiple VOCs employing ZnO nanorods and PCA</b>                  | Oct 2-4, 2020    |
| Avik Sett; Tanisha Rana; <u>Rudrajyoti Roy</u> ; Tufan Saha; Tarun Kanti Bhattacharyya          | Kolkata, India   |
| Intl Conference on Electronics & Materials Engineering DOI: 10.1109/IEMENTech51367.2020.9270117 |                  |

## Projects

---

|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |                       |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------|
| <b>Attention As You Need It</b>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | Sept 2025 - Present   |
| <ul style="list-style-type: none"><li>Proposed automated rewriting framework to dynamically generate tiled and <b>fused attention kernel</b> variants (like FlexAttention) tailored to maximize performance under various <b>LLM inference</b> deployment scenarios.</li><li>Currently developing a framework to translate PyTorch kernels into compute graphs, followed by cost-constrained exploratory state-space search to <b>optimise compute and memory bottlenecks</b>.</li></ul>                                                                                                              |                       |
| <b>MirrorMaze: Compiler-guided control-flow obfuscation to prevent timing attacks</b>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | Jan 2025 - Apr 2025   |
| <ul style="list-style-type: none"><li>Developed an <b>LLVM extension</b> that automatically detects secret-dependent control flow divergence (taint analysis) and intelligently inserts minimal dummy operations to make all branches computationally equivalent.</li><li>Achieved <b>4-5% performance boost</b> over state-of-the-art obfuscation method without compromising security.</li></ul>                                                                                                                                                                                                    |                       |
| <b>Contention Aware Task Scheduling on Arbitrarily Networked Execution Platforms</b>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | July 2021 - July 2023 |
| <ul style="list-style-type: none"><li>Developed a framework for task-graph generation and scheduling on arbitrarily distributed execution platform.</li><li>Proposed two novel heuristics towards scheduling computation and communication for minimizing total makespan while adhering to contention constraints. Conducted extensive Monte-Carlo simulations to evaluate.</li></ul>                                                                                                                                                                                                                 |                       |
| <b>Future-Aware Dynamic Thermal Management in Heterogenous MPSoC Platforms</b>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        | Mar 2021 - June 2022  |
| <ul style="list-style-type: none"><li>Designed and implemented a Future-Aware Supervisory Control Framework for real-time resource configuration for dynamically arriving OpenCL kernels on embedded platforms, that minimizes <b>peak platform temperature</b>.</li><li>Formulated and trained Newtonian thermal model and analytical performance model for discretized MPC.</li><li>Leveraged a thermal axiom as a heuristic upperbound to reduce an exponential state-space search into linear-time beam search, enabling real-time deployment and outperforming SOTA by 14% on average.</li></ul> |                       |
| <b>Efficient GPGPU Parallelization of RCNN pipeline for Real-time Object Detection</b>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | Feb 2021 - Apr 2021   |
| <ul style="list-style-type: none"><li>Constructed efficient CUDA convolution kernels by formulating as multiplication (GEMM) in fourier domain.</li><li>Achieved 100x speedup over naive approach by leveraging cuFFT/cuBLAS support and fine-tuning using GPU optimization techniques like Im2Col flattening, tiling, fusion and memory access coalescing.</li></ul>                                                                                                                                                                                                                                 |                       |

## Skills

---

**Languages:** C, C++, Embedded C, CUDA, OpenCL, Python, Systemverilog, SystemC, MATLAB, Shell, LLVM  
**Tools:** UVM, Synopsis VCS, Verdi, gem5, SniperSim, gppuSim, Hexagon SDK, PyTorch, Docker, Git, ClearCase

## Extra-curricular Activities

---

**Robotics:** SWARM UG research group; Runner-up for JLR(**Inter-IIT Tech Meet**) and Tessaract (as Team Leader)  
**Sports and Cultural:** Chess, Table-Tennis, Badminton, Whitewater Kayaking, Music (Vocal and Keyboard)  
**Mentoring:** SWG Peer Mentor for UG Students, tutored underprivileged school children as NSS volunteer