

# Soumil Paranjpay

San Diego, CA | +1 (619) 953-7058

[soumil07.com](http://soumil07.com) | [linkedin](https://www.linkedin.com/in/soumil-paranjpay/) | [soumil.paranjpay@gmail.com](mailto:soumil.paranjpay@gmail.com)

## Education

---

|                                                                                                     |                       |
|-----------------------------------------------------------------------------------------------------|-----------------------|
| <b>University of California, San Diego</b>                                                          | Sep. 2024 – Dec. 2025 |
| Master of Science in Electrical & Computer Engineering – CGPA: <b>3.78/4.00</b>                     |                       |
| <b>Vishwakarma Institute of Technology, Pune</b>                                                    | Aug. 2020 – Jun. 2024 |
| B. Tech in Electronics & Telecommunication Engineering – CPI: <b>8.89/10.00</b> (Class Rank: 5/307) |                       |

## Work Experience

---

|                                                                                                                                                                                      |                                          |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------|
| <b>Apple – GPU Architecture Modelling Engineer</b>                                                                                                                                   | Santa Clara, CA   Jan. 2026 – Present    |
| • Incoming Jan. 2026                                                                                                                                                                 |                                          |
| <b>JPMorganChase – Software Engineering Intern</b>                                                                                                                                   | Mumbai, India   Jan. 2024 – Jun. 2024    |
| • Spearheaded the development of an <b>automation tool</b> for .NET Framework to Core migration, achieving a <b>95% reduction in migration times</b> through advanced code analysis. |                                          |
| • Collaborated with cross-functional teams to ensure seamless integration and adoption of the tool, significantly enhancing project efficiency.                                      |                                          |
| <b>JPMorganChase – Software Engineering Intern</b>                                                                                                                                   | Bengaluru, India   Jun. 2023 – Jul. 2023 |
| • Assisted in deployment, debugging, and writing Kubernetes manifests for the WMDM team in the document management space.                                                            |                                          |

## Projects

---

|                                                                                                                                                                                                                                                                |                       |
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------|
| <b>Out-of-Order RISC-V Processor (C++)</b>                                                                                                                                                                                                                     | Jan. 2025 – Mar. 2025 |
| • Architected and modelled a scalar Out-of-Order (OoO) processor incorporating <b>speculative execution</b> (GShare predictor), <b>dynamic instruction scheduling</b> via an instruction queue, and precise state management with <b>in-order retirement</b> . |                       |
| • Developed a configurable C++ <b>performance simulator</b> to quantify the CPI impact of varying microarchitectural parameters (e.g., instruction queue depth, predictor table size) across different SPEC-CPU workloads.                                     |                       |

|                                                                                                                                                                     |                       |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------|
| <b>Reconfigurable Systolic Array AI Accelerator (Verilog, Python)</b>                                                                                               | Sep. 2025 – Dec. 2025 |
| • RTL Design, prototyping, and verification of a 16x16 systolic array AI accelerator, with reconfigurable SIMD and output-stationary modes for maximum flexibility. |                       |
| • Trained quantized <b>VGGNet</b> and validated modified convolution layer to 16x16 accelerator tile.                                                               |                       |
| • Mapped to Altera Cyclone FPGA and optimized for power and throughput with HW/SW codesign                                                                          |                       |

|                                                                                                                                       |                       |
|---------------------------------------------------------------------------------------------------------------------------------------|-----------------------|
| <b>Low-Power Dual Core Machine Learning Accelerator (Verilog)</b>                                                                     | Jan. 2025 – Mar. 2025 |
| • Designed and optimized RTL for a 16x16 <b>systolic array</b> for attention calculation.                                             |                       |
| • Implemented multi-VT place-and-route, <b>clock gating, power gating</b> to reduce power and improve PPA metrics by <b>45%</b> .     |                       |
| • Optimized RTL for sparse vector multiplication and implemented dual-core communication using async <b>4-way handshake</b> protocol. |                       |

## Skills

---

**Languages and Tools:** C, C++, Python, Verilog, SystemVerilog, TCL, Gem5, Verilator