

**New York University Tandon School of Engineering**  
Electrical and Computer Engineering  
Course Outline ECE 6913 [Computing Systems Architecture], Fall 2025  
**Instructor: Azees Bhavnagarwala**

*Instructor Contact :* [ajb20@nyu.edu](mailto:ajb20@nyu.edu)

*Course Assistants & Graders contact:*

Abhishek Koti, [ak11546@nyu.edu](mailto:ak11546@nyu.edu)

Riya Shah, [rs9177@nyu.edu](mailto:rs9177@nyu.edu)

Vishwajeet Kulkarni, [vk2630@nyu.edu](mailto:vk2630@nyu.edu)

Shreyas Kootiganahalli, [sk12200@nyu.edu](mailto:sk12200@nyu.edu)

Kushal Gadhiya, [khg9056@nyu.edu](mailto:khg9056@nyu.edu)

Sakshi Bhavsar, [sqb9086@nyu.edu](mailto:sqb9086@nyu.edu)

*Instructor Off. Hours:*

Directly after Class

*Course Assistant Off. Hours:*

Monday, Tuesday, Wednesday, Thursday **9:30 AM – 11:00 AM**

*Instruction:*

In-person Classroom discussions:

Section A: Thursday 2:00 PM – 4:30 PM (5 MetroTech, Pfizer Auditorium)

Section B: Tuesday 5:00 AM – 7:30 PM (6 MetroTech, Jacobs Hall Room 475)

**INET Section:** Discussion Hour, Wednesdays 9AM – 10AM (Zoom)

**Course Prerequisites:** Basic knowledge of digital logic and Computer Architecture is assumed. If you have not taken an undergraduate level class on Computer Architecture, you will need to supplement course work with additional preparation – *please review introductory texts posted on Brightspace.*

## Weekly Schedule:

| ECE 6913                    | MON                               | TUE                                                                     | WED                                    | THU                                                                         | FRI                                      |
|-----------------------------|-----------------------------------|-------------------------------------------------------------------------|----------------------------------------|-----------------------------------------------------------------------------|------------------------------------------|
| <b>9 AM – 10 AM</b>         |                                   |                                                                         | <b>Discussion Hour<br/>(Zoom) INET</b> |                                                                             |                                          |
| <b>9:30-11:00 AM</b>        | <b>CA Office Hours<br/>(Zoom)</b> | <b>CA Office Hours<br/>(Zoom)</b>                                       | <b>CA Office Hours<br/>(Zoom)</b>      | <b>CA Office Hours<br/>(Zoom)</b>                                           |                                          |
| <b>2:00 PM –<br/>4:30PM</b> |                                   |                                                                         |                                        | <b>Lecture</b><br>Section A: MT 5<br>Pfizer Auditorium<br><b>Instructor</b> |                                          |
| <b>5:00 – 7:30 PM</b>       |                                   | <b>Lecture</b><br>Section B: MT 6<br>Jacobs Rm 475<br><b>Instructor</b> |                                        |                                                                             | <b>Weekly HW<br/>due by 11:55<br/>PM</b> |

## Detailed Course Description:

ECE 6913 provides a solid foundation for graduate students to understand modern computing systems architecture and to apply these insights and principles to design computer systems given the emerging age of domain specific architectures and an unprecedented growth in markets for trillions of pervasive, energy-efficient and ultra-low-cost processor chips.

The course begins with an introduction to the foundations of general-purpose computing - the Von Neumann architecture and incumbent Instruction Set Architectures (ISAs). Understanding the evolution of computing applications and more relevantly, the limitations of underlying silicon CMOS and interconnect technologies enable us to appreciate the challenges the industry now sees and the opportunities for innovation across the compute stack we could pursue to resolve these in the foreseeable future. RISC computing is introduced as an ISA that emerged from the falling cost of Memory per bit and the need for efficient execution using simple, regular and small instruction sets. Older ISAs were required to use powerful and complex instructions to maintain a compact program size give the higher system cost from storing larger programs.

The course reviews in detail the open-source RISC-V ISA that has proliferated over the last 5-7 years across IoT, mobile, embedded and HPC systems. We will review the formats and the encoding of its compact Base Integer ISA of just 47 instructions. The course continues with IEEE 754 representation of Real numbers for Floating Point arithmetic components, reviewing the RV32FD - Single/Double Precision Floating Point RISC V extensions, RV32C extensions for Compressed instructions and RV32V - Vector extensions of the RISC V ISA

We will look at the basics of pipelining and its implications on the data path of a general-purpose processor - its implementation & control of the classic five-stage RISC pipeline in detail, and will then examine its performance considering hazards - how these degrade performance and how they are resolved. Branch prediction is introduced as an advanced technique to reduce direct stalls attributable to branches. Out of order instruction execution is introduced as a technique where an instruction executes as soon as its data operands are available – reducing stalls seen in an in-order execution pipeline.

Because high-end processors have multiple cores, the bandwidth requirements on the memory hierarchy are greater than for single cores with the gap between CPU memory demand and on-chip bandwidth continuing to grow with the number of cores. The course details an example memory hierarchy of the Intel quad Core i7 6700 that delivers a total peak data and instruction reference demand bandwidth of 409.6 GiB/s at a clock rate of 4.2GHz - accomplished by multiporting and pipelining the caches using three levels

of caches with two private levels per core and a shared L3 with separate instruction and data arrays at the first level.

Architecture innovations over the past few decades may not be a good match for the emerging AI domains supporting image or speech recognition and GenAI. The class reviews example AI Hardware accelerators – a GPU and a custom CMOS ASIC - the TPU for the data center. Cost-performance comparisons of the TPU with CPUs and GPUs using DNN benchmarks reveal the opportunity of an upcoming renaissance for computer architecture

**Online Course Content Schedule:** Weekly lecture slide sets and HW assignments, reading assignments to be made available on NYU Brightspace. Homework Assignments are due weekly. Please submit HW assignments as PDFs of Word documents with your identifying information and not on hand-written sheets of paper. Please prefix your HW assignment submission file name with your netID followed by the HW assignment. For example, I would submit HW 4 as a PDF document with filename: *ajb20\_HW4.pdf*

#### **Course structure:**

Your performance in the course will be assessed via **weekly HW assignments (10% of total grade)** Project (**10% of total grade**): RISCV processor simulator [**Project A**], **2 Midterms (50% of total grade)** and a final (**25% of total grade**). Class Participation (**5% of grade**). There will also be pop-quizzes and (extra credit) HW assignments. Participation in these activities is highly encouraged.

**Course Textbooks:** [1] Hennessy and Patterson (RISC-V edition) of “Computer Organization & Design, Hardware-Software Interface” [2] Hennessy and Patterson, “Computer Architecture: A Quantitative Approach” [6<sup>th</sup> Edition], Morgan Kaufmann.

#### **Policy on Academic Honesty:**

In pursuing these goals, NYU expects and requires its students to adhere to the highest standards of scholarship, research and academic conduct. Essential to the process of teaching and learning is the periodic assessment of students' academic progress through measures such as papers, examinations, presentations, and other projects. Academic dishonesty compromises the validity of these assessments as well as the relationship of trust within the community. Students who engage in such behavior will be subject to review and the possible imposition of penalties in accordance with the standards, practices, and procedures of NYU and its colleges and schools. Violations may result in failure on a particular assignment, failure in a course, suspension or expulsion from the University, or other penalties.

More details about specific actions that constitute a violation of the NYU policy can be found here. <https://www.nyu.edu/about/policies-guidelines-compliance/policies-and-guidelines/academic-integrity-for-students-at-nyu.html>

#### **Moses Center Statement of Disability:**

If you are student with a disability who is requesting accommodations, please contact New York University's Moses Center for Students with Disabilities at 212-998-4980 or [mosescsd@nyu.edu](mailto:mosescsd@nyu.edu). You must be registered with CSD to receive accommodations. Information about the Moses Center can be found at [www.nyu.edu/csd](http://www.nyu.edu/csd). The Moses Center is located at 726 Broadway on the 2nd floor.

**Course Schedule:**

| <b>Week</b> | <b>Weekly HW Release date</b> | <b>ECE 6913 Content</b>                                                                                                                                                                                                                                                                                                               | <b>Assignment</b>                                                                                      |
|-------------|-------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------|
| <b>1</b>    | 9/5                           | Quantitative Design & Analysis, Physical limits on scaling CMOS – End of Dennard Scaling and Moore’s Law. Evolution of System architecture, cost, energy efficiency and performance driven by emerging workloads dominated by AI. Hardware limitations and opportunities from ISAs, DSAs and Wafer Scale/ 3D Heterogenous Integration | HW 1                                                                                                   |
| <b>2</b>    | 9/12                          | Introduction to the RISC-V, RISC-V Instructions, Instruction formats, encoding, memory management. Open-Source RISCV ISA review. Comparisons with older ISAs                                                                                                                                                                          | HW 2, 3                                                                                                |
| <b>3</b>    | 9/19                          |                                                                                                                                                                                                                                                                                                                                       |                                                                                                        |
| <b>4</b>    | 9/26                          | <i>Introductory Review of Project, Midterm 1 Review Problems</i>                                                                                                                                                                                                                                                                      | HW 4                                                                                                   |
| <b>5</b>    | 10/3                          | <b>Midterm I</b> (Tue Sept 30, Sec B) (Thu Oct 2 <sup>nd</sup> Sec A)                                                                                                                                                                                                                                                                 | Project Phase 0 Due:<br>Complete trial submission of Shell assignment to align with auto grader format |
| <b>6</b>    | 10/10                         | No Class on Tue Oct 14 ( for Sec B). <u>Class will be held on Thu Oct 16 (for Sec A) Monday Schedule at University on Tue 10/14</u>                                                                                                                                                                                                   | HW 5                                                                                                   |
| <b>7</b>    | <b>Oct 17</b>                 | IEEE 754 representation & Floating-point Arithmetic for Computers. RV32FD – FP registers, FP load/stores/arithmetic, FP moves/converts                                                                                                                                                                                                | HW 6                                                                                                   |
| <b>8</b>    | 10/24                         | Pipelining: Basic & Intermediate concepts. Classic 5 stage RISC processor pipeline.                                                                                                                                                                                                                                                   | HW 7                                                                                                   |
| <b>9</b>    | 10/31                         | Pipeline Hazards, Pipelining implementation                                                                                                                                                                                                                                                                                           | HW 8                                                                                                   |
| <b>10</b>   | 11/7                          | <i>Introduction to Phase II of Project, Midterm 2 Review Problems</i>                                                                                                                                                                                                                                                                 | <b>Phase I report of Project Due</b>                                                                   |
| <b>11</b>   | 11/14                         | <b>Midterm II</b> (Tue, Nov 11 <sup>th</sup> Sec B) (Thu, Nov 13 <sup>th</sup> Sec A)                                                                                                                                                                                                                                                 | -                                                                                                      |
| <b>12</b>   | <b>Nov 21</b>                 | Introduction to Memory Hierarchy: Memory technologies, Caches, Cache performance. Comparison of ARM Cortex A-53 and the Intel quad Core i7 6700 Memory hierarchy. Virtual Machines, Virtual Memory, FSM for simple cache controller, Cache coherence                                                                                  | HW 9                                                                                                   |
| <b>13</b>   | 11/28                         | <b>Thanksgiving Recess.</b> <u>No class on Thu 11/27 (Sec A) Class will be held on Tue 11/25 (Sec B)</u>                                                                                                                                                                                                                              | -                                                                                                      |
| <b>14</b>   | 12/5                          | Introduction to Parallel Processing SISD, MIMD, SIMD, SPMD, and Vector machines, Hardware Multithreading Multicore and Other Shared Memory Multiprocessors. Introduction to the GPU                                                                                                                                                   | HW 10                                                                                                  |
| <b>15</b>   | 12/12                         | <i>Closing Topics, Review Problems for Final</i>                                                                                                                                                                                                                                                                                      | <b>Phase II report of Project Due</b>                                                                  |
| <b>16</b>   | 12/19                         | <b>Final: Tue, Dec 16 (Sec B) &amp; Thu Dec 18 (Sec A)</b>                                                                                                                                                                                                                                                                            |                                                                                                        |