Skip to content
View howiejayz's full-sized avatar
🤿
Focusing
🤿
Focusing

Organizations

@ROCmSoftwarePlatform @ROCm
Block or Report

Block or report howiejayz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
howiejayz/README.md

Hello, I'm Junhao Zhang! 👋

wakatime GitHub followers

🚀 About Me

  • 🎓 Alumnus of Imperial College London
  • 💼 Machine Learning Software Engineer at AMD
  • 🌱 I’m highly skilled in C++ and Python.
  • 🔭 I have deep knowledge of PyTorch, CUDA, and HIP.

📫 How to Reach Me

🛠 Tech Stack

Tech Stack

👷 Recently Projects

  • Flash Attention: Implemented Flash Attention algorithm on MI200/MI300 using PyTorch and Composable Kernel.

⌨️ My Development Breakdown

From: 27 October 2023 - To: 30 April 2024

Total Time: 202 hrs 14 mins

Python             93 hrs 42 mins  ⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣄⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀   44.71 %
C++                38 hrs 23 mins  ⣿⣿⣿⣿⣦⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀   18.32 %
Bash               27 hrs 18 mins  ⣿⣿⣿⣤⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀   13.03 %
Docker             12 hrs 1 min    ⣿⣦⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀   05.74 %
Other              7 hrs 21 mins   ⣷⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀   03.51 %
TypeScript         7 hrs 11 mins   ⣷⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀   03.43 %
Text               6 hrs 57 mins   ⣷⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀   03.32 %
CMake              3 hrs 43 mins   ⣦⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀   01.78 %
Cuda               3 hrs 29 mins   ⣦⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀   01.67 %
Markdown           2 hrs 12 mins   ⣤⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀⣀   01.06 %

Pinned

  1. flappy-bird-wechat-minigame flappy-bird-wechat-minigame Public

    a implementation on Wechat Minigame of Flappy Bird

    JavaScript

  2. graph-digitiser graph-digitiser Public

    Individual project @ University of Liverpool

    C++

  3. Video-Player-Controlled-by-Action-Recognition Video-Player-Controlled-by-Action-Recognition Public

    Forked from Diregie-J/Video-Player-Controlled-by-Action-Recognition

    Jupyter Notebook

  4. ROCm/flash-attention ROCm/flash-attention Public

    Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    Python 86 25

  5. flash-attention flash-attention Public

    Forked from ROCm/flash-attention

    Fast and memory-efficient exact attention

    C++