Skip to content
View yiakwy-xpu-ml-framework-team's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
Block or Report

Block or report yiakwy-xpu-ml-framework-team

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • 👋 Hi, I’m @yiakwy-xpu-ml-framework-team
  • 👀 I’m interested in accelerating the word through algorithms, chips and intelligence. (compiler/transpiler, c++ for critical performance path and python bindings for HPC application.)
  • 🌱 I’m currently working on core framework infrastracture and AI compilier technologies.
  • 📫 Please drop me a message through yiak.wy@gmail.com

Popular repositories

  1. gbp-poplar gbp-poplar Public

    Forked from joeaortiz/gbp-poplar

    Poplar implementation of "Bundle Adjustment on a Graph Processor" (CVPR 2020)

    C++ 2

  2. NVIDIA-DOCA-App-Code-Sharing NVIDIA-DOCA-App-Code-Sharing Public

    Forked from openhackathons-org/NVIDIA-DOCA-App-Code-Sharing

    DOCA Application code sharing Contest

    2

  3. NV-nccl-tests NV-nccl-tests Public

    Forked from NVIDIA/nccl-tests

    NCCL Tests

    Cuda 2

  4. llama.cpp llama.cpp Public

    Forked from ggerganov/llama.cpp

    Port of Facebook's LLaMA model in C/C++

    C 1

  5. llama-cpp-python llama-cpp-python Public

    Forked from abetlen/llama-cpp-python

    Python bindings for llama.cpp

    Python 1

  6. NV_grouped_gemm NV_grouped_gemm Public

    Forked from fanshiqing/grouped_gemm

    PyTorch bindings for CUTLASS grouped GEMM for MoE.

    Cuda 1