Skip to content
View xiayuqing0622's full-sized avatar
  • Microsoft Research Asia
  • Beijing, China

Block or report xiayuqing0622

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. customized-flash-attention customized-flash-attention Public

    Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    Python 3

  2. microsoft/nnfusion microsoft/nnfusion Public

    A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

    C++ 951 158

  3. cutlass cutlass Public

    Forked from NVIDIA/cutlass

    CUDA Templates for Linear Algebra Subroutines

    C++