Popular repositories
214 contributions in the last year
Less
More
Contribution activity
March 2021
Created 1 commit in 1 repository
Created 1 repository
Created a pull request in microsoft/onnxruntime that received 10 comments
Ability to fuse non-square (pruned) attention weights for BERT-like models
Following @tianleiwu implementation for non squared (i.e. pruned) attention layer this PR introduces the necessary machinery to fuse Attention laye…
+14
−11
•
10
comments

