Senior Researcher at Microsoft GenAI | CS Ph.D. at Johns Hopkins University | ex-Intern at Microsoft Research| ex-intern at Meta AI | ex-intern at Amazon Alexa
-
Microsoft
- Seattle
- https://www.fe1ixxu.com/
- @fe1ixxu
Pinned Loading
-
Intra-Distillation
Intra-Distillation PublicThis is the repository for our EMNLP 2022 paper "The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains".
-
Stratified_Mixture_of_Experts
Stratified_Mixture_of_Experts PublicThis is the repository for our EMNLP 2023 paper: Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity.
Python 5
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.