Experience

 
 
 
 
 
Applied Scientist Intern
May. 2024 – Aug. 2024 Redmond, WA, USA
  • Long-form ASR and Speech In-context Learning
    • Proposed an efficient architecture for long-form speech recognition
    • Successfully achieved general speech in-context learning capabilities with provided contexts
  •  
     
     
     
     
    Research Intern
    Aug. 2022 – Dec. 2022 Cambridge, MA, USA
  • Diffusion-Based Speech Enhancement
    • Leveraged Diffusion-based model and proposed a novel unfolding training procedure for speech enhancement tasks
    • Significantly shrunk the performance gap between probabilistic diffusion model and conventional discriminative models
  •  
     
     
     
     
    AIML - ASR Understanding Intern
    May. 2022 – Aug. 2022 Seattle, WA, USA
    • Embedding-Matching Acoustic-to-Word ASR
      • Exposed limitations of existing embedding-matching acoustic-to-word (A2W) that previous studies did not point out
      • Proposed generating multiple embeddings as well as using pronunciation-based embeddings, to make significant accuracy improvements to embedding-matching A2W
     
     
     
     
     
    Research Assistant
    Jul. 2019 – Jan. 2021 Taipei, Taiwan
    • 2020 Detection and Classification of Acoustic Scenes and Events
      • Outperformed baseline score by relative improvement of 19.45% on task 4: Sound Event Detection and Separation in Domestic Environments
      • Implemented models and designed training procedure with few labeled data (less than 1600) and over 10000 unlabeled data