Experience

Applied Scientist Intern

May. 2024 – Aug. 2024 Redmond, WA, USA

Long-form ASR and Speech In-context Learning

Proposed an efficient architecture for long-form speech recognition
Successfully achieved general speech in-context learning capabilities with provided contexts

Research Intern

Aug. 2022 – Dec. 2022 Cambridge, MA, USA

Diffusion-Based Speech Enhancement

Leveraged Diffusion-based model and proposed a novel unfolding training procedure for speech enhancement tasks
Significantly shrunk the performance gap between probabilistic diffusion model and conventional discriminative models

AIML - ASR Understanding Intern

May. 2022 – Aug. 2022 Seattle, WA, USA

Embedding-Matching Acoustic-to-Word ASR
- Exposed limitations of existing embedding-matching acoustic-to-word (A2W) that previous studies did not point out
- Proposed generating multiple embeddings as well as using pronunciation-based embeddings, to make significant accuracy improvements to embedding-matching A2W

Research Assistant

Jul. 2019 – Jan. 2021 Taipei, Taiwan

2020 Detection and Classification of Acoustic Scenes and Events
- Outperformed baseline score by relative improvement of 19.45% on task 4: Sound Event Detection and Separation in Domestic Environments
- Implemented models and designed training procedure with few labeled data (less than 1600) and over 10000 unlabeled data