Yiming Wang | Speech Recognition Researcher at NVIDIA
My
	Image Yiming Wang (王一鸣)

Research Scientist at NVIDIA
NeMo team
NVIDIA Corporation, Santa Clara, CA, USA
E-mail: freewym AT gmail DOT com

Google Scholar
LinkedIn
GitHub
Biography
  • I am a Research Scientist at NVIDIA, working on Multimodal LLMs. Before joining NVIDIA, I worked in Microsoft CoreAI under Jinyu Li, after receiving my Ph.D. degree in Computer Science from Johns Hopkins University. At JHU, I was also affiliated with the Center for Language and Speech Processing (CLSP), advised by Prof. Sanjeev Khudanpur and former JHU Prof. Daniel Povey. I am mostly working on speech recognition (ASR) problems, and have broad interests in machine learning and natural language processing as well. I am one of the major contributors of the Kaldi project, and the owner of the open-source end-to-end ASR toolkit Espresso. I interned at Google’s speech team and Amazon’s Alexa ASR team in 2017 and 2018 respectively, working on end-to-end ASR.

  • I received my B.S. and M.S. degree in Computer Science at Nanjing University in 2009 and 2012, respectively. My master advisor was Prof. Tong Lu.

Education
Work Experience
  • Staff Research Scientist
    NeMo team, NVIDIA Corporation, Santa Clara, CA, USA (Apr 2026 - present)
    Supervisor: Dr. Boris Ginsburg
  • Principal Applied Scientist
    CoreAI, Microsoft Corporation, Redmond, WA, USA (Sep 2020 - Apr 2026)
    Supervisor: Dr. Jinyu Li
  • Applied Scientist Intern
    Amazon.com, Inc., Seattle, WA, USA (May 2018 - Aug 2018)
    I worked with Dr. Xing Fan, Dr. I-Fan Chen and Dr. Yuzong Liu on improving Seq2Seq ASR model with information extracted from anchored words for Amazon Alexa.
Teaching Experience
Talks
Publications
Patents


View My Stats