Sitemap
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Pages
Posts
Reflections as a Newbie Tech Lead
Published:
From a technically-oriented newbie leader — takeaways on staying sharp, navigating conflict, and leading with character.
Conformer: Combining CNNs and Transformers for Speech Recognition
Published:
A deep dive into the Conformer architecture, which fuses CNNs and Transformers for state-of-the-art automatic speech recognition.
Paper Reading: Progressive Neural Networks
Published:
Paper Reading: Learning without Forgetting
Published:
portfolio
Portfolio item number 1
Short description of portfolio item number 1
Portfolio item number 2
Short description of portfolio item number 2 
publications
Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People
Published in WASPAA, 2025
Speech foundation models (SFMs) have shown strong performance across various downstream tasks, including speech intelligibility prediction. In this paper, we provide a comprehensive guide highlighting the key considerations necessary to maximize prediction performance.
Recommended citation: Zhou, H., Cao, B., Mo, C., Li, L., & Wang, S. X. (2025). Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People. WASPAA 2025.
Download Paper
No Audiogram: Leveraging Existing Scores for Personalized Speech Intelligibility Prediction
Published in Interspeech, 2025
In this paper, instead of using audiograms, we propose a new paradigm for personalized speech intelligibility prediction, which utilizes one’s existing intelligibility scores to predict the intelligibility score on new audio clips.
Recommended citation: Zhou, H., Mo, C., Cao, B., Li, L., & Wang, S. X. (2025). No Audiogram: Leveraging Existing Scores for Personalized Speech Intelligibility Prediction. Interspeech 2025.
Download Paper
Intrusive Intelligibility Prediction with ASR Encoders
Published in Interspeech, 2025
Speech foundation models have demonstrated strong performance in speech intelligibility prediction. In this paper, we present a set of practical techniques that enabled us to achieve first place in the Third Clarity Prediction Challenge.
Recommended citation: Yu, H., Zhou, H., Cao, B., Mo, C., Li, L., & Wang, S. X. (2025, August). Intrusive Intelligibility Prediction with ASR Encoders. In Clarity-2025: The 6th Clarity Workshop on Improving Speech-in-Noise for Hearing Devices (Satellite of Interspeech 2025), Delft, The Netherlands.
Download Paper
talks
Talk 1 on Relevant Topic in Your Field
Published:
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!
Conference Proceeding talk 3 on Relevant Topic in Your Field
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
teaching
Teaching experience 1
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Teaching experience 2
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.
