Publications of the Statistical Speech Technology Group
University of Illinois at Urbana-Champaign
- Ming Liu, Xi Zhou, Mark Hasegawa-Johnson, Thomas S. Huang, and
Zhengyou Zhang, "Frequency
Domain Correspondence for Speaker Normalization," in
Proc. Interspeech, Antwerp, August, 2007.
- Xi Zhou, Yu Fun, Ming Liu, Mark Hasegawa-Johnson, and Thomas
Huang, "Robust Analysis and Weighting on
MFCC Components for Speech Recognition and Speaker
Identification," ICME 2007.
- Sarah Borys and Mark Hasegawa-Johnson, "Distinctive Feature Based SVM
Discriminant Features for Improvements to Phone Recognition on
Telephone Band Speech." ISCA Interspeech, October 2005.
- Mark Hasegawa-Johnson, Sarah Borys and Ken Chen, ``Experiments in
Landmark-Based Speech Recognition.'' Sound to Sense: Workshop in
Honor of Kenneth N. Stevens, June, 2004.
- M. Kamal Omar and Mark Hasegawa-Johnson, "Model Enforcement: A Unified Feature
Transformation Framework for Classification and Recognition," IEEE
Transactions on Signal Processing, vol. 52, no. 10, pp. 2701-2710,
2004.
- Stefan Geirhofer, Feature
Reduction with Linear Discriminant Analysis and its Performance on
Phoneme Recognition. Undergraduate research project.
- Mohamed Kamal Mahmoud Omar, Acoustic Feature Design for Speech
Recognition: A Statistical Information-Theoretic Approach.
Ph.D. Thesis, 2003.
- M. Kamal Omar and Mark Hasegawa-Johnson, "Approximately Independent Factors of
Speech Using Nonlinear Symplectic Transformation," IEEE
Transactions on Speech and Audio Processing, vol. 11, no. 6,
pp. 660-671, 2003.
- M. Omar and M. Hasegawa-Johnson, "Non-Linear Independent Component Analysis
for Speech Recognition," International Conference on Computer,
Communication and Control Technologies (CCCT '03), 2003.
- M. Omar and M. Hasegawa-Johnson, "Strong-Sense Class-Dependent Features for
Statistical Recognition," IEEE Workshop on Statistical Signal
Processing, St. Louis, MO, 2003, 473-476.
- M. K. Omar and M. Hasegawa-Johnson, "Maximum Conditional Mutual
Information Projection For Speech Recognition," Interspeech,
September, 2003, 505-508.
- M. K. Omar and M. Hasegawa-Johnson, "Non-Linear Maximum Likelihood
Feature Transformation For Speech Recognition," Interspeech,
September, 2003, 2497-2500.
- M. Hasegawa-Johnson, "Finding the Best Acoustic
Measurements for Landmark-Based Speech Recognition," Accumu
Magazine, Kyoto Computer Gakuin, Kyoto, Japan, 2002.
- M. Omar, K. Chen, M. Hasegawa-Johnson and Y. Brandman, "An Evaluation of using Mutual
Information for Selection of Acoustic-Features Representation of
Phonemes for Speech Recognition," Interspeech, Denver, CO,
September 2002, pp. 2129-2132.
- Z. Jing and M. Hasegawa-Johnson, "Auditory-Modeling Inspired Methods of
Feature Extraction for Robust Automatic Speech Recognition,"
ICASSP Student Session, May 2002, IV:4176.
- M. K. Omar and M. Hasegawa-Johnson, "Maximum Mutual Information Based
Acoustic Features Representation of Phonological Features for Speech
Recognition," ICASSP, May 2002, I:81-84.
- Zhinian Jing, Voice Index and Frame
Index for Recognition of Digits in Speech Background.
M.S. Thesis, 2002.
- W. Gunawan and M. Hasegawa-Johnson, "PLP Coefficients can be Quantized at 400
bps," ICASSP, Salt Lake City, UT, pp. 2.2.1-4, 2001.
- Bowon Lee, Robust Speech
Recognition in a Car Using a Microphone Array." Ph.D. thesis,
2006.
- Chitturi, R. and Hasegawa-Johnson, M. "Novel Entropy-Based
Moving Average Refiners for HMM Landmarks." Interspeech, September
2006.
- Mark Hasegawa-Johnson, James Baker, Sarah Borys, Ken Chen, Emily
Coogan, Steven Greenberg, Amit Juneja, Katrin Kirchhoff, Karen
Livescu, Srividya Mohan, Jennifer Muller, Kemal Sönmez, and Tianyu
Wang, "Landmark-Based
Speech Recognition: Report of the 2004 Johns Hopkins Summer
Workshop." ICASSP, March 2005.
- Yeojin Kim and Mark Hasegawa-Johnson, "Phonetic Segment Rescoring Using SVMs."
Midwest Computational Linguistics Colloquium, Columbus, OH, 2005.
- Mark Hasegawa-Johnson, James Baker, Steven Greenberg, Katrin
Kirchhoff, Jennifer Muller, Kemal Sonmez, Sarah Borys, Ken Chen, Amit
Juneja, Katrin Kirchhoff, Karen Livescu, Srividya Mohan, Emily Coogan,
and Tianyu Wang, "Landmark-Based
Speech Recognition: Report of the 2004 Johns Hopkins Summer
Workshop." technical report of the Johns Hopkins Center for
Language and Speech Processing, 2005.
- Mark Hasegawa-Johnson, Landmark-Based
Speech Recognition: The Marriage of High-Dimensional Machine Learning
Techniques with Modern Linguistic Representations," talk given at
Tsinghua University, October 2004.
- Ameya Deoras and Mark Hasegawa-Johnson, "A Factorial HMM Approach
to Robust Isolated Digit Recognition in Background Music."
Interspeech, October, 2004.
- Ameya Deoras and Mark Hasegawa-Johnson, "A Factorial HMM Approach to
Simultaneous Recognition of Isolated Digits Spoken by Multiple Talkers
on One Audio Channel," ICASSP 2004.
- Yanli Zheng and Mark Hasegawa-Johnson, "Acoustic segmentation using switching
state Kalman Filter," ICASSP 2003, April 2003, I:752-755.
- Ameya Deoras, A Factorial HMM
Approach to Robust Isolated Digit Recognition in Non-Stationary
Noise. B.S. Thesis, 2003.
- M. K. Omar, M. Hasegawa-Johnson and S. E. Levinson, "Gaussian Mixture Models of Phonetic
Boundaries for Speech Recognition," ASRU 2001.
- M. Hasegawa-Johnson, "Multivariate-State Hidden
Markov Models for Simultaneous Transcription of Phones and
Formants," ICASSP, Istanbul, pp. 1323-26, 2000
- Mark Hasegawa-Johnson, Karen Livescu, Partha Lal and Kate Saenko,
"Audiovisual Speech
Recognition with Articulator Positions as Hidden Variables," in
Proc. International Congress on Phonetic Sciences (ICPhS),
Saarbrücken, August, 2007.
- Mark Hasegawa-Johnson, "Audio-Visual Speech
Recognition: Audio Noise, Video Noise, and Pronunciation
Variability," talk given to the Signal Processing Society, IEEE
Japan, June 2007.
- Yun Fu, Xi Zhou, Ming Liu, Mark Hasegawa-Johnson, and Thomas
S. Huang, "Lipreading by Locality
Discriminant Graph," IEEE International Conference on Image
Processing (ICIP) 2007.
- Karen Livescu, Ozgur Cetin, Mark Hasegawa-Johnson, Simon King,
Chris Bartels, Nash Borges, Arthur Kantor, Partha Lal, Lisa Yung, Ari
Bezman, Stephen Dawson-Haggerty, Bronwyn Woods, Joe Frankel, Matthew
Magimai-Doss, and Kate Saenko, "Articulatory Feature-Based Methods
for Acoustic and Audio-Visual Speech Recognition: Summary from the
2006 JHU Summer Workshop." ICASSP, May 2007.
- Karen Livescu, Özgür Çetin, Mark Hasegawa-Johnson, Simon King,
Chris Bartels, Nash Borges, Arthur Kantor, Partha Lal, Lisa Yung, Ari
Bezman, Stephen Dawson-Hagerty, Bronwyn Woods, Joe Frankel, Mathew
Magimai-Doss, and Kate Saenko, "Articulatory-Feature-Based Methods for
Acoustic and Audio-Visual Speech Recognition: 2006 JHU Summer Workshop
Final Report." Johns Hopkins Center for Language and Speech
Processing, 2007.
- Mark Hasegawa-Johnson, Object Tracking and Asynchrony
in Audio-Visual Speech Recognition. talk given to the Artificial
Intelligence, Vision, and Robotics seminar series, August, 2006.
- Mark Hasegawa-Johnson, Dealing with
Acoustic Noise. Part IIII: Video. tutorial presentation given at
WS06, Center for Language and Speech Processing, July 2006.
- Camille Goudeseune and Bowon Lee, AVICAR: Audio-Visual Speech Recognition in
a Car Environment. Promotional Film, 2006.
- Bowon Lee, Mark Hasegawa-Johnson, Camille Goudeseune, Suketu
Kamdar, Sarah Borys, Ming Liu, and Thomas Huang, "AVICAR: Audio-Visual Speech Corpus
in a Car Environment." Interspeech, October 2004.
- S.E. Levinson, T.S. Huang, M.A. Hasegawa-Johnson, K. Chen,
S. Chu, A. Garg, Z. Jing, D. Li, J. Lin, M. Omar and Z. Wen, "Multimodal Dialog Systems Research at
Illinois," ARPA Workshop on Multimodal Speech Recognition and
SPINE, June, 2002.
- Karen Livescu, Özgür Çetin, Mark Hasegawa-Johnson, Simon King,
Chris Bartels, Nash Borges, Arthur Kantor, Partha Lal, Lisa Yung, Ari
Bezman, Stephen Dawson-Hagerty, Bronwyn Woods, Joe Frankel, Mathew
Magimai-Doss, and Kate Saenko, "Articulatory-Feature-Based Methods for
Acoustic and Audio-Visual Speech Recognition: 2006 JHU Summer Workshop
Final Report." Johns Hopkins Center for Language and Speech
Processing, 2007.
- Karen Livescu, Ozgur Cetin, Mark Hasegawa-Johnson, Simon King,
Chris Bartels, Nash Borges, Arthur Kantor, Partha Lal, Lisa Yung, Ari
Bezman, Stephen Dawson-Haggerty, Bronwyn Woods, Joe Frankel, Matthew
Magimai-Doss, and Kate Saenko, "Articulatory Feature-Based Methods
for Acoustic and Audio-Visual Speech Recognition: Summary from the
2006 JHU Summer Workshop." ICASSP, May 2007.
- Ken Chen and Mark Hasegawa-Johnson, "Modeling pronunciation variation
using artificial neural networks for English spontaneous speech."
Interspeech, October 2004.
- Jui-Ting Huang and Mark Hasegawa-Johnson, Unsupervised Prosodic Break Detection in Mandarin Speech, SpeechProsody 2008
- Xiaodan Zhuang and Mark Hasegawa-Johnson, Towards Interpretation of Creakiness in Switchboard, SpeechProsody 2008
- Taejin Yoon, Jennifer Cole, and Mark Hasegawa-Johnson, Detecting Non-Modal Phonation in Telephone Speech, SpeechProsody, 2008
- Taejin Yoon, A Predictive Model of Prosody Through Grammatical Interface: A Computational Approach, Ph.D. Thesis, 2007.
- Ken Chen, Mark Hasegawa-Johnson and Jennifer Cole, "A Factored Language Model for
Prosody-Dependent Speech Recognition," Speech Synthesis and
Recognition, Vedran Kordic, Ed., Advanced Robotic Systems, 2007.
- Mark Hasegawa-Johnson, Jennifer Cole, Ken Chen, Partha Lal, Amit
Juneja, Taejin Yoon, Sarah Borys, and Xiaodan Zhuang, "Prosodically Organized
Automatic Speech Recognition." Linguistic Processes in Spontaneous
Speech, Academica Sinica, Taiwan, November 2006.
- Mark Hasegawa-Johnson, "Phonology and the Art of
Automatic Speech Recognition," Director's Seminar Series, Beckman
Institute, University of Illinois at Urbana-Champaign, November 2006.
- Taejin Yoon, Xiaodan Zhuang, Jennifer Cole, and Mark
Hasegawa-Johnson, "Voice Quality
Dependent Speech Recognition." Midwest Computational Linguistics
Colloquium, Urbana, IL, 2006.
- Cole, Jennifer, Mark Hasegawa-Johnson, Chilin Shih, Eun-Kyung Lee,
Heejin Kim, H. Lu, Yoonsook Mo, Tae-Jin Yoon. (2005). "Prosodic Parallelism as a Cue to
Repetition and Hesitation Disfluency," Proceedings of DISS'05 (An
ISCA Tutorial and Research Workshop), Aix-en-Provence, France,
pp. 53-58.
- Mark Hasegawa-Johnson, Ken Chen, Jennifer Cole, Sarah Borys,
Sung-Suk Kim, Aaron Cohen, Tong Zhang, Jeung-Yoon Choi, Heejin Kim,
Taejin Yoon, and Sandra Chavarria, "Simultaneous Recognition of
Words and Prosody in the Boston University Radio Speech Corpus."
Speech Communication 46(3-4):418-439, 2005.
- Ken Chen, Mark Hasegawa-Johnson, Aaron Cohen, Sarah Borys,
Sung-Suk Kim, Jennifer Cole and Jeung-Yoon Choi, "Prosody Dependent Speech Recognition on
Radio News Corpus of American English." IEEE Transactions on
Speech and Audio Processing, 14(1):232-245, 2006.
- Sarah Borys, Mark Hasegawa-Johnson, Ken Chen, and Aaron Cohen, "Modeling and Recognition of
Phonetic and Prosodic Factors for Improvements to Acoustic Speech
Recognition Models." Interspeech, October, 2004.
- Mark Hasegawa-Johnson, Speech Recognition
Models of the Interdependence Among Syntax, Prosody, and Segmental
Acoustics," talk given at Tsinghua University, October 2004.
- Mark Hasegawa-Johnson, Jennifer Cole, Chilin Shih, Ken Chen,
Aaron Cohen, Sandra Chavarria, Heejin Kim, Taejin Yoon, Sarah Borys,
and Jeung-Yoon Choi, "Speech Recognition Models of
the Interdependence Among Syntax, Prosody, and Segmental
Acoustics," Human Language Technologies: Meeting of the North
American Chapter of the Association for Computational Linguistics
(HLT/NAACL), Workshop on Higher-Level Knowledge in Automatic Speech
Recognition and Understanding, May, 2004.
- Ken Chen and Mark Hasegawa-Johnson, "How Prosody Improves Word
Recognition," SpeechProsody 2004, Nara, Japan, March 2004,
583-586.
- Ken Chen, Mark Hasegawa-Johnson and Sung-Suk Kim, ``An Intonational Phrase Boundary and Pitch
Accent Dependent Speech Recognizer.'' International Conference on
Systems, Cybernetics, and Intelligence, 2003.
- Ken Chen and Mark Hasegawa-Johnson, ``Improving the robustness of prosody
dependent language modeling based on prosody syntax
cross-correlation.'' ASRU, 2003.
- Ken Chen, Mark Hasegawa-Johnson and Jennifer Cole, "Prosody Dependent Speech Recognition on
Radio News," IEEE Workshop on Statistical Signal Processing,
St. Louis, MO, 2003.
- K. Chen, M. Hasegawa-Johnson, A. Cohen, S. Borys, and J. Cole, "Prosody Dependent Speech
Recognition with Explicit Duration Modelling at Intonatinal Phrase
Boundaries." Interspeech, September, 2003, 393-396.
- Sarah Borys, Recognition of
Prosodic Factors and Detection of Landmarks for Improvements to
Continuous Speech Recognition Systems. B.S. Thesis, 2003.
- Sarah Borys, Mark Hasegawa-Johnson and Jennifer Cole, "The Importance of Prosodic Factors in
Phoneme Modeling with Applications to Speech Recognition," ACL
Student Session, 2003.
- Sarah Borys, Mark Hasegawa-Johnson and Jennifer Cole, "Prosody as a Conditioning Variable in
Speech Recognition," Illinois Journal of Undergraduate Research, 2003.
- Weimo Zhu, Mark Hasegawa-Johnson, Karen Chapman-Novakofski, and
Arthur Kantor, "Cellphone-Based Nutrition
E-Diary." National Nutrient Database Conference, 2007.
- Weimo Zhu, Mark Hasegawa-Johnson, Arthur Kantor, Dan Roth, Yong
Gao, Youngsik Park, and Lin Yang, "E-coder for Automatic Scoring
Physical Activity Diary Data: Development and Validation." ACSM,
2007.
- Mark Hasegawa-Johnson, Jonathan Gunderson, Adrienne Perlman, and
Thomas Huang, "HMM-Based
and SVM-Based Recognition of the Speech of Talkers with Spastic
Dysarthria," ICASSP, May 2006.
- Weimo Zhu, Mark Hasegawa-Johnson, and Mital Arun Gandhi, ``Accuracy of Voice-Recognition Technology in
Collecting Behavior Diary Data.'' Association of Test Publishers
(ATP): Innovations in Testing, March 2005.
- Tong Zhang, Mark Hasegawa-Johnson and Stephen E. Levinson, "Extraction of Pragmatic and Semantic
Salience from Spontaneous Spoken English," Speech Communication, 2007.
- Tong Zhang, Mark Hasegawa-Johnson and Stephen E. Levinson, "Cognitive State Classification in a spoken
tutorial dialogue system," Speech Communication 48(6):616-632,
2006.
- Cole, Jennifer, Mark Hasegawa-Johnson, Chilin Shih, Eun-Kyung Lee,
Heejin Kim, H. Lu, Yoonsook Mo, Tae-Jin Yoon. (2005). "Prosodic Parallelism as a Cue to
Repetition and Hesitation Disfluency," Proceedings of DISS'05 (An
ISCA Tutorial and Research Workshop), Aix-en-Provence, France,
pp. 53-58.
- Tong Zhang, Mark Hasegawa-Johnson, and Stephen E. Levinson, "A Hybrid Model for Spontaneous Speech
Understanding." AAAI 2005.
- Tong Zhang, Mark Hasegawa-Johnson, and Stephen E. Levinson, "Children's Emotion
Recognition in an Intelligent Tutoring Scenario." Interspeech,
October, 2004.
- Tong Zhang, Mark Hasegawa-Johnson and Stephen E. Levinson, "Automatic detection of
contrast for speech understanding." Interspeech, October, 2004.
- Yuexi Ren, Mark Hasegawa-Johnson and Stephen
E. Levinson. "Semantic analysis for a speech user interface in an
intelligent-tutoring system", Intl. Conf. on Intelligent User
Interfaces. Madeira, Portugal, 2004.
- Tong Zhang, Mark Hasegawa-Johnson, and Stephen E. Levinson, "An empathic-tutoring system using spoken
language," Australian conference on computer-human interaction
(OZCHI 2003).
- Tong Zhang, Mark Hasegawa-Johnson, and Stephen E. Levinson, "Mental State Detection of Dialogue System
Users via Spoken Language," ISCA/IEEE Workshop on Spontaneous
Speech Processing and Recognition (SSPR), April 2003, MAP17.1-4.
- Ming Liu, Xi Zhou, Mark Hasegawa-Johnson, Thomas S. Huang, and
Zhengyou Zhang, "Frequency Domain
Correspondence for Speaker Normalization," in Proc. Interspeech,
Antwerp, August, 2007.
- Xi Zhou, Yu Fun, Ming Liu, Mark Hasegawa-Johnson, and Thomas
Huang, "Robust Analysis and Weighting on
MFCC Components for Speech Recognition and Speaker
Identification," ICME 2007.
- Ming Liu, Zhengyou Zhang, Mark Hasegawa-Johnson, and Thomas Huang,
"Exploring Discriminative Learning for
Text-Independent Speaker Recognition," ICME 2007.
- Mark Hasegawa-Johnson, Shamala Pizza, Abeer Alwan, Jul Cha, and
Katherine Haker, "Vowel
Category Dependence of the Relationship Between Palate Height, Tongue
Height, and Oral Area," Journal of Speech, Language, and Hearing
Research, vol. 46, no. 3, pp. 738-753, 2003.
- Yanli Zheng, Mark Hasegawa-Johnson, and Shamala Pizza, "PARAFAC Analysis of the Three dimensional
tongue Shape," Journal of the Acoustical Society of America,
vol. 113, no. 1, pp. 478-486, January 2003.
- Mark Hasegawa-Johnson, "Line Spectral Frequencies are
the Poles and Zeros of a Discrete Matched-Impedance Vocal Tract
Model," Journal of the Acoustical Society of America, vol. 108,
no. 1, pp. 457-460, 2000.
- Y. Zheng and M. Hasegawa-Johnson, "Three Dimensional Tongue shape Factor
Analysis," American Speech-Language Hearing Association National
Convention, Washington, DC, 2000. Published in the magazine ASHA
Leader, 5(16):144.
- M. Hasegawa-Johnson, "Preliminary Work and Proposed
Continuation: Imaging of Speech Anatomy and Behavior." Talk given
at the Universities of Illinois Inter-campus Biomedical Imaging Forum,
2001.
- M. Hasegawa-Johnson, J. Cha and K. Haker, "CTMRedit: A Matlab-based tool
for segmenting and interpolating MRI and CT images in three orthogonal
planes," 21st Annual International Conference of the IEEE/EMBS
Society, pp. 1170. 1999.
- M. Hasegawa-Johnson, "Combining magnetic resonance image planes in
the Fourier domain for improved spatial resolution." International
Conference On Signal Processing Applications and Technology, Orlando,
FL, pp. 81.1-5, 1999
- Mark Hasegawa-Johnson, "Electromagnetic Exposure
Safety of the Carstens Articulograph AG100," Journal of the
Acoustics Society of America, vol. 104, pp. 2529-2532, 1998.
- M. A. Johnson, "Using beam elements to model the vocal fold length
in breathy voicing," JASA 91:2420-2421, 1992.
- Soo-Eun Chang, Nicoline Ambrose, Kirk Erickson, and Mark
Hasegawa-Johnson,
"Brain Anatomy Differences in Childhood
Stuttering." Neuroimage, in press.
- Soo-Eun Chang, Kirk I. Erickson, Nicoline G. Ambrose, Mark
Hasegawa-Johnson, and C.L. Ludlow, "Deficient white matter development
in left hemisphere speech-language regions in children who stutter."
Society for Neuroscience, Atlanta, GA, 2006.
- Soo-Eun Chang, Nicoline Ambrose, and Mark Hasegawa-Johnson, "An
MRI (DTI) study on children with persistent developmental stuttering."
2004 ASHA Convention, American Speech Language and Hearing
Association, November, 2004.
- Mark Hasegawa-Johnson, "Bayesian Learning for Models of
Human Speech Perception," IEEE Workshop on Statistical Signal
Processing, St. Louis, MO, 2003, 393-396.
- S. Takayanagi, M. Hasegawa-Johnson, L. S. Eisner and
A. Schaefer-Martinez, "Information
theory and variance estimation techniques in the analysis of category
rating data and paired comparisons." JASA, 102:3091, 1997
- Bowon Lee and Mark Hasegawa-Johnson, "Minimum Mean Squared Error A
Posteriori Estimation of High Variance Vehicular Noise," in 2007
Biennial on DSP for In-Vehicle and Mobile Systems, Istanbul, June,
2007.
- Bowon Lee, Robust Speech
Recognition in a Car Using a Microphone Array." Ph.D. thesis,
2006.
- Mark Hasegawa-Johnson, Dealing with
Acoustic Noise. Part II: Beamforming. tutorial presentation given
at WS06, Center for Language and Speech Processing, July 2006.
- Mark Hasegawa-Johnson, Dealing with
Acoustic Noise. Part I: Spectral Estimation. tutorial
presentation given at WS06, Center for Language and Speech Processing,
July 2006.
- Laehoon Kim and Mark Hasegawa-Johnson, "Generalized Optimal
Multi-Microphone Speech Enhancement Using Sequential Minimum Variance
Distortionless Response (MVDR) Beamforming and Postfiltering," ICASSP,
May 2006.
- Laehoon Kim and Mark Hasegawa-Johnson, "Generalized multi-microphone spectral
amplitude estimation based on correlated noise model." 119th
Convention of the Audio Engineering Society, New York, October 2005.
- Bowon Lee, Mark Hasegawa-Johnson, and Camille Goudeseune, "Open Loop Multichannel Inversion of Room
Impulse Response," JASA 113(4):2202-3, 2003.
- M. Hasegawa-Johnson and A. Alwan, "Speech Coding:
Fundamentals and Applications," Wiley Encyclopedia of
Telecommunications and Signal Processing, J. Proakis, Ed., Wiley and
Sons, NY, December 2002.
- W. Gunawan and M. Hasegawa-Johnson, "PLP Coefficients can be Quantized at
400 bps," ICASSP, Salt Lake City, UT, pp. 2.2.1-4, 2001.
- Mark Hasegawa-Johnson and T. Taniguchi, "On-line and off-line computational
reduction techniques using backward filtering in CELP speech
coders," IEEE Transactions Acoustics, Speech, and Signal
Processing, vol. 40, pp. 2090-2093, 1992.
- M. A. Johnson and T. Taniguchi, "Low-complexity multi-mode
VXC using multi-stage optimization and mode selection," ICASSP,
Toronto, Canada, pp. 221-224, 1991.
- T. Taniguchi, M. A. Johnson, and Y. Ohta, "Pitch sharpening for
perceptually improved CELP, and the sparse-delta codebook for reduced
computation," ICASSP, Toronto, Canada, pp. 241-244, 1991.
- T. Taniguchi, F. Amano, and M. A. Johnson, "Improving the
performance of CELP-based speech coding at low bit rates,"
International Symposium on Circuits and Systems, Singapore, 1991.
- M. A. Johnson and T. Taniguchi, "Computational reduction in
sparse-codebook CELP using backward-weighting of the input," Institute
of Electr., Information, and Comm. Eng. Symposium, DSP 90-15, Hakata,
61-66, 1990.
- T. Taniguchi, M. A. Johnson, and Y. Ohta, "Multi-vector
pitch-orthogonal LPC: quality speech with low complexity at rates
between 4 and 8 kbps," ICSLP, Kobe, pp. 113-116, 1990.
- M. A. Johnson and T. Taniguchi, "Pitch-orthogonal code-excited
LPC," IEEE Global Telecommunications Conference (GLOBECOM), San Diego,
CA, pp. 542-546, 1990.
- Yoonsook Mo, "Temporal, spectral
evidence of devoiced vowels in Korean," in Proc. International
Congress on Phonetic Sciences (ICPhS), Saarbrücken, August, 2007.
- Chitturi, R. and Hasegawa-Johnson, M. "Novel Time-Domain Multi-class
SVMs for Landmark Detection." Interspeech, September 2006.
- M. Hasegawa-Johnson, "Time-Frequency
Distribution of Partial Phonetic Information Measured Using Mutual
Information," Interspeech IV:133-136, Beijing, 2000.
- M. A. Hasegawa-Johnson, "Burst spectral measures and formant
frequencies can be used to accurately discriminate stop place of
articulation," JASA, 98:2890, 1995
- Mark A. Johnson, "A mapping between trainable generalized
properties and the acoustic correlates of distinctive features," MIT
Speech Communication Group Working Papers, vol. 9, pp. 94-105, 1994.
- M. Johnson, "Automatic context-sensitive measurement of the
acoustic correlates of distinctive features," ICSLP, Yokohama,
pp. 1639-1643, 1994
- M. A. Johnson, "A mapping between trainable generalized properties
and the acoustic correlates of distinctive features," JASA, vol. 94,
p. 1865, 1993.
- Yanli Zheng, "Feature Extraction
and Acoustic Modeling for Speech Recognition." Ph.D. Thesis, 2005.
- Yanli Zheng and Mark Hasegawa-Johnson, "Stop Consonant Classification by
Dynamic Formant Trajectory." Interspeech, October, 2004.
- Yanli Zheng and Mark Hasegawa-Johnson, "Formant Tracking by Mixture State
Particle Filter," ICASSP 2004.
- Y. Zheng and M. Hasegawa-Johnson, "Particle Filtering Approach to Bayesian
Formant Tracking," IEEE Workshop on Statistical Signal Processing,
September, 2003, 581-584.
- Taejin Yoon, Jennifer Cole and Mark Hasegawa-Johnson, "On the edge: Acoustic cues to layered
prosodic domains," in Proc. International Congress on Phonetic
Sciences (ICPhS), Saarbrücken, August, 2007.
- Taejin Yoon, Jennifer Cole and Mark Hasegawa-Johnson, "On the edge: Acoustic cues to layered
prosodic domains." 81st Annual Meeting of the Linguistic Society
of America, Anaheim, CA, January 5, 2007.
- Jennifer Cole, Heejin Kim, Hansook Choi, and Mark
Hasegawa-Johnson, "Prosodic effects on acoustic cues to stop voicing
and place of articulation: Evidence from Radio News speech." J
Phonetics 35:180-209, 2007.
- Kim, H., Yoon, T., Cole, J., and Hasegawa-Johnson, M. "Acoustic differentiation of L- and L-L% in
Switchboard and Radio News speech." Proceedings of Speech Prosody
2006, Dresden.
- Taejin Yoon, "Mapping Syntax and
Prosody." Midwest Computational Linguistics Colloquium, Columbus,
OH, 2005.
- Jeung-Yoon Choi, Mark Hasegawa-Johnson, and Jennifer Cole, "Finding Intonational Boundaries Using
Acoustic Cues Related to the Voice Source." Journal of the Acoustical
Society of America 118(4):2579-88, 2005.
- Cole, Jennifer, Mark Hasegawa-Johnson, Chilin Shih, Eun-Kyung Lee,
Heejin Kim, H. Lu, Yoonsook Mo, Tae-Jin Yoon. (2005). "Prosodic Parallelism as a Cue to
Repetition and Hesitation Disfluency," Proceedings of DISS'05 (An
ISCA Tutorial and Research Workshop), Aix-en-Provence, France,
pp. 53-58.
- Yoon, Tae-Jin, Cole, Jennifer, Mark Hasegawa-Johnson, and Chilin
Shih. "Detecting Non-modal
Phonation in Telephone Speech." Unpublished manuscript, 2005.
- Yoon, Tae-Jin, Cole, Jennifer, Mark Hasegawa-Johnson, and Chilin
Shih. (2005). "Acoustic correlates of
non-modal phonation in telephone speech," The Journal of the
Acoustical Society of America 117(4), p. 2621.
- Taejin Yoon, Sandra Chavarria, Jennifer Cole, and Mark
Hasegawa-Johnson, "Intertranscriber Reliability of
Prosodic Labeling on Telephone Conversation Using ToBI."
Interspeech, October, 2004.
- Sung-Suk Kim, Mark Hasegawa-Johnson, and Ken Chen, "Automatic Recognition of Pitch Movements
Using Multilayer Perceptron and Time-Delay Recursive Neural
Network," IEEE Signal Processing Letters 11(7):645-648, 2004.
- Yuexi Ren, Sung-Suk Kim, Mark Hasegawa-Johnson, and Jennifer
Cole, "Speaker-Independent Automatic
Detection of Pitch Accent," SpeechProsody 2004, Nara, Japan, March
2004, 521-524.
- Tae-Jin Yoon, Heejin Kim, and Sandra Chavarría. "Local Acoustic Cues Distinguishing Two
Levels of prosodic Phrasing: Speech Corpus Evidence," Lab phon 9,
University of Illinois at Urbana-Champaign, 2004.
- Aaron Cohen, A Survey of Machine
Learning Methods for Predicting Prosody in Radio Speech.
M.S. Thesis, 2004.
- Heejin Kim, Jennifer Cole, Hansook Choi, and Mark
Hasegawa-Johnson, "The Effect of Accent on
Acoustic Cues to Stop Voicing and Place of Articulation in Radio News
Speech," SpeechProsody 2004, Nara, Japan, March 2004, 29-32.
- Sandra Chavarria, Taejin Yoon, Jennifer Cole, and Mark
Hasegawa-Johnson, "Acoustic
differentiation of ip and IP boundary levels: Comparison of L- and
L-L% in the Switchboard corpus," Speech Prosody 2004, Nara, Japan,
March 2004, 333-336.
- Ken Chen, Mark Hasegawa-Johnson, Aaron Cohen, and Jennifer Cole,
"A Maximum Likelihood Prosody
Recognizer," SpeechProsody 2004, Nara, Japan, March 2004, 509-512.
- Ken Chen and Mark Hasegawa-Johnson, "An Automatic Prosody Labeling System
Using ANN-Based Syntactic-Prosodic Model and GMM-Based
Acoustic-Prosodic Model," ICASSP 2004.
- J. Cole, H. Choi, H. Kim, and M. Hasegawa-Johnson, "The Effect of Accent on the Acoustic Cues
to Stop Voicing in Radio News Speech," Proceedings of the
International Congress of Phonetic Sciences, Barcelona, Spain, August,
2003.
- Mark A. Johnson, "Analysis of durational rhythms in two poems by
Robert Frost," MIT Speech Communication Group Working Papers, vol. 8,
pp. 29-42, 1992.
- Xi Zhou, Xiaodan Zhuang, Ming Lui, Hao Tang, Mark Hasgeawa-Johnson
and Thomas Huang, "HMM-Based Acoustic
Event Detection with AdaBoost Feature Selection," Proc. CLEAR
Evaluation and Workshop (Classification of Events, Activities, and
Relationships), Baltimore, May, 2007.
- Mital Gandhi and Mark Hasegawa-Johnson, "Source Separation using Particle
Filters." Interspeech, October 2004.
- David Petruncio, Evaluation of
Various Features for Music Genre Classification with Hidden Markov
Models. B.S. Thesis, 2002.
- J. Beauchamp, H. Taube, S. Tipei, S. Wyatt, L. Haken and
M. Hasegawa-Johnson, "Acoustics, Audio, and Music Technology Education
at the University of Illinois," JASA, 110(5):2961, 2001.
- M. Hasegawa-Johnson, J. Cha, S. Pizza and K. Haker, "CTMRedit: A case study in
human-computer interface design," International Conference On
Public Participation and Information Tech., Lisbon, pp. 575-584, 1999