Speech Processing References
525.747
The following speech processing related books and articles marked with an ► have been found to be particularly useful to students in this class and those marked with a ® are on reserve at the JHU Dorsey Center for student use.
In addition to the following, see papers in the following journals: AT&T Technical Journal; Computer Speech and Language; Digital Signal Processing; IEEE ASSP Magazine; IEEE Communications Magazine; IEEE Transactions on Speech and Audio Processing (formerly IEEE Transactions on Acoustics, Speech, and Signal Processing); Journal of the Acoustical Society of America; and Speech Communication. Also see papers in the following conference proceedings: European Signal Processing Conference (EUSIPCO), International Conference on Acoustics, Speech, and Signal Processing (ICASSP); International Conference on Spoken Language Processing (ICSLP); and IEEE Speech Coding Workshops (Kluwer book series).
® Atal, B. S., V. Cuperman, and A. Gersho, ed. Advances in Speech Coding. IEEE Workshop on Speech Coding for Telecommunications. Boston: Kluwer Academic Publishers, 1991a.
Atal, B. S., J. L. Miller, and R. D. Kent, ed. Papers in Speech Communication: Speech Processing. Vol. 3. Papers in Speech Communication. Woodbury: Acoustical Society of America, 1991b.
Borden, G. and K. Harris. Speech Science Primer. 2nd ed., Baltimore: Williams & Wilkins, 1984.
► Campbell, J., T. Tremain, and V. Welch. "The Federal Standard 1016 4800 bps CELP Voice Coder." Digital Signal Processing: A Review Journal 1, no. 3 (1991): 145 - 155.
Cooke, M., S. Beet, and M. Crawford, ed. Visual Representations of Speech Signals. Chichester: John Wiley & Sons, 1993.
► Cox, R. "Three New Speech Coders from the ITU Cover a Range of Applications." IEEE Communications Magazine 35, no. 9 (September 1997, special issue on Standardization and Characterization of G.729): 40 - 47.
► Deller, J., J. Proakis, and J. Hansen. Discrete-Time Processing of Speech Signals. New York: Macmillan, 1993.
Denes, P. and E. Pinson. The Speech Chain. New York: Doubleday, 1973.
Dixon, N. R. and T. B. Martin, ed. Automatic Speech & Speaker Recognition. New York: IEEE Press, 1979.
Edwards, H. T. Applied Phonetics: The Sounds of American English. San Diego: Singular Publishing, 1992.
Fallside, F. and W. Woods, ed. Computer Speech Processing. London: Prentice/Hall International, 1985.
► Fant, G. Acoustic Theory of Speech Production: With Calculations based on X-Ray Studies of Russian Articulations. Description and Analysis of Contemporary Standard Russian, The Hague: Mouton, 1970.
Flanagan, J. Speech Analysis Synthesis and Perception. 2nd ed., Berlin: Springer-Verlag, 1972.
Furui, S. Digital Speech Processing, Synthesis, and Recognition. New York: Marcel Dekker, 1989.
Furui, S. and M. M. Sondhi, ed. Advances in Speech Signal Processing. New York: Marcel Dekker, 1992.
Gersho, A. "Advances in Speech and Audio Compression." Proceedings of the IEEE 82, no. 6 (1994): 900 - 918.
Gersho, A. and R. M. Gray. Vector Quantization and Signal Compression. Boston: Kluwer, 1992.
Hess, W. Pitch Determination of Speech Signals: Algorithms and Devices. Information Sciences, ed. M. Schroeder. Berlin: Springer-Verlag, 1983.
► Jayant, N. S. and P. Noll. Digital Coding of Waveforms: Principles and Applications to Speech and Video. Signal Processing Series, ed. A. V. Oppenheim. Englewood Cliffs: Prentice-Hall, 1984.
Kleijn, W. B. and K. K. Paliwal, ed. Speech Coding and Synthesis. Amsterdam: Elsevier, 1995.
Lee, C.-H., F. K. Soong, and K. K. Paliwal, ed. Automatic Speech & Speaker Recognition: Advanced Topics. International Series in Engineering & Computer Science, Natural Language Processing & Machine Translation: Multimedia Systems & Applications. Boston: Kluwer Academic Publishers, 1996.
® Lee, K.-F. Automatic Speech Recognition - The Development of the SPHINX System. Boston: Kluwer Academic Publishers, 1989.
Lim, J., ed. Speech Enhancement. Englewood Cliffs: Prentice-Hall, 1983.
► Max, J. "Quantizing for Minimum Distortion." IRE Trans. Inform. Theory IT-6, (March 1960): 7 - 12.
► Makhoul, J. "Linear Prediction: A Tutorial Review." Proceedings of the IEEE 63, no. 4 (1975): 561 - 580.
Markel, J. and A. Gray. Linear Prediction of Speech. Communication and Cybernetics, Berlin: Springer-Verlag, 1976.
► O’Shaughnessy, D. Speech Communication, Human and Machine. Digital Signal Processing, Reading: Addison-Wesley, 1987.
Olive, J. P., A. Greenwood, and J. S. Coleman. Acoustics of American English Speech: A Dynamic Approach. New York: Springer-Verlag, 1993.
Oppenheim, A. and R. Schafer. Digital Signal Processing. Englewood Cliffs: Prentice-Hall, 1975.
Owens, F. J. Signal Processing of Speech. New York: McGraw-Hill, 1993.
Parsons, T. Voice and Speech Processing. Communications and Signal Processing, ed. S. Director. New York: McGraw-Hill, 1987.
Potter, R. A., G. A. Kopp, and H. C. Green. Visible Speech. Van Nostrand, 1947.
► Rabiner, L. and B.-H. Juang. Fundamentals of Speech Recognition. Signal Processing, ed. A. Oppenheim. Englewood Cliffs: Prentice Hall, 1993.
► Rabiner, L. and B.-H. Juang. "An Introduction to Hidden Markov Models." IEEE ASSP Magazine. January 1986, 4 - 16.
► ® Rabiner, L. and R. Schafer. Digital Processing of Speech Signals. Signal Processing, ed. A. Oppenheim. Englewood Cliffs: Prentice-Hall, 1978.
Saito, S. and K. Nakata. Fundamentals of Speech Signal Processing. Tokyo: Academic Press, 1985.
Scharf, L. Statistical Signal Processing: Detection, Estimation, and Time Series Analysis. Digital Signal Processing, ed. R. Roberts. Reading: Addison-Wesley, 1991.
Schroeder, M. R. Speech and Speaker Recognition. New York: Karger, 1985.
Waibel, A. and K.-F. Lee, ed. Readings in Speech Recognition. San Mateo: Morgan Kaufmann, 1990.
► Zelinski, R. and P. Noll. "Adaptive Transform Coding of Speech Signals." IEEE Trans. on Acoust., Speech, and Signal Processing ASSP-25, no. 4 (August 1977): 299 - 309.