I received my Ph.D. in Electrical Engineering and Computer Science department from M.I.T. in 1999. At M.I.T., I worked as a research assistant in the Machine Listening Group of the MIT Media Lab. My undergraduate education was completed at Cornell University, in Electrical Engineering, with a concentration on signal processing and some work in music and the psychology of hearing.
My research interests are interdisciplinary, drawing from computer science, signal processing, pattern recognition, psychology, and cognitive science. My Master's Thesis was a model of human spatial hearing, using interaural differences to resolve sound source position in both azimuth and elevation. My Masters Thesis Proposal, which gives a reasonable introduction to my thinking about the topic, is online in hypertext, though it isn't pretty.
In 1996, I mainly worked on systems for automatic music transcription. I was never very satisfied with the results, but if you are really interested, here's a web page from mid 1996 that describes my thinking at the time (the page describes work-in-progress on a later abandoned blackboard system for automatic polyphonic transcription). There are three related references below in the publications section.
My dissertation is about building a system that listens to music and can identify the instruments. he dissertation itself is titled "Sound-source recognition: A theory and computational model".
If you want to know more about my professional life, here's my resume.
Martin, K. D., Scheirer, E. D., and Vercoe, B. L. (1998). Music content analysis through models of audition. Presented at the 1998 ACM Multimedia Workshop on Content Processing of Music for Multimedia Applications, Bristol, England, September, 1998. Submitted to the Journal of New Music Research.
Martin, K. D. (1998). Toward automatic sound source recognition: Identifying musical instruments. Presented at the 1998 NATO Advanced Study Institute on Computational Hearing, Il Ciocco, Italy, July, 1998. [Paper] [Poster]
Martin, K. D. (1997). Echo Suppression in a Computational Model of the Precedence Effect. Presented at the 1997 IEEE Mohonk workshop on Applications of Signal Processing to Acoustics and Audio, 1997.
Martin, K. D. and Scheirer, E. D. (1997). Automatic Transcription of Simple Polyphonic Music: Integrating Musical Knowledge. Presented at SMPC, 1997.
Martin, K. D. (1996). Automatic Transcription of Simple Polyphonic Music: Robust Front End Processing. M.I.T. Media Lab Perceptual Computing Technical Report #399, November 1996, presented at the Third Joint Meeting of the Acoustical Societies of America and Japan, December, 1996.
Martin, K. D. (1996). A Blackboard System for Automatic Transcription of Simple Polyphonic Music. M.I.T. Media Lab Perceptual Computing Technical Report #385, July 1996.
Martin, K. D. (1995), ``Estimating Azimuth and Elevation from Interaural Differences'', presented at the 1995 IEEE Mohonk workshop on Applications of Signal Processing to Acoustics and Audio, October 1995.
Martin, K. D. (1995), A Computational Model of Spatial Hearing, Master's thesis, EECS dept, MIT, June 1995.
Gardner, William G. and Martin, Keith D. (1995), ``HRTF Measurements of a KEMAR'', J. Acoust. Soc. Am., 97(6), June 1995.
Martin, Keith, ``A computational model of auditory localization'', presented at the 128th meeting of the Acoustical Society of America, November, 1994.
Gardner, Bill and Martin, Keith (1994), ``HRTF Measurements of a KEMAR Dummy-Head Microphone,'' MIT Media Laboratory Perceptual Computing Technical Report #280, May, 1994. Included on the CD-ROM "Standards in Computer Generated Music", Goffredo Haus and Isabella Pighi, eds., published by the IEEE CS Technical Committee on Computer Generated Music, 1996.
Back to my homepage...