| New Phase-Vocoder Techniques for Real-Time Pitch-Shifting, Chorusing, Harmonizing and Other Exotic Audio Modifications (2000) | |||||||||||||||
Abstract | |||||||||||||||
| The phase-vocoder is a well-established tool for timescaling and pitch shifting speech and audio signals. Its theory is now well understood and improvements have been proposed to reduce artifacts commonly encountered when time-expanding signals by large factors [1, 2, 3, 4]. In the literature, the phase-vocoder has been described primarily as a tool for time-scaling rather than pitch shifting, the latter usually being achieved by a combination of time-scaling and sampling rate conversion [5, 6]. This article focuses mainly on pitch-scale modification of speech and audio signals, and discusses the drawbacks of the standard timescale /resampling technique. Two alternative techniques are presented that significantly reduce the complexity and computational cost, while offering dramatically extended capabilities. In particular, the new techniques, which operate solely in the frequency domain, enable chorusing, harmonizing and non-standard frequency modifications such as partial-stretching (nonlinear frequency scaling), frequency inversions and so on. | |||||||||||||||
Publication details | |||||||||||||||
| |||||||||||||||