CDP Software Maintenance engineer Richard Dobson writes regarding the question about PVOC frequency & amplitude v. phase shift & amplitude:
The short answer is, yes, we really do mean frequencies. Techncially, "frequency" (as a nominally continuous artefact) is defined as the rate of change of phase. This is why sometimes the phase vocoder has been referred to as a "tracking" vocoder, simnply because it tracks the (rate of) change of phase over time. A zero rate of change would equate to a constant DC level.
In the original pvoc, understood very much as an accumulating (sliding/overlapping frames) FFT-based process, data could be output in all the possible formats: raw "complex" values as delivered by the FFT, or either of amplitude/phase or all the way to ampltude/frequency. For pure analysis tasks, amp/phase is still often used. But in CDP (and likewise in Csound etc), we want to make weird transformations of the source and resynthesise it - this is very much easier and more "intuitive" with the classic amp/frequency format. It means, for example, that resynthesis can be done (from the same data) either by pvoc or by direct oscillator bank.
Some transformations (such as zeroing the amplitudes in some analysis bins) are "incorrect" mathematically (in terms of strict band-limited "linear" DSP), but composers rarely care about that. So long as the results are both interesting and mostly correlate with the intention, we just get on with it!
The "pvocex" program now included in the CDP system is a straight port of the origial pvoc (but creating pvx files), with (almost) all the original flag options.
Cheers,
Richard
The short answer is, yes, we really do mean frequencies. Techncially, "frequency" (as a nominally continuous artefact) is defined as the rate of change of phase. This is why sometimes the phase vocoder has been referred to as a "tracking" vocoder, simnply because it tracks the (rate of) change of phase over time. A zero rate of change would equate to a constant DC level.
In the original pvoc, understood very much as an accumulating (sliding/overlapping frames) FFT-based process, data could be output in all the possible formats: raw "complex" values as delivered by the FFT, or either of amplitude/phase or all the way to ampltude/frequency. For pure analysis tasks, amp/phase is still often used. But in CDP (and likewise in Csound etc), we want to make weird transformations of the source and resynthesise it - this is very much easier and more "intuitive" with the classic amp/frequency format. It means, for example, that resynthesis can be done (from the same data) either by pvoc or by direct oscillator bank.
Some transformations (such as zeroing the amplitudes in some analysis bins) are "incorrect" mathematically (in terms of strict band-limited "linear" DSP), but composers rarely care about that. So long as the results are both interesting and mostly correlate with the intention, we just get on with it!
The "pvocex" program now included in the CDP system is a straight port of the origial pvoc (but creating pvx files), with (almost) all the original flag options.
Cheers,
Richard