PSOLA pitch shifting

dealerpriest

Hello!
Would be grateful for any help/advice!
I'm working on a multi effect for saxophone using Bela. I already have a few effects working well. One of those is a time-domain pitch-shifter using amdf as described in this paper:
http://ant-s4.unibw-hamburg.de/dafx/paper-archive/2007/Papers/p007.pdf

This pitchshifting algorithm works rather well and I feel the responsiveness and low latency is muuch better than some other pitchshifting effects I've tried. But I stumbled upon the PSOLA algorithm and was intrigued that it supposedly keeps the formants of the signal!
So I read up a bit on the PSOLA algorithm and took a try at implementing it in the BELA. I now have something that works. Kinda...
But. There are some things that aren't totally clear to me.

The "pitch marks". Must they be positioned on a high energy peak and if so, why? I'm currently using the amdf for the pitch tracking and it won't find peaks in the periodic signal.

The grains/splices/segments. Must/should they be 2 periods long as I've read in many places? Why? Should it always be a new grain for every period, regardless of grain size (i.e. with a grain size of 2 periods, there would always be two grains overlapping)?

The windowing of the grains. Must/should it be the same window regardless of distance between grains, or does it make sense to aim for crossfadeish between grains? Or at least make the grainsize/window larger when the grains are far apart? As of now, a down shift of two octaves, will introduce a lot of zero samples between the grains.

Intuitively I have a feeling that the windowing/grain size is key in creating the pitch shift. I was considering the case of a grain/window size of 3 periods and a pitch ratio of 0.5 (shifting down 1 octave). This would have the combined grains overlap by exactly one period, basically resulting in no pitch shift.

Would be happy to get some better understanding of this. Grateful for any help!

MattB

Hey - I'm trying to do a similar project. Do you have any code for Bela you're willing to share?

dealerpriest

Hello @MattB !
Here is a link to the repo:
https://github.com/Dealerpriest/bela-octave

I haven't worked on it for a long while, but actually started looking into the code again just a few days ago!
The repo poorly documented because it's at this point only for my personal purposes. But you are welcome to have a look!!

dealerpriest

I'm not sure (can verify later), but i think you could try out the working AMDF-based pitchshifting algorithm by uncommenting this line:
https://github.com/Dealerpriest/bela-octave/blob/1b88606b85e5c5766b4b1d44f6f272e762d16ee0/render.cpp#L349-L356

That should enable switching between AMDF and PSOLA algorithms.
A while back I separated the amdf search and the actual sample rate change into separate classes. So when using AMDF-algorithm the classes amdf.cpp and pitchshifter.cpp work together to perform the pitchshifting. The Amdf class calculates the position of the pitched ring buffer pointer, and the pitchshifter picks samples and resample from the ring buffer.

ryjobil

I recall autotalent implemented a PSOLA algorithm in case this sparks any interest:
http://tombaran.info/autotalent.html

Volod

A tad bit late but Clouds from Mutable Instruments has a WSOLA implementation (superior to PSOLA) that you could adapt with ease. https://github.com/pichenettes/eurorack/blob/master/clouds/dsp/fx/pitch_shifter.h