Skip to main content

Notice

Please note that most of the software linked on this forum is likely to be safe to use. If you are unsure, feel free to ask in the relevant topics, or send a private message to an administrator or moderator. To help curb the problems of false positives, or in the event that you do find actual malware, you can contribute through the article linked here.
Topic: Pitch Shifting in the Encoded Domain (Read 2030 times) previous topic - next topic
0 Members and 1 Guest are viewing this topic.

Pitch Shifting in the Encoded Domain

This is a question for those who (unlike me) understand the nuts and bolts of compression algorithms.

I have an interest in high quality real-time tempo shifting (which requires pitch shifting to restore the pitch after resampling).  I presume one way of pitch shifting involves FFT into the frequency domain, shifting the frequency components, and then FFT back into the time domain.

What I am curious about is whether any compression formats save the payload as frequency components, and could therefore perform the pitch shifting directly, in the compressed data prior to decoding for playback.  Eg wavelets.

It's your privilege to disagree, but that doesn't make you right and me wrong.

Re: Pitch Shifting in the Encoded Domain

Reply #1
I believe MOST lossy compression works in the frequency domain.   So it should be possible. 

But existing editors like Mp3DirectCut that work without de-compressing/re-compressing don't go that "deep" into processing.   i.e. They  don't do equalization or mixing, etc.

Re: Pitch Shifting in the Encoded Domain

Reply #2
have an interest in high quality real-time tempo shifting (which requires pitch shifting to restore the pitch after resampling).  I presume one way of pitch shifting involves FFT into the frequency domain, shifting the frequency components, and then FFT back into the time domain.

There is a lot more to it.  Take a look at this tutorial:  https://www.guitarpitchshifter.com/algorithm.html

What I am curious about is whether any compression formats save the payload as frequency components, and could therefore perform the pitch shifting directly, in the compressed data prior to decoding for playback.  Eg wavelets.

Almost all lossy formats do, but that doesn't really help you with tempo adjustment.

Re: Pitch Shifting in the Encoded Domain

Reply #3
Okay... I guess I knew that much already.

Is it true that the ear is only sensitive to frequency, not phase?

Supposing that to be true, then if a signal were represented as a series of overlapping wavelets (which themselves do not contain discontinuities), wouldn't transposing wavelets with equivalent wavelets but at a shifted frequency achieve the effect of pitch shifting?  That's my idea, anyway.
It's your privilege to disagree, but that doesn't make you right and me wrong.