Real Time Signal Alignment DSP

EdJames94

Hi, I'm fairly new to programming but for my final year uni project I'm needing to develop a DSP that automatically time aligns a DI signal and a microphone signal (the DI signal roughly needs to be delayed by 1ms). I'm just wondering if anyone has any good pointers to actually get me going with my task. I'm using the Bela starter kit with the audio expander capelet.

Thank you in advance for any help!

AndyCap

Is it as simple as delaying by a fixed amount of time, or automatically determining the phase difference between the signals?

EdJames94

It needs to automatically delay the signal so depending on the phase difference at any certain point, it should be able to do it within the device.

AndyCap

Just interested but what degree are you studying, and is this your main final year project?

The main problem I guess is that the DI and Mic signals will differ in frequency content as well as amplitude and phase.

I'm not an expert at this (and I don't know your expertise) but maybe you could look at Cross Correlation https://en.wikipedia.org/wiki/Cross-correlation and concentrate on the lower frequencies.

Or maybe something like this: http://www.emglab.net/emglab/Publications/Documents/HRASW.pdf

EdJames94

AndyCap
I do sound, light and live event technology at the university of derby. Yeah it's my dissertation, and basically my question is about the use of mics and DI's on bass amps. Cross correlation is basically what I need to use but it's more the coding side that I'm struggling with as it's a new area for me i.e. Transferring what I know into code for the dsp (if any of that makes sense haha!)

hulappa

EdJames94 Could tell us the method you are planning to use for the system? Link a paper or describe it otherwise.

EdJames94

@hulappa In terms of how the system will work?

giuliomoro

It would be good to understand your level of expertise and have a clearer idea of what your project looks like in order to provide better advice.
Basic programming concepts you will want to look into for as a starting point for your project are circular buffers and delay lines.

ryjobil

Any chance you have the algorithm implemented in some kind of mathematics tool such as Matlab? This kind of tool might express the algorithm in a way that's more easy to break down into code-able chunks. Mainly something that expresses your intended algorithm as a basis for interpretation into C++.

My advice would be to start simple with bela, one small step at a time. Obviously first would be looking at the audio pass-through example and try to understand how that works first (and get it working), then build on that.

Next get a simple delay line working in Bela where output A is delayed from output B by a certain (hard-coded) amount based on concepts @giuliomoro suggested. You can even observe the effect on the built-in oscilloscope.

The next step would be to implement the cross-correlation function to approximate the delay between the two sources. The good thing about your application is it's not like somebody is swinging the microphone above their head so you don't need to keep up with delay changes in real-time (hint -- don't use real-time resources to do cross-correlation). This one could start as a project of its own where you just print out delay time to the console so you can see it is working.

Next figure out how to take a chunk of audio and push it into a background thread to process delay offset. Then figure out how to deliver an output to the real-time thread to adjust delay offset.

Next I would picture a peak detector to decide when to compute cross-correlation. You don't want to process silence or background noise. You would implement this chunk and make sure you can activate and deactivate a processing trigger based on signal levels in the delay line.

Once you have a function that can delay a signal, and a function to tell you how much to delay the signal then you can combine them into an automated control system. If you implement an interpolated delay line then you can smoothly transition the delay offset over time until it settles on the final value. You certainly won't want a situation where delay changes rapidly in real time.

This seems to me more of a track-and-hold control system. You want to process frequently enough to keep up with realistic situations, such as the musician (or sound guy) adjusting the microphone location during sound check. There is a hint that a next code block would keep a moving average of the cross-correlator output and slowly modify the delay offset over the course of several seconds.

You might even consider adding a "lock" switch that can be activated during the performance so that the delay offset won't change after the system is set up. The other approach would be to implement a button somebody pushes to tell Bela to lock onto the delay offset. Then when it decides it has locked onto the delay time it stops processing cross-correlation and activates an LED to show it is locked. Maybe even two modes that the user can choose.

The nice thing about having user input about when Bela should attempt cross-correlation is that they could perform the calibration during set-up when stage noise is controlled. Then they don't have to worry about it drifting around when there more background noise to confuse the cross-correlator (drums, other instruments, crowd) being picked up in the microphone during a performance.

Well I got out of control and off the point. My point (in summary) is to implement small functions on Bela one at a time. As you do each one you will learn and it will make each next bit easier.

Just several ideas to consider. Take care.

EdJames94

ryjobil I have a delay line in place for one I/O and a passthrough for the other as this channel I don't need to delay. I believe I am at the point where I should implement the cross-correlation, could you possibly explain this a little more in depth please as I'm unsure what to write into my code.

PS your original comment really helped me make a start on this!

ryjobil

EDIT:
To summarize all below into more simple steps:
1) Figure out how to make a background task, test it with a printf statement or something. Look at examples in Bela git Repo for how to schedule a non-RT task. Your system will choke if you try to do this within the RT audio thread.
This is out of sequence from advice I gave earlier, but thinking practically how to implement cross-correlation without a non-rt environment in which to do so requires something like processing some wave files on your computer...and this mixes in an extra challenge for somebody new to programming (learning wav file I/O and type-casting).
2) Prevent the task from being rescheduled until it is complete (something like clearing a bool gTaskCompleted variable in the audio thread that is set by the non-rt thread when it completes. Audio thread tests it on every loop to decide whether to reschedule the task). This is known as a semophore (or a spin-lock). Normally a spin-lock for thread synchronization is considered crude and inefficient, but in the current context a spin lock doesn't cost you very much (just an if() statement each loop). You might use the usleep() function in the non-rt thread to make it take 1/2 second to complete and then do a printf() output when it finishes so you can see the background task ran.
3) You need a circular buffer (delay line) for each channel in the RT audio thread even though the output of the Mic Input will not be delayed. This is just for recording data to be processed: 1 for Line In and 1 for Mic In. Line In channel is larger than Mic In channel by an additional number of samples representing something more than the maximum delay you expect between the 2 channels. For the Line In channel this circular buffer doubles as your delay line since you can pull your Line output from this buffer lagging the write pointer by however much delay you need.
4) Every time the gTaskCompleted is set, "unwrap" the circular buffers into a pair of linear buffers (something that doesn't change while you are processing it). Remember, Line is longer than Mic by maximum expected delay between the 2. These 2 linear buffers are what you process with the cross-correlation function. When you get cross-correlation implemented, print an integer value for the delay between these 2 buffers, or maybe write it out to a log file that you can examine in a spreadsheet or even just a text editor. Here is an example of using fprintf() (C-style): http://www.cplusplus.com/reference/cstdio/fprintf/
5) When you feel like your log file is giving you reasonable values for delay when signal level is high enough to be valid, then you can work on the peak/level detector to trigger processing.
6) Implement peak level detector. Just a really rough sketch below (Copy/paste ok if you understand how to fill in the missing pieces):

#define T_ATK 10/1000  //Attack time on peak detector to avoid activating on brief glitches, pops, noise, etc.
#define T_RLS 50/1000 //Release time to hold peak value to hold between cycles
float gLineLevel;
float gMicLevel;
float gaa, gab, gra, grb; //1-pole LPF coefficients, 1 set for attack, 1 set for release
.
.
.
//Initialize
float dt = 1/SampleRate;  //get SampleRate from context
//setup coefficients
// see https://en.wikipedia.org/wiki/Low-pass_filter#Simple_infinite_impulse_response_filter
gaa = dt/(dt + T_ATK);
gra = dt/(dt + T_RLS);
gab = 1.0 - gaa;
grb = 1.0 - gra;
gLine_Level = 0.0;
gMic_Level = 0.0;


.
.
.
//In render loop, do this for every sample:
  // This is a 1/2 wave rectifier.  I doubt you will need more than this for simple
 // signal level detection.  To do full-wave you use abs(input_mic[n])
if(input_mic[n] > gMic_Level) {
     gMic_Level = gaa*input_mic[n] + gab*gMic_Level; //run 1-pole LPF with attack time
} else {
     gMic_Level = gra*input_mic[n] + grb*gMic_Level; //run 1-pole LPF with release time
}
//Repeat above for line input channel, then:
if( (gMic_Level > gProcessingThreshold) && (gLine_Level > gProcessingThreshold) && (gTaskCompleted == true) ) {
 //needs some logic to check that it has been above the threshold enough during the past 
//MAX_LENGTH samples so you know the majority of your circular buffers are full of valid audio 
//frames instead of noise or silence.
  //then populate buffers and schedule background processing.
}

Here is a sketch of brute-force cross-correlation:

//EDIT:  See following post.  I worked out the details to make it into code that compiles and works as expected.

After several of these cycles do some statistics on the computed delay times to gain a confidence in the mean (remove extreme outliers). After that you should be able to stop computing cross-correlation unless you want to periodically re-evaluate to make sure somebody didn't move the mic.

I don't know your level of knowledge in signal processing theory, but a cross-correlation is a convolution between two signals where one is time-reversed. This convolution can be performed by a single point-wise multiplication in the frequency domain + the cost of FFT and iFFT instead of NxN multiplications.
Compare these:
https://en.wikipedia.org/wiki/Cross-correlation
https://en.wikipedia.org/wiki/Convolution

You will see the formulas are identical except for the sign on τ , which is merely a time reference reversal, which is a minor detail to keep in mind when interpreting the results of a convolution in the context of cross-correlation.
You can do this efficiently with an FFT :
http://dsp.stackexchange.com/questions/736/how-do-i-implement-cross-correlation-to-prove-two-audio-files-are-similar

But you probably want to prove that out to yourself in MATLAB (octave) or similar environment before you attempt to implement it in C++.

Because you can push this to the background, outside of the RT audio thread, and you don't need to keep up with it in real time then it may not matter...I would put off the FFT convolution method to the last thing you do if you want to optimize for performance.

Here is an example for how to schedule a low priority auxiliary task:
https://github.com/BelaPlatform/Bela/blob/master/examples/11-Extras/oscillator-bank/render.cpp

ryjobil

I just got curious to see how this worked. It occurred to me it doesn't require real .wav signals to test the concept. I also worked out the mental confusion about which direction to cycle through the buffers to get the delay time to come out at face-value.

Below works with a test signal as generated

//
// Cross-correlation delay computation test routine
// To compile with gcc, use the following command:
// gcc xcorr.c -o xcorr -lm
//

#include <stdlib.h>
#include <stdio.h>
#include <string.h>
#include <math.h>

#define FS 44100
#define MAX_DELAY 15*FS/1000 //ms
#define BUF_SZ 8000

float run_oscillator(float* s, float* c, float o, float d, float frq, float fs)
{
	float k =  2.0*M_PI*frq/fs; 
	//run oscillator

    *s += *c*k;
    *c -= *s*k;
	return *s*d + o;
}

float* make_test_signal(float* outbuf, float delay, float fs, size_t size)
{
    //General
    float Ts = 1.0/fs;
    int dly = ((int) (delay*fs));
    int i;
    size_t bufsz = MAX_DELAY + size;
    outbuf = (float*) malloc( bufsz*sizeof(float) );
    
   
    //Exponential generator
    float envelope = 0.0;
    float exp = 1.0;
    float exp_max = 100.0;
    float exp_min = 1.0;
    
    //Exponential generator timing variables
    float Trise = 0.025;
    float Tfall = 0.5;
    float tau_r = Trise/(log(exp_max - exp_min + 1.0));
    float tau_f = Tfall/(log(exp_max - exp_min + 1.0));
    float attack = 1.0/(expf(-Ts/tau_r));
    float decay = expf(-Ts/tau_f);
    float exp_rate = attack;
    
    //Sine wave signal
    float s = 0.0;
    float c = 1.0;
    float o = 0.0;
    float d = 1.0;
    float frq = 80.0;
    float sine = 0.0;
    
    for(i=0; i < bufsz; i++)
    {
        outbuf[i] = 0.0;
    }
    
    //printf("TIME\tF(x)\n");
    for(i=dly; i < bufsz; i++)
    {
        exp *= exp_rate;
        if(exp > exp_max) {
            exp_rate = decay;
        } else if(exp < exp_min) {
            exp = attack;
        }
        
        envelope = (exp - exp_min)/(exp_max-exp_min);
        
        sine = run_oscillator(&s, &c, o, d, frq, fs);
       //if(i%100 == 0)
       //printf("%f\t%f\n", ((float) i)*Ts, envelope*sine);
        outbuf[i] = envelope*sine;
    }
    
    return outbuf;

}

int main()
{
    float* buf_mic;
    float* buf_line;
    
    buf_line = make_test_signal(buf_line, 0.0, FS, BUF_SZ);
    
    //Insert expected delay on mic channel here:
    float mic_delay = 7.0/1000.0;
    buf_mic  = make_test_signal(buf_mic, mic_delay, FS, BUF_SZ);
    
    int k, i;
    float t = 0.0;
    float dt = 1.0/FS;
    /*
    //Plot the delay line signals to verify it looks as expected
    printf("t\tLine\tMic\n");
    for(i=0; i < (MAX_DELAY+BUF_SZ-1); i++)
    {
        if(i%100 == 0)
        printf("%f\t%f\t%f\n", t, buf_line[i], buf_mic[i]);
        t+=dt;
    }*/
    
    float sum = 0.0;
    float max = 0.0;
    int delay = 0;
    

    for(k = MAX_DELAY-1; k>=0;k--)
    {
        sum = 0.0;
        for (i = 0; i<BUF_SZ; i++)
        {
            sum += buf_mic[i+k]*buf_line[i];  // for every bin, change buf_line delay offset
                                              // by a sample and correlate.
        }
    
    if(sum > max) {
        max = sum;
        delay = k;
    }
    //I might also suggest adding a usleep() here so you don't make your IDE unresponsive...
    //sort of like giving the processor a few moments to come up for a breath of air
    //usleep(SOME_REASONABLE_AMOUNT_OF_TIME);
    }

    //See how it did
    printf("Delay (Nsamples) = %d\nDelay Time (delay, delay+1):  %f ms, %f ms\n", delay, 1000.0*((float) delay)/((float)FS), 1000.0*((float) (delay+1))/((float)FS));
    return 0;
}

Output with 7 ms delay:

Delay (Nsamples) = 308
Delay Time (delay, delay+1):  6.984127 ms, 7.006803 ms

EdJames94

ryjobil

How would I make this work within my current code? I am following where you are going with this but I'm still finding it hard to understand where I would implement these. Apologies if I'm coming across as stupid!

#include <Bela.h>
#define DELAY_BUFFER_SIZE 44100
// Buffer holding previous samples per channel
float gDelayBuffer_l[DELAY_BUFFER_SIZE] = {0};
float gDelayBuffer_r[DELAY_BUFFER_SIZE] = {0};
// Write pointer
int gDelayBufWritePtr = 0;
// Amount of delay
float gDelayAmount = 1;
// Amount of feedback
float gDelayFeedbackAmount = 0;
// Level of pre-delay input
float gDelayAmountPre = 1;
int gDelayInSamples = 22050;

// Buffer holding previous samples per channel
float hDelayBuffer_l[DELAY_BUFFER_SIZE] = {0};
float hDelayBuffer_r[DELAY_BUFFER_SIZE] = {0};
// Write pointer
int hDelayBufWritePtr = 0;
// Amount of delay
float hDelayAmount = 1;
// Amount of feedback
float hDelayFeedbackAmount = 0;
// Level of pre-delay input
float hDelayAmountPre = 1;
// Amount of delay in samples (needs to be smaller than or equal to the buffer size defined above)
int hDelayInSamples = 44100;

bool setup(BelaContext *context, void *userData)
{
    
    return true;
}

void render(BelaContext *context, void *userData)
{

    for(unsigned int n = 0; n < context->analogFrames; n++) {
        
        float out_l = 0;
        float out_r = 0;
        
        // Read audio inputs
        out_l = analogRead(context,n,0);
        out_r = analogRead(context,n,1);
        
        // Increment delay buffer write pointer
        if(++gDelayBufWritePtr>DELAY_BUFFER_SIZE)
            gDelayBufWritePtr = 0;
          
            
               // Increment delay buffer write pointer
       
        // Calculate the sample that will be written into the delay buffer...
        // 1. Multiply the current (dry) sample by the pre-delay gain level (set above)
        // 2. Get the previously delayed sample from the buffer, multiply it by the feedback gain and add it to the current sample
        float del_input_l = (gDelayAmountPre * out_l + gDelayBuffer_l[(gDelayBufWritePtr-gDelayInSamples+DELAY_BUFFER_SIZE)%DELAY_BUFFER_SIZE] * gDelayFeedbackAmount);
        float del_input_r = (gDelayAmountPre * out_r + gDelayBuffer_r[(gDelayBufWritePtr-gDelayInSamples+DELAY_BUFFER_SIZE)%DELAY_BUFFER_SIZE] * gDelayFeedbackAmount);
        
       
        //  Now we can write it into the delay buffer
        gDelayBuffer_l[gDelayBufWritePtr] = del_input_l;
        gDelayBuffer_r[gDelayBufWritePtr] = del_input_r;
        
       
        
        // Get the delayed sample (by reading `gDelayInSamples` many samples behind our current write pointer) and add it to our output sample
        out_l = gDelayBuffer_l[(gDelayBufWritePtr-gDelayInSamples+DELAY_BUFFER_SIZE)%DELAY_BUFFER_SIZE] * gDelayAmount;
        out_r = gDelayBuffer_r[(gDelayBufWritePtr-gDelayInSamples+DELAY_BUFFER_SIZE)%DELAY_BUFFER_SIZE] * gDelayAmount;
       

        analogWrite(context, n, 0, out_l);
        analogWrite(context, n, 1, out_r);
     
    }
    for(unsigned int n = 0; n < context->analogFrames; n++) {
      
        float out_l = 0;
        float out_r = 0;
        
        // Read audio inputs
       out_l = analogRead(context,n,2);
        out_r = analogRead(context,n,3);
        // Increment delay buffer write pointer
        if(++hDelayBufWritePtr>DELAY_BUFFER_SIZE)
            hDelayBufWritePtr = 0;
            
               // Increment delay buffer write pointer
        if(++hDelayBufWritePtr>DELAY_BUFFER_SIZE)
            hDelayBufWritePtr = 0;
        
        // Calculate the sample that will be written into the delay buffer...
        // 1. Multiply the current (dry) sample by the pre-delay gain level (set above)
        // 2. Get the previously delayed sample from the buffer, multiply it by the feedback gain and add it to the current sample
        float del_input_l = (hDelayAmountPre * out_l + hDelayBuffer_l[(hDelayBufWritePtr-hDelayInSamples+DELAY_BUFFER_SIZE)%DELAY_BUFFER_SIZE] * hDelayFeedbackAmount);
        float del_input_r = (hDelayAmountPre * out_r + hDelayBuffer_r[(hDelayBufWritePtr-hDelayInSamples+DELAY_BUFFER_SIZE)%DELAY_BUFFER_SIZE] * hDelayFeedbackAmount);
        
        
          
        //  Now we can write it into the delay buffer
        hDelayBuffer_l[hDelayBufWritePtr] = del_input_l;
        hDelayBuffer_r[hDelayBufWritePtr] = del_input_r;
        
        // Get the delayed sample (by reading `gDelayInSamples` many samples behind our current write pointer) and add it to our output sample
        out_l = hDelayBuffer_l[(hDelayBufWritePtr-hDelayInSamples+DELAY_BUFFER_SIZE)%DELAY_BUFFER_SIZE] * hDelayAmount;
        out_r = hDelayBuffer_r[(hDelayBufWritePtr-hDelayInSamples+DELAY_BUFFER_SIZE)%DELAY_BUFFER_SIZE] * hDelayAmount;
        
       
        analogWrite(context, n, 2, out_l);
        analogWrite(context, n, 3, out_r);
        
     
    
}
}
void cleanup(BelaContext *context, void *userData)
{
}

ryjobil

Let's really simplify this. I see you have other things going on in the code (including feedback!!!???). Don't contaminate this delay line or you're likely to create really weird cross-correlation results. If you want implement audio effects then let them track their own delay lines.

I have simplified down to a single pair of channels and given some tips about how to do this. You will see there is still stuff left to you to finish, but I think the general outline will help you figure out how to make this work.

Don't start adding more inputs until you can get a single pair working. You'll be busy chasing bugs all over the place if you try to do it all at once:

#include <Bela.h>
#define DELAY_BUFFER_SIZE 44100
// Buffer holding previous samples per channel
float gDelayBuffer_l[DELAY_BUFFER_SIZE] = {0};
float gDelayBuffer_r[DELAY_BUFFER_SIZE] = {0};

float gProcess_Buff_l[DELAY_BUFFER_SIZE] = {0};
float gProcess_Buff_r[DELAY_BUFFER_SIZE] = {0};
// Write pointer
unsigned int gDelayBufWritePtr = 0;  // usually a good idea to use unsigned
                                     // for something you expect to always be
                                     // a positive number
unsigned int gDelayBufReadPtr = 0; 

unsigned int gDelay = 0;
bool gUpdated_delay = false;

//Peak detector
float gLeftPk = 0.0;
float gRightPk = 0.0;
float gPkThreshold = 0.25;
unsigned int gPeakTimer;
unsigned int gPeakTimerMax = 10000; //I randomly picked a value

//Cross correlator
bool gXcorIsBusy = false;

bool setup(BelaContext *context, void *userData)
{
    // Good practice to initialize this way, but since this is 
    // compiled as a static memory location initialized to zeros
    // by the compiler you can probably trust it.
    // I don't, so I sometimes do stuff that probably doesn't need to be done.
    // Another reason is for forming habits.  If you do more C or C++
    // programming you will inevitably end up writing programs with 
    // dynamic memory allocation.  Since the memory is created at run-time
    // you have to initialize it at run-time.
    for(int i = 0; i<DELAY_BUFFER_SIZE; i++)
    {
        gDelayBuffer_l[i] = 0.0;
        gDelayBuffer_r[i] = 0.0;
        gProcess_Buff_l[i] = 0.0;
        gProcess_Buff_r[i] = 0.0;        
    }
    
    gDelay = 5*context->getAnalogSampleRate/1000;  //5ms to start out
    gUpdated_delay = true;
    
    gPeakTimerMax = 100*context->getAnalogSampleRate/1000; //100ms worth of time above threshold
    
    return true;
}

unsigned int get_Write_Ptr(unsigned int wr_ptr, unsigned int delay, unsigned int buf_sz)
{
    unsigned int wr = wr_ptr;

    if(delay > wr)
        wr += buf_sz;

    return wr - delay;
}

void run_peak_detect(float in, float* state_variable) 
{
    //run peak detection 
}

unsigned int xcorr(float *inl, float *inr)
{
    //cross-correlation routine
    //returns number of samples delay between inl and inr
    return delay;
}

void xcorrelation_task()
{
    gXcorIsBusy = true;
    gDelay = xcorr(gProcessBuff_l, gProcessBuff_r);
    gXcorIsBusy = false;
    
    //one refinement would be to save results from several runs
    //and compute statistics on returned delay time values
    //and then only update the delay time when you have a certain
    //level of confidence.
}

void render(BelaContext *context, void *userData)
{

    //Check if delay time is updated
    if(gUpdated_delay) {
        gDelayBufReadPtr = get_Write_Ptr(gDelayBufWritePtr, gDelay, DELAY_BUFFER_SIZE);
        gUpdated_delay = false;
    }

    for(unsigned int n = 0; n < context->analogFrames; n++) {
        
        //when initializing float values always use decimal point
        //Otherwise compiler might give you some odd results
        //although you're usually safe with zero
        float in_l = 0.0; //Most clear to read
        float in_r = 0.;  //another method
        
        // Read audio inputs
        in_l = analogRead(context,n,0);
        in_r = analogRead(context,n,1);
        
        //Signal level processing strategy:
       //  Peak_Detector -> Integrator -> Schmitt-Trigger, high? Process & reset
        //Run peak detector
        run_peak_detect(in_l, &gLeft_pk);
        run_peak_detect(in_r, &gRight_pk);
        
        //this if/else statement acts like an integrator feeding into
        // a Schmitt trigger
        // You might not even really need the peak detector using this strategy
        //  however the peak detector method makes this agnostic frequency
        // there are a lot of "right" ways to determine if signal level has been high enough
        // long enough to correlate... this is just one way to do it.
        if( (gLeft_pk > gPkThreshold) && (gRight_pk > gPkThreshold)) {
            //If enough of the samples have been above the threshold
            //then make a decision to compute cross-correlation
            if(++gPeakTimer > gPeakTimerMax) {
                //You may want to reset this to something else to invoke 
                //multiple
                gPeakTimer = 0;
                
                //unwrap circular buffers into processing buffs
                unsigned int wr = gDelayBufWritePtr;
                for(int i = 0; i<DELAY_BUFFER_SIZE; i++)
                {
                    if(++wr>=DELAY_BUFFER_SIZE)  
                        wr = 0;
                    gProcessBuf_l[i] = gDelayBuffer_l[wr];
                    gProcessBuf_r[i] = gDelayBuffer_r[wr];
                }
                //Then run cross-correlation
                  /// Don't run xcorr() in real-time!
                //You will need to follow examples to set this up and create the 
                //Auxiliary task
                if(gXcorIsBusy == false) // this is the spin-lock to prevent scheduling 
                                                          // cross-correlation if it is already running.
                    Bela_scheduleAuxiliaryTask(gXcorrTask);
                    
                gUpdatedDelay = true;
            }
        } else {
            if(--gPeakTimer < 1) gPeakTimer = 1; //run it down to zero if peak levels not up
        }
        
        
        // Increment delay buffer write pointer
        // Note >= here.  You can't let it ever be equal to 
        // DELAY_BUFFER_SIZE or you will risk segmentation fault
        // (access memory your program isn't allowed to access"
        if(++gDelayBufWritePtr>=DELAY_BUFFER_SIZE)  
            gDelayBufWritePtr = 0;
        if(++gDelayBufReadPtr>=DELAY_BUFFER_SIZE)  
            gDelayBufReadPtr = 0;

        //  Write it into the delay buffer
        gDelayBuffer_l[gDelayBufWritePtr] = in_l;
        gDelayBuffer_r[gDelayBufWritePtr] = in_r;            
 
        analogWrite(context, n, 0, in_l);
        analogWrite(context, n, 1, gDelayBuffer_r[gDelayBufReadPtr]);  //Delay the right channel by amount computed in cross-correlation
     
    }
}
void cleanup(BelaContext *context, void *userData)
{
}

EdJames94

ryjobil Thank you very much, yeah the reason why it had feedback in it, is because I used the sample one and rather than deleting it out I just put it at zero

ryjobil

EdJames94 That makes sense. I was beginning to think that came from some example code. Hopefully my edit helps narrow it down to what's needed a little bit and maybe give you some ideas how to implement this.

A couple comments regarding the hardware side of things:
Are you planning to do 4 channels ultimately? If not then use the audio codec inputs.

As for selecting the sampling frequency in your project, here's the amplitude error effect vs frequency (bandwidth) of adding a signal to itself when delayed by a single sample:

H(f)=0.5*(1+cos(2*pi*f/fs))
The -3dB cut-off is approximately fs/5.5  ( 2*pi/{ acos[sqrt(2)-1] } ~= 5.5 )

I would recommend analog anti-aliasing and reconstruction filters if you want pro-audio sound quality from the analog inputs and outputs. The aggressiveness of the filtering all depends on the high-frequency noise level and your signal source (whether it can generate any harmonics above Fs/2).

The audio codec handles all of these considerations internally so this is preferred if you don't need more than 2 channels.

EdJames94

ryjobil Yeah I'm needing 4 channels because I'll be wiring xlr's up to use channel's 0-1 as left and right, then the same for the second xlr using channels 2-3, so I'm wanting to cross-correlate channels 0-1 with channels 2-3

giuliomoro

If you need four channels of audio and don't need any other analog ins, then you could run Bela with two analog channels, which will be sampled at 88.2kHz. This way you get less aliasing overall, the analog filter you will have to put in place can have a higher cutoff, and you can do some low-pass anti-aliasing filter as part of your processing as well.

ryjobil

Is one set coming from mics on stereo located speakers, each with its own mic where delay between 0&2 might be different than 1&3? Before editing I was suggesting saving some CPU and only cross-correlating left channel, but as I'm thinking about it there's probably no guarantee that left and right channel are delayed equally.

In either case you would do well to consider using the suggestion by giuliomoro . An FIR filter can be used to down-sample the 88 kHz side to 44k. These are not hard to design in MATLAB (or gnu octave is what I use). Also found this: http://t-filter.engineerjs.com/ (even generates code for the filter...really cool)

If you put your line (DI box) side into the audio codec then you get the conversion latency delay "for free". Audio codec latency is automatically subtracted from the delay adjustment. The only time you might be adding latency that isn't directly related to speed of sound in air is if the mic is <4" from the speaker. [EDIT: I used the t-filter above to design an FIR filter for downsampling the AnalogIn and with a reasonably balanced filter you end up with roughly the same latency as the audio codec, so the conversion latency turns out to be a wash (imagine that!).]

But of course, the first thing is to get cross-correlation working between 2 of your 4 channels.

EdJames94

ryjobil Just as an update on this, I bought 2 mini jack to xlr splitters to simplify my project a little bit and use less channels. I'm inputting the code that you recommended, but at the moment, both the di and mic signal are running only through the right i/o channels, and I can't figure out why.

ryjobil

EdJames94 both the di and mic signal are running only through the right i/o channels, and I can't figure out why.

If you are running the code exactly as I posted it then keep in mind right now it is only working on channels 0 and 1 just to simplify, so you would hear pass-through on only 2 channels.

As for why you are getting only right channels would have to do with hardware connections. You might want to ring those out just to make sure the Bela inputs and outputs are going to the expected pins on the cable (if you haven't already).

The idea with implementing only 2 channels in my example code is because it is the bare minimum for cross-correlation. If you can get it to work on, say, mic right vs line right, then you can extend the concept to the left pair once you know it's doing what you want between the right channels.