I would probably use one or more [bp~]
to isolate the frequencies of interest, going into [env~ 8192]
to get the RMS value over a reasonably large window, then into a [lop~]
with a low enough cutoff to transform it back to signal and add a bit more smoothing, then into a properly tuned [threshold~]
, which will output two bangs, one on the left when the signal goes above the higher threshold, one on the right when the signal goes back below the lower threshold. The left bang can be used to start the recording if it's not started already, the right bang probably needs to go into some [delay]
to make sure that the recording doesn't stop immediately when the signal goes quieter (you wouldn't want it to stop in between words, for instance). The [delay]
should be disabled after you receive a left bang and are waiting for a right bang.
Once you have it basically working, you'll spend some time tuning the input filter, the threshold~ levels and the delay times.
Something like this
