I wanted to write a code to detect baby cry sound. I am using Windows as platform. Presently, I am able to get audio samples and its frequency plot(using FFT) but not sure how to proceed forward.
I wanted to ask what steps I should follow to detect the baby cry sound given its time-frequency plot.
I saw some methods such as median filter followed by HMM in speech recognition. But for simple sound detection do I need to go for such sophiticated method?
I will be very grateful if you could help me.