I agree with Bryn that Audacity should do the trick. I use it to join up speech and music for podacasts all the time, ok mostly in mp3 but WAV too. I do a lot of manual click removal with Audacity cutting and pasting in WAVs works just fine. You can stack up the waveforms of 'tracks' and adjust the silences between (milliseconds or seconds) then save the whole lot as a single wav file. (my podcasts are made up of 22 waveforms, stacked up on Audacity then saved as one) so your silences between music can be set accurately as your ears wish
I'm currently using their 1.3 Beta version but I see they are now offering 2.0
http://audacity.sourceforge.net/download/



Reply With Quote
