Bounty: Implement noise cancellation on RPi-3 based hardware devices (Mark 1 and Picroft) #1478

KathyReid · 2018-03-14T11:06:25Z

NOTE: This issue supercedes Issue #57

Problem statement

The current audio bus on the Mark 1 and Picroft images does not eliminate the speaker audio from the microphone. This leads to undesirable device behavior, most noticeably when an audio stream is playing and the user is unable to “barge in” easily with a Hey Mycroft.

The device is aware of what audio is being output from the speaker. The essential idea desired is to subtract the speaker audio-out from the microphone audio-in using an appropriate approach - such as time-shifting the outbound audio and matching it to the audio in from the microphone.

Acceptance criteria

The solution must work on a Mark 1 reference hardware device. Picroft is OK for testing or proof of concept, but the solution must work in a Mark 1 enclosure acoustic environment
The solution must work with an audio stream that is being played at 3/4 volume, such as Pandora, Spotify, Mopidy or other streaming audio
The solution must work with the default Precise Wake Word detection software.
A user must be able to interrupt the audio input/output stream by speaking the Wake Word - ie ‘Hey Mycroft’ at normal volume (ie not shouting).
The solution must work within the CPU limitations of RPi 3 hardware (the hardware used for both Mark 1 and Picroft). Namely, not exceeding a 3.0 load average when running the top command.

Useful information

Key technical contact - Steve Penrod (@penrods) (@steve-mycroft at https://chat.mycroft.ai)

Bounty

The Bounty for this feature request is $USD1000, as well as a free Mark 1 and a Gold Mycroft Challenge Coin.

The text was updated successfully, but these errors were encountered:

stephanelpaul · 2018-03-15T15:11:19Z

I'm going to take a look at this shortly

ekjswim · 2018-03-16T20:17:47Z

Info that may be helpful re: OSS DSP:
http://www.audioxpress.com/news/the-linux-foundation-adopts-sound-open-firmware-project-enabling-developers-to-adapt-operating-systems-for-audio-devices

pcwii · 2018-03-19T11:27:15Z

More helpful information:
PulseAudio supports module-echo-cancelation.
More information here...https://arunraghavan.net/2016/05/improvements-to-pulseaudios-echo-cancellation/

el-tocino · 2018-03-19T16:57:43Z

Some hopefully useful links about the pulse module:
https://www.freedesktop.org/wiki/Software/PulseAudio/Documentation/User/Modules/#index45h3
https://wiki.archlinux.org/index.php/PulseAudio/Troubleshooting#Enable_Echo.2FNoise-Cancelation
The echo cancellation module can also do beamforming...

pcwii · 2018-03-19T17:32:05Z

@KathyReid @penrods
Has anyone explored this option (pulse audio echo cancelation) previously? I am willing to give it a go although I only have a picroft to work with.

forslund · 2018-03-22T16:11:02Z

I believe it was tried a couple of years ago but the cpu strain was quite high. (This is what I've heard so no personal experience on the Pi). The pulse audio echo cancellation works great on my workstation so it'd be cool if it could work on the Pi as well. If it's too intensive on the hardware maybe there are tweaks that can be made.

Give it a try, and see what the result is!

roadriverrail · 2018-04-12T16:53:22Z

I've worked on projects using a Broadcom chipset not unlike that of the BCM2837 (which is used in RPi3) and we'd seen good success using the Opus echo canceler. It does take CPU to do, but it wasn't particularly bad. Unfortunately, I don't have the necessary free time to contribute to the bounty hunt, but I thought perhaps suggesting this would help someone else.

KathyReid · 2018-04-20T11:27:11Z

Thanks for your feedback, @roadriverrail - great suggestion!

el-tocino · 2018-04-21T05:32:59Z

Potentially interesting:
https://github.com/xiph/rnnoise
and based on that:
https://github.com/werman/noise-suppression-for-voice
(the above are significantly slower than viable, alas: ~8:1 increase in processing)

tlc · 2018-04-24T18:47:42Z

@forslund, When working on a workstation with the mycroft source, does pulse echo cancellation get loaded automatically or do we have to do that ourselves?

Do USB speakerphone devices such as the Jabra 410 (popular in the forums) do echo cancellation? I'm using one with a RPi 3B+ and "Hey Mycroft, stop" seems to work. Although, I'm not sure if it works "well" at "normal volume".

el-tocino · 2018-04-24T19:01:52Z

Currently, no distros load the pulse echo cancellation (that I know of).
Per https://www.jabra.com/business/speakerphones/jabra-speak-series/jabra-speak-410 "Digital Signal Processing (DSP ) technology
Crystal clear sound without echoes or or distorted sounds even at max volume level" which sounds a lot like it has some sort of echo canceling.

forslund · 2018-04-27T07:07:42Z

@tlc as @el-tocino states the echo cancellation isn't loaded by default. Loading it creates a virtual microphone that you need to set as default to use with mycroft. (basically selecting it in the pulse audio volume control)

KathyReid · 2018-05-11T10:16:55Z

How are we all going with this one - any questions? Any information we could provide to help?

j1nx · 2018-08-23T07:23:22Z

Not my work, but just ran into it;

https://github.com/voice-engine/ec

Looks interesting and ticking the boxes.

domcross · 2018-08-26T19:37:28Z

I have experimented with voice-engine/ec (which is basically a wrapper for speex) and PulseAudio's echo-cancel module (you have to install PA 7.1 from the Debian-Jessie-Backports for that) using algorithms "webrtc" and "speex" (adrian is not usable at all) but had no luck so far. I see mainly two reasons:

when music is played over the Mark-I speaker the mic of the Mark-I almost only picks up the music (this is caused of the physical construction), in addition the mic/preamp picks up a lot of electric/radio noise. This makes it really tough for any noise/echo-cancel algorithm.
The RPI3 timing of the internal clock is not stable enough for this kind of realtime processing - the permanent timedrift confuses the echo-cancel algorithms as well.
I will give "rnnoise" a try shortly (have it already compiled for RPI but some problems configuring it for PA) but don't have to high exspectation for the above reasons

penrods · 2018-08-26T19:57:23Z

I'd be willing to consider a solution that requires a minor and cheap add-on or modification to the Mark 1, e.g. acoustic foam separating the mic and speaker or wire rerouting. But not board level changes.

el-tocino · 2018-08-30T07:22:40Z

Beamforming based on the mic position plus a cheapo usb mic might be an option. One or two of these mini mics (search "overfly portable usb 2.0 mic") set in the ports combined with the audio from the existing mic run through a beamformer should be able to do aec and improve listening. I haven't tried it myself yet, alas.

domcross · 2018-08-31T19:07:18Z

After some more experimenting I have a configuration with the PulseAudio echo-cancel module that works reasonably^* with volume levels up to 5 (Mark-1's maximum is 11) within a distance of approx. 4 feet. There is some more room for tweaking parameters that might increase reliability.
I didn't try the hardware tweaking (acoustic foam) yet. In addition I am considering changes in Mycroft Audioservices, e.g. duck/mute music as soon as wake-word is detected in order to get a clean utterance...

^*depends on the music material, the more compressed (see "loudness war") the less reliable it works.

j1nx · 2018-08-31T19:34:19Z

I believe @forslund already did some work on the ducking part. Believe it is already in PR / Issue section somewhere.

With you that AEC has to be combined with audio ducking.

el-tocino · 2018-12-07T17:49:30Z

I used some door/window insulating foam (similar: https://www.homedepot.com/p/Frost-King-3-4-in-x-5-16-in-x-10-ft-Black-Rubber-Foam-Weatherseal-Tape-R534H/202262324) to make a barrier around the front of the mic between the face circuitboard and the faceplate. Secondarily to that, I covered the back of the speaker with foam as well.

KathyReid added help wanted Difficulty: medium bounty labels Mar 14, 2018

onpon4 mentioned this issue Jul 19, 2020

[Proposal] Built-in interruption support through Wake Word #2638

Open

krisgesling removed the bounty label Sep 16, 2020

krisgesling added the Type: Enhancement - proposed New proposal for a feature that is not currently a priority on the roadmap. label Sep 24, 2020

krisgesling mentioned this issue Sep 24, 2020

Echo cancelation in software #57

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bounty: Implement noise cancellation on RPi-3 based hardware devices (Mark 1 and Picroft) #1478

Bounty: Implement noise cancellation on RPi-3 based hardware devices (Mark 1 and Picroft) #1478

KathyReid commented Mar 14, 2018 •

edited

stephanelpaul commented Mar 15, 2018

ekjswim commented Mar 16, 2018

pcwii commented Mar 19, 2018

el-tocino commented Mar 19, 2018

pcwii commented Mar 19, 2018

forslund commented Mar 22, 2018

roadriverrail commented Apr 12, 2018

KathyReid commented Apr 20, 2018

el-tocino commented Apr 21, 2018 •

edited

tlc commented Apr 24, 2018

el-tocino commented Apr 24, 2018

forslund commented Apr 27, 2018

KathyReid commented May 11, 2018

j1nx commented Aug 23, 2018

domcross commented Aug 26, 2018

penrods commented Aug 26, 2018

el-tocino commented Aug 30, 2018

domcross commented Aug 31, 2018

j1nx commented Aug 31, 2018

el-tocino commented Dec 7, 2018

Bounty: Implement noise cancellation on RPi-3 based hardware devices (Mark 1 and Picroft) #1478

Bounty: Implement noise cancellation on RPi-3 based hardware devices (Mark 1 and Picroft) #1478

Comments

KathyReid commented Mar 14, 2018 • edited

Problem statement

Acceptance criteria

Useful information

Bounty

stephanelpaul commented Mar 15, 2018

ekjswim commented Mar 16, 2018

pcwii commented Mar 19, 2018

el-tocino commented Mar 19, 2018

pcwii commented Mar 19, 2018

forslund commented Mar 22, 2018

roadriverrail commented Apr 12, 2018

KathyReid commented Apr 20, 2018

el-tocino commented Apr 21, 2018 • edited

tlc commented Apr 24, 2018

el-tocino commented Apr 24, 2018

forslund commented Apr 27, 2018

KathyReid commented May 11, 2018

j1nx commented Aug 23, 2018

domcross commented Aug 26, 2018

penrods commented Aug 26, 2018

el-tocino commented Aug 30, 2018

domcross commented Aug 31, 2018

j1nx commented Aug 31, 2018

el-tocino commented Dec 7, 2018

KathyReid commented Mar 14, 2018 •

edited

el-tocino commented Apr 21, 2018 •

edited