Skip to content
View trumpepsteinaudio's full-sized avatar
Block or Report

Block or report trumpepsteinaudio

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
trumpepsteinaudio/README.md

It has been a while? Do you have audio/tips to share? Email to trumpepsteinaudio@gmail.com

Let's put our AI audio processing skills to good use.

We sure would like to know what Trump and Epstein said to each other during this exchange in the 90s. https://www.youtube.com/watch?v=KLcfpU2cubo&feature=youtu.be Unfortunately, the song "Rhythm is a Dancer" by Snap! is playing loudly. With the correct reference track, it is possible that an echo canceller can remove the music and expose the conversation. Let's try it as a collective.

In an echo canceller, the reference track is signal that you want to remove. In this case, it would be a clean recording of the song playing in the club. You can use AI to build a model of the music track in the club so that it can be removed.

UPDATE: In record time, a community member has found what appears to be the correct reference track! It is reference.wav, now in repo!

FIRST PROBLEM: The official version of this song (below) does not appear to be the one playing in the club. Can you help us find (or construct) the perfect reference track?You can submit to this repo or mailto: trumpepsteinaudio@gmail.com. A complete listing of the versions of this song are on wikipedia https://en.wikipedia.org/wiki/Rhythm_Is_a_Dancer#Track_listing_and_formats. Known song versions is in a text file in this repo. You can edit, or indicate which ones should be removed. If you want to download youtube videos https://ytdl-org.github.io/youtube-dl/ is an easy command-line utility.

SECOND PROBLEM Let's try some echo cancellers. Share any tips in the issues tab or helpful code, or email trumpepsteinaudio@gmail.com.

UPDATE: speex codec seems to not work well. how can we better model the transfer functions of the speaker->room->microphone? Are there papers the community can recommend? A standard open source version is available here: https://www.speex.org/ (see Acoustic Echo Canceller) A python implementation is here: https://github.com/xiongyihui/speexdsp-python

Apple has done advanced work in this space: https://machinelearning.apple.com/research/optimizing-siri-on-homepod-in-far-field-settings

Successful models might want to submit to the Acoustic Echo Cancellation Challenge at ICASSP 2021 (organized by Microsoft)

Repo contains the starting files.

OTHER APPROACHES Beyond echo cancellation, you might want to try models that use video to select a target voice such as: https://arxiv.org/abs/1804.03619 and https://arxiv.org/pdf/1804.04121.pdf

Or you might want to try video-to-audio lipreading models: http://cvit.iiit.ac.in/research/projects/cvit-projects/speaking-by-observing-lip-movements

Send us any references/links/code that could be useful, and we will post here.

Relevant Online Commnuities:

Music-IR Mailing List https://groups.google.com/a/ismir.net/g/community?pli=1

Machine Listening Mailing List: https://groups.google.com/forum/#!forum/machinelistening

Music IR Slack: https://launchpass.com/mircommunity

Popular repositories

  1. trumpepsteinaudio trumpepsteinaudio Public

    15