Math 480 Final Project

THIS README IS INCOMPLETE - PLEASE REFER TO: http://bryantwong.github.io/Math480_FinalProject/

Introduction

This project was done for Math 480b at the University of Washington, Spring 2014 by Bryant Wong, Yuki Sheng, and Tony Xu.

The basic premise of our project is that we are interested in figuring out why different instruments sound different, i.e. what makes instruments have different timbres.

Our process is to input some uncompressed audio file of an instrument playing at 440HZ (A4), process that file at a sampling rate of 44.1 kHz, run it through a self-implemented version of the Fast Fourier Transform, and look at the amplitude and frequncies of the resulting sine waves. At this stage, there are various things we can do with this output, some of which we will investigate.

Most of our code is currently written in pure Python, but one of the things we would like to do is to speed up our FFT algorithm by rewriting it in Cython.

Implementation/Process

Initially we were interested in working with Fourier transforms to derive some qualities from an audio clip of a song, such as BPM. However, since we have not done too much with discrete Fourier transforms before, this project was scrapped in favor of the one we have now.

We knew that the Python package numpy already has an extensive implementation of the FFT/DFT, but we felt like that it would 1) be beneficial to our programming skills and 2) add to the mathematical rigor of our project if we implemented an FFT algorithm ourselves. Initially we looked at how other packages like numpy implemented their algorithms. Most use the Cooley-Tukey Fast Fourier Transform algorithm which is extremely optimized, however we decided to use a simpler algorithm that we found more understandable that utilizes recursion. As a result, our FFT algorithm is not very fast, but we think that it's more understandable and clearer to someone who has not had much experience with Discrete Fourier Transforms.

The algorithm we implemented is the following piece of code:

	# The actual FFT function 
	def fft(signal):
   		n = len(signal)
    
   		# if the input is only one sample then we can't really do a fft 
    
   		if n == 1:
    	  	return signal
    
   		else:
      		# breaking up into odd and even pieces
      		F_even = fft([signal[i] for i in xrange(0, n, 2)])
      		F_odd = fft([signal[i] for i in xrange(1, n, 2)])
        
      		# defining new empty array with n entries
      		combined = [0] * n
      
     		# implementation of the alg (using roots of unity)
      			for m in xrange(n/2):
         		combined[m] = F_even[m] + omega(n, -m) * F_odd[m]
         		combined[m + n/2] = F_even[m] - omega(n, -m) * F_odd[m]
 
		    return combined

(INSERT MATHY STUFF ABOUT FFTS HERE) + explanation of the algorithm will be done for final project.

Example

For our example, we will take a 1 second clip of an oboe playing A4 (440 HZ) and run it through our algorithm. Here is the original sample:

Running it in the terminal (GUI in progress):

	python finalProj.py oboe.wav

After about a minute, (will implement in Cython ASAP to reduce computation time), the program outputs an image with the frequencies and their associated relative amplitudes: http://i.imgur.com/t5RDUOS.png

One interesting thing that we can do is actually try to reconstruct the sound of the oboe using the information given above using Audacity by adding up a bunch of sine waves with the appropriate frequencies and amplitudes: http://i.imgur.com/qgpheYg.png

What we get is a sound that is extremely close to the original file:

Things to do

We are currently working on adding a GUI or some sort of interface so that people can take their own sounds and choose sampling rate/number of samples to get their own plots. Other small things on our to-do list include adding docstrings to our functions, as well as rewriting the FFT portion of our code in Cython to get a speedup and remove the bottleneck in our program.

Possible directions we have been thinking of include some sort of synthesizer where people can blend sounds of different instruments and automatic reconstruction of the sound in Audacity.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
javascripts		javascripts
stylesheets		stylesheets
README.md		README.md
cello.png		cello.png
compile.py		compile.py
finalProj.py		finalProj.py
finalProj.pyc		finalProj.pyc
finalProj2.py		finalProj2.py
finalProject.py		finalProject.py
flute.png		flute.png
frame2.py		frame2.py
harp.png		harp.png
index.html		index.html
oboe.wav		oboe.wav
oboe_digital.png		oboe_digital.png
oboe_digital.wav		oboe_digital.wav
organ.png		organ.png
params.json		params.json
piano.wav		piano.wav
signal.png		signal.png
sin.png		sin.png
violin1.png		violin1.png

bryantwong/Math480_FinalProject

Folders and files

Latest commit

History

Repository files navigation

Math 480 Final Project

THIS README IS INCOMPLETE - PLEASE REFER TO: http://bryantwong.github.io/Math480_FinalProject/

Introduction

Implementation/Process

Example

Things to do

About

Resources

Stars

Watchers

Forks

Languages