Lab: Master Theorem

Notice that this lab is not hosted on github but on a competitor website gitlab. There are many different websites for hosting git repositories, but all of them are just front-ends for the git command line utility. We are using github in class because it provides a convenient and free continuous integration system (github actions) for open source projects, so you don't have to pay every time you submit an assignment and run the test cases.

The purpose of this lab is to both get you familiar with using these non-github webpages, and to practice the master theorem.

Tasks

Clone this repo on to the lambda server.

NOTE: I'm not going to give you the exact command to run in these instructions because that's a gauche thing to do. Real world instructions often assume you are capable of figuring out these sorts of details for yourself. The command will look like the clone commands you've been running before, just with a gitlab.com url instead of a github.com url. (I am of course happy to answer any questions if you can't get this to work.)

Now you will practice using the master theorem.

The file search.py contains three functions. Run the doctests to ensure that they are implmented correctly.

NOTE: Running doctests should become a habit for you on every new python file you see. Again, I'm not going to give you the command because that's uncouth. The exact commands can vary slightly from system to system, and so library/tutorial authors usually assume you'll be able to figure out these details yourself. I want you to get some practice figuring out things things for yourself, but I am of course happy to answer any questions if you can't get this to work.

The first two functions in search.py are the sequential search and binary search algorithms you saw last week.

Recall that the sequential search had a worst case runtime of $\Theta(n)$.

NOTE: I'm being careful to state that this was a worst case runtime of $\Theta(n)$. The best case runtime happens when xs[0] == y, and the runtime is then $\Theta(1)$ regardless of the size of the input list xs. When we talk about just the runtime without a qualifer, then this includes all possible cases. There is no $\Theta$ bound that can hold for both cases, and so the best statement that we can make is that the runtime is $O(n)$. In casual conversation, programmers often neglect these formal differences, but it is important to emphasize these formal differences when solving problems in this class or in an interview situation.

The binary search algorithm greatly improved on sequential search by reducing the worst case runtime to $\Theta(\log n)$.

NOTE: People often say informally that the binary search runtime is $O(\log n)$. While that is a true statement, it is also a true statement that the binary search runtime is $O(n)$. (Because $O$ is an upper bound and any large upper bound is guaranteed to also hold.) Because the big-O notation can potentially be loose in this way and mask the improvement of binary search over sequential search, we prefer to be excplicit with $\Theta$ when possible.

But is $\Theta(\log n)$ the best we can do? If splitting the data into 2 recursive subproblems was good, maybe splitting it into 3 recursive subproblems will be better. The trinary_search function does just that.

Your task is to analyze the runtime of this function.

First we will analyze the runtime theoretically. Modify the README file to include:
1. The recurrence relation that describes the function's runtime: $$T(n) = T(n/3) + 1$$
2. The solution to the recurrence you wrote above as provided by the master theorem: $$T(n) = \Theta(1)$$
(Feel free to check your answers with me before moving on.)

Next, we will use the timeit module to analyze the function empirically. The theoretical results above should give you a good prediction about how the empirical experiments below will turn out.

Your task is to complete below that shows the actual runtime of the binary_search and trinary_search functions. To complete this table, modify the timeit command from the Runtime vs N section of lab-timeit2 so that it: (1) uses the appropriate functions from this lab, and (2) uses the numpy array instead of a list as the input data structure. As in the previous lab, I recommend using a bash for loop to complete this task.

	`binary_search`	`trinary_search`
`n=2**0`	0.353 usec	0.562 usec
`n=2**1`	0.554 usec	1.01 usec
`n=2**2`	0.833 usec	0.974 usec
`n=2**3`	0.879 usec	1.05 usec
`n=2**4`	1.18 usec	0.295 usec
`n=2**5`	1.39 usec	1.16 usec
`n=2**6`	1.72 usec	1.9 usec
`n=2**7`	2.33 usec	2.09 usec
`n=2**8`	2.88 usec	2.65 usec
`n=2**9`	3.83 usec	2.41 usec
`n=2**10`	6.28 usec	4.22 usec
`n=2**11`	10.6 usec	6.23 usec
`n=2**12`	17.7 usec	10.6 usec
`n=2**13`	35.2 usec	17.7 usec
`n=2**14`	71.8 usec	33 usec
`n=2**15`	180 usec	75 usec
`n=2**16`	434 usec	188 usec
`n=2**17`	938 usec	428 usec
`n=2**18`	1.92 msec	937 usec
`n=2**19`	4.44 msec	2.03 msec
`n=2**20`	12.7 msec	5.87 msec
`n=2**21`	40.6 msec	14.1 msec
`n=2**22`	82.3 msec	34.4 msec

Use the master theorem to solve the following recurrence relations, and modify the table to include the solutions. There is no experimental portion for this problem.

recurrence	solution	practical application
T(n) = T(n/2) + n	$\Theta(n)$	runtime of the bad binary search
T(n) = T(n/2) + 1	$\Theta(log(n))$	runtime of the correct binary search
T(n) = T(n/3) + 1	$\Theta(log(n))$	runtime of "trinary search"
T(n) = 2T(n/2) + 1	$\Theta(n * log(n))$	runtime for finding the median of an unsorted list
T(n) = 2T(n/2) + n	$\Theta(n * log(n))$	runtime of merge sort
T(n) = 3T(n/3) + n	$\Theta(n * log(n))$	runtime of a trinary merge sort
T(n) = T(n/2) + n^2	$\Theta(n**2)$
T(n) = 2T(n/2) + n^2	$\Theta(n**2)$
T(n) = 3T(n/2) + n^2	$\Theta(n**(log2(3)))$
T(n) = 3T(n/2) + n	$\Theta(n ** (log2(7)))$	runtime of Karatsuba's integer multiplication algorithm; HINT: Case 1
T(n) = 7T(n/2) + n^2	$\Theta(n**2)$	runtime of Strassen's matrix multiplication algorithm

Upload your changes to github (and not gitlab) by using the following steps.
1. Create a new github repo. Ensure that you do not add any default files/branches to this repo, and that it is created empty.
2. Add this newly created github repo as a remote by running
```
$ git remote add github $url
```
  where $url is the url of your new repo.
3. Add and commit your changes like normal. Then run
```
$ git push github master
```
Notice that there is no problem working with both github and gitlab on a single repository. Major open source projects regularly are hosted on many providers ar the same time. For example, the linux kernel is mirrored on gitlab here and on github here, but they don't actually use either website as their primary development platform. Instead, this is handled by a custom web frontend at https://kernel.org.

Submission

Upload the url of your github repo to sakai.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
search.py		search.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

search.py

search.py

Repository files navigation

Lab: Master Theorem

Tasks

Submission

About

Releases

Packages

Contributors 2

Languages

npcrites/master_thm_lab

Folders and files

Latest commit

History

Repository files navigation

Lab: Master Theorem

Tasks

Submission

About

Topics

Resources

Stars

Watchers

Forks

Languages