ElfProbTET

Probabilistic Treatment of Execution Times

This is the main repository of my computer science course conclusion project 1.

Directory Structure

codes-R/ - Code used to perform statistical inference (in R) and analysis (in Python).
dijkstra/ - Code used to collect execution times of the Dijkstra algorithm.
experiments/- The execution times collected. Each file contains multiple (3 or 4) sets of 1000 execution times.
mandelbrot/ - Code for collecting execution times of the Mandelbrot algorithm.
min-estimators/ - [unused now] I did initial experiments on min-estimators here.
optimization-packages/ - I tested some R optimization packages here.
sqldb\_manipulation/ - Code for collecting execution times of the database manipulation algorithm.

Objectives

The objective of this project focuses on the scheduling problem, but in a way that may also contribute to the two other areas mentioned in the previous section. More specifically, focus on the problem of DAG scheduling[1], in which tasks are organized as a dependency graph and the objective is to find the best order for their execution on the available computational resources, aiming to achieve the lowest overall execution time (also called makespan). The problem is made more complicated because, as often happens, the DAG being scheduled will be executed alongside multiple other DAGs (varying with time), and these will make use of the same available resources.

The scheduling problem has often been approached in a deterministic way, using the average execution time of tasks to perform the scheduling. Some probabilistic models have also been proposed for the problem (in this case it is called stochastic scheduling), mostly under the assumption that the probability distribution of tasks is known, such as done by Li and Antonio[2] and Zheng and Sakellariou[1]. In both of these papers, in order to validate their results, the authors performed simulations where the the distributions of tasks are somewhat arbitrarily chosen to be either normal or uniform. In this project, we would like to further investigate whether these are reasonable choices of distributions for execution times. Under this light, we propose the following:

Hypothesis 1. Given a computer architecture, it is possible to determine a minimal probability model that can be fit to the execution time of any program for a fixed input.

In order to investigate this hypothesis, a set of programs will be implemented and will undergo experiments to generate samples of execution times. Based on these samples, suitable probability models will be determined, and inference (conventional and/or bayesian) will be performed to find the parameters of these distributions for each sample of execution times.

If Hypothesis 1 is true, simulations will be performed to validate or reproduce the results given in existing studies regarding stochastic scheduling. If it is not true, the reasons thereof will be investigated, and the hypothesis will be narrowed down to more specific classes of programs and machine circumstances, which will undergo the same procedure exposed above.

[1] - ZHENG, W.; SAKELLARIOU, R. Stochastic dag scheduling using a monte carlo approach. Journal of Parallel and Distributed Computing, Elsevier, v. 73, n. 12, p. 1673–1689, 2013.
[2] - LI, Y. A.; ANTONIO, J. K. Estimating the execution time distribution for a task graph in a heterogeneous computing system. In: IEEE. Proceedings Sixth Heterogeneous Computing Workshop (HCW’97). [S.l.], 1997. p. 172–184.

Results

Funding & Support

Computational resources from CeMEAI (FAPESP grant 2013/07375-0).

Name		Name	Last commit message	Last commit date
Latest commit History 181 Commits
codes-R		codes-R
dijkstra		dijkstra
experiments		experiments
gengamma		gengamma
mandelbrot		mandelbrot
min-estimators		min-estimators
optimization-packages		optimization-packages
sqldb_manipulation		sqldb_manipulation
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
plot_hist.py		plot_hist.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

codes-R

codes-R

dijkstra

dijkstra

experiments

experiments

gengamma

gengamma

mandelbrot

mandelbrot

min-estimators

min-estimators

optimization-packages

optimization-packages

sqldb_manipulation

sqldb_manipulation

LICENSE

LICENSE

README.md

README.md

TODO.md

TODO.md

plot_hist.py

plot_hist.py

Repository files navigation

ElfProbTET

Directory Structure

Objectives

Results

Funding & Support

About

Releases

Packages

Languages

License

matheushjs/ElfProbTET

Folders and files

Latest commit

History

Repository files navigation

ElfProbTET

Directory Structure

Objectives

Results

Funding & Support

About

Resources

License

Stars

Watchers

Forks

Languages