Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Updates on How To Characterize Systems using GEOPM #3183

Open
wants to merge 25 commits into
base: dev
Choose a base branch
from

Conversation

lhlawson
Copy link
Contributor

@lhlawson lhlawson commented Oct 5, 2023

A PR for covering validation and documentation of system characterization using GEOPM

@lhlawson lhlawson force-pushed the public-lhlawson-system-characterization-validation branch 2 times, most recently from b25c601 to aa5de33 Compare October 6, 2023 00:16
@lhlawson lhlawson mentioned this pull request Oct 5, 2023
15 tasks
integration/test/test_gpu_activity_agent.py Outdated Show resolved Hide resolved
integration/test/test_gpu_activity_agent.py Outdated Show resolved Hide resolved
experiment_cli_args=experiment_cli_args)

base_dir = 'test_gpu_activity_output'
cls._gpu_ca_characterization = GPUCACharacterization(base_dir=base_dir)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add comment explaining what's being done here.

Comment on lines 68 to 81
node_count=node_count,
parres_cores_per_node=None,
parres_gpus_per_node=None,
parres_cores_per_rank=1,
parres_init_setup=None,
parres_exp_setup=None,
parres_teardown=None,
parres_args=None,
trial_count=1,
cool_off_time=3,
enable_traces=False,
enable_profile_traces=False,
phi_list=None,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should not have been removed.

lhlawson and others added 14 commits November 17, 2023 08:45
Signed-off-by: Lowren Lawson <lowren.h.lawson@intel.com>
Signed-off-by: lhlawson <lowren.h.lawson@intel.com>
Signed-off-by: lhlawson <lowren.h.lawson@intel.com>
Signed-off-by: Alejandro Vilches <alejandro.vilches@intel.com>
Add comments to InitControl file describing each setting.

Signed-off-by: Alejandro Vilches <alejandro.vilches@intel.com>
Signed-off-by: lhlawson <lowren.h.lawson@intel.com>
Co-authored-by: Brad Geltz <brgeltz@gmail.com>
Signed-off-by: lhlawson <lowren.h.lawson@intel.com>
Signed-off-by: Brad Geltz <brad.geltz@intel.com>
Signed-off-by: Brad Geltz <brad.geltz@intel.com>

WIP Use geopm-ctl-local with test_gpu_activity

Signed-off-by: Brad Geltz <brad.geltz@intel.com>

Fix bad merge

Signed-off-by: lhlawson <lowren.h.lawson@intel.com>
Signed-off-by: lowren.h.lawson@intel.com <lowren.h.lawson@intel.com>
Signed-off-by: Alejandro Vilches <alejandro.vilches@intel.com>
Signed-off-by: Alejandro Vilches <alejandro.vilches@intel.com>
Signed-off-by: Alejandro Vilches <alejandro.vilches@intel.com>
Signed-off-by: Alejandro Vilches <alejandro.vilches@intel.com>
Also add minor improvements to CPUCharacterization to facilitate debug.

Signed-off-by: Alejandro Vilches <alejandro.vilches@intel.com>
@avilcheslopez avilcheslopez force-pushed the public-lhlawson-system-characterization-validation branch from e95b72c to 61a0386 Compare November 17, 2023 16:45
lhlawson and others added 8 commits November 17, 2023 15:37
Signed-off-by: lhlawson <lowren.h.lawson@intel.com>
- Enable process mode and affinity for AIB and minife
- Don't LD_PRELOAD libgeopm
- Adjust problem size for CPU-CA characterization sweeps
- Misc. clean-up

Signed-off-by: Alejandro Vilches <alejandro.vilches@intel.com>
Signed-off-by: lhlawson <lowren.h.lawson@intel.com>
Signed-off-by: lhlawson <lowren.h.lawson@intel.com>
Signed-off-by: lhlawson <lowren.h.lawson@intel.com>
Signed-off-by: lhlawson <lowren.h.lawson@intel.com>
Signed-off-by: lhlawson <lowren.h.lawson@intel.com>
Signed-off-by: lhlawson <lowren.h.lawson@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

System Characterization Tests
3 participants