Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

coll/tuned: Extend the collective tuning file to be topology-aware #12321

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Commits on Feb 9, 2024

  1. coll/tuned: Extend the collective tuning file to be topology-aware

    TUNED collectives selection should account for communicator topology
    like HAN. The communicator size and message based algorithm selection logic
    is no longer sufficient to achieve optimal performance when HAN is used.
    The best algorithm differs between inter-node and intra-node for the same
    communicator size and message size based on the tuning results.
    
    This commit introduces topology dimension in both TUNED collective tuning
    file rule and the algorithm selection logic. The topological level can
    be intra-node, internode, or default(mixed).
    
    Specify @inter_node or @intra_node after the message size in the
    dynamic file rules. This is an optional feature so it will not break the
    old file format. See the file example in coll_tuned_dynamic_file.h
    
    Signed-off-by: Jessie Yang <jiaxiyan@amazon.com>
    jiaxiyan committed Feb 9, 2024
    Configuration menu
    Copy the full SHA
    4c6b561 View commit details
    Browse the repository at this point in the history