{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":46153892,"defaultBranch":"master","name":"nccl","ownerLogin":"NVIDIA","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2015-11-14T00:12:04.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/1728152?v=4","public":true,"private":false,"isOrgOwned":true},"refInfo":{"name":"","listCacheKey":"v0:1713913650.0","currentOid":""},"activityList":{"items":[{"before":null,"after":"9e76527612e5b86d481740ab174f4635bb5190e8","ref":"refs/heads/ib-router","pushedAt":"2024-04-23T23:07:30.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"spotluri","name":null,"path":"/spotluri","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/4461171?s=80&v=4"},"commit":{"message":"adding support for IB router\n\nuse flid instead of lid if subnets do not match\nwarn when flidq is 0","shortMessageHtmlLink":"adding support for IB router"}},{"before":"e0c0a5897edaa6db5d6c4ea5ad0ac8ca329b574f","after":"ee3d92b8063059813636045697b2704d0b9d4e49","ref":"refs/heads/fix-ack-loss","pushedAt":"2024-04-15T02:46:33.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"KaimingOuyang","name":"Kaiming Ouyang","path":"/KaimingOuyang","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/17588743?s=80&v=4"},"commit":{"message":"Fix peristent mode comm->channels[c].workFifoSent counter\n\nwe cannot increase comm->channels[c].workFifoSent when persistent\nmode is on. Otherwise, comm->channels[c].workFifoSent can mismatch\nack","shortMessageHtmlLink":"Fix peristent mode comm->channels[c].workFifoSent counter"}},{"before":null,"after":"e0c0a5897edaa6db5d6c4ea5ad0ac8ca329b574f","ref":"refs/heads/fix-ack-loss","pushedAt":"2024-04-14T15:28:40.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"KaimingOuyang","name":"Kaiming Ouyang","path":"/KaimingOuyang","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/17588743?s=80&v=4"},"commit":{"message":"Mark fifo ack as volatile","shortMessageHtmlLink":"Mark fifo ack as volatile"}},{"before":"6dd51f15bf57a54d07f5f1282d9133fc6dbd3fed","after":"ab2b89c4c339bd7f816fbc114a4b05d386b66290","ref":"refs/heads/master","pushedAt":"2024-04-04T15:49:36.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"sjeaugey","name":"Sylvain Jeaugey","path":"/sjeaugey","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/12857445?s=80&v=4"},"commit":{"message":"2.21.5-1\n\nAdd support for IB SHARP 1PPN operation with user buffers.\nImprove support for MNNVL, add NVLS support and multi-clique support.\n * Detect the NVLS clique through NVML\n * Exchange XML between peers in the same NVLS clique and fuse XMLs\n before creating the topology graph.\n * Rework bootstrap allgather algorithms to allow for large allgather\n operations intra-node (XML exchange).\nNet/IB: add support for dynamic GID detection.\n * Automatically select RoCEv2/IPv4 interface by default. Allow to\n select IPv6 or even the network/mask.\nReduce NVLS memory usage.\n * Add stepSize as property of a connection to allow for different\n sizes on different peers; set it to 128K for NVLink SHARP.\nImprove tuner loading\n * Look for more paths, be more consistent with the network device\n plugin.\n * Also search for tuner support inside the net plugin.\nImprove tuner API\n * Add context to support multi-device per process.\nAdd magic number around comm object to detect comm corruption.\n * Add some basic check around communicators so that we can report a\n problem when a communicator gets corrupted or a wrong comm pointer\n is passed to NCCL.\nFix net/IB error path. Github PR #1164\nFix collnet rail mapping with split comm.\nFix packet reordering issue causing bootstrap mismatch\n * Use a different tag in ncclTransportP2pSetup for the connectInfo\n exchange and the following barrier.\nFix hang when crossNic is inconsistent between ranks.\nFix minCompCap/maxCompCap computation. Github issue #1184","shortMessageHtmlLink":"2.21.5-1"}},{"before":"48bb7fec7953112ff37499a272317f6663f8f600","after":"6dd51f15bf57a54d07f5f1282d9133fc6dbd3fed","ref":"refs/heads/master","pushedAt":"2024-03-18T17:12:26.000Z","pushType":"pr_merge","commitsCount":2,"pusher":{"login":"jbachan","name":null,"path":"/jbachan","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/2293859?s=80&v=4"},"commit":{"message":"Merge pull request #1217 from crazy-JiangDongHua/bugfix_undo_plan\n\nBug in plan enqueue logic where plans could be silently not launched for some communicators. Triggered when both are true:\r\n1. Multiple communicators per ncclGroup.\r\n2. Communicators within a group have different plan counts.\r\n2. Intra-process launch barrier disabled.","shortMessageHtmlLink":"Merge pull request #1217 from crazy-JiangDongHua/bugfix_undo_plan"}},{"before":"b6475625fbcaa2c3c0e50eed2fa1255d7514d4a2","after":"48bb7fec7953112ff37499a272317f6663f8f600","ref":"refs/heads/master","pushedAt":"2024-03-05T13:03:44.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"sjeaugey","name":"Sylvain Jeaugey","path":"/sjeaugey","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/12857445?s=80&v=4"},"commit":{"message":"2.20.5-1\n\nFix UDS connection failure when using ncclCommSplit. Issue #1185","shortMessageHtmlLink":"2.20.5-1"}},{"before":null,"after":"fba92421939a343cb39c6c485eb1044b0a691800","ref":"refs/heads/gid_dynamic_selection","pushedAt":"2024-02-15T15:49:12.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"sjeaugey","name":"Sylvain Jeaugey","path":"/sjeaugey","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/12857445?s=80&v=4"},"commit":{"message":"Net/IB: dynamically detect the IB port GID\n\nInstead of using a default GID index of zero, in case the user has not\nset the NCCL_IB_GID_INDEX variable, detect the first available GID.\nGID selection in NCCL can be guided by the user through three variables:\n\n - NCCL_IB_ADDR_FAMILY : AF_INET (default) or AF_INET6\n - NCCL_IB_ADDR_RANGE : ipv{4,6}_prefix/prefixlen, default ::/0\n - NCCL_IB_ROCE_VERSION_NUM : 1 or 2 (default), for RoCE v1 or RoCE v2\n\nAdditional changes:\n - Pass GID index to ncclIbRtrQp as argument","shortMessageHtmlLink":"Net/IB: dynamically detect the IB port GID"}},{"before":"b6d7438d3145a619f924dbbca6c96db21fab716e","after":"b6475625fbcaa2c3c0e50eed2fa1255d7514d4a2","ref":"refs/heads/master","pushedAt":"2024-02-13T12:24:35.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"sjeaugey","name":"Sylvain Jeaugey","path":"/sjeaugey","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/12857445?s=80&v=4"},"commit":{"message":"2.20.3-1\n\nAdd support for alternating rings, allow for cross-nic rings without\ncross-rail communication.\nAdd support for user buffer registration for network send/recv.\nOptimize aggregated operations to better utilize all channels.\nAdd flattening for BCM PCI gen5 switches.\nAdd support for inter-node NVLink communication\nAdd support for port fusion in NET/IB.\nAdd support for ReduceScatter and AllGather using Collnet.\nUpdate net API to v8.\nFix hang during A2A connection.","shortMessageHtmlLink":"2.20.3-1"}},{"before":"fcf0b4115685f65064d9ede916e0de419664a07c","after":"e5b9be991500707e3a3a3b0ec92310aa0010be11","ref":"refs/heads/net_device_plugin_unpack_nchannels","pushedAt":"2024-02-09T14:30:02.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"Moved back to 2+ nodes","shortMessageHtmlLink":"Moved back to 2+ nodes"}},{"before":null,"after":"fcf0b4115685f65064d9ede916e0de419664a07c","ref":"refs/heads/net_device_plugin_unpack_nchannels","pushedAt":"2024-02-07T22:19:05.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"Variable name","shortMessageHtmlLink":"Variable name"}},{"before":"6ede65fbf4274a28b1c228d6d74972479ab430f5","after":"5d0dffa97e38cfec49b12546112601b79060aa8e","ref":"refs/heads/v2.19-bcmgen5flatten","pushedAt":"2024-01-31T09:20:08.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"sjeaugey","name":"Sylvain Jeaugey","path":"/sjeaugey","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/12857445?s=80&v=4"},"commit":{"message":"Add suffix to version","shortMessageHtmlLink":"Add suffix to version"}},{"before":null,"after":"6ede65fbf4274a28b1c228d6d74972479ab430f5","ref":"refs/heads/v2.19-bcmgen5flatten","pushedAt":"2024-01-31T09:19:19.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"sjeaugey","name":"Sylvain Jeaugey","path":"/sjeaugey","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/12857445?s=80&v=4"},"commit":{"message":"Flatten BCM Gen5 switches\n\nDetect when we have a BCM PCI Gen5 switch reporting a two-level\nhierarchy with bogus speeds, and flatten it in the topology graph.\nThat will ensure the graph search will see the PCI switch as it is\nin reality (bandwidth-wise) and find the best solution.","shortMessageHtmlLink":"Flatten BCM Gen5 switches"}},{"before":"f9e6a64d7e5a128cd53febd0c7e9f73827af1472","after":"74212aac36f8da461aca23aac1c216880b1f9de5","ref":"refs/heads/github-abort-hang-fix","pushedAt":"2024-01-16T21:51:34.000Z","pushType":"push","commitsCount":2,"pusher":{"login":"KaimingOuyang","name":"Kaiming Ouyang","path":"/KaimingOuyang","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/17588743?s=80&v=4"},"commit":{"message":"Move abort/destroy INFO to commFree","shortMessageHtmlLink":"Move abort/destroy INFO to commFree"}},{"before":"cc621941f2610cc65258237d7d6ba543e71420fa","after":"53396471d27aa6be49b0f33ee30030dcb8dec390","ref":"refs/heads/net_device_plugin_unpack_github","pushedAt":"2023-11-20T22:44:24.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"Affinity logging for pre-set, post-set, and progress init","shortMessageHtmlLink":"Affinity logging for pre-set, post-set, and progress init"}},{"before":"8b6b65078e2eb00e9ca555a97ad641a16b60ba83","after":"cc621941f2610cc65258237d7d6ba543e71420fa","ref":"refs/heads/net_device_plugin_unpack_github","pushedAt":"2023-11-20T21:38:07.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"Added logging to proxyProgress init for affinity / gpu matching","shortMessageHtmlLink":"Added logging to proxyProgress init for affinity / gpu matching"}},{"before":null,"after":"f9e6a64d7e5a128cd53febd0c7e9f73827af1472","ref":"refs/heads/github-abort-hang-fix","pushedAt":"2023-11-20T21:35:40.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"KaimingOuyang","name":"Kaiming Ouyang","path":"/KaimingOuyang","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/17588743?s=80&v=4"},"commit":{"message":"Fix abort hang issues\n\nRevert proxy detach design\nSupport cuMem* based cudaHostAlloc memory\nAdd heartbeat to proxy thread","shortMessageHtmlLink":"Fix abort hang issues"}},{"before":"f9e6a64d7e5a128cd53febd0c7e9f73827af1472","after":null,"ref":"refs/heads/github-abort-meta","pushedAt":"2023-11-20T21:35:40.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"KaimingOuyang","name":"Kaiming Ouyang","path":"/KaimingOuyang","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/17588743?s=80&v=4"}},{"before":"16b5be19f6eb82ae0b70bef2f602ac30ab588fdb","after":"b6d7438d3145a619f924dbbca6c96db21fab716e","ref":"refs/heads/master","pushedAt":"2023-11-20T13:07:51.000Z","pushType":"push","commitsCount":2,"pusher":{"login":"sjeaugey","name":"Sylvain Jeaugey","path":"/sjeaugey","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/12857445?s=80&v=4"},"commit":{"message":"Merge remote-tracking branch 'origin/master'","shortMessageHtmlLink":"Merge remote-tracking branch 'origin/master'"}},{"before":"0e35f5d3902f4e30c4c97b9700a6a64b25ba7f7f","after":"16b5be19f6eb82ae0b70bef2f602ac30ab588fdb","ref":"refs/heads/master","pushedAt":"2023-11-18T19:05:42.000Z","pushType":"pr_merge","commitsCount":2,"pusher":{"login":"AddyLaddy","name":"David Addison","path":"/AddyLaddy","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/2538330?s=80&v=4"},"commit":{"message":"Merge pull request #1070 from Flamefire/fix-cpuid2\n\nFix use of CPUID overwriting registers in use","shortMessageHtmlLink":"Merge pull request #1070 from Flamefire/fix-cpuid2"}},{"before":null,"after":"2931cc2bc5bff5d170ab883f7222bcef50c99b14","ref":"refs/heads/net_device_plugin_no_unpack_github","pushedAt":"2023-11-16T23:01:16.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"Gating unpack","shortMessageHtmlLink":"Gating unpack"}},{"before":"1ac4f59941012790fa1be5fff4e135c2b8ac96d5","after":"8b6b65078e2eb00e9ca555a97ad641a16b60ba83","ref":"refs/heads/net_device_plugin_unpack_github","pushedAt":"2023-11-09T23:40:42.000Z","pushType":"push","commitsCount":3,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"nconnections logging","shortMessageHtmlLink":"nconnections logging"}},{"before":"406a1eb700948969c3c72923fffd368326ae1342","after":"c944b2eb2739d51ed117a94247de328f41ff6eda","ref":"refs/heads/net_device_plugin_unpack_github_nvtx","pushedAt":"2023-11-08T20:14:20.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"Per-proxyArg trace","shortMessageHtmlLink":"Per-proxyArg trace"}},{"before":null,"after":"f9e6a64d7e5a128cd53febd0c7e9f73827af1472","ref":"refs/heads/github-abort-meta","pushedAt":"2023-11-06T17:22:19.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"KaimingOuyang","name":"Kaiming Ouyang","path":"/KaimingOuyang","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/17588743?s=80&v=4"},"commit":{"message":"Fix abort hang issues\n\nRevert proxy detach design\nSupport cuMem* based cudaHostAlloc memory\nAdd heartbeat to proxy thread","shortMessageHtmlLink":"Fix abort hang issues"}},{"before":"383245a9d3051ae63a96e2214a1b6d590774f233","after":"406a1eb700948969c3c72923fffd368326ae1342","ref":"refs/heads/net_device_plugin_unpack_github_nvtx","pushedAt":"2023-11-02T21:48:08.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"Included category in initNvtx, tracing for debug","shortMessageHtmlLink":"Included category in initNvtx, tracing for debug"}},{"before":"18b5a7bd38363e253fe620643df8febf852ae6a0","after":"383245a9d3051ae63a96e2214a1b6d590774f233","ref":"refs/heads/net_device_plugin_unpack_github_nvtx","pushedAt":"2023-11-01T18:38:21.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"Trying to force category","shortMessageHtmlLink":"Trying to force category"}},{"before":"e284c1aae1985c6c9f5f2fd7d820a9dd67c3be2a","after":"18b5a7bd38363e253fe620643df8febf852ae6a0","ref":"refs/heads/net_device_plugin_unpack_github_nvtx","pushedAt":"2023-10-31T18:41:47.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"Accidentally started two ranges at once","shortMessageHtmlLink":"Accidentally started two ranges at once"}},{"before":"7f1defb85870f795e68a9fe84cdbc51c0ebf8859","after":"e284c1aae1985c6c9f5f2fd7d820a9dd67c3be2a","ref":"refs/heads/net_device_plugin_unpack_github_nvtx","pushedAt":"2023-10-26T20:02:52.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"Added opCount to collective NVTX traces","shortMessageHtmlLink":"Added opCount to collective NVTX traces"}},{"before":null,"after":"7f1defb85870f795e68a9fe84cdbc51c0ebf8859","ref":"refs/heads/net_device_plugin_unpack_github_nvtx","pushedAt":"2023-10-26T19:49:48.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"Updated version","shortMessageHtmlLink":"Updated version"}},{"before":"1ac4f59941012790fa1be5fff4e135c2b8ac96d5","after":null,"ref":"refs/heads/net_device_plugin_unpack_github_nvtx","pushedAt":"2023-10-26T19:48:57.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"}},{"before":null,"after":"1ac4f59941012790fa1be5fff4e135c2b8ac96d5","ref":"refs/heads/net_device_plugin_unpack_github_nvtx","pushedAt":"2023-10-26T19:43:46.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"wenbilliams","name":"Ben Williams","path":"/wenbilliams","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/88295938?s=80&v=4"},"commit":{"message":"Fix cudaMemcpyAsync bug\n\nWe are trying to use the copy result of first cudaMemcpyAsync in the\nsecond cudaMemcpyAsync without sync in between. This patch fixes it\nby allocating a CPU side array to cache device side addr so that we\ncan avoid this consecutive cuda mem copy.\n\nFixes #957","shortMessageHtmlLink":"Fix cudaMemcpyAsync bug"}}],"hasNextPage":true,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAEOJIK9AA","startCursor":null,"endCursor":null}},"title":"Activity ยท NVIDIA/nccl"}