Skip to content

[BUG] Distributed Training With (NVTabular + Pytorch DDP), I got this error: RuntimeError: parallel_for: failed to synchronize: cudaErrorIllegalAddress: an illegal memory access was encountered #236

[BUG] Distributed Training With (NVTabular + Pytorch DDP), I got this error: RuntimeError: parallel_for: failed to synchronize: cudaErrorIllegalAddress: an illegal memory access was encountered

[BUG] Distributed Training With (NVTabular + Pytorch DDP), I got this error: RuntimeError: parallel_for: failed to synchronize: cudaErrorIllegalAddress: an illegal memory access was encountered #236

Workflow file for this run

name: triage_issues
on:
issues:
types: [opened, reopened]
jobs:
triage_issue:
uses: nvidia-merlin/.github/.github/workflows/triage.yaml@main
secrets:
TRIAGE_APP_ID: ${{ secrets.TRIAGE_APP_ID }}
TRIAGE_APP_PEM: ${{ secrets.TRIAGE_APP_PEM }}