Skip to content

Adding CUDA graph capture feature for supervised learning.#98

Open
romerojosh wants to merge 13 commits intomasterfrom
cuda_graphs_2
Open

Adding CUDA graph capture feature for supervised learning.#98
romerojosh wants to merge 13 commits intomasterfrom
cuda_graphs_2

Conversation

@romerojosh
Copy link
Collaborator

This PR adds CUDA graphs support for supervised learning problems. The feature is enabled via a new general configuration entry: enable_cuda_graphs, see updated documentation.

Since we are targeting high-performance use cases, this functionality is made to be fairly minimal in terms of features. In particular, we do not maintain internal static entry points to the captured graphs, allow graph recapture for dynamic shapes, etc. Instead, we expect users to provide consistent input data (memory locations, shapes) to be compatible with the CUDA graphs operating model.

Marking this as a draft for now as I still need to implement some tests.

@romerojosh
Copy link
Collaborator Author

/build_and_test

@github-actions
Copy link

github-actions bot commented Dec 3, 2025

🚀 Build workflow triggered! View run

@github-actions
Copy link

github-actions bot commented Dec 4, 2025

✅ Build workflow passed! View run

Copy link
Collaborator

@azrael417 azrael417 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good, thanks a lot. much cleaner but I have a few comments still.

Signed-off-by: Josh Romero <joshr@nvidia.com>
Signed-off-by: Josh Romero <joshr@nvidia.com>
Signed-off-by: Josh Romero <joshr@nvidia.com>
Signed-off-by: Josh Romero <joshr@nvidia.com>
Signed-off-by: Josh Romero <joshr@nvidia.com>
Signed-off-by: Josh Romero <joshr@nvidia.com>
Signed-off-by: Josh Romero <joshr@nvidia.com>
Signed-off-by: Josh Romero <joshr@nvidia.com>
Signed-off-by: Josh Romero <joshr@nvidia.com>
Signed-off-by: Josh Romero <joshr@nvidia.com>
Signed-off-by: Josh Romero <joshr@nvidia.com>
@romerojosh romerojosh marked this pull request as ready for review February 6, 2026 00:11
Signed-off-by: Josh Romero <joshr@nvidia.com>
@romerojosh
Copy link
Collaborator Author

/build_and_test

@github-actions
Copy link

github-actions bot commented Feb 6, 2026

🚀 Build workflow triggered! View run

@github-actions
Copy link

github-actions bot commented Feb 6, 2026

✅ Build workflow passed! View run

Signed-off-by: Josh Romero <joshr@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants