Skip to content

Commit

Permalink
Deploy latest docs: 527f137
Browse files Browse the repository at this point in the history
  • Loading branch information
leofang committed Jan 2, 2025
1 parent 11d8531 commit b01fb60
Show file tree
Hide file tree
Showing 4 changed files with 25 additions and 25 deletions.
50 changes: 25 additions & 25 deletions docs/cuda-bindings/jupyter_execute/overview.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
"cells": [
{
"cell_type": "markdown",
"id": "89cc298e",
"id": "21df139c",
"metadata": {},
"source": [
"# Overview\n",
Expand Down Expand Up @@ -50,7 +50,7 @@
{
"cell_type": "code",
"execution_count": 1,
"id": "fbbf48f8",
"id": "cbc9440b",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -60,7 +60,7 @@
},
{
"cell_type": "markdown",
"id": "e7856b1c",
"id": "1252e44f",
"metadata": {},
"source": [
"Error checking is a fundamental best practice in code development and a code\n",
Expand All @@ -72,7 +72,7 @@
{
"cell_type": "code",
"execution_count": 2,
"id": "a15ca753",
"id": "f2567e98",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -98,7 +98,7 @@
},
{
"cell_type": "markdown",
"id": "2f6edb25",
"id": "ddb35620",
"metadata": {},
"source": [
"It’s common practice to write CUDA kernels near the top of a translation unit,\n",
Expand All @@ -112,7 +112,7 @@
{
"cell_type": "code",
"execution_count": 3,
"id": "ad3b35ea",
"id": "f7f41840",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -130,7 +130,7 @@
},
{
"cell_type": "markdown",
"id": "3b497b44",
"id": "902254ea",
"metadata": {},
"source": [
"Go ahead and compile the kernel into PTX. Remember that this is executed at runtime using NVRTC. There are three basic steps to NVRTC:\n",
Expand All @@ -147,7 +147,7 @@
{
"cell_type": "code",
"execution_count": 4,
"id": "183f49bc",
"id": "1bea9665",
"metadata": {},
"outputs": [],
"source": [
Expand Down Expand Up @@ -177,7 +177,7 @@
},
{
"cell_type": "markdown",
"id": "0981c1a8",
"id": "82b2efa5",
"metadata": {},
"source": [
"Before you can use the PTX or do any work on the GPU, you must create a CUDA\n",
Expand All @@ -189,7 +189,7 @@
{
"cell_type": "code",
"execution_count": 5,
"id": "0fb562ab",
"id": "1f5e8267",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -199,7 +199,7 @@
},
{
"cell_type": "markdown",
"id": "d8331acd",
"id": "f08fe571",
"metadata": {},
"source": [
"With a CUDA context created on device 0, load the PTX generated earlier into a\n",
Expand All @@ -211,7 +211,7 @@
{
"cell_type": "code",
"execution_count": 6,
"id": "fb3af604",
"id": "9e8bb8c6",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -224,7 +224,7 @@
},
{
"cell_type": "markdown",
"id": "cfda3062",
"id": "8e565ad1",
"metadata": {},
"source": [
"Next, get all your data prepared and transferred to the GPU. For increased\n",
Expand All @@ -236,7 +236,7 @@
{
"cell_type": "code",
"execution_count": 7,
"id": "a7678a2f",
"id": "d94566ab",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -254,7 +254,7 @@
},
{
"cell_type": "markdown",
"id": "0a7b45b0",
"id": "0e6a048f",
"metadata": {},
"source": [
"With the input data `a`, `x`, and `y` created for the SAXPY transform device,\n",
Expand All @@ -271,7 +271,7 @@
{
"cell_type": "code",
"execution_count": 8,
"id": "d459bd0b",
"id": "0da58965",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -291,7 +291,7 @@
},
{
"cell_type": "markdown",
"id": "39ccaa76",
"id": "a19277c7",
"metadata": {},
"source": [
"With data prep and resources allocation finished, the kernel is ready to be\n",
Expand All @@ -308,7 +308,7 @@
{
"cell_type": "code",
"execution_count": 9,
"id": "d50cc757",
"id": "7cee3931",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -324,7 +324,7 @@
},
{
"cell_type": "markdown",
"id": "9137714f",
"id": "5038c1a3",
"metadata": {},
"source": [
"Now the kernel can be launched:"
Expand All @@ -333,7 +333,7 @@
{
"cell_type": "code",
"execution_count": 10,
"id": "2963c1cd",
"id": "b51996bb",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -359,7 +359,7 @@
},
{
"cell_type": "markdown",
"id": "4bc876e6",
"id": "9ee0c808",
"metadata": {},
"source": [
"The `cuLaunchKernel` function takes the compiled module kernel and execution\n",
Expand All @@ -374,7 +374,7 @@
{
"cell_type": "code",
"execution_count": 11,
"id": "ac71e923",
"id": "85614dc2",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -386,7 +386,7 @@
},
{
"cell_type": "markdown",
"id": "ea813e42",
"id": "427e1d0d",
"metadata": {},
"source": [
"Perform verification of the data to ensure correctness and finish the code with\n",
Expand All @@ -396,7 +396,7 @@
{
"cell_type": "code",
"execution_count": 12,
"id": "c29ce423",
"id": "f71ec922",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -410,7 +410,7 @@
},
{
"cell_type": "markdown",
"id": "a4a86e92",
"id": "f46aa461",
"metadata": {},
"source": [
"## Performance\n",
Expand Down
Binary file modified docs/cuda-bindings/latest/.doctrees/environment.pickle
Binary file not shown.
Binary file modified docs/cuda-core/latest/.doctrees/environment.pickle
Binary file not shown.
Binary file modified docs/latest/.doctrees/environment.pickle
Binary file not shown.

0 comments on commit b01fb60

Please sign in to comment.