Skip to content

Commit

Permalink
1.4.1 doc gen
Browse files Browse the repository at this point in the history
Former-commit-id: 7daf6e8583ca79b64306c09a44c0fe649dad3bad
  • Loading branch information
dumerrill committed Apr 13, 2015
1 parent cc2e231 commit ebb50a6
Show file tree
Hide file tree
Showing 342 changed files with 2,547 additions and 2,480 deletions.
14 changes: 14 additions & 0 deletions CHANGE_LOG.TXT
Original file line number Diff line number Diff line change
@@ -1,5 +1,19 @@
//-----------------------------------------------------------------------------

1.4.1 04/13/2015
- Bug fixes:
- Fixes for CUDA 7.0 issues with SHFL-based warp-scan and warp-reduction
on non-primitive data types (e.g., user-defined structs)
- Fixes for minor CUDA 7.0 performance regressions in cub::DeviceScan,
DeviceReduceByKey
- Fixes to allow cub::DeviceRadixSort and cub::BlockRadixSort on bool types
- Remove requirement for callers to define the CUB_CDP macro
when invoking CUB device-wide rountines using CUDA dynamic parallelism
- Fix for headers not being included in the proper order (or missing includes)
for some block-wide functions

//-----------------------------------------------------------------------------

1.4.0 03/18/2015
- New Features:
- Support and performance tuning for new Maxwell GPU architectures
Expand Down
12 changes: 1 addition & 11 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
<hr>
<h3>About CUB</h3>

Current release: v1.4.0 (03/18/2015)
Current release: v1.4.1 (04/13/2015)

We recommend the [CUB Project Website](http://nvlabs.github.com/cub) and the [cub-users discussion forum](http://groups.google.com/group/cub-users) for further information and examples.

Expand Down Expand Up @@ -89,16 +89,6 @@ feature set. At the moment, we do not publicly provide non-stable releases
such as development snapshots, beta releases or rolling releases. (Feel free
to contact us if you would like such things.) See the
[CUB Project Website](http://nvlabs.github.com/cub) for more information.
| Date | Version |
| ---- | ------- |
| 03/18/2015 | [CUB v1.4.0](https://github.com/NVlabs/cub/archive/1.4.0.zip) |
| 05/23/2014 | [CUB v1.3.2](https://github.com/NVlabs/cub/archive/1.3.2.zip) |
| 04/01/2014 | [CUB v1.2.3](https://github.com/NVlabs/cub/archive/1.2.3.zip) |
| 12/10/2013 | [CUB v1.1.1](https://github.com/NVlabs/cub/archive/1.1.1.zip) |
| 08/08/2013 | [CUB v1.0.1](https://github.com/NVlabs/cub/archive/1.0.1.zip) |
| 03/07/2013 | [CUB v0.9.0](https://github.com/NVlabs/cub/archive/0.9.zip) |
<br><hr>
<h3>Contributors</h3>
Expand Down
4 changes: 2 additions & 2 deletions cub/device/device_spmv.cuh
Original file line number Diff line number Diff line change
Expand Up @@ -94,8 +94,8 @@ struct DeviceSpmv
* float beta = 0.0;
*
* float* d_values; // e.g., [1, 1, 1, 1, 1, 1, 1, 1,
* // 1, 1, 1, 1, 1, 1, 1, 1,
* // 1, 1, 1, 1, 1, 1, 1, 1]
* // 1, 1, 1, 1, 1, 1, 1, 1,
* // 1, 1, 1, 1, 1, 1, 1, 1]
*
* int* d_column_indices; // e.g., [1, 3, 0, 2, 4, 1, 5, 0,
* // 4, 6, 1, 3, 5, 7, 2, 4,
Expand Down
6 changes: 3 additions & 3 deletions docs/download_cub.html
Original file line number Diff line number Diff line change
Expand Up @@ -37,14 +37,14 @@
</head>

<body
onload="downloadURL('https://github.com/NVlabs/cub/archive/1.4.0.zip');"
onload="downloadURL('https://github.com/NVlabs/cub/archive/1.4.1.zip');"
style="color: rgb(102, 102, 102); font-family: Helvetica, arial, freesans, clean, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: 300; height: 18px;">

<center>
If your download doesn't start in 3s:
<br><br>
<a href="https://github.com/NVlabs/cub/archive/1.4.0.zip"><img src="download-icon.png" style="position:relative; bottom:-10px; border:0px;"/></a>
<a href="https://github.com/NVlabs/cub/archive/1.4.0.zip"><em>Download CUB!</em></a>
<a href="https://github.com/NVlabs/cub/archive/1.4.1.zip"><img src="download-icon.png" style="position:relative; bottom:-10px; border:0px;"/></a>
<a href="https://github.com/NVlabs/cub/archive/1.4.1.zip"><em>Download CUB!</em></a>
</center>

</body>
Expand Down
16 changes: 15 additions & 1 deletion docs/html/CHANGE_LOG.TXT
Original file line number Diff line number Diff line change
@@ -1,5 +1,19 @@
//-----------------------------------------------------------------------------

1.4.1 04/13/2015
- Bug fixes:
- Fixes for CUDA 7.0 issues with SHFL-based warp-scan and warp-reduction
on non-primitive data types (e.g., user-defined structs)
- Fixes for minor CUDA 7.0 performance regressions in cub::DeviceScan,
DeviceReduceByKey
- Fixes to allow cub::DeviceRadixSort and cub::BlockRadixSort on bool types
- Remove requirement for callers to define the CUB_CDP macro
when invoking CUB device-wide rountines using CUDA dynamic parallelism
- Fix for headers not being included in the proper order (or missing includes)
for some block-wide functions

//-----------------------------------------------------------------------------

1.4.0 03/18/2015
- New Features:
- Support and performance tuning for new Maxwell GPU architectures
Expand Down Expand Up @@ -118,7 +132,7 @@
- Added min, max, arg-min, and arg-max to DeviceReduce
- Added DeviceSelect (select-unique, select-if, and select-flagged)
- Added DevicePartition (partition-if, partition-flagged)
- Added generic cub::ShuffleUp(), cub::ShuffleDown(), and cub::ShuffleBroadcast() for warp-wide communication of arbitrary data types (SM3x+)
- Added generic cub::ShuffleUp(), cub::ShuffleDown(), and cub::ShuffleIndex() for warp-wide communication of arbitrary data types (SM3x+)
- Added cub::MaxSmOccupancy() for accurately determining SM occupancy for any given kernel function pointer
- Performance
- Improved DeviceScan and DeviceRadixSort performance for older architectures (SM10-SM30)
Expand Down
Loading

0 comments on commit ebb50a6

Please sign in to comment.