Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge of 0.15.1dev2 tag to gold/2021 #1471

Merged
merged 100 commits into from
Nov 9, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
100 commits
Select commit Hold shift + click to select a range
b528b10
Use cmake_minimum_required with min...max
oleksandr-pavlyk Sep 27, 2023
a9aae28
Fix upload_linux/upload_windows steps
oleksandr-pavlyk Sep 27, 2023
b4aabe4
Merge pull request #1424 from IntelPython/address-cmake-warnings
oleksandr-pavlyk Sep 28, 2023
caa0939
Merge pull request #1425 from IntelPython/fix-wheel-uploads
oleksandr-pavlyk Sep 28, 2023
71e891c
Check in of generic reduction templates and some reductions (#1399)
ndgrigorian Oct 3, 2023
dfba436
repeat with `axis=None` repeats flattened array (#1427)
ndgrigorian Oct 4, 2023
2c757d4
Do apt-get update before installing gdb
oleksandr-pavlyk Oct 5, 2023
d4cc465
Merge pull request #1429 from IntelPython/fix-gdb-installation
oleksandr-pavlyk Oct 6, 2023
049cd77
Added a comment line
oleksandr-pavlyk Oct 4, 2023
f73a1c4
Implement Python API for ext_intel_device_info descriptors
oleksandr-pavlyk Oct 4, 2023
d11e64e
Add tests for new function in dpctl.utils
oleksandr-pavlyk Oct 4, 2023
ebf118a
Repeat Python bindings properly pass host task dependencies (#1430)
ndgrigorian Oct 6, 2023
0988dd0
Skip dpt.prod testing for complex result dtype on Gen9
oleksandr-pavlyk Oct 6, 2023
79a9d78
Remove _is_gen9, export intel_device_info
oleksandr-pavlyk Oct 6, 2023
662bc45
Removed test for _is_gen9, as it was removed
oleksandr-pavlyk Oct 6, 2023
cd0ec5d
Skip prod tests for complex output types on Gen9
oleksandr-pavlyk Oct 6, 2023
a1b915a
Swap use of deprecated IntelDPCPP with IntelSYCL cmake script
oleksandr-pavlyk Sep 12, 2023
5d5dcc2
Use IntelSYCL instead of IntelDPCPP cmake script
oleksandr-pavlyk Sep 12, 2023
de644b9
libsyclinterface/tests/test_helper.cpp needs SYCL
oleksandr-pavlyk Sep 12, 2023
ce5e7c8
Make sure to not overwrite -fsycl option in select files
oleksandr-pavlyk Sep 12, 2023
5898fdd
IntelSycl_VERSION -> IntelSyclCompiler_VERSION
oleksandr-pavlyk Sep 12, 2023
72e44ba
Added additional test file needed sycl
oleksandr-pavlyk Sep 12, 2023
679fdc9
Try to make link options add by add_sycl_to_target PRIVATE
oleksandr-pavlyk Sep 12, 2023
c6a9305
Do not override -fsycl
oleksandr-pavlyk Sep 13, 2023
5adfe0e
Fixed typo in examples/pybind11/external_usm_allocation/CMakeLists.txt
oleksandr-pavlyk Sep 13, 2023
b60bc90
Removed setting SKBUILD_LINK_LIBRARIES_KEYWORD since we move to cmake…
oleksandr-pavlyk Sep 24, 2023
ad1f120
Use python_add_library with WITH_SOABI
oleksandr-pavlyk Oct 6, 2023
9f078d9
Set no-fast-math for reduction_over_axis too
oleksandr-pavlyk Oct 6, 2023
30b49b1
Channel OVERRIDE_INTEL_IPO env. variable
oleksandr-pavlyk Oct 8, 2023
550b20e
_device_queries extension needs SYCL
oleksandr-pavlyk Oct 9, 2023
6aa8023
Fixes gh-1432 (#1433)
ndgrigorian Oct 9, 2023
08cc479
Merge pull request #1428 from IntelPython/add-intel-device-info-data
oleksandr-pavlyk Oct 9, 2023
39e0700
Merge pull request #1400 from IntelPython/use-IntelSYCL-cmake-script
oleksandr-pavlyk Oct 10, 2023
e885838
Dedicated kernels for in-place ``dpt.divide`` and ``dpt.floor_divide`…
ndgrigorian Oct 11, 2023
4a0c171
SyclEvent._wait static function to use _Wait, not _WaitAndThrow
oleksandr-pavlyk Oct 11, 2023
058107d
SyclContext and SyclQueue creation now use NULL async_handler pointer
oleksandr-pavlyk Oct 11, 2023
20a74fe
Upload wheel to the same version
ZzEeKkAa Oct 11, 2023
526f410
Merge pull request #1435 from ZzEeKkAa/fix/wheel_upload
oleksandr-pavlyk Oct 12, 2023
e1e2dba
Add clean up job
ZzEeKkAa Oct 12, 2023
67c7dbf
Resolves setuptools "Package would be ignored" warnings
oleksandr-pavlyk Oct 13, 2023
6d681db
Merge pull request #1441 from IntelPython/heed-setuptools-warnings
oleksandr-pavlyk Oct 13, 2023
f486053
Merge pull request #1437 from ZzEeKkAa/feature/add_cleanup
oleksandr-pavlyk Oct 13, 2023
af04d34
Merge pull request #1436 from IntelPython/do-not-use-wait-and-throw
oleksandr-pavlyk Oct 14, 2023
c9cc505
Tweaked test_intel_device_info
oleksandr-pavlyk Oct 17, 2023
2d2f235
Elementwise functions cbrt, exp2, copysign, and rsqrt (#1443)
ndgrigorian Oct 17, 2023
be5fb99
Added SyclQueue._submit_keep_args_alive method
oleksandr-pavlyk Sep 7, 2023
5b5363e
Used _submit_keep_args_alive in a test
oleksandr-pavlyk Sep 7, 2023
0a35e9c
Do not use async_dec_ref in submit method
oleksandr-pavlyk Sep 10, 2023
f30a6a2
Add memcpy_async method
oleksandr-pavlyk Sep 10, 2023
e7ee1d9
Introduce dpctl.SyclQueue.submit_async
oleksandr-pavlyk Sep 15, 2023
b6dd4d2
Use pragma once in _host_task_util.hpp
oleksandr-pavlyk Sep 15, 2023
306ff96
Fixed possible memory leak on error
oleksandr-pavlyk Sep 15, 2023
67a325d
Add DPCTLQueue_MemcpyWithEvents
oleksandr-pavlyk Sep 18, 2023
6949e69
Adds dpctl.SyclQueue.memcpy_async
oleksandr-pavlyk Sep 18, 2023
cf9084d
One of the memcpy_async calls must use events
oleksandr-pavlyk Sep 18, 2023
48cb54d
Make SyclTimer accumulative
oleksandr-pavlyk Sep 18, 2023
0a73764
Make cdef function except * to channel Python exceptions
oleksandr-pavlyk Sep 22, 2023
10722d4
SyclTimer.dt return object with named accessors
oleksandr-pavlyk Oct 5, 2023
1d57614
Merge pull request #1445 from IntelPython/fix-intel-device-test
oleksandr-pavlyk Oct 17, 2023
db331d4
Removed superfluous barrier, moved comment, formatted kernel
oleksandr-pavlyk Oct 22, 2023
b437c47
Merge pull request #1450 from IntelPython/remove-barrier-in-accumulat…
oleksandr-pavlyk Oct 23, 2023
4ac53fd
Use partition_type_property descriptor in DPCTLDevice_GetParentDevice
oleksandr-pavlyk Oct 23, 2023
26b9c92
Merge pull request #1395 from IntelPython/async-ref-count-increment
oleksandr-pavlyk Oct 24, 2023
386bd8b
Merge pull request #1452 from IntelPython/improve-exception-parent-de…
oleksandr-pavlyk Oct 24, 2023
442e46f
Implement kernels for in-place ``pow``, ``remainder``, and bitwise op…
ndgrigorian Oct 25, 2023
2eba93e
Implements ``dpctl.tensor.clip`` (#1444)
ndgrigorian Oct 25, 2023
03fd737
Implements ``dpctl.tensor.logsumexp`` and ``dpctl.tensor.reduce_hypot…
ndgrigorian Oct 26, 2023
b3e9465
Implementations of reductions for contigous case must take offsets in…
oleksandr-pavlyk Oct 26, 2023
c63c545
Expand test to cover non-contig. input that can be simplified into one
oleksandr-pavlyk Oct 26, 2023
e92d1f9
Add tests for strided input where contig implementation is applicable
oleksandr-pavlyk Oct 26, 2023
702b707
Added comments to the test file
oleksandr-pavlyk Oct 26, 2023
dcb566a
Corrected logical error in can_use_reduce_over_group trait implementa…
oleksandr-pavlyk Oct 26, 2023
bfba152
The taper optimization in tree-reduction which causes problem with CUDA
oleksandr-pavlyk Oct 26, 2023
d82f3a9
Merge pull request #1458 from IntelPython/fix-reduction-contig_impl-o…
oleksandr-pavlyk Oct 27, 2023
02e7714
Fix ``axis0`` calls in reduction Python binding (#1459)
ndgrigorian Oct 27, 2023
9131925
Improve raise TypeError by providing exception message (#1460)
oleksandr-pavlyk Oct 31, 2023
11ecba8
Fix search reductions giving incorrect results for F-contiguous input…
ndgrigorian Nov 1, 2023
c742e79
Use SequentialReductionKernel for tree-reduction as well
oleksandr-pavlyk Nov 2, 2023
6a0b09c
Apply SequentialReductionKernel to axis0 reduction
oleksandr-pavlyk Nov 2, 2023
f74eae0
Split _tensor_impl into three extensions
oleksandr-pavlyk Nov 2, 2023
421b270
Used new native extension modules
oleksandr-pavlyk Nov 2, 2023
41ec378
Added docstrings and getter methods for ElementwiseFunc classes
oleksandr-pavlyk Nov 3, 2023
645044a
Instantiate atomic reduction templates for min/max ops for double/flo…
oleksandr-pavlyk Nov 3, 2023
097ecf5
Modified sycl_timer example to use dpctl.tensor function
oleksandr-pavlyk Nov 3, 2023
d4d4992
Fixed misspelled words
oleksandr-pavlyk Nov 3, 2023
af28d98
Improves performance of search reductions for small numbers of elemen…
ndgrigorian Nov 3, 2023
eb21e50
Remove superfluous includes in tensor_ctors.cpp per PR review
oleksandr-pavlyk Nov 3, 2023
9018745
Merge pull request #1463 from IntelPython/optimize-small-size-tree-re…
oleksandr-pavlyk Nov 4, 2023
7e79083
Enable use of np.int64 to specify shape of usm_ndarray
oleksandr-pavlyk Nov 5, 2023
2bc7939
Add a test for shape being np.int64 scalar
oleksandr-pavlyk Nov 5, 2023
aadb6b4
Eliminated multiple uses of same literal constants in test_search_red…
oleksandr-pavlyk Nov 5, 2023
07c075b
Corrected argmin/argmax docstring
oleksandr-pavlyk Nov 7, 2023
80e2f29
Fixed gh-1468
oleksandr-pavlyk Nov 7, 2023
ff9b5eb
Added a test based on gh-1468
oleksandr-pavlyk Nov 7, 2023
da59476
Changed TypeError wording per PR feedback
oleksandr-pavlyk Nov 7, 2023
ca2c6aa
Removed redundant asdtype function call
oleksandr-pavlyk Nov 7, 2023
dbab3fe
Merge pull request #1470 from IntelPython/fix-gh-1468-reduction
oleksandr-pavlyk Nov 8, 2023
f686102
Merge pull request #1467 from IntelPython/fix-usm-ndarray-ctor-when-s…
oleksandr-pavlyk Nov 8, 2023
f2af753
Add array API inspection utilities to ``dpctl.tensor`` (#1469)
ndgrigorian Nov 8, 2023
9afb742
Implements statistical functions ``mean``, ``std``, ``var`` (#1465)
ndgrigorian Nov 8, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
59 changes: 55 additions & 4 deletions .github/workflows/conda-package.yml
Original file line number Diff line number Diff line change
Expand Up @@ -102,6 +102,8 @@ jobs:
run: |
echo "WHEELS_OUTPUT_FOLDER=$GITHUB_WORKSPACE${{ runner.os == 'Linux' && '/' || '\\' }}" >> $GITHUB_ENV
- name: Build conda package
env:
OVERRIDE_INTEL_IPO: 1 # IPO requires more resources that GH actions VM provides
run: conda build --no-test --python ${{ matrix.python }} -c intel -c conda-forge --override-channels conda-recipe
- name: Upload artifact
uses: actions/upload-artifact@v3
Expand Down Expand Up @@ -181,6 +183,7 @@ jobs:
python -c "import dpctl; dpctl.lsplatform(verbosity=2)"
- name: Install gdb
run: |
sudo apt-get update --fix-missing
sudo apt-get install -y gdb
- name: Run test_elementwise under gdb
run: |
Expand Down Expand Up @@ -320,15 +323,22 @@ jobs:
matrix:
python: ['3.9', '3.10', '3.11']
steps:
- name: Download artifact
- name: Download conda artifact
uses: actions/download-artifact@v3
with:
name: ${{ env.PACKAGE_NAME }} ${{ runner.os }} Python ${{ matrix.python }}

- name: Download wheel artifact
uses: actions/download-artifact@v3
with:
name: ${{ env.PACKAGE_NAME }} ${{ runner.os }} Wheels Python ${{ matrix.python }}

- name: Install anaconda-client
run: conda install anaconda-client
- name: Add conda to system path
run: echo $CONDA/bin >> $GITHUB_PATH
- name: Package version
run: echo "PACKAGE_VERSION=$(basename ${{ env.PACKAGE_NAME }}-*.tar.bz2 | sed 's/^${{ env.PACKAGE_NAME }}-\([^-]*\).*/\1/')" >> $GITHUB_ENV

- name: Upload
env:
Expand All @@ -339,7 +349,7 @@ jobs:
- name: Upload Wheels
env:
ANACONDA_TOKEN: ${{ secrets.ANACONDA_TOKEN }}
run: anaconda --token ${{ env.ANACONDA_TOKEN }} upload --user dppy --label dev ${{ env.PACKAGE_NAME }}-*.whl
run: anaconda --token ${{ env.ANACONDA_TOKEN }} upload --user dppy --label dev ${{ env.PACKAGE_NAME }}-*.whl --version ${{ env.PACKAGE_VERSION }}

upload_windows:
needs: test_windows
Expand All @@ -353,13 +363,24 @@ jobs:
uses: actions/download-artifact@v3
with:
name: ${{ env.PACKAGE_NAME }} ${{ runner.os }} Python ${{ matrix.python }}

- name: Download wheel artifact
uses: actions/download-artifact@v3
with:
name: ${{ env.PACKAGE_NAME }} ${{ runner.os }} Wheels Python ${{ matrix.python }}

- uses: conda-incubator/setup-miniconda@v2
with:
auto-activate-base: true
activate-environment: ""

- name: Install anaconda-client
run: conda install anaconda-client

- name: Package version
shell: bash -el {0}
run: echo "PACKAGE_VERSION=$(basename ${{ env.PACKAGE_NAME }}-*.tar.bz2 | sed 's/^${{ env.PACKAGE_NAME }}-\([^-]*\).*/\1/')" >> $GITHUB_ENV

- name: Upload
env:
ANACONDA_TOKEN: ${{ secrets.ANACONDA_TOKEN }}
Expand All @@ -369,7 +390,37 @@ jobs:
- name: Upload Wheels
env:
ANACONDA_TOKEN: ${{ secrets.ANACONDA_TOKEN }}
run: anaconda --token ${{ env.ANACONDA_TOKEN }} upload --user dppy --label dev ${{ env.PACKAGE_NAME }}-*.whl
run: anaconda --token ${{ env.ANACONDA_TOKEN }} upload --user dppy --label dev ${{ env.PACKAGE_NAME }}-*.whl --version ${{ env.PACKAGE_VERSION }}

cleanup_packages:
name: Clean up anaconda packages
needs: [upload_linux, upload_windows]
runs-on: 'ubuntu-latest'
defaults:
run:
shell: bash -el {0}
steps:
- uses: conda-incubator/setup-miniconda@v2
with:
run-post: false
channel-priority: "disabled"
channels: conda-forge
python-version: '3.11'

- name: Install anaconda-client
run: conda install anaconda-client

- name: Checkout repo
uses: actions/checkout@v2
with:
repository: IntelPython/devops-tools
fetch-depth: 0

- name: Cleanup old packages
run: |
python scripts/cleanup-old-packages.py \
--verbose --force --token ${{ secrets.ANACONDA_TOKEN }} \
--package dppy/${{ env.PACKAGE_NAME }} --label dev

test_examples_linux:
needs: build_linux
Expand Down Expand Up @@ -615,7 +666,7 @@ jobs:
python -c "import dpctl; dpctl.lsplatform()"
export ARRAY_API_TESTS_MODULE=dpctl.tensor
cd /home/runner/work/array-api-tests
pytest --ci --json-report --json-report-file=$FILE array_api_tests/ || true
pytest --json-report --json-report-file=$FILE array_api_tests/ || true
- name: Set Github environment variables
shell: bash -l {0}
run: |
Expand Down
3 changes: 2 additions & 1 deletion CMakeLists.txt
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
cmake_minimum_required(VERSION 3.21...3.27 FATAL_ERROR)

project(dpctl
VERSION 0.15
LANGUAGES CXX
DESCRIPTION "Python interface for XPU programming"
)
Expand All @@ -17,7 +18,7 @@ option(DPCTL_GENERATE_COVERAGE
OFF
)

find_package(IntelDPCPP REQUIRED PATHS ${CMAKE_SOURCE_DIR}/cmake NO_DEFAULT_PATH)
find_package(IntelSYCL REQUIRED PATHS ${CMAKE_SOURCE_DIR}/cmake NO_DEFAULT_PATH)

add_subdirectory(libsyclinterface)

Expand Down
23 changes: 0 additions & 23 deletions MANIFEST.in

This file was deleted.

Loading
Loading