145 Commits

Author SHA1 Message Date
Anthony Berg
f2bda95812 feat(common): add a check for hipblas in hip_check 2025-06-24 21:40:56 +02:00
Anthony Berg
189eb42603 feat(common): add a common hip_check function 2025-06-24 21:25:01 +02:00
Anthony Berg
b8836f4fb0 fix(common): change assert to raise in Array2D 2025-06-24 21:17:13 +02:00
Anthony Berg
d1df00267a feat(common): make subpackage for arrays to split hip and cuda 2025-06-24 21:11:19 +02:00
Anthony Berg
bfed972046 refactor(simulator): change from CudaContext to KernelContext 2025-06-24 20:37:24 +02:00
Anthony Berg
3be1b074dd feat(simulator): change to KernelContext for autotuner 2025-06-24 20:28:34 +02:00
Anthony Berg
4da604737a feat(kernel): make handling path and file extension for kernel by respective Context 2025-06-24 20:18:24 +02:00
Anthony Berg
d4607183aa refactor(kernel): follow PEP8 formatting standard and add f strings 2025-06-24 20:15:55 +02:00
Anthony Berg
1469ac1128 refactor(helpers): follow PEP8 formatting standard 2025-06-24 20:04:44 +02:00
Anthony Berg
d4f2ffc493 feat(kernel): add environment condition for importing certain packages 2025-06-24 19:57:59 +02:00
Anthony Berg
3b424d1f5a refactor(autotuner): move models to a separate package 2025-06-24 18:53:33 +02:00
Anthony Berg
ae9d2b1595 refactor(autotuner): replace string format with f strings 2025-06-24 18:39:54 +02:00
Anthony Berg
1786ca979f refactor(ipm): replace string format with f strings 2025-06-24 17:59:32 +02:00
Anthony Berg
0633561bbf refactor(mpi): follow PEP8 scheme and replace .format() with f strings 2025-06-24 17:55:23 +02:00
Anthony Berg
7475d590eb refactor(mpi): follow PEP8 function naming scheme 2025-06-24 17:48:57 +02:00
Anthony Berg
d2544e7c55 fix(kernel): type for defines in get_module 2025-06-24 17:45:50 +02:00
Anthony Berg
ae23145753 refactor(kernel): split Common.py to a separate package 2025-06-24 17:35:42 +02:00
Anthony Berg
c54f08c417 refactor(kernel): split Common.py to a separate package 2025-06-24 17:34:29 +02:00
Anthony Berg
8f24cd45ea feat(kernel): add basic HIPContext 2025-06-24 16:04:48 +02:00
Anthony Berg
0fa04dbcec feat(kernel): add hipified kernels 2025-06-20 11:08:07 +02:00
Anthony Berg
8096f4ce04 build(kernel): add separate CMakeLists 2025-06-18 16:22:12 +02:00
Anthony Berg
bb0f75afea refactor(gpu): fix up formatting and warnings 2025-06-16 17:38:23 +02:00
Anthony Berg
66e0f8024a build(gpu): add CMakeLists for CUDA code 2025-06-16 17:36:56 +02:00
Anthony Berg
19ea8319e8 Start of HIPIFYcation 2025-06-16 16:09:45 +02:00
Anthony Berg
e7cd6ae34a feat: add progress bar for Autotuner 2025-03-26 00:28:16 +01:00
Anthony Berg
c485b37219 fix(GPUSimulators): swap out missing functions in Autotuner 2025-02-18 10:22:00 +01:00
Anthony Berg
aa54d21f0a feat(GPUSimulator): add the extra variables introduced in SWWECommon for all other algorithms 2025-02-18 10:21:12 +01:00
Anthony Berg
71cc5628c8 fix(GPUSimulator): use class function instead of calling hash_kernel separately 2025-02-18 10:16:10 +01:00
Anthony Berg
7c312c255f docs(GPUSimulators): add documentation for CudaContext 2025-02-14 16:29:47 +01:00
Anthony Berg
ef207432db refactor(GPUSimulator): follow PEP 8 style guide 2025-02-14 12:40:31 +01:00
Martin Lilleeng Sætra
55029275e3 Cleanup 2022-08-16 19:01:24 +02:00
Martin Lilleeng Sætra
234c8cb727 Merge branch 'master' of github.com:setmar/ShallowWaterGPU 2022-08-16 14:35:21 +00:00
Martin Lilleeng Sætra
403146cccc Draft of SHMEMSimulator 2022-08-16 14:34:57 +00:00
Martin Lilleeng Sætra
a260230002 Removed nvtx and MPI options for job script 2022-05-25 14:52:44 +02:00
Martin Lilleeng Sætra
a588948e77 Bugfix for ghost cell exchange and nc atts 2022-05-25 11:25:10 +00:00
Martin Lilleeng Sætra
e5a8486501 Periodic BC now working with domain decomposition again. 2022-05-09 13:50:59 +00:00
Martin Lilleeng Sætra
c7e6f17445 Adjusting order of substep ops and benchmark scripts 2022-04-26 15:39:21 +02:00
Martin Lilleeng Sætra
acb7d2ab39 Async mem ops 2022-04-26 12:10:22 +00:00
Martin Lilleeng Sætra
6c8bac6f7b Async mem ops 2022-04-26 11:34:29 +00:00
Martin Lilleeng Sætra
dcef56a1b9 More time counters and separation of experiments. 2022-04-11 13:57:33 +00:00
Martin Lilleeng Sætra
d83d620512 Profiling code for DGX-2. Added fixed dt. 2022-04-08 15:09:09 +02:00
Martin Lilleeng Sætra
8fd9d3d850 Added more profiling timers 2022-04-07 09:07:20 +00:00
Martin Lilleeng Sætra
124e033ff5 More profiling 2022-04-01 13:42:56 +00:00
Martin Lilleeng Sætra
11223f0890 Checkpoint. Next: Compose domain before writing 2021-05-23 18:26:18 +00:00
Martin Lilleeng Sætra
ad2be4b5a5 Added sync, some comments, and done some tinkering 2021-05-18 21:19:07 +00:00
Martin Lilleeng Sætra
b6424b84fb First running version. 2021-05-13 15:25:34 +00:00
Martin Lilleeng Sætra
5b6c4f682d Checkpoint SHMEMSimulatorGroup. Needs debugging. 2021-05-12 16:09:29 +00:00
Martin Lilleeng Sætra
2da4640840 Added class for managing SHMEMSimulators 2021-05-05 15:24:34 +00:00
Martin Lilleeng Sætra
98aff206dc Running multiple CUDA contexts per process/thread 2021-02-21 22:57:50 +01:00
Martin Lilleeng Sætra
239c1fdcd9 Refactoring. 2019-08-16 15:27:46 +00:00