Commit Graph

  • d112dc98bd fix(common): allow for empty cpu_data to be parsed hipify Anthony Berg 2025-07-01 01:30:20 +02:00
  • a59c0f073d fix(common): bad import of hip_check Anthony Berg 2025-07-01 01:14:06 +02:00
  • 86821d659c feat(common): add hiprtc to hip_check Anthony Berg 2025-07-01 00:33:01 +02:00
  • 2241da1532 fix(simulator): add default value for no parameters Anthony Berg 2025-07-01 00:11:31 +02:00
  • a77c3786e5 fix(simulator): circular import with BoundaryCondition Anthony Berg 2025-06-30 23:37:20 +02:00
  • 86b56741e2 feat(gpu): make CudaContext inherit Context Anthony Berg 2025-06-30 20:36:02 +02:00
  • 1343cfd8c1 fix(gpu): remove unused import in HIPContext Anthony Berg 2025-06-30 20:26:25 +02:00
  • 8f90ec6291 feat(gpu): add compiler to HIPContext Anthony Berg 2025-06-30 14:11:57 +02:00
  • ecfdaaa39e feat(gpu): add function to find includes in C/C++ files Anthony Berg 2025-06-30 13:53:18 +02:00
  • 5eab354968 fix(gpu): add directory to includes Anthony Berg 2025-06-30 13:51:46 +02:00
  • 0f0329cf36 fix(gpu): escape sequence in regex Anthony Berg 2025-06-29 23:13:07 +02:00
  • 3aedef93cf feat(gpu): add common cache directory Anthony Berg 2025-06-25 15:08:10 +02:00
  • 4df7b9b6b7 fix(common): cyclical imports for arrays Anthony Berg 2025-06-25 13:57:56 +02:00
  • f4fff25539 feat(common): add a function to get root directory Anthony Berg 2025-06-25 13:34:15 +02:00
  • 1375743e21 fix(simulator): prevent import causing recursive inheritance Anthony Berg 2025-06-25 13:19:19 +02:00
  • c549556c36 feat(common): add imports for HIP 2d arrays Anthony Berg 2025-06-25 13:12:55 +02:00
  • 01b39cc9b4 feat(common): add HIP array2d and arkawaa2d Anthony Berg 2025-06-25 13:11:52 +02:00
  • 985e774979 feat(simulator): add CUDA and HIP BaseSimulators Anthony Berg 2025-06-25 12:58:11 +02:00
  • 0d3227303a refactor(simulator): move Simulator to its own package Anthony Berg 2025-06-25 12:37:27 +02:00
  • 3ea511f445 fix(helpers): add init for helpers Anthony Berg 2025-06-25 10:25:43 +02:00
  • f2bda95812 feat(common): add a check for hipblas in hip_check Anthony Berg 2025-06-24 21:40:56 +02:00
  • 189eb42603 feat(common): add a common hip_check function Anthony Berg 2025-06-24 21:25:01 +02:00
  • b8836f4fb0 fix(common): change assert to raise in Array2D Anthony Berg 2025-06-24 21:17:13 +02:00
  • d1df00267a feat(common): make subpackage for arrays to split hip and cuda Anthony Berg 2025-06-24 21:11:19 +02:00
  • bfed972046 refactor(simulator): change from CudaContext to KernelContext Anthony Berg 2025-06-24 20:37:24 +02:00
  • 3be1b074dd feat(simulator): change to KernelContext for autotuner Anthony Berg 2025-06-24 20:28:34 +02:00
  • 4da604737a feat(kernel): make handling path and file extension for kernel by respective Context Anthony Berg 2025-06-24 20:18:24 +02:00
  • d4607183aa refactor(kernel): follow PEP8 formatting standard and add f strings Anthony Berg 2025-06-24 20:15:34 +02:00
  • 1469ac1128 refactor(helpers): follow PEP8 formatting standard Anthony Berg 2025-06-24 20:04:44 +02:00
  • d4f2ffc493 feat(kernel): add environment condition for importing certain packages Anthony Berg 2025-06-24 19:57:59 +02:00
  • 3b424d1f5a refactor(autotuner): move models to a separate package Anthony Berg 2025-06-24 18:53:33 +02:00
  • ae9d2b1595 refactor(autotuner): replace string format with f strings Anthony Berg 2025-06-24 18:39:54 +02:00
  • 2726a4dfce refactor: replace string format with f strings and follow PEP8 formatting Anthony Berg 2025-06-24 18:27:35 +02:00
  • 1786ca979f refactor(ipm): replace string format with f strings Anthony Berg 2025-06-24 17:59:32 +02:00
  • 0633561bbf refactor(mpi): follow PEP8 scheme and replace .format() with f strings Anthony Berg 2025-06-24 17:55:23 +02:00
  • 7475d590eb refactor(mpi): follow PEP8 function naming scheme Anthony Berg 2025-06-24 17:48:57 +02:00
  • d2544e7c55 fix(kernel): type for defines in get_module Anthony Berg 2025-06-24 17:45:50 +02:00
  • ae23145753 refactor(kernel): split Common.py to a separate package Anthony Berg 2025-06-24 17:35:42 +02:00
  • c54f08c417 refactor(kernel): split Common.py to a separate package Anthony Berg 2025-06-24 17:34:29 +02:00
  • 8f24cd45ea feat(kernel): add basic HIPContext Anthony Berg 2025-06-24 16:04:48 +02:00
  • 0fa04dbcec feat(kernel): add hipified kernels Anthony Berg 2025-06-20 11:08:07 +02:00
  • 8096f4ce04 build(kernel): add separate CMakeLists Anthony Berg 2025-06-18 16:22:12 +02:00
  • bb0f75afea refactor(gpu): fix up formatting and warnings Anthony Berg 2025-06-16 17:38:23 +02:00
  • 66e0f8024a build(gpu): add CMakeLists for CUDA code Anthony Berg 2025-06-16 17:36:56 +02:00
  • 19ea8319e8 Start of HIPIFYcation Anthony Berg 2025-06-16 16:09:45 +02:00
  • e7cd6ae34a feat: add progress bar for Autotuner master Anthony Berg 2025-03-26 00:28:16 +01:00
  • 079bad08b8 feat: add having run jupyter notebooks with fixes Anthony Berg 2025-03-25 23:25:23 +01:00
  • c485b37219 fix(GPUSimulators): swap out missing functions in Autotuner Anthony Berg 2025-02-18 10:22:00 +01:00
  • aa54d21f0a feat(GPUSimulator): add the extra variables introduced in SWWECommon for all other algorithms Anthony Berg 2025-02-18 10:21:12 +01:00
  • 71cc5628c8 fix(GPUSimulator): use class function instead of calling hash_kernel separately Anthony Berg 2025-02-18 10:16:10 +01:00
  • 23837b5828 build(conda): update Python to 3.9 Anthony Berg 2025-02-18 10:12:54 +01:00
  • 7c312c255f docs(GPUSimulators): add documentation for CudaContext Anthony Berg 2025-02-14 16:29:47 +01:00
  • ef207432db refactor(GPUSimulator): follow PEP 8 style guide Anthony Berg 2025-02-14 12:40:31 +01:00
  • ce8e834771 feat(gitignore): add generated files, Python, CUDA, and JetBrains to .gitignore Anthony Berg 2025-02-13 16:48:10 +01:00
  • e43d7de73b
    Update README.md André R. Brodtkorb 2023-02-23 08:57:38 +01:00
  • 9613269c45
    Merge pull request #10 from setmar/master André R. Brodtkorb 2023-02-23 08:55:50 +01:00
  • 55029275e3 Cleanup Martin Lilleeng Sætra 2022-08-16 19:01:24 +02:00
  • 4216a7d530 Added setup instructions Martin Lilleeng Sætra 2022-08-16 18:24:21 +02:00
  • d7fbc18cdd Merge branch 'master' of github.com:setmar/ShallowWaterGPU Martin Lilleeng Sætra 2022-08-16 14:48:53 +00:00
  • 02b4467b36 Separate notebook for Simula Figures. Martin Lilleeng Sætra 2022-08-16 16:48:05 +02:00
  • 96a9f0faad Separate notebook for Saga Figures Martin Lilleeng Sætra 2022-08-16 16:41:38 +02:00
  • 234c8cb727 Merge branch 'master' of github.com:setmar/ShallowWaterGPU Martin Lilleeng Sætra 2022-08-16 14:35:21 +00:00
  • 403146cccc Draft of SHMEMSimulator Martin Lilleeng Sætra 2022-08-16 14:34:57 +00:00
  • a7a723aca6 Added benchmarks for Saga and updated figures. Martin Lilleeng Sætra 2022-06-24 17:14:10 +02:00
  • 528b738dda Added benchmarks and updated figures. Martin Lilleeng Sætra 2022-06-24 14:33:16 +02:00
  • aa693a9a46 Revised DGX-2 benchmarks and added HGX benchmarks. Martin Lilleeng Sætra 2022-06-20 09:28:45 +02:00
  • 0f0cbad2dd Tweaked MPI job params Martin Lilleeng Sætra 2022-06-09 14:09:10 +02:00
  • ff27e14217 Adjusted dt Martin Lilleeng Sætra 2022-05-25 17:04:44 +02:00
  • 286a5dd872 Adjusting job script Martin Lilleeng Sætra 2022-05-25 15:50:24 +02:00
  • 9749acb6fe Added numactl Martin Lilleeng Sætra 2022-05-25 14:58:05 +02:00
  • a260230002 Removed nvtx and MPI options for job script Martin Lilleeng Sætra 2022-05-25 14:52:44 +02:00
  • a588948e77 Bugfix for ghost cell exchange and nc atts Martin Lilleeng Sætra 2022-05-25 11:25:10 +00:00
  • 4d03e6cfbc Updated job/experiment scripts Martin Lilleeng Sætra 2022-05-25 11:23:22 +00:00
  • 5faf334c24 Updated plots Martin Lilleeng Sætra 2022-05-09 13:55:05 +00:00
  • b5ce0142a0 Fixed logging and matplotlib commands Martin Lilleeng Sætra 2022-05-09 13:54:35 +00:00
  • e5a8486501 Periodic BC now working with domain decomposition again. Martin Lilleeng Sætra 2022-05-09 13:50:59 +00:00
  • b4d204c9c9 Scripts for single-GPU testing and running on PPI. Martin Lilleeng Sætra 2022-05-09 13:50:12 +00:00
  • 28f73c20af Benchmark script for weak scaling Martin Lilleeng Sætra 2022-04-26 15:39:54 +02:00
  • c7e6f17445 Adjusting order of substep ops and benchmark scripts Martin Lilleeng Sætra 2022-04-26 15:39:21 +02:00
  • acb7d2ab39 Async mem ops Martin Lilleeng Sætra 2022-04-26 12:10:22 +00:00
  • 6c8bac6f7b Async mem ops Martin Lilleeng Sætra 2022-04-26 11:34:29 +00:00
  • dcef56a1b9 More time counters and separation of experiments. Martin Lilleeng Sætra 2022-04-11 13:57:33 +00:00
  • 42aff3e095 Added profiling run script for seymour. Martin Lilleeng Sætra 2022-04-11 08:57:40 +00:00
  • 91291ac19d Updated graphs. Martin Lilleeng Sætra 2022-04-08 17:33:39 +02:00
  • d83d620512 Profiling code for DGX-2. Added fixed dt. Martin Lilleeng Sætra 2022-04-08 15:09:09 +02:00
  • 80d84e0489 Merge branch 'master' of github.com:setmar/ShallowWaterGPU Martin Lilleeng Sætra 2022-04-07 14:23:02 +00:00
  • 286ead0fbe Added notebook for plotting (scalability) figures. Martin Lilleeng Sætra 2022-04-07 14:22:42 +00:00
  • f8baa365d8 Merge branch 'master' of github.com:setmar/ShallowWaterGPU Martin Lilleeng Sætra 2022-04-07 13:52:28 +02:00
  • 353056d5b1 Added arguments for job script Martin Lilleeng Sætra 2022-04-07 13:52:16 +02:00
  • 8fd9d3d850 Added more profiling timers Martin Lilleeng Sætra 2022-04-07 09:07:20 +00:00
  • 124e033ff5 More profiling Martin Lilleeng Sætra 2022-04-01 13:42:56 +00:00
  • bf8dc33b28 Merge remote-tracking branch 'upstream/master' Martin Lilleeng Sætra 2022-01-24 20:15:37 +01:00
  • 5b6e145d80 Merge branch 'master' of github.com:setmar/ShallowWaterGPU Martin Lilleeng Sætra 2022-01-24 19:58:12 +01:00
  • 8f565f57f6 Tweaked DGX-2 run setup. Martin Lilleeng Sætra 2022-01-24 19:57:58 +01:00
  • bc39170efb Tweaked settings for DGX-2 run. Martin Lilleeng Sætra 2022-01-24 19:56:07 +01:00
  • 689cf14202
    Update README.md Martin Lilleeng Sætra 2022-01-24 18:17:41 +01:00
  • 55499546be
    Update README.md Martin Lilleeng Sætra 2022-01-24 18:04:55 +01:00
  • ab0b3d29a1
    Update README.md Martin Lilleeng Sætra 2022-01-24 17:26:40 +01:00
  • b48701bfe9 Added separate conda env for use with jupyter Martin Lilleeng Sætra 2022-01-24 16:10:17 +01:00
  • bf959d15a9
    Update README.md Martin Lilleeng Sætra 2022-01-24 12:42:29 +01:00