-
Notifications
You must be signed in to change notification settings - Fork 38
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #16 from LLNL/release/v1.2.4
Release/v1.2.4
- Loading branch information
Showing
58 changed files
with
567 additions
and
1,542 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,10 @@ | ||
07/14/2019 v1.2.4: Updated to BSD-3-Clause license, Caliper support | ||
|
||
10/12/2018 v1.2.3: CUDA support, updated policy selection/dispatch mechanisms, updated build system | ||
|
||
04/04/2018 v1.2.2-CORAL2: Fixed bug in Population edit | ||
|
||
03/26/2018 v1.2.1-CORAL2: Updated to RAJA-0.6.0rc2, fixed FOM calculation and updated docs | ||
|
||
11/02/2017 v1.2.0-CORAL2: Initial release for CORAL2 | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,53 @@ | ||
Intellectual Property Notice | ||
------------------------------ | ||
|
||
Kripke is licensed under the BSD 3-Clause license, | ||
(BSD-3-Clause or https://opensource.org/licenses/BSD-3-Clause). | ||
|
||
See the files LICENSE and NOTICE for details. | ||
|
||
Copyrights and patents in the Kripke project are retained by contributors. | ||
No copyright assignment is required to contribute to Kripke. | ||
|
||
|
||
SPDX usage | ||
------------ | ||
|
||
Individual files contain SPDX tags instead of the full license text. | ||
This enables machine processing of license information based on the SPDX | ||
License Identifiers that are available here: https://spdx.org/licenses/ | ||
|
||
Files that are licensed as BSD 3-Clause contain the following | ||
text in the license header: | ||
|
||
SPDX-License-Identifier: (BSD-3-Clause) | ||
|
||
|
||
External Packages | ||
------------------- | ||
Kripke has some external dependencies, some of which are included as Git | ||
submodules and others are bundled in the Kripke repository. These | ||
packages are covered by various permissive licenses. A summary listing | ||
follows. See the license included with each package for full details. | ||
|
||
|
||
PackageName: BLT | ||
PackageHomePage: https://github.com/LLNL/blt/ | ||
PackageLicenseDeclared: BSD-3-Clause | ||
|
||
PackageName: CHAI | ||
PackageHomePage: http://github.com/LLNL/CHAI/ | ||
PackageLicenseDeclared: BSD-3-Clause | ||
|
||
PackageName: CUB | ||
PackageHomePage: http://nvlabs.github.com/cub/ | ||
PackageLicenseDeclared: BSD-3-Clause | ||
|
||
PackageName: RAJA | ||
PackageHomePage: https://github.com/LLNL/RAJA/ | ||
PackageLicenseDeclared: BSD-3-Clause | ||
|
||
PackageName: Umpire | ||
PackageHomePage: http://github.com/LLNL/Umpire/ | ||
PackageLicenseDeclared: MIT | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,30 @@ | ||
BSD 3-Clause License | ||
|
||
Copyright (c) 2018, Lawrence Livermore National Security, LLC | ||
All rights reserved. | ||
|
||
Redistribution and use in source and binary forms, with or without | ||
modification, are permitted provided that the following conditions are met: | ||
|
||
* Redistributions of source code must retain the above copyright notice, this | ||
list of conditions and the following disclaimer. | ||
|
||
* Redistributions in binary form must reproduce the above copyright notice, | ||
this list of conditions and the following disclaimer in the documentation | ||
and/or other materials provided with the distribution. | ||
|
||
* Neither the name of the copyright holder nor the names of its | ||
contributors may be used to endorse or promote products derived from | ||
this software without specific prior written permission. | ||
|
||
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" | ||
AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE | ||
IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE | ||
DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE | ||
FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL | ||
DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR | ||
SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER | ||
CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, | ||
OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE | ||
OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
This work was produced under the auspices of the U.S. Department of | ||
Energy by Lawrence Livermore National Laboratory under Contract | ||
DE-AC52-07NA27344. | ||
|
||
This work was prepared as an account of work sponsored by an agency of | ||
the United States Government. Neither the United States Government nor | ||
Lawrence Livermore National Security, LLC, nor any of their employees | ||
makes any warranty, expressed or implied, or assumes any legal liability | ||
or responsibility for the accuracy, completeness, or usefulness of any | ||
information, apparatus, product, or process disclosed, or represents that | ||
its use would not infringe privately owned rights. | ||
|
||
Reference herein to any specific commercial product, process, or service | ||
by trade name, trademark, manufacturer, or otherwise does not necessarily | ||
constitute or imply its endorsement, recommendation, or favoring by the | ||
United States Government or Lawrence Livermore National Security, LLC. | ||
|
||
The views and opinions of authors expressed herein do not necessarily | ||
state or reflect those of the United States Government or Lawrence | ||
Livermore National Security, LLC, and shall not be used for advertising | ||
or product endorsement purposes. | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,42 +1,9 @@ | ||
KRIPKE | ||
====== | ||
|
||
Version 1.2.3 | ||
|
||
Release Date 10/12/2018 | ||
|
||
LLNL-CODE-658597 | ||
|
||
|
||
Authors | ||
======= | ||
* Adam J. Kunen [[email protected]](mailto:[email protected]) (Primary point of contact) | ||
* Peter N. Brown [[email protected]](mailto:[email protected]) | ||
* Teresa S. Bailey [[email protected]](mailto:[email protected]) | ||
* Peter G. Maginot [[email protected]](mailto:[email protected]) | ||
|
||
|
||
License | ||
======= | ||
See included file NOTICE.md | ||
|
||
|
||
Changes | ||
======= | ||
|
||
* 10/12/2018 v1.2.3: CUDA support, updated policy selection/dispatch mechanisms, updated build system | ||
* 04/04/2018 v1.2.2-CORAL2: Fixed bug in Population edit | ||
* 03/26/2018 v1.2.1-CORAL2: Updated to RAJA-0.6.0rc2, fixed FOM calcuation and updated docs | ||
* 11/02/2017 v1.2.0-CORAL2: Initial release for CORAL2 | ||
|
||
|
||
|
||
|
||
Overview | ||
======== | ||
Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. A main goal of Kripke is investigating how different data-layouts affect instruction, thread and task level parallelism, and what the implications are on overall solver performance. | ||
|
||
Kripkie supports storage of angular fluxes (Psi) using all six striding orders (or "nestings") of Directions (D), Groups (G), and Zones (Z), and provides computational kernels specifically written for each of these nestings. Most Sn transport codes are designed around one of these nestings, which is an inflexibility that leads to software engineering compromises when porting to new architectures and programming paradigms. | ||
Kripke supports storage of angular fluxes (Psi) using all six striding orders (or "nestings") of Directions (D), Groups (G), and Zones (Z), and provides computational kernels specifically written for each of these nestings. Most Sn transport codes are designed around one of these nestings, which is an inflexibility that leads to software engineering compromises when porting to new architectures and programming paradigms. | ||
|
||
Early research has found that the problem dimensions (zones, groups, directions, scattering order) and the scaling (number of threads and MPI tasks), can make a profound difference in the performance of each of these nestings. To our knowledge this is a capability unique to Kripke, and should provide key insight into how data-layout effects Sn solver performance. An asynchronous MPI-based parallel sweep algorithm is provided, which employs the concepts of Group Sets (GS) Zone Sets (ZS), and Direction Sets (DS), borrowed from the [Texas A&M code PDT](https://parasol.tamu.edu/asci/). | ||
|
||
|
@@ -56,7 +23,7 @@ A major challenge of achieving high-performance in an Sn transport (or any physi | |
|
||
Parallel sweep algorithms can be explored with Kripke in multiple ways. The core MPI algorithm could be modified or rewritten to explore other approaches, domain overloading, or alternate programming models (such as Charm++). The effect of load-imbalance is an understudied aspect of Sn transport sweeps, and could easily be studied with Kripke by artificially adding more work (ie unknowns) to a subset of MPI tasks. Block-AMR could be added to Kripke, which would be a useful way to explore the cost-benefit analysis of adding AMR to an Sn code, and would be a way to further study load imbalances and AMR effects on sweeps. | ||
|
||
The coupling of on-node sweep kernel, the parallel sweep algorithm, and the choices of decomposing the problem phase space into GS's, ZS's and DS's impact the performance of the overall sweep. The tradeoff between large and small "units of work" can be studied. Larger "units of work" provide more opportunity for on-node parallelism, while creating larger messages, less "sends", and less efficient parallel sweeps. Smaller "units of work" make for less efficient on-node kernels, but more efficient parallel sweeps. | ||
The coupling of on-node sweep kernel, the parallel sweep algorithm, and the choices of decomposing the problem phase space into GS's, ZS's and DS's impact the performance of the overall sweep. The trade off between large and small "units of work" can be studied. Larger "units of work" provide more opportunity for on-node parallelism, while creating larger messages, less "sends", and less efficient parallel sweeps. Smaller "units of work" make for less efficient on-node kernels, but more efficient parallel sweeps. | ||
|
||
We can also study trading MPI tasks for threads, and the effects this has on our programming models and cache efficiency. | ||
|
||
|
@@ -116,6 +83,9 @@ Basic requirements: | |
|
||
* (Optional) OpenMP 3 or later | ||
|
||
* (Optional) [Caliper](https://github.com/LLNL/Caliper): a performance profiling/analysis library. | ||
|
||
|
||
Submodule dependencies: | ||
|
||
* [BLT](https://github.com/LLNL/blt) v0.1: a CMake based build system (required) | ||
|
@@ -165,7 +135,7 @@ The easiest way to get Kripke running, is to directly invoke CMake and take what | |
cd build | ||
cmake .. | ||
|
||
For a number of platforms, we have cache inits file that makes things easier: | ||
For a number of platforms, we have CMake cache files that make things easier: | ||
|
||
cd build | ||
cmake .. -C../host-configs/llnl-bgqos-clang.cmake | ||
|
@@ -181,7 +151,7 @@ The easiest way to get Kripke running, is to directly invoke CMake and take what | |
|
||
There are a number of cache init files for LLNL machines and operating systems. | ||
These might not meet your needs, but can be a very good starting point for developing your own. | ||
The current list of cache init files (located in the ./host-confgs/ directory) are: | ||
The current list of cache init files (located in the ./host-configs/ directory) are: | ||
|
||
* llnl-bgqos-clang.cmake | ||
|
||
|
@@ -202,11 +172,16 @@ The current list of cache init files (located in the ./host-confgs/ directory) a | |
Running Kripke | ||
============== | ||
|
||
Environment Variabes | ||
Environment Variables | ||
-------------------- | ||
|
||
If Kripke is built with OpenMP support, then the environment variables ``OMP_NUM_THREADS`` is used to control the number of OpenMP threads. Kripke does not attempt to modify the OpenMP runtime in anyway, so other ``OMP_*`` environment variables should also work as well. | ||
|
||
If Kripke is built with OpenMP support, then the environment variables ``OMP_NUM_THREADS`` is used to control the number of OpenMP threads. Kripke does not attempt to modify the OpenMP runtime in any way, so other ``OMP_*`` environment variables should also work as well. | ||
|
||
If Kripke is built with Caliper support, Caliper performance measurements can be configured through Caliper environment variables. For example, | ||
|
||
CALI_CONFIG_PROFILE=runtime-report ./kripke ... | ||
|
||
will print a time profile of annotated code regions in Kripke. For more information, see https://llln.github.io/Caliper. | ||
|
||
Command Line Options | ||
-------------------- | ||
|
@@ -250,10 +225,10 @@ Command line option help can also be viewed by running "./kripke --help" | |
|
||
* **``--layout <LAYOUT>``** | ||
|
||
Data layout selection. This determines the data layout and kernel implementation details (such as loop nesting order). The layouts are determined by the order of unknwons in the angular flux: Direction, Group, and Zone. Available layouts are DGZ, DZG, GDZ, GZD, ZDG, and ZGD. The order is specified left-to-right in longest-to-shortes stride. For example: DGZ means that Directions are the longest stride, and Zones are stride-1. (Default: --nest DGZ) | ||
Data layout selection. This determines the data layout and kernel implementation details (such as loop nesting order). The layouts are determined by the order of unknowns in the angular flux: Direction, Group, and Zone. Available layouts are DGZ, DZG, GDZ, GZD, ZDG, and ZGD. The order is specified left-to-right in longest-to-shortest stride. For example: DGZ means that Directions are the longest stride, and Zones are stride-1. (Default: --nest DGZ) | ||
|
||
|
||
###Parallel Decomposition Options: | ||
### Parallel Decomposition Options: | ||
|
||
* **``--pdist <lout>``** | ||
|
||
|
@@ -276,7 +251,7 @@ Command line option help can also be viewed by running "./kripke --help" | |
Number of zone-sets in x, y, and z. (Default: --zset 1:1:1) | ||
|
||
|
||
###Solver Options: | ||
### Solver Options: | ||
|
||
* **``--niter <NITER>``** | ||
|
||
|
@@ -302,12 +277,6 @@ Some ideas for future study: | |
|
||
|
||
|
||
Retirement | ||
========== | ||
|
||
Retirement of this Mini-App should be considered when it is no longer a representative of state-of-the-art transport codes, or when it becomes too cumbersome to adapt to advanced architectures. Also, at the point of retirement it should be clear how to design its successor. | ||
|
||
|
||
Links | ||
===== | ||
|
||
|
@@ -316,4 +285,20 @@ Links | |
|
||
Release | ||
======= | ||
LLNL-CODE-658597 | ||
|
||
Copyright (c) 2014-2019, Lawrence Livermore National Security, LLC. | ||
|
||
Produced at the Lawrence Livermore National Laboratory. | ||
|
||
All rights reserved. | ||
|
||
`LLNL-CODE-775068` | ||
|
||
Unlimited Open Source - BSD Distribution | ||
|
||
For release details and restrictions, please read the COPYRIGHT, LICENSE, | ||
and NOTICE files, also linked here: | ||
- [RELEASE](./RELEASE) | ||
- [COPYRIGHT](./COPYRIGHT) | ||
- [LICENSE](./LICENSE) | ||
- [NOTICE](./NOTICE) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
******************************************************************************* | ||
|
||
Kripke: ................................, version 1.2.4 | ||
|
||
Copyright (c) 2014-19, Lawrence Livermore National Security, LLC. | ||
Produced at the Lawrence Livermore National Laboratory. | ||
All rights reserved. See details in the LICENSE file. | ||
|
||
Unlimited Open Source - BSD Distribution | ||
LLNL-CODE-775068 | ||
|
||
The Kripke development team and contributors are: | ||
|
||
Adam J. Kunen ([email protected]) | ||
Peter N. Brown ([email protected]) | ||
Teresa S. Bailey ([email protected]) | ||
Peter G. Maginot ([email protected]) | ||
|
Oops, something went wrong.