WIP: Add duplicated memory to force_lj #21

stanmoore1 · 2017-12-08T16:33:31Z

Don't merge yet. This is an example of using the new duplicated memory feature in Kokkos as an alternative to thread atomics, see kokkos/kokkos#1225 and kokkos/kokkos#825. For OpenMP or PThreads it uses a duplicated non-atomic view, for CUDA it still uses a non-duplicated atomic view, and for Serial it uses a non-duplicated, non-atomic view. On my Linux box with OpenMP it does give speedup over atomics. The API/naming may change a bit before it is formally released into Kokkos.

@crtrott

stanmoore1 · 2017-12-08T21:16:02Z

Here is some performance data for ExaMiniMD on a 4 core Linux box
LJ benchmark, 256,000 atoms, 1 MPI x 4 OpenMP threads

Method	Performance (atom-steps/s)
thread atomic	4.86e+05
work duplication (full neigh list)	6.15e+05
data duplication	7.98e+05
data duplication, persistent memory	7.99e+05

sslattery · 2018-05-04T21:35:38Z

@stanmoore1 any comments on differences in memory usage for OpenMP and Pthreads vs. using atomics?

stanmoore1 · 2018-05-04T21:40:49Z

There is some memory overhead because the force array is duplicated. The force array is the second largest data structure, after the neighbor list, however typically each atom has many neighbors, so the neighbor list is much larger than the force array, and typically we only duplicate 8 or less times.

stanmoore1 · 2018-05-04T21:41:44Z

Also the numbers for data duplication may be a little better because we fixed this bug: #22.

janciesko · 2024-06-12T22:02:57Z

What's the status of this?

Add duplicated memory to force_lj

1955d31

stanmoore1 added the enhancement label Dec 8, 2017

stanmoore1 self-assigned this Dec 8, 2017

Rename ReductionView to ScatterView

a77a1b9

stanmoore1 mentioned this pull request Jan 5, 2018

Add LJ variant for HalfNeighbor list which uses data replication #8

Open

Merge branch 'bugfix' into duplicated_f

652c168

Merge branch 'master' into duplicated_f

c02d8ae

stanmoore1 mentioned this pull request May 8, 2019

User ScatterView ECP-copa/Cabana#124

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP: Add duplicated memory to force_lj #21

WIP: Add duplicated memory to force_lj #21

Uh oh!

stanmoore1 commented Dec 8, 2017 •

edited

Loading

Uh oh!

stanmoore1 commented Dec 8, 2017

Uh oh!

sslattery commented May 4, 2018

Uh oh!

stanmoore1 commented May 4, 2018

Uh oh!

stanmoore1 commented May 4, 2018

Uh oh!

janciesko commented Jun 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

WIP: Add duplicated memory to force_lj #21

Are you sure you want to change the base?

WIP: Add duplicated memory to force_lj #21

Uh oh!

Conversation

stanmoore1 commented Dec 8, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stanmoore1 commented Dec 8, 2017

Uh oh!

sslattery commented May 4, 2018

Uh oh!

stanmoore1 commented May 4, 2018

Uh oh!

stanmoore1 commented May 4, 2018

Uh oh!

janciesko commented Jun 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

stanmoore1 commented Dec 8, 2017 •

edited

Loading