The developerWorks Connections platform will be sunset on December 31, 2019. On January 1, 2020, this blog will no longer be available. More details available on our FAQ.
Blogs
Blogs
Matching:
upc_forall
X
Display: 
Implementing a Scalable Parallel Reduction in Unified Parallel C Community Blog
NancyWang
Tags:
upc
reduction
upc_forall
parallel_performance
parallel
upc_programming
cppcafe
parallel_computing
10,584 Views
A reduction is the process of combining elements of a vector (or array) to yield a single aggregate element. It is commonly used in scientific computations. For instance the inner product of two ndimensional vectors x, y is given by: This computation requires...
from Blog: C/C++ compilers for IBM Z Blog


Implementing a Scalable Parallel Reduction in Unified Parallel C (part 3) Community Blog
NancyWang
Tags:
cppcafe
parallel_performance
parallel_computing
upc_forall
upc
parallel
reduction
parallel_programming
8,161 Views
continue from the second parallel reduction blog . To get better scalability (increased program performance as the number of threads increases), it is critical to remove the lock in the upc_forall loop. This can be done by accumulating the partial sum...
from Blog: C/C++ compilers for IBM Z Blog


Implementing a Scalable Parallel Reduction in Unified Parallel C (part 2) Community Blog
NancyWang
Tags:
cppcafe
parallel_performance
upc_forall
parallel_programming
upc
reduction
parallel
parallel_computing
1 Comment
10,274 Views
continue from the previous parallel reduction blog The result is obvious wrong, but what is the problem? The keen reader might point out that the program as written contains a race condition. Multiple threads can write into shared variable "sum" concurrently,...
from Blog: C/C++ compilers for IBM Z Blog
