The developerWorks Connections platform will be sunset on January 2, 2020. This blog will no longer be available unless an extension has been requested. More details available on our FAQ.
Blogs
Blogs
Matching:
parallel_computing
X
Display: 
Assist Threads Part 1 Community Blog
stan_kvasov
Tags:
parallel
parallel_computing
parallel_performance
cppcafe
concurrency
3 Comments
10,909 Views
Today’s multicore processors support many threads of execution and can provide substantial performance when running multithreaded applications. Unfortunately, multithreaded programming is difficult, and as a result, a lot of today’s software is still...
from Blog: C/C++ compilers for IBM Z Blog


Implementing a Scalable Parallel Reduction in Unified Parallel C Community Blog
NancyWang
Tags:
upc
reduction
upc_forall
parallel_performance
parallel
upc_programming
cppcafe
parallel_computing
10,617 Views
A reduction is the process of combining elements of a vector (or array) to yield a single aggregate element. It is commonly used in scientific computations. For instance the inner product of two ndimensional vectors x, y is given by: This computation requires...
from Blog: C/C++ compilers for IBM Z Blog


Implementing a Scalable Parallel Reduction in Unified Parallel C (part 3) Community Blog
NancyWang
Tags:
cppcafe
parallel_performance
parallel_computing
upc_forall
upc
parallel
reduction
parallel_programming
8,169 Views
continue from the second parallel reduction blog . To get better scalability (increased program performance as the number of threads increases), it is critical to remove the lock in the upc_forall loop. This can be done by accumulating the partial sum...
from Blog: C/C++ compilers for IBM Z Blog


Implementing a Scalable Parallel Reduction in Unified Parallel C (part 2) Community Blog
NancyWang
Tags:
cppcafe
parallel_performance
upc_forall
parallel_programming
upc
reduction
parallel
parallel_computing
1 Comment
10,297 Views
continue from the previous parallel reduction blog The result is obvious wrong, but what is the problem? The keen reader might point out that the program as written contains a race condition. Multiple threads can write into shared variable "sum" concurrently,...
from Blog: C/C++ compilers for IBM Z Blog
