Blogs
with Tags:
parallel_computing
X
 Sort by:
 Date ▼
 Title
 Likes/Votes
 Comments
 Visits
Assist Threads Part 1 Community Blog
stan_kvasov
Tags:
parallel_performance
parallel
cppcafe
concurrency
parallel_computing
3 Comments
2,753 Visits
Today’s multicore processors support many threads of execution and can provide substantial performance when running multithreaded applications. Unfortunately, multithreaded programming is difficult, and as a result, a lot of today’s software is still...
from Blog: C/C++ Cafe


Implementing a Scalable Parallel Reduction in Unified Parallel C Community Blog
NancyWang
Tags:
cppcafe
upc_programming
parallel
upc_forall
parallel_performance
reduction
parallel_computing
upc
2,917 Visits
A reduction is the process of combining elements of a vector (or array) to yield a single aggregate element. It is commonly used in scientific computations. For instance the inner product of two ndimensional vectors x, y is given by: This computation requires...
from Blog: C/C++ Cafe


Implementing a Scalable Parallel Reduction in Unified Parallel C
NancyWang
Tags:
parallel_performance
reduction
upc_programming
upc
parallel_computing
parallel
upc_forall
1,372 Visits
A reduction is the process of combining elements of a vector (or array) to yield a single aggregate element. It is commonly used in scientific computations. For instance the inner product of two ndimensional vectors x, y is given by: This computation requires...
from Blog: Parallel and MultiCore Computing with C/C++


Implementing a Scalable Parallel Reduction in Unified Parallel C (part 3) Community Blog
NancyWang
Tags:
parallel
reduction
upc
upc_forall
cppcafe
parallel_computing
parallel_performance
parallel_programming
2,162 Visits
continue from the second parallel reduction blog . To get better scalability (increased program performance as the number of threads increases), it is critical to remove the lock in the upc_forall loop. This can be done by accumulating the partial sum...
from Blog: C/C++ Cafe


Implementing a Scalable Parallel Reduction in Unified Parallel C (part 3)
NancyWang
Tags:
upc_forall
parallel
upc
reduction
parallel_performance
parallel_computing
parallel_programming
1,200 Visits
continue from the second parallel reduction blog . To get better scalability (increased program performance as the number of threads increases), it is critical to remove the lock in the upc_forall loop. This can be done by accumulating the partial sum...
from Blog: Parallel and MultiCore Computing with C/C++


Implementing a Scalable Parallel Reduction in Unified Parallel C (part 2) Community Blog
NancyWang
Tags:
parallel_computing
reduction
cppcafe
upc
parallel_programming
upc_forall
parallel
parallel_performance
1 Comment
2,493 Visits
continue from the previous parallel reduction blog The result is obvious wrong, but what is the problem? The keen reader might point out that the program as written contains a race condition. Multiple threads can write into shared variable "sum" concurrently,...
from Blog: C/C++ Cafe


Implementing a Scalable Parallel Reduction in Unified Parallel C (part 2)
NancyWang
Tags:
parallel_programming
upc_forall
reduction
parallel_performance
parallel_computing
parallel
upc
1,229 Visits
continue from the previous parallel reduction blog The result is obvious wrong, but what is the problem? The keen reader might point out that the program as written contains a race condition. Multiple threads can write into shared variable "sum" concurrently,...
from Blog: Parallel and MultiCore Computing with C/C++
