A new parallelisation technique for heterogeneous CPUs

Gdura, Youssef Omran (2012) A new parallelisation technique for heterogeneous CPUs. PhD thesis, University of Glasgow.

Full text available as:
[img]
Preview
PDF
Download (6MB) | Preview

Abstract

Parallelization has moved in recent years into the mainstream compilers, and the demand
for parallelizing tools that can do a better job of automatic parallelization is higher than
ever. During the last decade considerable attention has been focused on developing programming
tools that support both explicit and implicit parallelism to keep up with the
power of the new multiple core technology. Yet the success to develop automatic parallelising
compilers has been limited mainly due to the complexity of the analytic process
required to exploit available parallelism and manage other parallelisation measures such
as data partitioning, alignment and synchronization.
This dissertation investigates developing a programming tool that automatically parallelises
large data structures on a heterogeneous architecture and whether a high-level programming
language compiler can use this tool to exploit implicit parallelism and make use
of the performance potential of the modern multicore technology. The work involved the
development of a fully automatic parallelisation tool, called VSM, that completely hides
the underlying details of general purpose heterogeneous architectures. The VSM implementation
provides direct and simple access for users to parallelise array operations on the
Cell’s accelerators without the need for any annotations or process directives. This work
also involved the extension of the Glasgow Vector Pascal compiler to work with the VSM
implementation as a one compiler system. The developed compiler system, which is called
VP-Cell, takes a single source code and parallelises array expressions automatically.
Several experiments were conducted using Vector Pascal benchmarks to show the validity
of the VSM approach. The VP-Cell system achieved significant runtime performance
on one accelerator as compared to the master processor’s performance and near-linear
speedups over code runs on the Cell’s accelerators. Though VSM was mainly designed for
developing parallelising compilers it also showed a considerable performance by running
C code over the Cell’s accelerators.

Item Type: Thesis (PhD)
Qualification Level: Doctoral
Keywords: parallel programming tools, heterogeneous multi-core archirecture, virtual machine, parallelising array operations, implicit model, parallelising compilers
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Colleges/Schools: College of Science and Engineering > School of Computing Science
Supervisor's Name: Cockshott, Dr. W.P.
Date of Award: 2012
Depositing User: Mr Youssef Omran Gdura
Unique ID: glathesis:2012-3406
Copyright: Copyright of this thesis is held by the author.
Date Deposited: 01 Jun 2012
Last Modified: 10 Dec 2012 14:06
URI: http://theses.gla.ac.uk/id/eprint/3406

Actions (login required)

View Item View Item