A vector floating point processing unit design

Thesis (M.Eng.)--Memorial University of Newfoundland, 2008. Engineering and Applied Science Includes bibliographical references (leaves 101-104) The main contribution of this thesis is the successful development of a vector floating point processing unit for high accuracy science computing. For thes...

Full description

Bibliographic Details
Main Author: Chen, Shi, 1976-
Other Authors: Memorial University of Newfoundland. Faculty of Engineering and Applied Science
Format: Thesis
Language:English
Published: 2008
Subjects:
Online Access:http://collections.mun.ca/cdm/ref/collection/theses4/id/29765
id ftmemorialunivdc:oai:collections.mun.ca:theses4/29765
record_format openpolar
spelling ftmemorialunivdc:oai:collections.mun.ca:theses4/29765 2023-05-15T17:23:33+02:00 A vector floating point processing unit design Chen, Shi, 1976- Memorial University of Newfoundland. Faculty of Engineering and Applied Science 2008 xii, 104 leaves : ill. Image/jpeg; Application/pdf http://collections.mun.ca/cdm/ref/collection/theses4/id/29765 Eng eng Electronic Theses and Dissertations (9.88 MB) -- http://collections.mun.ca/PDFs/theses/Chen_Shi.pdf a2542352 http://collections.mun.ca/cdm/ref/collection/theses4/id/29765 The author retains copyright ownership and moral rights in this thesis. Neither the thesis nor substantial extracts from it may be printed or otherwise reproduced without the author's permission. Paper copy kept in the Centre for Newfoundland Studies, Memorial University Libraries Field programmable gate arrays Floating-point arithmetic Vector processing (Computer science) Text Electronic thesis or dissertation 2008 ftmemorialunivdc 2015-08-06T19:21:53Z Thesis (M.Eng.)--Memorial University of Newfoundland, 2008. Engineering and Applied Science Includes bibliographical references (leaves 101-104) The main contribution of this thesis is the successful development of a vector floating point processing unit for high accuracy science computing. For these numerically-intensive applications, vector processing offers simple and straightforward parallelism by executing mathematical operations on multiple data elements simultaneously. The simple control and datapath structures of vector processing enable the embedded computing system to attain high performance at low power. -- This vector floating point processing unit includes: a vector register file, vector floating point arithmetic units, and vector memory units. The central module, a vector register file, is divided into twelve lanes. One lane contains 16 vector registers, each including 32x32-bit elements, and is connected to a floating point adder and a floating point multiplier. By modeling the multi-port register file using configurable block RAM on Field Programmable Gate Arrays (FPGA) target, vector register files can efficiently obtain data from external memory and feed data to different arithmetic units simultaneously. Utilizing the quick carry out path and embedded multiplier macro unit, the vector floating point arithmetic units can run at over 200 MHz. A flag register is used to indicate the calculation sequence for the specific computing model. Moreover, the embedded Power PC processor not only can easily control the calculation flow, but also can support an embedded operating system to extend a broad range of applications. The prototype is implemented on Xilinx Virtex II Pro devices, and a peak performance of 4.530 GFLOPS at 188.768 MHz has been achieved. -- First, we present a brief introduction to the floating point arithmetic operations, including addition, multiplication, and multiplier-adder-fused. Second, the architecture of the vector processing unit and a detailed description of vector function units are introduced. Moreover, for a specific computing application, the appropriate overlap execution scheme is discussed. In the end, the performance of each component is analyzed, and the time and area analysis of whole system is provided. Thesis Newfoundland studies University of Newfoundland Memorial University of Newfoundland: Digital Archives Initiative (DAI) Lanes ENVELOPE(18.933,18.933,69.617,69.617)
institution Open Polar
collection Memorial University of Newfoundland: Digital Archives Initiative (DAI)
op_collection_id ftmemorialunivdc
language English
topic Field programmable gate arrays
Floating-point arithmetic
Vector processing (Computer science)
spellingShingle Field programmable gate arrays
Floating-point arithmetic
Vector processing (Computer science)
Chen, Shi, 1976-
A vector floating point processing unit design
topic_facet Field programmable gate arrays
Floating-point arithmetic
Vector processing (Computer science)
description Thesis (M.Eng.)--Memorial University of Newfoundland, 2008. Engineering and Applied Science Includes bibliographical references (leaves 101-104) The main contribution of this thesis is the successful development of a vector floating point processing unit for high accuracy science computing. For these numerically-intensive applications, vector processing offers simple and straightforward parallelism by executing mathematical operations on multiple data elements simultaneously. The simple control and datapath structures of vector processing enable the embedded computing system to attain high performance at low power. -- This vector floating point processing unit includes: a vector register file, vector floating point arithmetic units, and vector memory units. The central module, a vector register file, is divided into twelve lanes. One lane contains 16 vector registers, each including 32x32-bit elements, and is connected to a floating point adder and a floating point multiplier. By modeling the multi-port register file using configurable block RAM on Field Programmable Gate Arrays (FPGA) target, vector register files can efficiently obtain data from external memory and feed data to different arithmetic units simultaneously. Utilizing the quick carry out path and embedded multiplier macro unit, the vector floating point arithmetic units can run at over 200 MHz. A flag register is used to indicate the calculation sequence for the specific computing model. Moreover, the embedded Power PC processor not only can easily control the calculation flow, but also can support an embedded operating system to extend a broad range of applications. The prototype is implemented on Xilinx Virtex II Pro devices, and a peak performance of 4.530 GFLOPS at 188.768 MHz has been achieved. -- First, we present a brief introduction to the floating point arithmetic operations, including addition, multiplication, and multiplier-adder-fused. Second, the architecture of the vector processing unit and a detailed description of vector function units are introduced. Moreover, for a specific computing application, the appropriate overlap execution scheme is discussed. In the end, the performance of each component is analyzed, and the time and area analysis of whole system is provided.
author2 Memorial University of Newfoundland. Faculty of Engineering and Applied Science
format Thesis
author Chen, Shi, 1976-
author_facet Chen, Shi, 1976-
author_sort Chen, Shi, 1976-
title A vector floating point processing unit design
title_short A vector floating point processing unit design
title_full A vector floating point processing unit design
title_fullStr A vector floating point processing unit design
title_full_unstemmed A vector floating point processing unit design
title_sort vector floating point processing unit design
publishDate 2008
url http://collections.mun.ca/cdm/ref/collection/theses4/id/29765
long_lat ENVELOPE(18.933,18.933,69.617,69.617)
geographic Lanes
geographic_facet Lanes
genre Newfoundland studies
University of Newfoundland
genre_facet Newfoundland studies
University of Newfoundland
op_source Paper copy kept in the Centre for Newfoundland Studies, Memorial University Libraries
op_relation Electronic Theses and Dissertations
(9.88 MB) -- http://collections.mun.ca/PDFs/theses/Chen_Shi.pdf
a2542352
http://collections.mun.ca/cdm/ref/collection/theses4/id/29765
op_rights The author retains copyright ownership and moral rights in this thesis. Neither the thesis nor substantial extracts from it may be printed or otherwise reproduced without the author's permission.
_version_ 1766113222154256384