Harmonic-Summing Module of SKA on FPGA—Optimizing the Irregular Memory Accesses

Wang, Haomiao; Prabu, T.; Sinnen, Oliver

Please use this identifier to cite or link to this item: http://hdl.handle.net/2289/7352

Full metadata record

DC Field	Value	Language
dc.contributor.author	Wang, Haomiao	-
dc.contributor.author	Prabu, T.	-
dc.contributor.author	Sinnen, Oliver	-
dc.date.accessioned	2019-11-18T05:57:34Z	-
dc.date.available	2019-11-18T05:57:34Z	-
dc.date.issued	2018-03	-
dc.identifier.citation	IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2018, Vol.27, p624-636	en_US
dc.identifier.issn	1063-8210	-
dc.identifier.issn	1557-9999(Online)	-
dc.identifier.uri	http://hdl.handle.net/2289/7352	-
dc.description	Restricted Access	en_US
dc.description.abstract	The Square Kilometer Array, which will be the world's largest radio telescope, will enhance and boost a large number of science projects, including the search for pulsars. The frequency-domain acceleration search is an efficient approach to search for binary pulsars. A significant part of it is the harmonic-summing module, which is the research subject of this paper. Most of the operations in the harmonic-summing module are relatively cheap operations for field-programmable gate arrays (FPGAs). The main challenge is the large number of point accesses to off-chip memory, which are not consecutive but irregular. Having the harmonic summing on the FPGA will avoid off-board communication with other pulsar search modules, which could destroy other acceleration benefits. Two types of harmonic-summing approaches are investigated in this paper: (1) storing intermediate data in off-chip memory and (2) processing the input signals directly without storing. For the second type, two approaches of caching data are proposed and evaluated: (1) preloading points that are frequently touched and (2) preloading all necessary points that are used to generate a chunk of output points. Open Computing Language (OpenCL) is adopted to implement the proposed approaches. In an extensive experimental evaluation, the same OpenCL kernel codes are evaluated on FPGA boards and GPU cards. Regarding the proposed preloading methods, preloading all necessary points method while reordering the input signals is faster than all the other methods. While in raw performance, a single-FPGA board cannot compete with a GPU. Regarding energy dissipation, GPU costs up to 2.6× times more energy than that of FPGAs in executing the same NDRange kernels.	en_US
dc.language.iso	en	en_US
dc.publisher	IEEE	en_US
dc.relation.uri	https://doi.org/10.1109/TVLSI.2018.2882238	en_US
dc.rights	2018, IEEE	en_US
dc.subject	Field-programmable gate arrays (FPGAs)	en_US
dc.subject	harmonic summing	en_US
dc.subject	irregular memory access optimization	en_US
dc.subject	Open Computing Language (OpenCL).	en_US
dc.title	Harmonic-Summing Module of SKA on FPGA—Optimizing the Irregular Memory Accesses	en_US
dc.type	Article	en_US
Appears in Collections:	Research Papers (A&A)

Files in This Item:

File	Description	Size	Format
2019-IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION-Vol.27 p 624-636.pdf	Restricted Access	3.02 MB	Adobe PDF	View/Open

Show simple item record