Show simple item record

dc.contributor.authorPapaphilippou, Philippos
dc.date.accessioned2023-11-10T15:12:15Z
dc.date.available2023-11-10T15:12:15Z
dc.date.created2023en
dc.date.issued2023
dc.date.submitted2023en
dc.identifier.citationPhilippos Papaphilippou, Zhiqiang Que, Wayne Luk, Efficiently Removing Sparsity for High-Throughput Stream Processing, The International Conference on Field-Programmable Technology (FPT) 2023, Yokohama, Japan, 2023en
dc.identifier.otherY
dc.description.abstractBig data analytics and machine learning are increasingly targeted by FPGAs due to their significant amount of computing capabilities and internal parallelism. Different programming models are used to distribute the workload to the internals of the FPGAs at different granularities. While the memory bandwidth has been steadily increasing, there are some challenges in the way system-on-chips use this bandwidth. One way system-on-chip architects exploit the increasing memory bandwidth is by widening the datapath width. This is reflected at various points in the system including the widening of vector instructions. On FPGAs, many analytics accelerators are memory-bound, and would benefit from making the most of the available bandwidth. In this paper we present a scalable and highly-efficient building block for building high-throughput streaming accelerators, which removes sparsity on-the-fly without backpressure.en
dc.language.isoenen
dc.rightsYen
dc.subjectPrefix scanen
dc.subjectInterconnectsen
dc.subjectFPGAen
dc.subjectStream compactionen
dc.subjectAggregationen
dc.subjectHigh- throughput computationen
dc.subjectAnalyticsen
dc.titleEfficiently Removing Sparsity for High-Throughput Stream Processingen
dc.title.alternativeThe International Conference on Field-Programmable Technology (FPT) 2023en
dc.typeConference Paperen
dc.type.supercollectionscholarly_publicationsen
dc.type.supercollectionrefereed_publicationsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/papaphip
dc.identifier.rssinternalid260091
dc.rights.ecaccessrightsopenAccess
dc.subject.TCDTagComputer Architectureen
dc.subject.TCDTagComputer Engineeringen
dc.subject.TCDTagComputer Scienceen
dc.subject.TCDTagParallel Computer Architectureen
dc.subject.TCDTagParallel Programmingen
dc.subject.TCDTagParallel Systemsen
dc.identifier.orcid_id0000-0002-7452-7150
dc.status.accessibleNen
dc.identifier.urihttp://hdl.handle.net/2262/104146


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record