Custom precision accelerators for energy-efficient image-to-image transformations in motion picture workflows

Shanker, Shreejith

dc.contributor.author	Shanker, Shreejith	en
dc.date.accessioned	2023-08-22T09:00:12Z
dc.date.available	2023-08-22T09:00:12Z
dc.date.created	August, 2023	en
dc.date.issued	2023	en
dc.date.submitted	2023	en
dc.identifier.citation	Emmet Murphy, Shashwat Khandelwal, Shanker Shreejith, Custom precision accelerators for energy-efficient image-to-image transformations in motion picture workflows, Applications of Digital Image Processing XLV., San Diego, USA, August, 2023, SPIE, 2023	en
dc.identifier.other	Y	en
dc.description	PUBLISHED	en
dc.description	San Diego, USA	en
dc.description.abstract	Image to Image (I2I) transformations have been an integral part of video processing workflows with applications in Image Synthesis for Virtual Productions, Segmentation, and Matting, among others. Over the years, deep learning-based approaches have been enabling new methods and tools for automating parts of the processing pipeline, reducing the human effort involved in post-production workflows. These compute-intensive models are often accelerated through on-premise or in-cloud GPU instances to improve the responsiveness and latency while expending large amounts of energy in performing these complex transformations. In this work, we present an approach for optimising the energy efficiency of I2I deep-learning models using quantised neural networks accelerated on a server-style FPGA. We use deep learning-based alpha background matting as the I2I application which is implemented using a U-Net conditional Generative Adversarial Network deep learning model. The model is trained and quantised using Vitis-AI flow from AMD/Xilinx and deployed on a data centre class Alveo U50 FPGA device. Our results show that the quantised model on the FPGA achieves a 1.14× higher throughput for inference acceleration while consuming 11× lower energy consumption per inference when compared to a GPU-accelerated version of the model on a 3080-Ti, while generating nearly identical results with an average IoU > 0.95 across multiple user images at 1080p and 4K resolutions. Additionally, offloads to the FPGA device can be seamlessly integrated into widely used motion picture tools like NUKE with minimal effort. With most cloud providers integrating heterogenous platforms (including FPGAs) into systems, we envision that this work paves the way for more efficient utilisation of custom precision deep-learning models and FPGA acceleration in deep learning-based motion picture workflows.	en
dc.language.iso	en	en
dc.publisher	SPIE	en
dc.rights	Y	en
dc.subject	Quantised Deep Learning	en
dc.subject	Image to Image Transformations	en
dc.subject	U-Net	en
dc.title	Custom precision accelerators for energy-efficient image-to-image transformations in motion picture workflows	en
dc.title.alternative	Applications of Digital Image Processing XLV.	en
dc.type	Conference Paper	en
dc.type.supercollection	scholarly_publications	en
dc.type.supercollection	refereed_publications	en
dc.identifier.peoplefinderurl	http://people.tcd.ie/shankers	en
dc.identifier.rssinternalid	257819	en
dc.rights.ecaccessrights	openAccess
dc.subject.TCDTheme	Creative Technologies	en
dc.subject.TCDTheme	Making Ireland	en
dc.subject.TCDTheme	Smart & Sustainable Planet	en
dc.subject.TCDTag	Field Programmable Gate Arrays (FPGAs)	en
dc.subject.TCDTag	Image Processing	en
dc.subject.TCDTag	MACHINE LEARNING	en
dc.subject.TCDTag	Reconfigurable Computing	en
dc.subject.TCDTag	VHDL, FPGA, DIGITAL DESIGN	en
dc.subject.TCDTag	VIDEO PROCESSING	en
dc.identifier.orcid_id	0000-0002-9717-1804	en
dc.status.accessible	N	en
dc.identifier.uri	http://hdl.handle.net/2262/103756

Files in this item

Name:: SPIE2023_Custom_Precision_Acce ...
Size:: 4.105Mb
Format:: PDF
Description:: Accepted for publication (author's ...

View/Open

Name:: license.txt
Size:: 3.534Kb
Format:: Text file

View/Open

This item appears in the following Collection(s)

Electronic & Electrical Eng (Scholarly Publications)
Electronic & Electrical Eng (Scholarly Publications)
RSS Feeds

Show simple item record

Browse

My Account

Custom precision accelerators for energy-efficient image-to-image transformations in motion picture workflows

Files in this item

This item appears in the following Collection(s)