مرکز منطقه ای اطلاع رساني علوم و فناوري - CPGAN : An Efficient Architecture Designing for Text-to-Image Generative Adversarial Networks Based on Canonical Polyadic Decomposition

Title of article :

CPGAN : An Efficient Architecture Designing for Text-to-Image Generative Adversarial Networks Based on Canonical Polyadic Decomposition

Author/Authors :

Ma , Ruixin School of Software - Dalian University of Technology, China , Lou, Junying School of Software - Dalian University of Technology, China

Pages :

From page :

To page :

Abstract :

Text-to-image synthesis is an important and challenging application of computer vision. Many interesting and meaningful text-to-image synthesis models have been put forward. However, most of the works pay attention to the quality of synthesis images, but rarely consider the size of these models. Large models contain many parameters and high delay, which makes it difficult to be deployed on mobile applications. To solve this problem, we propose an efficient architecture CPGAN for text-to-image generative adversarial networks (GAN) based on canonical polyadic decomposition (CPD). It is a general method to design the lightweight architecture of text-to-image GAN. To improve the stability of CPGAN, we introduce conditioning augmentation and the idea of autoencoder during the training process. Experimental results prove that our architecture CPGAN can maintain the quality of generated images and reduce at least 20% parameters and flops.

Keywords :

CPGAN , Canonical Polyadic Decomposition , Adversarial Networks

Journal title :

Scientific Programming

Serial Year :

2021

Full Text URL :

downloads.hindawi.com/journals/sp/2021/5573751.pdf

Record number :

2612947

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=2612947