Layout and Context Understanding for Image Synthesis with Scene Graphs

碩士 === 國立臺灣科技大學 === 資訊工程系 === 107 === Advancements on text-to-image synthesis generate remarkable images from textual descriptions. However, these methods are designed to generate only one object with varying attributes. They face difficulties with complex descriptions having multiple arbitrary obje...

Full description

Bibliographic Details
Main Author: Arces A. Talavera
Other Authors: Kai-Lung Hua
Format: Others
Language:en_US
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/z3f7su
id ndltd-TW-107NTUS5392016
record_format oai_dc
spelling ndltd-TW-107NTUS53920162019-05-16T01:40:46Z http://ndltd.ncl.edu.tw/handle/z3f7su Layout and Context Understanding for Image Synthesis with Scene Graphs Layout and Context Understanding for Image Synthesis with Scene Graphs Arces A. Talavera Arces A. Talavera 碩士 國立臺灣科技大學 資訊工程系 107 Advancements on text-to-image synthesis generate remarkable images from textual descriptions. However, these methods are designed to generate only one object with varying attributes. They face difficulties with complex descriptions having multiple arbitrary objects since it would require information on the placement and sizes of each object in the image. Recently, a method that infers object layouts from scene graphs has been proposed as a solution to this problem. However, their method uses only object labels in describing the layout, which fail to capture the appearance of some objects. Moreover, their model is biased towards generating rectangular shaped objects in the absence of ground-truth masks. In this paper, we propose an object encoding module to capture object features and use it as additional information to the image generation network. We also introduce a graph-cuts based segmentation method that can infer the masks of objects from bounding boxes to better model object shapes. Our method produces more discernable images with more realistic shapes as compared to the images generated by the current state-of-the-art method. Kai-Lung Hua 花凱龍 2019 學位論文 ; thesis 43 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 資訊工程系 === 107 === Advancements on text-to-image synthesis generate remarkable images from textual descriptions. However, these methods are designed to generate only one object with varying attributes. They face difficulties with complex descriptions having multiple arbitrary objects since it would require information on the placement and sizes of each object in the image. Recently, a method that infers object layouts from scene graphs has been proposed as a solution to this problem. However, their method uses only object labels in describing the layout, which fail to capture the appearance of some objects. Moreover, their model is biased towards generating rectangular shaped objects in the absence of ground-truth masks. In this paper, we propose an object encoding module to capture object features and use it as additional information to the image generation network. We also introduce a graph-cuts based segmentation method that can infer the masks of objects from bounding boxes to better model object shapes. Our method produces more discernable images with more realistic shapes as compared to the images generated by the current state-of-the-art method.
author2 Kai-Lung Hua
author_facet Kai-Lung Hua
Arces A. Talavera
Arces A. Talavera
author Arces A. Talavera
Arces A. Talavera
spellingShingle Arces A. Talavera
Arces A. Talavera
Layout and Context Understanding for Image Synthesis with Scene Graphs
author_sort Arces A. Talavera
title Layout and Context Understanding for Image Synthesis with Scene Graphs
title_short Layout and Context Understanding for Image Synthesis with Scene Graphs
title_full Layout and Context Understanding for Image Synthesis with Scene Graphs
title_fullStr Layout and Context Understanding for Image Synthesis with Scene Graphs
title_full_unstemmed Layout and Context Understanding for Image Synthesis with Scene Graphs
title_sort layout and context understanding for image synthesis with scene graphs
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/z3f7su
work_keys_str_mv AT arcesatalavera layoutandcontextunderstandingforimagesynthesiswithscenegraphs
AT arcesatalavera layoutandcontextunderstandingforimagesynthesiswithscenegraphs
_version_ 1719178358677307392