Wang, Zepeng, Li, Ping, Zhang, Luming and Shao, Ling (2019) Community-aware photo quality evaluation by deeply encoding human perception. IEEE Transactions on Multimedia. pp. 1-11. ISSN 1520-9210
Full text not available from this repository.Abstract
Computational photo quality evaluation is a useful technique in many tasks of computer vision and graphics, <formula><tex>$e.g.$</tex></formula>, photo retaregeting, 3D rendering, and fashion recommendation. Conventional photo quality models are designed by characterizing pictures from all communities (eg "architecture" and "colorful") indiscriminately, wherein community-specific features are not encoded explicitly. In this work, we develop a new community-aware photo quality evaluation framework. It uncovers the latent community-specific topics by a regularized latent topic model (LTM), and captures human visual quality perception by exploring multiple attributes. More specifically, given massive-scale online photos from multiple communities, a novel ranking algorithm is proposed to measure the visual/semantic attractiveness of regions inside each photo. Meanwhile, three attributes: photo quality scores, weak semantic tags, and inter-region correlations, are seamlessly and collaboratively incorporated during ranking. Subsequently, we construct gaze shifting path (GSP) for each photo by sequentially linking the top-ranking regions from each photo, and an aggregation-based deep CNN calculates the deep representation for each GSP. Based on this, an LTM is proposed to model the GSP distribution from multiple communities in the latent space. To mitigate the overfitting problem caused by communities with very few photos, a regularizer is added into our LTM. Finally, given a test photo, we obtain its deep GSP representation and its quality score is determined by the posterior probability of the regularized LTM. Comprehensive comparative studies on four image sets have shown the competitiveness of our method. Besides, eye tracking experiments demonstrated that our ranking-based GSPs are highly consistent with real human gaze movements.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | community,deep feature,gaze behavior,machine learning,quality model,topic model,signal processing,media technology,computer science applications,electrical and electronic engineering ,/dk/atira/pure/subjectarea/asjc/1700/1711 |
Faculty \ School: | Faculty of Science > School of Computing Sciences |
Related URLs: | |
Depositing User: | LivePure Connector |
Date Deposited: | 07 Apr 2020 00:44 |
Last Modified: | 22 Oct 2022 06:01 |
URI: | https://ueaeprints.uea.ac.uk/id/eprint/74720 |
DOI: | 10.1109/TCYB.2019.2937319 |
Actions (login required)
View Item |