Smart Image Retargeting Techniques

Presentation

This website is intended to promote and present the progress of work on the dissertation of the Master in Electrical and Computer Engineering, course of the Faculty of Engineering, University of Porto.
The dissertation is titled "Smart Image Retargeting Techniques" and is developed at the INESC TEC company.

Introduction

With a crescent use of mobile devices, such as mobile phones and PC tablets, the access to information, social networking and other applications, has also been increased. Due to the rate of its use it became necessary to organize information according to the device’s screen size, and sequentially resizing images automatically. It was also concluded that the methods used for resizing, like scaling (which resizes the images homogeneously), did not provide an image with quality. Then other resizing techniques had appeared, considering the image’s content.

Problem formulation

Given:
An image with size n x m.

Find:
1. A cost function that efficiently identify the important regions of the image.
2. A path function computationally fast.

Such that:
The image can be resized, in real time and with high quality, to a n' x m' size.

Goals

The initial goal is to develop a technique which preserves the important content of an image, limiting visual artifacts resulting from the resizing process, and preserving intern structures of the image, given the small computing power of some mobile devices. As an aditional goal, after the algorithm developement, it is also intended to develop a web application with this method.

Methodology

This work aims to use image retargeting techniques in applications such as web and mobile. In order to achieve such goal, the work will be divided into several stages:
1. Cost function phase: Detection of an image structure and attraente parts with importance map - a combination of gradient with saliency maps.
2. Path function phase: Create an algorithm of image retargeting, using seam carving or seam merging, or a possibility of interligation between these techniques.
To support this method accuracy, some high-level semantic correlation computational measures will be used.
Given the time, the overall solution will be implemented in a web application.

Planning

Cost Function (from 16/Febr/205 to 13/Mar/2015):
- implementation of some approaches
- evaluation of the implementations
Path Function (from 16/Mar/2015 to 10/Apr/2015):
- implementation of some approaches
- evaluation of the implementations
Final solution development (from 13/Apr/2015 to 15/May/2015):
- implementation of the developed methodology
- analysis and testing
Developing of an web applicationCost Function (from 18/May/2015 to 29/May/2015):
- implementation
- analysis and testing
Dissertation writing (from 1/Jun/2015 to 31/Jun/2015)
Dissertation website - creation and management (from 16/Fev/2015 to 31/Jun/2015)

Execution

Week 1

- Planing of the work for the testing phase of the thesis.

- Research for existing codes in the web.

Week 2

- Implementation of some cost function methods in matlab

Week 3

- Building of a matlab UI for the cost functions implemented until the moment

- Incorportion of seam carving in the matlab UI

- Metting with supervisor and co-supervisor, at 6/Mar, to show what was done until the date, and to receive feedback

Week 4

- Inclusion of another cost function to the matlab UI

- Inclusion of DTW algorithm to the matlab UI (still need a few enhancements)

- Sending emails to some article authors in order to know if they can provide their code

Week 5

- Corrections in DTW code

- Meeting with supervisor and co-supervisor, at 18/Mar, to show what was done until now and to talk about some problems along the dissertation

Week 6

- Inclusion of seams selection tool in matlab UI

- Meeting with supervisor, at 25/Mar, to talk about some doubts in the DTW algorithm

Week 7

- Corrections of the DTW, discussed in the 25/Mar metting, made

Week 8

- Original DTW code already working

- Meeting with supervisor, at 10/Abr, to show the DTW algorithm, and discussion of ways to inovate it

Week 9

- Begining of inovating the DTW algorithm with mask

- Search of ways to use metrics in the resizing case

- Meeting with supervisor and co-supervisor, at 17/Abr, to talk about the metric and some inovation viability.

Week 10

- Initiation of the implementation of objective metrics in the work

- Meeting with supervisor, at 24/Abr, to find ways to correct a problem in the DTW (when not having stable paths, how to chose the paths to be taken)

Week 11

- Attempt to implement object removal in the algorithm (not working)

- Enhancement of the metrics implementation

- Meeting with supervisor, at 30/Abr, to talk about the INESC demo on the 6/May and to find ways to make the DTW algorithm faster

Week 12

- Participation in INESC demo day at 6/May

- Adition of new code to the DTW algorithm

Week 13

- Final enhancements of the metrics code and inovations to the DTW algorithm

- Structuring of the monography

- Writing of the some metodology in the monography

Week 14

- Continuation of the monography metodology writing

- Writing of the results in the monography

- Meeting with supervisor, at 22/May, to talk about the monography structuration and the final algorithm

Week 15

- Continuation of the monography results writing

Week 16

- changes in the monography metodology writing

Week 17

- Continuation of the monography metodology writing

- Meeting with supervisor and co-supervisor to a brief analysis of the monography writing

Week 18

- Reewriting of a Results section

- Writing of abstract, conclusion and future work

Week 19

- Meeting with supervisor for analysis of the monography

- Final changes in the monography

Week 20

- Delivery of the CD with the monography

Team

Laura Figueiredo Ângelo

(Student)

Institution:
Faculdade de Engenharia da Universidade do Porto

Course:
Mestrado Integrado em Engenharia Electrotécnica e de Computadores

E-mail:
[email protected]

Hélder Pinto de Oliveira

(Supervisor)

Institution:
Faculdade de Engenharia da Universidade do Porto;
Instituto de Engenharia de Sistemas e Computadores - Tecnologia e Ciência

E-mail:
[email protected]

Personal page

Ana Maria Rebelo

(Co-supervisor)

Institution:
Instituto de Engenharia de Sistemas e Computadores - Tecnologia e Ciência

E-mail:
[email protected]

Personal page

Conclusions

With the appearing of several display devices, such as mobile phones and computer monitor, a necessity to resize images emerged. As the typical resizing methods, like the simple scaling and cropping, bring undesired essential content elimination, details softening, stretching and squeezing, the retargeting methods appeared. These methods methodology seeks to change an image size while preserving the important content.

The main goal of this research was the implementation of an accurate image retargeting algorithm that reduces and enlarges images without causing the feeling of a damaged image, maintaining the core elements of the image and eliminating others with least importance. The first step for this work, was the development of an intuitive Matlab application, essentially for the testing process. This process passed through two stages: finding the best cost function, that allows to change the image size while preserving its main content, and improvement of the Stable Paths algorithm accompanied with the enlargement tool adding.

In order to make the fairest possible subjective selection and taking into account the limited existing time, a database, from Rubinstein et al. [50], was used as a starting point, aiming for an image resizing with the same reducing/enlarging factors used by these authors. The cost function was selected, having as parameters the complete retarget of the image to the new specified measure, with the minimal artifacts and distortions, using the human perceiving system. With this, our methodology was applied to several images for both reducing and enlarging, followed by the application of the metrics measures to all, including on some of the results provided by Rubinstein et al. [50], that were also addressed in the Literature Review.

Then, after an analysis of the obtained metrics data, it was concluded that our method, Seam Carving and the Nonhomogeneous Warping methods gave the closest result values to the ideals of translation, scaling and rotation, between an object in the retargeted and original images. It was also observed, that, by little, our method was the one closest to the ideal values of object features preservation.

Despite the relatively upbeat results, the method still lacks precision, causing some images retargets areas to lack structure, especially in aleatory lines that appear in backgrounds, where less important image contents are, and in more geometric structures. That appends, because similarly to seam carving, these images content is laid out in a way that prevents seams from bypassing those structures.

Documentação

References

[1] RobertKeys.Cubicconvolutioninterpolationfordigitalimageprocessing.volume29,pages 1153–1160. IEEE, 1981.
[2] Li-Qun Chen, Xing Xie, Xin Fan, Wei-Ying Ma, Hong-Jiang Zhang, and He-Qin Zhou. A visual attention model for adapting images on small displays. volume 9, pages 353–364. Springer, 2003.
[3] Matthew Ma and Jinhong K Guo. Automatic image cropping for mobile device with built- in camera. In Consumer Communications and Networking Conference, 2004. CCNC 2004. First IEEE, pages 710–711. IEEE, 2004.
[4] Vidya Setlur, Saeko Takagi, Ramesh Raskar, Michael Gleicher, and Bruce Gooch. Auto- matic image retargeting. In Proceedings of the 4th international conference on Mobile and ubiquitous multimedia, pages 59–68. ACM, 2005.
[5] Ran Gal, Olga Sorkine, and Daniel Cohen-Or. Feature-aware texturing. In Proceedings of the 17th Eurographics conference on Rendering Techniques, pages 297–303. Eurographics Association, 2006.
[6] Shai Avidan and Ariel Shamir. Seam carving for content-aware image resizing. In ACM Transactions on graphics (TOG), volume 26, page 10. ACM, 2007.
[7] Wikipedia.SeamCarving.URL:http://en.wikipedia.org/wiki/Seam_carving.
[8] Paul Viola and Michael Jones. Rapid object detection using a boosted cascade of simple features. In Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on, volume 1, pages I–511. IEEE, 2001.
[9] Laurent Itti, Christof Koch, and Ernst Niebur. A model of saliency-based visual attention for rapid scene analysis. volume 20, pages 1254–1259. IEEE Computer Society, 1998.
[10] Michael Rubinstein, Ariel Shamir, and Shai Avidan. Improved seam carving for video retar- geting. In ACM transactions on graphics (TOG), volume 27, page 16. ACM, 2008.
[11] Lior Wolf, Moshe Guttmann, and Daniel Cohen-Or. Non-homogeneous content-driven video-retargeting. In Computer Vision, 2007. ICCV 2007. IEEE 11th International Con- ference on, pages 1–6. IEEE, 2007.
[12] Yu-Shuen Wang, Chiew-Lan Tai, Olga Sorkine, and Tong-Yee Lee. Optimized scale-and- stretch for image resizing. In ACM Transactions on Graphics (TOG), volume 27, page 118. ACM, 2008.
[13] Chris Harris and Mike Stephens. A combined corner and edge detector. In Alvey vision conference, volume 15, page 50. Manchester, UK, 1988.
[14] Navneet Dalal and Bill Triggs. Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Con- ference on, volume 1, pages 886–893. IEEE, 2005.
[15] Hélder Filipe Pinto de Oliveira and Jaime dos Santos Cardoso. Image retargeting using stable paths. In Proceedings of the International Conference on Computer Vision Theory and Applications, 2009.
[16] Sebastiano Battiato, G Farinella, Giovanni Puglisi, and Daniele Ravi. Saliency based se- lection of gradient vector flow paths for content aware image resizing. volume 23, pages 2081–2095. IEEE, 2014.
[17] Qi Wang and Yuan Yuan. Learning to resize image. volume 131, pages 357–367. Elsevier, 2014.
[18] Radhakrishna Achanta, Sheila Hemami, Francisco Estrada, and Sabine Susstrunk. Frequency-tuned salient region detection. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, pages 1597–1604. IEEE, 2009.
[19] Radhakrishna Achanta and Sabine Susstrunk. Saliency detection for content-aware image resizing. In Image Processing, 2009. ICIP 2009. 2009 16th IEEE International Conference on, pages 1005–1008. IEEE, 2009.
[20] Yuming Fang, Zhenzhong Chen, Weisi Lin, and C-W Lin. Saliency detection in the com- pressed domain for adaptive image retargeting. volume 21, pages 3888–3901. IEEE, 2012.
[21] Stas Goferman, Lihi Zelnik-Manor, and Ayellet Tal. Context-aware saliency detection. vol- ume 34, pages 1915–1926. IEEE, 2012.
[22] Dirk Walther and Christof Koch. Modeling attention to salient proto-objects. Neural net- works, 19(9):1395–1407, 2006.
[23] Chenlei Guo, Qi Ma, and Liming Zhang. Spatio-temporal saliency detection using phase spectrum of quaternion fourier transform. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pages 1–8. IEEE, 2008.
[24] Xiaodi Hou and Liqing Zhang. Saliency detection: A spectral residual approach. In Com- puter Vision and Pattern Recognition, 2007. CVPR 2007. IEEE Conference on, pages 1–8. IEEE, 2007.
[25] Tie Liu, Zejian Yuan, Jian Sun, Jingdong Wang, Nanning Zheng, Xiaoou Tang, and Heung- Yeung Shum. Learning to detect a salient object. volume 33, pages 353–367. IEEE, 2011.
[26] Jin Sun and Haibin Ling. Scale and object aware image thumbnailing. volume 104, pages 135–153. Springer, 2013.
[27] James Mannos and David J Sakrison. The effects of a visual fidelity criterion of the encoding of images. volume 20, pages 525–536. IEEE, 1974.
[28] Floris L Van Nes, Maarten A Bouman, et al. Spatial modulation transfer in the human eye. volume 57, pages 401–406. Optical Society of America, 1967.
[29] Bogdan Alexe, Thomas Deselaers, and Vittorio Ferrari. Measuring the objectness of image windows. volume 34, pages 2189–2202. IEEE, 2012.
[30] Lingqiang Ran and Xiangxu Meng. Fast seam carving using gaussian pyramid. In Intelligent Human-Machine Systems and Cybernetics, 2014. IHMSC 2014. 2014 Sixth International Conference on, volume 2, pages 59–63. IEEE, 2014.
[31] Radhakrishna Achanta, Francisco Estrada, Patricia Wils, and Sabine Süsstrunk. Salient re- gion detection and segmentation. In Computer Vision Systems, pages 66–75. Springer, 2008.
[32] Zhiwei He, Mingyu Gao, Haibin Yu, Xueyi Ye, Li Zhang, and Guojin Ma. A new improved seam carving content aware image resizing method. In Industrial Electronics and Applica- tions, 2013. ICIEA 2013. 2013 8th IEEE Conference on, pages 738–741. IEEE, 2013.
[33] Xiao Lin, Bin Sheng, LiZhuang Ma, Yang Shen, and ZhiHua Chen. Seamlet carving for shape-aware image resizing. volume 55, pages 1073–1081. Springer, 2012.
[34] Taeg Sang Cho, Moshe Butman, Shai Avidan, and William T Freeman. The patch transform and its applications to image editing. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pages 1–8. IEEE, 2008.
[35] Yael Pritch, Eitam Kav-Venaki, and Shmuel Peleg. Shift-map image editing. In Computer Vision, 2009 IEEE 12th International Conference on, pages 151–158. IEEE, 2009.
[36] Feng Liu and Michael Gleicher. Automatic image retargeting with fisheye-view warping. In Proceedings of the 18th annual ACM symposium on User interface software and technology, pages 153–162. ACM, 2005.
[37] YongJin,LigangLiu,andQingbiaoWu.Nonhomogeneousscalingoptimizationforrealtime image resizing. volume 26, pages 769–778. Springer, 2010.
[38] Yi-Fei Zhang, Shi-Min Hu, and Ralph R Martin. Shrinkability maps for content-aware video resizing. In Computer Graphics Forum, volume 27, pages 1797–1804. Wiley Online Library, 2008.
[39] Guo-Xin Zhang, Ming-Ming Cheng, Shi-Min Hu, and Ralph R Martin. A shape-preserving approach to image resizing. In Computer Graphics Forum, volume 28, pages 1897–1906. Wiley Online Library, 2009.
[40] Sunghyun Cho, Hanul Choi, Yasuyuki Matsushita, and Seungyong Lee. Image retarget- ing using importance diffusion. In Image Processing (ICIP), 2009 16th IEEE International Conference on, pages 977–980. IEEE, 2009.
[41] Kazu Mishiba and Masaaki Ikehara. Seam merging for image resizing with structure preser- vation. In Acoustics, Speech and Signal Processing, 2011. ICASSP 2011. 2011 IEEE Inter- national Conference on, pages 1001–1004. IEEE, 2011.
[42] KazuMishibaandMasaakiIkehara.Imageresizingusingimprovedseammerging.InAcous- tics, Speech and Signal Processing, 2012. ICASSP 2012. IEEE International Conference on, pages 1261–1264. IEEE, 2012.
[43] Weiming Dong, Ning Zhou, Jean-Claude Paul, and Xiaopeng Zhang. Optimized image re- sizing using seam carving and scaling. In ACM Transactions on Graphics (TOG), volume 28, page 125. ACM, 2009.
[44] Jianbing Shen, Dapeng Wang, and Xuelong Li. Depth-aware image seam carving. vol- ume 43, pages 1453–1461. IEEE, 2013.
[45] Denis Simakov, Yaron Caspi, Eli Shechtman, and Michal Irani. Summarizing visual data using bidirectional similarity. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pages 1–8. IEEE, 2008.
[46] Connelly Barnes, Eli Shechtman, Adam Finkelstein, and Dan Goldman. Patchmatch: a randomized correspondence algorithm for structural image editing. volume 28, page 24, 2009.
[47] Michael Rubinstein, Ariel Shamir, and Shai Avidan. Multi-operator media retargeting. In ACM Transactions on Graphics (TOG), volume 28, page 23. ACM, 2009.
[48] Ce Liu, Jenny Yuen, Antonio Torralba, Josef Sivic, and William T Freeman. Sift flow: Dense correspondence across different scenes. In Computer Vision–ECCV 2008, pages 28– 42. Springer, 2008.
[49] Ofir Pele and Michael Werman. Fast and robust earth mover’s distances. In Computer vision, 2009 IEEE 12th international conference on, pages 460–467. IEEE, 2009.
[50] Michael Rubinstein, Diego Gutierrez, Olga Sorkine, and Ariel Shamir. A comparative study of image retargeting. volume 29, pages 160:1–160:10, 2010.
[51] Liwei Wang, Yan Zhang, and Jufu Feng. On the euclidean distance of images. volume 27, pages 1334–1339. IEEE, 2005.
[52] Bangalore S Manjunath, Philippe Salembier, and Thomas Sikora. Introduction to MPEG-7: multimedia content description interface, volume 1. John Wiley & Sons, 2002.
[53] BangaloreSManjunath,J-ROhm,VinodVVasudevan,andAkioYamada.Colorandtexture descriptors. volume 11, pages 703–715. IEEE, 2001.
[54] Eiji Kasutani and Akio Yamada. The mpeg-7 color layout descriptor: a compact image feature description for high-speed image/video segment retrieval. In Image Processing, 2001. Proceedings. 2001 International Conference on, volume 1, pages 674–677. IEEE, 2001.