CFP last date
20 May 2024
Reseach Article

Reappearance Layout based Web Page Segmentation for Small Screen Devices

by V. Kalaivani, K. Rajkumar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 49 - Number 20
Year of Publication: 2012
Authors: V. Kalaivani, K. Rajkumar
10.5120/7884-0801

V. Kalaivani, K. Rajkumar . Reappearance Layout based Web Page Segmentation for Small Screen Devices. International Journal of Computer Applications. 49, 20 ( July 2012), 1-8. DOI=10.5120/7884-0801

@article{ 10.5120/7884-0801,
author = { V. Kalaivani, K. Rajkumar },
title = { Reappearance Layout based Web Page Segmentation for Small Screen Devices },
journal = { International Journal of Computer Applications },
issue_date = { July 2012 },
volume = { 49 },
number = { 20 },
month = { July },
year = { 2012 },
issn = { 0975-8887 },
pages = { 1-8 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume49/number20/7884-0801/ },
doi = { 10.5120/7884-0801 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:46:42.020107+05:30
%A V. Kalaivani
%A K. Rajkumar
%T Reappearance Layout based Web Page Segmentation for Small Screen Devices
%J International Journal of Computer Applications
%@ 0975-8887
%V 49
%N 20
%P 1-8
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Normally web sites are designed for large screen devices and hence it is not easy to browse these pages with limited user interface devices such as palm and mobile. Web page segmentation is an important technology for both search engine and web browser on mobile device. Web page segmentation is a task that breaks down the structure of web page into logical blocks which is an important step for identifying informative blocks for efficient information extraction and convenient display on the devices with small size screens. Previous repetition based segmentation method is not suitable for segmenting blocks, when there is no reappearance tags in the web pages. In order to improve the segmentation accuracy, a new method of Reappearance Layout based Web page Segmentation (RLSE) is introduced which segments web pages based on either reappearance based segmentation (RSE) scheme or based on Layout based segmentation (LSE) such as table, div, span and frame tags , depends up on key pattern detected in the web page. If it contains reappearance tag in key pattern means, it will segment based on reappearance based segmentation. Otherwise it will segment based on web layout information. From that segmented block hyperlink is displayed on the mobile first and then user will select hyperlinks of segmented blocks based on his area of interest. The interested information alone is displayed to the user.

References
  1. Arasu. A and Garcia-Molina. H , "Extracting structured data from web page," Proc. ACM SIGMOD Intl. Conf. on Management of Data, pp. 337–348, 2003.
  2. Crescenzi. V, Meriald. P, and Missier. P, "Fine-grain web site structure discovery," Proc. 5th ACM Intl. Workshop on Web Information andData Management, pp. 15-22, 2003.
  3. Cheolhee Choi, Jinbeom Kang and Joongmin Choi, "Extraction of user-defined data blocks using the regularity of dynamic web pages," Lecture Notes in Computer Science, vol. 4681, pp. 123-133, 2007.
  4. Guohua Hu,"Study to Eliminating Noisy Information in Web Pages based on Data Mining", Proc. Intl. Conf. on Natural Computation pp. 660-663,2010.
  5. Hattori. G, Hoashi. K, Matsumoto. K. , and Sugaya. F, "Robust web page segmentation for mobile terminal using content distances and page layout information," Proc. 16th Intl. Conf. on World Wide Web, pp. 361-370, 2007.
  6. Lin, S. -H. , Ho, J. -M. , Discovering Informative Content Blocks from Web Documents, In Proceedings of ACM SIGKDD'02, 2002
  7. Peifeng Xiang et al, "Effective Page Segmentation Combining Pattern Analysis and Visual Separators for Browsing on Small Screens. ", Proc. Intl. Conf. on web intelligence,2006.
  8. Riadh May. H, Akram and Othman. M,"Web Content Adaptation system",International Journal of Computer Application vol. 23,no. 9,pp. 32-39,2011.
  9. Sandip Debnath, Mitra. P, "Automatic Identification of Informative Sections of Web Pages", IEEE Transactions on knowledge and Data Engineering , vol. 17, no. 9,pp. 1233-1246, 2005.
  10. Suhit Gupta, Gail Kaiser, David Neistadt, Peter Grimm, "DOM-based Content Extraction of HTML Documents ",Proc. 12th International Conference on World Wide Web,pp. 207-214, 2003.
  11. Wookey Lee, Sanggil Kang, Seungkil Lim, "Adaptive hierarchical surrogate for searching web with mobile devices," IEEE Trans. Consumer Electron, vol. 53, no. 2, pp. 796-803,2007.
  12. Xin yang,Peifeng Xiang,"Semantic HTML Page Segmentation using Type Analysis", Proc. 1st Intl. Conf. on Natural computation,2006.
  13. XingXie . X , Miao . G, Song, Wen, "Efficient browsing of web search results on mobile devices based on block importance model,"IEEE Transactions on Pervasive computing and applications,pp. 669-674, 2006.
  14. Yonghyun Hwang etal, "Structure-Aware Web Transcoding for Mobile Device" ,IEEE Transactions on Internet computing,pp. 14-21 ,2003.
  15. Yunpeng Xiao, "A Dynamic Web Page Adaptation for Mobile Device Based on Web2. 0", IEEE Transactions on web intelligence,pp. 119-122 , 2008.
Index Terms

Computer Science
Information Sciences

Keywords

DOM (Document object Model) Layout based segmentation (LSE) Reappearance based segmentation(RSE) RLSE ( Reappearance Layout based Segmentation)