Towards XML schema extraction from deep web

TitreTowards XML schema extraction from deep web
Publication TypeConference Paper
Year of Publication2017
AuthorsSaissi, Y, Zellou, A, Idri, A
Conference NameColloquium in Information Science and Technology, CIST
Abstract

Today, not all the web is fully accessible by the web search engines. There is a hidden and inaccessible part of the web called the deep web. Many methods exist in the literature to access and to integrate the huge structured data contained in the deep web. In this paper, we propose our approach to extract the XML schema describing a selected deep web source. Our approach is based on the static and the dynamic analysis of the HTML forms giving access to the selected deep web source. Our approach uses two knowledge database during its process: our proprietary identification tables and Wordnet. The XML schema extracted will be used to integrate the associated deep web source into a mediation system without extracting all its information. © 2016 IEEE.

URLhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85010190133&doi=10.1109%2fCIST.2016.7805022&partnerID=40&md5=0c383caab22e43a339e6b52acca8947b
DOI10.1109/CIST.2016.7805022
Revues: 

Partenaires

Localisation

Suivez-nous sur

         

    

Contactez-nous

ENSIAS

Avenue Mohammed Ben Abdallah Regragui, Madinat Al Irfane, BP 713, Agdal Rabat, Maroc

  Télécopie : (+212) 5 37 68 60 78

  Secrétariat de direction : 06 61 48 10 97

        Secrétariat général : 06 61 34 09 27

        Service des affaires financières : 06 61 44 76 79

        Service des affaires estudiantines : 06 62 77 10 17 / n.mhirich@um5s.net.ma

        CEDOC ST2I : 06 66 39 75 16

        Résidences : 06 61 82 89 77

Contacts

    

Education - This is a contributing Drupal Theme
Design by WeebPal.