Towards XML schema extraction from deep web

TitreTowards XML schema extraction from deep web
Publication TypeConference Paper
Year of Publication2017
AuthorsSaissi, Y, Zellou, A, Idri, A
Conference NameColloquium in Information Science and Technology, CIST

Today, not all the web is fully accessible by the web search engines. There is a hidden and inaccessible part of the web called the deep web. Many methods exist in the literature to access and to integrate the huge structured data contained in the deep web. In this paper, we propose our approach to extract the XML schema describing a selected deep web source. Our approach is based on the static and the dynamic analysis of the HTML forms giving access to the selected deep web source. Our approach uses two knowledge database during its process: our proprietary identification tables and Wordnet. The XML schema extracted will be used to integrate the associated deep web source into a mediation system without extracting all its information. � 2016 IEEE.




Suivez-nous sur





Avenue Mohammed Ben Abdallah Regragui, Madinat Al Irfane, BP 713, Agdal Rabat, Maroc

  Télécopie : (+212) 5 37 68 60 78

  Secrétariat de direction : 06 61 48 10 97

        Secrétariat général : 06 61 34 09 27

        Service des affaires financières : 06 61 44 76 79

        Service des affaires estudiantines : 06 62 77 10 17 /

        Résidences : 06 61 82 89 77



    Compteur de visiteurs:479,930
    Education - This is a contributing Drupal Theme
    Design by WeebPal.