Logo
Benutzer: Gast  Login
Autoren:
Lee, Yeong Su; Geierhos, Michaela 
Dokumenttyp:
Konferenzbeitrag / Conference Paper 
Titel:
Business Specific Online Information Extraction from German Websites 
Herausgeber Sammlung:
Aly, Robin; Hauff, Claudia; Hiemstra, Djoerd; Huibers, Theo W. C.; de Jong, Franciska M. G. 
Titel Konferenzpublikation:
Proceedings of the 9th Dutch-Belgian Information Retrieval Workshop 
Konferenztitel:
Dutch-Belgian Information Retrieval Workshop (9., 2009, Enschede) 
Tagungsort:
Enschede, The Netherlands 
Jahr der Konferenz:
2009 
Verlagsort:
Twente 
Verlag:
Centre for Telematics and Information Technology (CTIT), University of Twente 
Jahr:
2009 
Seiten von - bis:
79-86 
Sprache:
Englisch 
Stichwörter:
company search ; information extraction ; sublanguage 
Abstract:
This paper presents a system that uses the domain name of a German business website to locate its information pages (e.g. company profile, contact page, imprint) and then identifies business specific information. We therefore concentrate on the extraction of characteristic vocabulary like company names, addresses, contact details, CEOs, etc. Above all, we interpret the HTML structure of documents and analyze some contextual facts to transform the unstructured web pages into structured forms. Ou...    »
 
ISSN:
0929-0672 
Open Access ja oder nein?:
Nein / No