Logo
User: Guest  Login
Authors:
Lee, Yeong Su; Geierhos, Michaela 
Document type:
Sammelbandbeitrag / Paper in Collective Volume 
Title:
Business Specific Online Information Extraction from German Websites 
Collection editors:
Gelbukh, Alexander 
Title of conference publication:
Computational Linguistics and Intelligent Text Processing 
Subtitle of conference publication:
10th International Conference, CICLing 2009, Mexico City, Mexico, March 1-7, 2009. Proceedings 
Volume:
5449 
Conference title:
International Conference on Computational Linguistics and Intelligent Text Processing (10., 2009, Mexiko City) 
Conference title:
CICLing 2009 
Venue:
Mexico City, Mexico 
Year of conference:
2009 
Date of conference beginning:
01.03.2009 
Date of conference ending:
07.03.2009 
Place of publication:
Berlin ; Heidelberg 
Publisher:
Springer 
Year:
2009 
Pages from - to:
369-381 
Language:
Englisch 
Abstract:
This paper presents a system that uses the domain name of a German business website to locate its information pages (e.g. company profile, contact page, imprint) and then identifies business specific information. We therefore concentrate on the extraction of characteristic vocabulary like company names, addresses, contact details, CEOs, etc. Above all, we interpret the HTML structure of documents and analyze some contextual facts to transform the unstructured web pages into structured forms. Our...    »
 
ISBN:
978-3-642-00381-3 ; 978-3-642-00382-0 
Open Access yes or no?:
Nein / No