AtheneForschung - Informationsportal der UniBw M

Home / Alle InhaltePublikationen (Universitätsbibliografie)Forschungszentren und -initiativenCODE1116

Autoren:

Lee, Yeong Su; Geierhos, Michaela

Dokumenttyp:

Sammelbandbeitrag / Paper in Collective Volume

Titel:

Business Specific Online Information Extraction from German Websites

Herausgeber Sammlung:

Gelbukh, Alexander

Titel Konferenzpublikation:

Computational Linguistics and Intelligent Text Processing

Untertitel Konferenzpublikation:

10th International Conference, CICLing 2009, Mexico City, Mexico, March 1-7, 2009. Proceedings

Jahrgang:

5449

Konferenztitel:

International Conference on Computational Linguistics and Intelligent Text Processing (10., 2009, Mexiko City)

Konferenztitel:

CICLing 2009

Tagungsort:

Mexico City, Mexico

Jahr der Konferenz:

2009

Datum Beginn der Konferenz:

01.03.2009

Datum Ende der Konferenz:

07.03.2009

Verlagsort:

Berlin ; Heidelberg

Verlag:

Springer

Jahr:

2009

Seiten von - bis:

369-381

Sprache:

Englisch

Abstract:

This paper presents a system that uses the domain name of a German business website to locate its information pages (e.g. company profile, contact page, imprint) and then identifies business specific information. We therefore concentrate on the extraction of characteristic vocabulary like company names, addresses, contact details, CEOs, etc. Above all, we interpret the HTML structure of documents and analyze some contextual facts to transform the unstructured web pages into structured forms. Our... »

ISBN:

978-3-642-00381-3 ; 978-3-642-00382-0

DOI:

10.1007/978-3-642-00382-0_30

URL zum Inhalt:

https://doi.org/10.1007/978-3-642-00382-0_30

Open Access ja oder nein?:

Nein / No

BibTeX

Vorkommen:

Home / Alle Inhalte Publikationen (Universitätsbibliografie)Forschungszentren und -initiativen CODE

Home / Alle Inhalte Publikationen (Universitätsbibliografie)Fakultäten (univ.)Fakultät für Informatik INF 7 - Institut für Datensicherheit