ER - Intl Conf on Conceptual Modeling

Automatic Web Information Extraction in the RoadRunner System

October 20, 2020

138

Authors: Giansalvatore Mecca, Paolo Merialdo, Valter Crescenzi

Tags: 2001, conceptual modeling

This paper presents Road Runner, a research project that aims at developing solutions for automatically extracting data from large HTML data sources. The target of our research are data-intensive Web sites, i.e., HTML-based sites with a fairly complex structure, that publish large amounts of data. The paper describes the top-level software architecture of the Road Runner System, and the novel research challenges posed by the attempt to automate the information extraction process.

Read the full paper here: https://link.springer.com/chapter/10.1007/3-540-46140-X_21

Automatic Web Information Extraction in the RoadRunner System

EDITOR PICKS

Roger H.L. Chiang – 2023 ASOCA Winner

Join us in the magical Miami for the 2023 AIS SIGSAND!

Participate in SAND sessions at AMCIS 2023 – August 10 –...

POPULAR POSTS

Participate in SAND sessions at AMCIS 2023 – August 10 –...

Conceptual Modelling in the “Digital First” Era — A Joint AIS...

TheoryOn: A Design Framework and System for Unlocking Behavioral Knowledge through...

POPULAR CATEGORY

Share this:

EDITOR PICKS

POPULAR POSTS

POPULAR CATEGORY