Plataforma de extração e recuperação de dados na Web no contexto de Big Data

  • Luan Silveira Pontolio Fundação Eurípides Soares da Rocha

Abstract

The dispersion of interest data to businesses and organizations in several domains on the Web, and in different formats, it becomes increasingly necessary the ability to get them and for that is needed to provide manners to extract these data, to ensure its reliability for the correct storage. Techniques of data extraction, in particular Web Scraping (search robot), allows the capturing of such data. This project aims to study techniques for data extraction, based on the web domain, and through this, it materializes in the development of a platform that offers the ability to extract this information by means of parameterization of search robots, allowing the user autonomy of its creation.

Published
2015-10-06
How to Cite
PONTOLIO, Luan Silveira. Plataforma de extração e recuperação de dados na Web no contexto de Big Data. Journal on Advances in Theoretical and Applied Informatics, [S.l.], v. 1, n. 1, p. 22-29, oct. 2015. ISSN 2447-5033. Available at: <https://revista.univem.edu.br/jadi/article/view/1038>. Date accessed: 21 nov. 2024. doi: https://doi.org/10.26729/jadi.v1i1.1038.