A Web Crawling Environment to Support Financial Strategies and Trend Correlation

Autori

Giovanni Ponti, Giuseppe Santomauro, Fiorenzo Ambrosino, Giovanni Bracco, Antonio Colavincenzo, Matteo De Rosa, Agostino Funel, Dante Giammattei, Guido Guarnieri, Silvio Migliori

Parole chiave (Tematica)

big data machine learning market trends web crawling

Data pubblicazione

07/02/2019

Fonte

ECML PKDD 2018 Workshops MIDAS 2018 and PAP 2018, Dublin, Ireland, September 10-14, 2018, Proceedings

Part of the Lecture Notes in Computer Science book series (LNCS, volume 11054)

https://link.springer.com/chapter/10.1007/978-3-030-13463-1_8

Abstract

We provide an overview on the development and the integration in ENEAGRID of a web crawling tool to retrieve data from the Web, manage and display it, and extract relevant information. We collected all these instruments in a collaborative environment called Web Crawling Virtual Laboratory, offering a GUI to operate remotely. Finally, we describe an ongoing activity on semantic crawling and data analysis to discover trends and correlations in finance.

Acknowledgements

The computing resources and the related technical support used for this work have been provided by ENEAGRID/CRESCO High Performance Computing infrastructure and its staff. ENEAGRID/CRESCO High Performance Computing infrastructure is funded by ENEA, the Italian National Agency for New Technologies, Energy and Sustainable Economic Development and by Italian and European research programmes, see http://www.cresco.enea.it/english for information.