Affordable Access

deepdyve-link
Publisher Website

An Analytical Tool for Georeferenced Sensor Data based on ELK Stack

Authors
  • Ngo, Thi Thu Trang
  • Sarramia, David
  • Kang, Myoung-Ah
  • Pinet, François
Publication Date
Apr 23, 2021
Identifiers
DOI: 10.5220/0010439200820089
OAI: oai:HAL:hal-03313080v1
Source
HAL
Keywords
Language
English
License
Unknown
External links

Abstract

In the context of the French CAP 2025 I-Site project, an environmental data lake called CEBA is built at an Auvergne regional level. Its goal is to integrate data from heterogeneous sensors, provide end users tools to query and analyse georeferenced environmental data, and open data. The sensors collect different environmental measures according to their location (air and soil temperature, water quality, etc.). The measures are used by different research laboratories to analyse the environment. The main component for data shipping and storing is the ELK stack. Data are collected from sensors through Beats and streamed by Logstash to Elasticsearch. Scientists can query the data through Kibana. In this paper, we propose a data warehouse frontend to CEBA based on the ELK stack. We as well propose an additional component to the ELK stack that operates streaming ETL which allows integrating and aggregating streaming data from different sensors and sources given the user configuration in order to provide end users more analytical capabilities on the data. We show the architecture of this system, we present the functionalities of the data lake through examples, and finally, we present an example dashboard of the data on Kibana.

Report this publication

Statistics

Seen <100 times