Click on “Download PDF” for the PDF version or on the title for the HTML version.


If you are not an ASABE member or if your employer has not arranged for access to the full-text, Click here for options.

An Open-Source Infrastructure for Real-Time Automatic Agricultural Machine Data Processing

Published by the American Society of Agricultural and Biological Engineers, St. Joseph, Michigan www.asabe.org

Citation:  2017 ASABE Annual International Meeting  1701022.(doi:10.13031/aim.201701022)
Authors:   Yang Wang, Andrew D Balmos, Alexander W Layton, Samuel Noel, Aaron Ault, James V Krogmeier, Dennis R Buckmaster
Keywords:   Big data, Cloud, digital agriculture, ISOBUS, CAN, machine data, real-time, infrastructure, Apache, open-source

Abstract. Real-time big data technologies have seen an unprecedented growth due to their crucial impacts on different scenarios, such as error detection, live monitoring and metrics analytics. Recently, the agricultural industry has begun leveraging some of those technologies for advancements in digital agriculture; however, it largely ignores one of the richest data sources – the machine data generated from modern agricultural machinery. This machine data set contains critical as-harvested and as-applied data that lead to powerful insights and predictions of a farm‘s operational plan, current state and productivity. Furthermore, farm operators will be able to make adaptive logistics decisions based on predictive analytics. Recent attempts to automate the machine data collection process have been made. The challenge is to construct a scalable ecosystem that ingests and processes the collected machine data efficiently and quickly.

This paper presents an open-source infrastructure for collecting and processing agricultural machine data to produce useful data analytics and visualizations in real-time. The paper consists of three main parts. The first part of the paper discusses the making of ISOBlue2, a hardware device that serves as the starting point of the infrastructure for automatic agricultural machine data collection. The second part of the paper gives a detailed explanation of the Cloud components within the infrastructure. Specifically, this part analyzes how Apache Kafka and Apache Storm can work concurrently to enable a real-time, robust and scalable processing platform for agricultural machine data. The next part of the paper provides evaluations of a test infrastructure. Finally, the last part of the paper demonstrates a couple of scenarios that would benefit from applying this infrastructure.

(Download PDF)    (Export to EndNotes)