*Result*: A Statistical Approach to Hyperparameter Tuning of Deep Learning for Construction Machine Classification.

Title:
A Statistical Approach to Hyperparameter Tuning of Deep Learning for Construction Machine Classification.
Authors:
Ottoni, André Luiz C.1 (AUTHOR) andre.ottoni@ufrb.edu.br, Novo, Marcela S.2 (AUTHOR), Oliveira, Marcos S.3 (AUTHOR)
Source:
Arabian Journal for Science & Engineering (Springer Science & Business Media B.V. ). Apr2024, Vol. 49 Issue 4, p5117-5128. 12p.
Database:
Academic Search Index

*Further Information*

*Deep learning methods have shown significant potential in computer vision applications within the field of civil engineering. One important area of research is the detection and classification of construction machines using convolutional neural networks (CNNs). However, a major challenge in adopting machine learning models for vision systems lies in hyperparameter optimization. Therefore, the objective of this paper is to propose a meticulous statistical approach to hyperparameter tuning for deep learning in the visual classification of construction machines. It is noteworthy that the method for selecting hyperparameters utilizes statistical concepts, including analysis of variance and the Tukey test. Moreover, three research questions were formulated to address statistical differences in performance when varying the CNN architecture, optimizer and configuration (architecture and optimizer). Two additional research questions were posed to determine the recommended hyperparameters for the task and assess performance differences for non-selected hyperparameters. For this purpose, 18 hyperparameter combinations were analyzed, encompassing three CNN architectures (DenseNet, Mobilenet and VGG16) and six optimizers (adadelta, adagrad, rmsprop, adam, adamax and sgd). The Alberta Construction Image Dataset, consisting of 2850 images of three construction machines (excavator, dump truck and concrete mixer truck), was used in the experiments. The results demonstrate that the hyperparameter configurations exhibit statistically significant differences in image classification. Furthermore, the recommended combination (DenseNet × adagrad) yielded the highest average accuracy results in both binary tests (90.0%) and multiclass classification (77.8%). [ABSTRACT FROM AUTHOR]*