A leaf-level spectral library to support high throughput plant phenotyping: Predictive accuracy and model transfer.

Wijewardane NK, Zhang H, Yang J, Schnable JC, Schachtman DP, Ge Y

Published: 6 April 2023 in Journal of experimental botany
Keywords: biochemical traits, camelina, extra-weighted spiking, high-throughput phenotyping, hyperspectral reflectance, machine-learning, maize, partial least squares regression, sorghum, soybean
Pubmed ID: 37018460
DOI: 10.1093/jxb/erad129

Leaf-level hyperspectral reflectance has become an effective tool for high-throughput phenotyping of plant leaf traits due to its rapid, low-cost, multi-sensing, and non-destructive nature. However, collecting samples for model calibration can still be expensive; and models show poor transferability among different datasets. This study had three specific objectives: (i) assemble a large library of leaf hyperspectral data (n=2460) from maize and sorghum, (ii) evaluate two machine-learning approaches to estimate nine leaf properties (chlorophyll, thickness, water content, nitrogen, phosphorus, potassium, calcium, magnesium, and sulfur), and (iii) investigate the usefulness of this spectral library for predicting external datasets (n=445) including soybean and camelina using extra-weighted spiking. Internal cross-validation showed satisfactory performance of the spectral library to estimate all nine traits (average R 2 0.688), with Partial Least Squares Regression outperforming Deep Neural Network models. Models calibrated solely using the spectral library showed degraded performance on external datasets (average R 2 0.159 for camelina, 0.337 for soybean). Models improved significantly when a small portion of external samples (n=20) was added to the library via extra-weighted spiking (average R 2 0.574 for camelina, 0.536 for soybean). The leaf-level spectral library greatly benefits plant physiological and biochemical phenotyping; whereas extra-weight spiking improves model transferability and extends its utility.