Abstract
The objective of this study was to evaluate the potential of big data technology for spectral feature extraction rice plants under different water management. In this study, we performed reflectance spectral analysis of rice grown under different water management schedules based on big data and machine learning approaches. We applied water management at the tillering, jointing, and heading stages of rice growth and collected visible to near-infrared reflectance spectra to analyze the spectral characteristics of rice grown under different water management schedules. Then, we constructed a data mining model based on the spectral analysis results using the Hadoop and Spark frameworks, and analyzed the characteristic bands of rice grown under each schedule using a parallel machine learning algorithm run in both local and cluster modes. The ChiSqSelector algorithm showed characteristic bands in the range of 400–1000 nm, with bands at approximately 474, 558, 632, 735, and 855 nm for water management in the tillering stage; 472, 551, 627, 721, and 843 nm in the jointing stage; and 471, 545, 642, 725, and 849 nm in the heading stage. By contrast, the UnivariateFeatureSelector algorithm showed characteristic bands at approximately 462, 665, 755, 833, and 937 nm in the tillering stage; 475, 671, 744, 848, and 932 nm in the jointing stage; and 470, 678, 757, 857, and 942 nm in the heading stage. These results demonstrate the feasibility and efficiency of applying big data and data mining technologies for spectral analysis of rice grown under different water management schedules.
Similar content being viewed by others
References
Rubaiyath Bin Rahman ANM, Zhang J (2016) Flood and drought tolerance in rice: opposite but may coexist. Food Energy Secur 5(2):76–88
Gobin A (2018) Weather related risks in Belgian arable agriculture. Agric Syst 159:225–236
Noreen Z, Muhammad BH, Ahmad N et al (2022) Rice production systems and grain quality. J Cereal Sci 105:103463
Mallikarjuna Swamy BP, Arvind K (2013) Genomics-based precision breeding approaches to improve drought tolerance in rice. Biotechnol Adv 31:1308–1318
Kamarudin ZS, Yusop MR, Tengku Muda Mohamed M et al (2018) Growth performance and antioxidant enzyme activities of advanced mutant rice genotypes under drought stress condition. Agronomy 8(12):1–15
Yang HS, Zhai SL, Li YF et al (2017) Waterlogging reduction and wheat yield increase through long-term ditch-buried straw return in a rice-wheat rotation system. Field Crop Res 209:189–197
Muehe EM, Wang T, Kerl CF et al (2019) Rice production threatened by coupled stresses of climate and soil arsenic. Nat Commun 10(4985):1–10
Lee A, Shim J, Kim B et al (2022) Non-destructive prediction of soluble solid contents in Fuji apples using visible near-infrared spectroscopy and various statistical methods. J Food Eng 321:110945
Savoia S, Albera A, Brugiapaglia A (2019) Prediction of meat quality traits in the abattoir using portable and hand-held near-infrared spectrometers. Meat Sci 161:108017
Beghi R, Giovenzana V, Tugnolo A et al (2018) Application of visible/near infrared spectroscopy to quality control of fresh fruits and vegetables in large-scale mass distribution channels: a preliminary test on carrots and tomatoes. J Sci Food Agr 98(7):2729–2734
Rajeev S, Khot LR, Rathnayakea AP et al (2019) Visible-near infrared spectroradiometry-based detection of grapevine leafroll-associated virus 3 in a red-fruited wine grape cultivar. Comput Electron Agr 162(165):173
Fan YY, Zhang C, Liu ZY et al (2019) Cost-sensitive stacked sparse auto-encoder models to detect striped stem borer infestation on rice based on hyperspectral imaging. Knowl-Based Syst 168(49):58
Lin H, Jiang H, Lin JJ et al (2021) Rice freshness identification based on visible near-infrared spectroscopy and colorimetric sensor array. Food Anal Method 14(7):1305–1314
Songyot N (2014) Internal damage inspection of almond nuts using optimal near-infrared waveband selection technique. J Food Eng 126:173–177
Bahareh J, Ezeddin M, Hossein F et al (2019) Pattern recognition-based optical technique for non-destructive detection of Ectomyelois ceratoniae infestation in pomegranates during hidden activity of the larvae. Spectrochim Acta A 206:552–557
Armstrong P, Maghirang E, Ozulu M (2019) Determining damage levels in wheat caused by Sunn pest (Eurygaster integriceps) using visible and near-infrared spectroscopy. J Cereal Sci 86:102–107
Johnson JB (2020) An overview of near-infrared spectroscopy (NIRS) for the detection of insect pests in stored grains. J Stored Prod Res 86:101558
Ip RHL, Ang LM, Kah PS et al (2018) Big data and machine learning for crop protection. Comput Electron Agr 151(376):383
Rob L, Rob K, Sander J et al (2016) Analysis of big data technologies for use in agro-environmental science. Environ Modell Softw 84:494–504
Ang KLM, Seng JKP (2021) Big data and machine learning with hyperspectral information in agriculture. IEEE Access 9:36699–36718
Melgar-García L, Gutierrez-Aviles D, Godinho MT et al (2022) A new big data triclustering approach for extracting three-dimensional patterns in precision agriculture. Neurocomputing 500(268):278
Liakos KG, Busato P, Moshou D et al (2018) Machine Learning in agriculture: a review. Sensors-Basel 18(8):2674
Tao Q, Ding H, Wang H et al (2021) Application research: big data in food industry. Foods 10(9):2203
Cravero A, Pardo S, Sepulveda S et al (2022) Challenges to use machine learning in agricultural big data: a systematic literature review. Agronomy-Basel 12(3):748
Liu B, He S, He D et al (2019) A spark-based parallel fuzzy c-means segmentation algorithm for agricultural image big data. IEEE Access 7(42169):42180
Zhang L, Xie L, Wang Z (2022) Cascade parallel random forest algorithm for predicting rice diseases in big data analysis. Electronics-Switz 11(7):1079
Jharna M, Sneha N, Shilpa A (2017) Analysis of agriculture data using data mining techniques: application of big data. J Big Data 4(20):1–15
Xia D, Bai Y, Zheng Y et al (2022) A parallel SP-DBSCAN algorithm on spark for waiting spot recommendation. Multimed Tools Appl 81(4015):4038
Misra NN, Dixit Y, Al-Mallahi A et al (2022) IoT, big data, and artificial intelligence in agriculture and food industry. IEEE Int Things 9(9):6305–6324
Jaber MM, Ali MH, Abd SK et al (2022) Predicting climate factors based on big data analytics based agricultural disaster management. Phys Chem Earth 128:103243
Shahhosseini M, Hu GP, Archontoulis SV (2020) Forecasting corn yield with machine learning ensembles. Front Plant Sci 11:1120
Nyeki A, Nemenyi M (2022) Crop yield prediction in precision agriculture. Agronomy-basel 12(10):2460
Zhai ZY, Martinez JF, Beltran V (2020) Decision support systems for agriculture 4.0: survey and challenges. Comput Electron Agr 170:105256
Jie Q (2022) Precision and intelligent agricultural decision support system based on big data analysis. Acta Agr Scand B-S P 72(1):401–414
Xia D, Yang N, Jiang S et al (2022) A parallel NAW-DBLSTM algorithm on Spark for traffic flow forecasting. Neural Comput Appl 34(1557):1575
Xia D, Bai Y, Geng J et al (2022) A distributed EMDN-GRU model on Spark for passenger waiting time forecasting. Neural Comput Appl 34(19035):19050
Xia D, Yang N, Jian S et al (2022) SW-BiLSTM: a Spark-based weighted BiLSTM model for traffic flow forecasting. Multimed Tools Appl 81(23589):23614
Rehman A, Javed K, Babri HA et al (2018) Selection of the most relevant terms based on a max-min ratio metric for text classification. Expert Syst Appl 114:78–96
Keles MK, Kilic U (2022) Classification of brain volumetric data to determine alzheimer’s disease using artificial bee Colony algorithm as feature selector. IEEE Access 10:82989–83001
Pes B (2020) Ensemble feature selection for high-dimensional data: a stability analysis across multiple domains. Neural Comput Appl 32(10):5951–5973
Chang Y, Lafata K, Sun W (2020) An investigation of machine learning methods in delta-radiomics feature analysis. PLoS One 14(12):1–14
Shiri I, Maleki H, Hajianfar G et al (2020) Next-generation radiogenomics sequencing for prediction of EGFR and KRAS mutation status in NSCLC patients using multimodal imaging and machine learning algorithms. Mol Imaging Biol 22(4):1132–1148
Han Y, Liu HJ, Zhang XL et al (2021) Prediction model of rice panicles blast disease degree based on canopy hyperspectral reflectance. Spectrosc Spect Anal 41(4):1220–1226
Shao YN, He Y (2013) Visible/Near infrared spectroscopy and chemometrics for the prediction of trace element (Fe and Zn) levels in rice leaf. Sensors-Basel 13:1872–1883
Miao XX, Miao Y, Gong HR et al (2019) Determination of moisture content in rice by near infrared spectroscopy combined with characteristic wavenumber. Food Sci Tech-Brazil 44(10):335–341
Li Z, Tan Y, Song SZ et al (2022) Spectral characteristic band analysis of japonica rice. Agricult Technol 42(7):11–14
Xu HC, Yao B, Wang Q et al (2021) Determination of suitable band width for estimating rice nitrogen nutrition index based on leaf reflectance spectra. Sci Agric Sin 54(21):4525–4538
Cao YL, Jiang KL, Liu YD et al (2021) Rice chlorophyll inversion based on hyperspectral red edge retrieval. J Shenyang Agricult Univ 52(6):718–728
Yu ZY, Wang X, Meng XT et al (2019) SPAD prediction model of Rice leaves considering the characteristics of water spectral absorption. Spectrosc Spect Anal 39(8):2528–2532
Du L, Yang J, Chen BW et al (2020) Novel combined spectral indices derived from hyperspectral and laser-induced fluorescence LiDAR spectra for leaf nitrogen contents estimation of rice. Remote Sens-Basel 12(1):185
Acknowledgements
The authors gratefully acknowledge the financial support from the National Key R&D Program of China (No. 2016YFD0300604, 2017YFD0300409-4). In part by the Introduction of talent research fund (YK20-05-01, YK20-05-06). In part by the Key Research and Development Program of Jiangsu Province under Grant BE2022351 (Modern Agriculture). In part by the Jiangsu Agricultural Science and Technology Innovation Found (No. CX(21)2042). In part by the Science and Technology Development Center Project of the Ministry of Education of China (No. 2020HYB02005). In part by the Jiangsu Industrial Software Engineering Technology Research and Development Center Project(No. ZK20-04-12).
Author information
Authors and Affiliations
Contributions
Xia Ji’An: Writing—original draft. WenYu Zhang: Methodology. WeiXin Zhang: Formal analysis. Mu WenTao, Xu RongWang, Yuan WangHao: Software. Ge DaoKuo, Zhang Qian, Ge SiJun: Investigation. HongXin Cao: Corresponding author.
Corresponding author
Ethics declarations
Competing interest
No conflict of interest exits in the submission of this manuscript (“Analysis of visible–near infrared spectral characteristics for water layer management of rice based on the big data platform”), and manuscript is approved by all authors for publication. I would like to declare on behalf of my co-authors that the work described was original research that has not been published previously, and not under consideration for publication elsewhere. All the authors listed have approved the manuscript that is enclosed.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Xia, J., Zhang, W., Zhang, W. et al. Analysis of visible–near infrared spectral characteristics for water layer management of rice based on the big data platform. Multimed Tools Appl 83, 53279–53292 (2024). https://doi.org/10.1007/s11042-023-17593-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-17593-y