首页> 中文期刊> 《光谱学与光谱分析》 >Vis-NIR光谱模式识别结合SG平滑用于转基因甘蔗育种筛查

Vis-NIR光谱模式识别结合SG平滑用于转基因甘蔗育种筛查

         

摘要

Based on Savitzky-Golay (SG) smoothing screening ,principal component analysis (PCA) combined with separately supervised linear discriminant analysis (LDA) and unsupervised hierarchical clustering analysis (HCA) were used for non-de-structive visible and near-infrared (Vis-NIR) detection for breed screening of transgenic sugarcane .A random and stability-de-pendent framework of calibration ,prediction ,and validation was proposed .A total of 456 samples of sugarcane leaves planting in the elongating stage were collected from the field ,which was composed of 306 transgenic (positive) samples containing Bt and Bar gene and 150 non-transgenic (negative) samples .A total of 156 samples (negative 50 and positive 106) were randomly se-lected as the validation set ;the remaining samples (negative 100 and positive 200 ,a total of 300 samples) were used as the mod-eling set ,and then the modeling set was subdivided into calibration (negative 50 and positive 100 ,a total of 150 samples) and prediction sets (negative 50 and positive 100 ,a total of 150 samples) for 50 times .The number of SG smoothing points was ex-panded ,while some modes of higher derivative were removed because of small absolute value ,and a total of 264 smoothing modes were used for screening .The pairwise combinations of first three principal components were used ,and then the optimal combination of principal components was selected according to the model effect .Based on all divisions of calibration and predic-tion sets and all SG smoothing modes ,the SG-PCA-LDA and SG-PCA-HCA models were established ,the model parameters were optimized based on the average prediction effect for all divisions to produce modeling stability .Finally ,the model validation was performed by validation set .With SG smoothing ,the modeling accuracy and stability of PCA-LDA ,PCA-HCA were signif-icantly improved .For the optimal SG-PCA-LDA model ,the recognition rate of positive and negative validation samples were 94.3% ,96.0% ;and were 92.5% ,98.0% for the optimal SG-PCA-LDA model ,respectively .Conclusion:Vis-NIR spectro-scopic pattern recognition combined with SG smoothing could be used for accurate recognition of transgenic sugarcane leaves ,and provided a convenient screening method for transgenic sugarcane breeding .%以Savitzky-Golay (SG)平滑筛选,主成分分析(PCA)分别结合有监督的线性判别分析(LDA)、无监督的系统聚类分析(HCA),应用于转基因甘蔗育种筛查的可见-近红外(Vis-NIR)无损检测。提出兼顾随机性、稳定性的定标、预测、检验框架;取田间种植处于伸长期甘蔗叶样品456个,具有Bt基因和Bar基因的转基因样品(阳)306个,非转基因样品(阴)150个;随机选取156个为检验集(阴性50、阳性106),余下为建模集(阴性100、阳性200,共300),建模集再随机划分为定标集(阴性50、阳性100,共150)、预测集(阴性50、阳性100,共150)共50次;扩充SG平滑点数,同时删除绝对值偏小的高阶导数模式,共264个平滑模式用于模型筛选;采用前3个主成分两两组合,再根据模型效果选出最优主成分组合;基于所有定标、预测集划分和SG平滑模式,建立SG-PCA-LDA和SG-PCA-HCA模型,根据平均预测效果优选参数,使模型具有稳定性;最后用检验集进行模型检验。经SG平滑后,PCA-LDA和PCA-HCA的建模精度、稳定性均显著改善;最优SG-PCA-LDA模型阳性、阴性样品检验识别率分别达到94.3%和96.0%;最优SG-PCA-HCA模型阳性、阴性样品检验识别率分别达到92.5%和98.0%。结果表明:Vis-NIR光谱模式识别结合SG平滑可用于转基因甘蔗叶的准确识别,提供了一种简便的转基因甘蔗育种筛查方法。

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号