高等学校化学学报 ›› 2009, Vol. 30 ›› Issue (4): 697.doi:

• 研究论文 • 上一篇    下一篇

烟草组分的近红外光谱和支持向量机分析

张勇1,2, 丛茜1, 谢云飞3, 赵冰3   

    1. 吉林大学地面机械仿生技术教育部重点实验室, 长春 130025;
    2. 吉林工程技术师范学院, 长春 130052;
    3. 吉林大学超分子结构与材料国家重点实验室, 长春 130012
  • 收稿日期:2008-07-03 出版日期:2009-04-10 发布日期:2009-04-10
  • 通讯作者: 赵冰, E-mail:zhaobing@jlu.edu.cn
  • 基金资助:

    国家自然科学基金(批准号: 20473029, 20573041)和吉林省科技厅项目(批准号: 20060902-02, 200705C07)资助.

Analysis of Tobacco by Near-infrared Spectroscopy and Support Vector Machine

ZHANG Yong1,2, CONG Qian1, XIE Yun-Fei3, ZHAO Bing3*   

    1. Key Laboratory for Terrain-Machine Bionics Engineering, Ministry of Education, Jilin University, Changchun 130025, China;
    2. Jilin Teachers′ Institute of Engineering and Technology, Changchun 130052, China;
    3. State Key Laboratory for Supramolecular Structure and Material, Jilin University, Changchun 130012, China
  • Received:2008-07-03 Online:2009-04-10 Published:2009-04-10
  • Contact: ZHAO Bing, E-mail:zhaobing@jlu.edu.cn

摘要:

测定了120个产自福建、安徽和云南烟草样品的近红外光谱. 在利用支持向量机(SVM)技术建立其定量、定性分析模型之前, 用小波变换技术对光谱变量进行了有效的压缩, 然后采用径向基核函数建立了75个烟草样品的分类模型, 同时建立了总糖、还原糖、烟碱和总氮4个组分的定量分析模型, 并利用45个烟草样品对模型进行了检验. 仿真实验表明, 建立的SVM分类模型分类准确率达到100%, 而4个组分的定量分析模型的预测决定系数(R2)、预测均方差(RMSEP)和平均相对误差(RME)3个指标值显示其模型泛化能力非常强, 预测效果良好, 可见这是一种有效的近红外光谱的建模分析方法.

关键词: 近红外光谱, 支持向量机, 小波变换, 烟草

Abstract:

In this study, in order to establish analysis models of near-infrared spectroscopy(NIR) of tobacco, 120 samples of tobacco from different cultivation area were surveyed by near-infrared(NIR) spectroscopy. As the new pattern recognition, support vector machine(SVM) can avoid over-fitting problem and owns the superior generalization ability and prediction accuracy, were applied in this study. The quantitative and qualitative analysis models of tobacco samples were studied separately in this experiment using radial basis function(RBF) SVM. For reducing dimension and moving noise, the spectrum variables were highly effectively compressed using the wavelet transformation(WT) technology and the haar wavelet was selected to decompose the spectroscopy signals. Simultaneously, the parameters of the models were also discussed in detail. The best experimental results were obtained using the RBF SVM regression with kernel parameter σ=1.0, 1.2, 1.4, 0\^6, separately corresponds to total-sugar, reducing sugar, nicotine, total-nitrogen, and RBF SVM classifier with kernel parameter σ=1.6. Meanwhile, the values of appraisal index, namely coefficient of determination(R2), root mean squared error of prediction(RMSEP) and mean relative error(RME), indicate its excellent generalization for quantitative and qualitative analysis results and high prediction accuracy. The overall results show that NIR spectroscopy combined with SVM can be efficiently utilized for rapid and accurate analysis of chemical compositions in tobacco and discrimination of tobacco of different origins. On the other hand, the research can show that SVM is effective modeling tools to NIR spectroscopy and can provide technical support for quantitative and quantitative analysis of other NIR applications.

Key words: Near-infrared spectroscopy, Support vector machine, Wavelet transformation, Tobacco

中图分类号: 

TrendMD: