高等学校化学学报 ›› 2016, Vol. 37 ›› Issue (10): 1792.doi: 10.7503/cjcu20160344

• 分析化学 • 上一篇    下一篇

基于最小二乘支持向量机的油页岩含油率近红外光谱分析

张福东, 刘杰, 王智宏()   

  1. 吉林大学仪器科学与电气工程学院, 长春 130021
  • 收稿日期:2016-05-16 出版日期:2016-10-10 发布日期:2016-09-20
  • 作者简介:联系人简介: 王智宏, 女, 博士, 教授, 博士生导师, 主要从事近红外光谱仪器研制及应用技术研究.E-mail:zhwang@jlu.edu.cn.
  • 基金资助:
    国家潜在油气资源(油页岩勘探开发利用)产学研用合作创新子课题(批准号: OSR-02-04)和吉林省科技发展计划项目重大科技专项(批准号: 20116014)资助

Analysis of Oil Yield from Oil Shale Minerals Based on Near-infrared Spectroscopy with Least Squares Support Vector Machines

ZHANG Fudong, LIU Jie, WANG Zhihong*()   

  1. College of Instrumentation Science & Electrical Engineering, Jilin University, Changchun 130021, China
  • Received:2016-05-16 Online:2016-10-10 Published:2016-09-20
  • Contact: WANG Zhihong E-mail:zhwang@jlu.edu.cn.

摘要:

为了提高油页岩含油率近红外光谱分析建模的预测精度和稳定性, 开展了基于最小二乘支持向量机(LS-SVM)建模方法的对比研究. 采用主成分-马氏距离(PCA-MD)和基于蒙特卡洛采样(MCS)2种方法进行了奇异样本的检测, 采用径向基核函数的LS-SVM、 偏最小二乘(PLS)和反向传播神经网络(BPANN)3种方法进行建模方法对比. 结果表明, 对于64个油页岩岩芯样本, 与PCA-MD方法相比, 采用MCS方法剔除奇异样本后所建PLS模型的预测精度提高了28%. 对于MCS方法剔除奇异样本后的58个样品, 采用Kennard-Stone法划分了44个样品的校正集和14个样品的预测集, 采用2阶导数和标准化预处理方法, 建立了100个LS-SVM的校正模型, 模型的预测决定系数R2平均值达到0.90以上, 高于PLS和BPANN模型的对应值; 且R2的变化量(0.02)小于BPANN模型的对应值(0.32). 因此, MCS奇异样本检测结合LS-SVM方法可提高油页岩含油率样本建模的精度和稳定性.

关键词: 最小二乘支持向量机, 油页岩, 含油率, 近红外光谱分析, 奇异样本

Abstract:

In order to improve the prediction accuracy and precision of near-infrared(NIR) spectroscopy model for analyzing the oil yield from oil shale, sixty-four oil shale samples from the No.2 well drilling of Fuyu oil shale base were analyzed based on least squares support vector machines(LS-SVM) calibration models. The Principal component-mahalanobis distance(PCA-MD) method and the Monte-Carlo sampling-based detection of outliers(MCS) method were investigated as means of removing the outliers. The modeling methods of radial basis function-based LS-SVM, partial least squares(PLS) and back propagation neural network (BPANN) were compared. The results showed that, compared with PCA-MD, the prediction accuracy of PLS models based on MCS was improved by 28%. The samples after eliminating the outliers were divided into the calibration set with 44 samples and the prediction set with 14 samples using the Kennard Stone method. One hundred LS-SVM calibration models were established based on preprocessing method of second-derivative and autoscaling. The mean determination coefficient(R2) were more than 90% and higher than PLS and BPANN models, and the fluctuation of R2 were less than BPANN models. Thus, LS-SVM regression with MCS method can improve the accuracy and precision of oil yield of oil shale modeling.

Key words: Least squares support vector machine, Oil shale, Oil yield, Near-infrared spectroscopy, Outlier

中图分类号: 

TrendMD: