Chem. J. Chinese Universities ›› 2020, Vol. 41 ›› Issue (1): 94.doi: 10.7503/cjcu20190400

• Analytical Chemistry • Previous Articles     Next Articles

Research on Boold Species Ide.pngication Algorithm Based on RF_AdaBoost Model

WEI Manman1,LU Haoxiang2,YANG Huihua1,3,*()   

  1. 1. School of Computer and Information Security
    2. School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin 541004, China
    3. School of Automation, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2019-07-19 Online:2020-01-10 Published:2019-10-11
  • Contact: Huihua YANG E-mail:yhh@bupt.edu.cn
  • Supported by:
    ? Supported by the National Natural Science Foundation of China Nos(21365008);? Supported by the National Natural Science Foundation of China Nos(61105004);and the Guangxi Key Laboratory of Automatic Testing Technology and Instrumentation Director Fund Project, China No(YQ18108)

Abstract: Aim ing at the requirements of non-destructive and high-efficiency analysis methods for ide.pngying human and non-human blood species, a method of blood species ide.pngication based on Random Forest combined Adaptive Boosting Algorithm(RF_AdaBoost) was proposed. This method uses RF as the weak classifier of AdaBoost to improve the ide.pngication accuracy and enhance the robustness of the model. RF, Extreme Learning Machine(ELM), Kernel Extreme Learning Machine(KELM), Stacked Auto-Encoder(SAE), Back Propagation(BP), Principal Component Analysis and Linear Discriminant Analysis(PCA-LDA), Partial Least Squares Discriminant Analysis(PLS-DA) are used to compare with the RF_AdaBoost model, and the training sets of different scales of blood Raman spectroscopy data were used for ide.pngication experiments to evaluate its performance. With the increase of training samples in the experiment, the ide.pngication accuracy of RF_AdaBoost is up to 100%, and the prediction standard deviation tends to zero. The results show that RF_AdaBoost has higher classification accuracy and stronger stability than other models, which provides an effective new method for the ide.pngication of blood species.

Key words: Raman spectroscopy, Random forest, AdaBoost algorithm, Ensemble learning, Boold species ide.pngication

CLC Number: 

TrendMD: