基于POT的多元统计过程核电数据异常检测方法

张大志; 罗骁域; 郑胜

doi:10.16516/j.ceec.2024-099

基于POT的多元统计过程核电数据异常检测方法

DOI: 10.16516/j.ceec.2024-099

CSTR: 32391.14.j.ceec.2024-099

张大志^1,,
罗骁域^2, ,,
郑胜^3,

1.
中核武汉核电运行技术股份有限公司, 湖北武汉 443074
2.
三峡大学电气与新能源学院, 湖北宜昌 443002
3.
三峡大学理学院, 湖北宜昌 443002

基金项目: 中核集团核工业仿真技术重点实验室对外开放基金项目“基于半监督/自监督的深度学习异常检测方法研究”（B220631）

详细信息

作者简介:
张大志，1977-，男，总工程师，博士，主要研究方向核电仿真及数据同化及核电厂运行数据异常检测（e-mail）zhangdz02@cnnp.com.cn

罗骁域，1994-，男，博士研究生，主要从事核电厂数据同化及核电厂运行数据异常检测工作（e-mail）lxy@ctgu.edu.cn

郑胜，1965-，男，博士生导师，教授，主要研究方向为图像处理，核电厂运行数据异常检测（e-mail）zsh@ctgu.edu.cn

通讯作者:
罗骁域，（e-mail）lxy@ctgu.edu.cn。

中图分类号: TL67；TL48

An Anomaly Detection Method for Multivariate Statistical Process Based on POT

ZHANG Dazhi^1
,,
LUO Xiaoyu^{2
, ,},
ZHENG Sheng^3
,

1.
China Nuclear Power Operation Technology Corporation, Ltd., Wuhan 443074, Hubei, China
2.
College of Electrical Engineering and New Energy, China Three Gorges University, Yichang 443002, Hubei, China
3.
College of Science, China Three Gorges University, Yichang 443002, Hubei, China

摘要: 目的核电设备的安全运行对核电厂至关重要，发生事故所带来的损失是不可估量的。因此，对核电设备进行有效的异常检测十分必要。针对固定阈值和人为检测方法的局限性，这些方法难以适应时序数据的动态变化，文章提出一种基于POT的多元统计过程的异常检测方法。方法文章采用主成分分析方法构建异常检测模型，将模型的SPE统计量作为POT算法的初始阈值，然后将超过初始阈值的部分进行广义帕累托分布拟合，从而确定最终的动态阈值。当异常分数超过最终阈值则发出异常警告。通过将多元统计过程控制和极值理论相结合，该方法利用多元统计过程控制快速发现核电厂运行数据中的异常情况，并结合极值理论对极端事件的建模与分析来提高异常检测的灵敏度和可靠性，能够快速发现核电厂高维运行数据中存在的异常情况。结果在仿真实验结果中，文章提出的方法相较于常规的多元统计方法和POT方法，具有更高的准确率、召回率。在核电厂不同设备上的实际运行数据的实验中，证明了该方法在异常检测上的有效性。结论将多元统计过程控制和极值理论结合，提出的异常检测方法不仅能检测到由数据相互关系改变引起的异常，而且能利用POT方法确定最终阈值避免传统多元统计过程控制中出现的误检。该方法能处理核电厂高维时序运行数据，提高异常发现的效率，确保了核电厂安全高效地运行从而提高核电厂的经济效益。
- 多元统计过程控制 /
- 主成分分析 /
- 动态阈值 /
- 极值理论
Abstract: Objective The safe operation of nuclear power equipment is crucial for nuclear power plants (NPPs), and the losses caused by accidents are immeasurable. Therefore, effective anomaly detection for nuclear power equipment is necessary. Considering the limitations of fixed thresholds and manual detection methods, which are difficult to adapt to the dynamic changes in time series data, this paper proposes an anomaly detection method based on POT for multivariate statistical processes. Method This paper adopted PCA to construct an anomaly detection model, where the SPE statistic of the model served as the initial threshold for the POT algorithm. Subsequently, the portion exceeding the initial threshold was fitted with a generalized Pareto distribution to determine the final dynamic threshold. An anomaly warning was issued when the anomaly score exceeded the final threshold. By combining multivariate statistical process control (MSPC) with extreme value theory (EVT), this method used MSPC to discover anomalies in the operating data of NPPs quickly and improved the sensitivity and reliability of anomaly detection by modeling and analyzing extreme events, so that it can quickly detect anomalies in high-dimensional operating data of NPPs. Result In the simulation experiment results, the proposed method has a higher accuracy and recall rate than conventional multivariate statistical and POT methods. In experiments with actual operating data from different equipment in NPPs, the method's effectiveness in anomaly detection has been demonstrated. Conclusion By combining MPSC with EVT, the anomaly detection method proposed in this paper can not only detect anomalies caused by changes in data relationships but also avoid false detection in traditional MSPC by determining the final threshold using the POT method. This method can handle high-dimensional time series operating data of NPPs, improve the efficiency of anomaly detection, ensure the safe and efficient operation of NPPs, and improve their economic benefits.
- multivariate statistical process control (MSPC) /
- principal component analysis (PCA) /
- peaks-over-threshold (POT) /
- extreme value theory (EVT)

图 1 基于多元统计过程控制的异常检测流程图

Fig. 1 Flowchart of anomaly detection based on MSPC.

下载: 全尺寸图片幻灯片

图 2 异常检测算法流程图

Fig. 2 Flowchart of anomaly detection algorithm

下载: 全尺寸图片幻灯片

图 3 仿真数据1的3个传感器数据

Fig. 3 Data from three sensors for simulated data 1

下载: 全尺寸图片幻灯片

图 4 仿真数据1中添加异常的测试数据

Fig. 4 Test data with anomalies added in simulated data 1

下载: 全尺寸图片幻灯片

图 5 仿真数据2的3个传感器数据

Fig. 5 Data from three sensors for simulated data 2

下载: 全尺寸图片幻灯片

图 6 仿真数据2中添加异常的测试数据

Fig. 6 Test data with anomalies added in simulated data 2

下载: 全尺寸图片幻灯片

图 7 对仿真数据1的数据检测结果

Fig. 7 Data detection results of simulated data 1

下载: 全尺寸图片幻灯片

图 8 三种方法检测的结果还原到仿真数据1，上子图为SPE-POT方法的检测结果，中间子图为MSPC方法的检测结果，下子图展示的是POT方法的检测结果

Fig. 8 Detection results of the three methods are restored to simulated data 1: the upper subplot shows the detection results using the SPE-POT method, and the middle subplot shows the detection results using the MSPC method, and the lower subplot shows the detection results using the POT method

下载: 全尺寸图片幻灯片

图 9 对仿真数据2的测试数据检测结果

Fig. 9 Detection results for the test data of simulated data 2

下载: 全尺寸图片幻灯片

图 10 SPE-POT方法异常检测结果（上、中、下子图分别为将检测结果在Data 4、5、6数据上标记的结果，红点表示检出的异常点）

Fig. 10 Anomaly detection results of the SPE-POT method: the upper, middle, and lower subplots respectively display the results marked on data 4, data 5, and data 6. red dots represent the detected anomalous points

下载: 全尺寸图片幻灯片

图 11 MSPC方法异常检测结果，上、中、下子图分别为将检测结果在Data4、Data5、Data6数据上标记的结果，红点表示检出的异常点

Fig. 11 Anomaly detection results of the MSPC method: the upper, middle, and lower subplots respectively display the results marked on data 4, data 5, and data 6. red dots represent the detected anomalous points

下载: 全尺寸图片幻灯片

图 12 POT方法检测结果还原到原始数据上的情况，上、中、下子图分别为将检测结果在Data 4、5、6的测试数据上标记的结果，红点表示检出的异常点

Fig. 12 Detection results of the POT method restored to the original data: the upper, middle, and lower subplots respectively display the results marked on data 4, data 5, and data 6 test data. red dots represent the detected anomalous points

下载: 全尺寸图片幻灯片

图 13 表4中数据的统计量SPE值及SPE-POT方法确定的阈值

Fig. 13 Statistical SPE values of the data in table 4 and the threshold determined by the SPE-POT method

下载: 全尺寸图片幻灯片

图 14 将检测结果分别还原到传感器RCV200MT的数据上，上、下子图分别展示的是SPE-POT和MSPC方法的检测结果

Fig. 14 Detection results are restored to the data of sensor RCV200MT respectively: the upper and lower subplots display the detection results of the SPE-POT and MSPC methods, respectively

下载: 全尺寸图片幻灯片

图 15 表5中数据的统计量SPE值及SPE-POT方法确定的阈值

Fig. 15 Statistical SPE values of the data in table 5 and the threshold determined by the SPE-POT method

下载: 全尺寸图片幻灯片

图 16 将2个方法的检测结果分别还原到传感器GRE014MM的数据上，上、下子图分别展示的是SPE-POT方法和MSPC方法的检测结果

Fig. 16 Detection results of the two methods are respectively restored to the data of sensor GRE014MM: the upper and lower subplots display the detection results of the SPE-POT method and the MSPC method, respectively.

下载: 全尺寸图片幻灯片

图 17 除氧器设备监测数据的SPE值及其阈值和SPE-POT方法确定的阈值

Fig. 17 SPE values of the deaerator equipment monitoring data and their thresholds, as well as the thresholds determined by the SPE-POT method.

下载: 全尺寸图片幻灯片

图 18 将3个方法的检测结果分别还原到传感器的数据上的情况，上、中、下子图分别展示的是SPE-POT方法、MSPC方法和POT方法的检测结果

Fig. 18 Detection results of the three methods are respectively restored to the situation of the sensor data: the upper, middle, and lower subplots display the detection results of the SPE-POT, MSPC, and POT method, respectively.

下载: 全尺寸图片幻灯片

表 1 SPE-POT方法的参数设置

Tab. 1. Parameter setting for the SPE-POT method

编号	参数	设定值	描述
1	c_α	0.05	标准正态分布的置信极限
2	η	0.95	特征值累计贡献率
3	q	0.000 1	广义帕累托分布的概率

下载: 导出CSV

表 2 SPE-POT和MSPC方法检测的统计结果

Tab. 2. Statistical results of detection by SPE-POT and MSPC methods

评价指标	R	P	F₁
SPE-POT	100%	100%	100%
POT	100%	100%	100%
MSPC	100%	97.4%	98.7%

下载: 导出CSV

表 3 SPE-POT和POT方法检测的统计结果

Tab. 3. Statistical results of detection by SPE-POT and POT methods

评价指标	R	P	F₁
SPE-POT	92.67%	89.68%	91.15%
POT	2.67%	2.86%	2.76%
MSPC	98.67%	11.80%	21.08%

下载: 导出CSV

表 4 真实Data 1的传感器信息

Tab. 4. Sensor information from real data 1

序号	传感器	描述
1	RCV200MT	电机非驱动端轴承温度
2	RCV201MT	电机驱动端轴承温度
3	RCV202MT	电机定子绕组温度
4	RCV206MT	增速箱LS轴承温度
5	RCV210MT	泵推力轴承温度
6	RCV223MT	电机定子绕组温度
7	RCV231MV	推力轴承箱振动(X方向)
8	RCV232MV	推力轴承箱振动(Y方向)
9	RCV233MV	径向轴承箱振动(X方向)
10	RCV234MV	径向轴承箱振动(X方向)

下载: 导出CSV

表 5 真实Data 2的传感器信息

Tab. 5. Sensor information from real data 2

序号	传感器	描述
1	GRE014MM	汽轮机轴向位移
2	GRE015MM	汽轮机轴向位移
3	GRE016MM	汽轮机轴向位移
4	GRE018MM	汽轮机转子偏心度
5	GRE019MY	高中压转子绝对胀差
6	GRE020MV	高中压缸绝对胀差
7	GRE315MV	低压转子绝对胀差

下载: 导出CSV

[1]	王浩然, 冯天天, 崔茗莉, 等. 碳交易政策下绿氢交易市场与电力市场耦合效应分析 [J]. 南方能源建设, 2023, 10(3): 32-46. DOI: 10.16516/j.gedi.issn2095-8676.2023.03.004. WANG H R, FENG T T, CUI M L, et al. Analysis of coupling effect between green hydrogen trading market and electricity market under carbon trading policy [J]. Southern energy construction, 2023, 10(3): 32-46. DOI: 10.16516/j.gedi.issn2095-8676.2023.03.004.
[2]	王鑫, 吴继承, 朴磊. “双碳”目标下核能发展形势思考 [J]. 核科学与工程, 2022, 42(2): 241-245. DOI: 10.3969/j.issn.0258-0918.2022.02.001. WANG X, WU J C, PU L. Consideration of the development situation of nuclear power under the goal of carbon peaking and carbon neutraulity [J]. Nuclear science and engineering, 2022, 42(2): 241-245. DOI: 10.3969/j.issn.0258-0918.2022.02.001.
[3]	蔡绍宽. 双碳目标的挑战与电力结构调整趋势展望 [J]. 南方能源建设, 2021, 8(3): 8-17. DOI: 10.16516/j.gedi.issn2095-8676.2021.03.002. CAI S K. Challenges and prospects for the trends of power structure adjustment under the goal of carbon peak and neutrality [J]. Southern energy construction, 2021, 8(3): 8-17. DOI: 10.16516/j.gedi.issn2095-8676.2021.03.002.
[4]	吴铮, 张悦, 董泽. 基于改进高斯混合模型的热工过程异常值检测 [J]. 系统仿真学报, 2023, 35(5): 1020-1033. DOI: 10.16182/j.issn1004731x.joss.22-0047. WU Z, ZHANG Y, DONG Z. Outlier detection during thermal processes based on improved gaussian mixture model [J]. Journal of system simulation, 2023, 35(5): 1020-1033. DOI: 10.16182/j.issn1004731x.joss.22-0047.
[5]	崔文浩, 郑胜, 秦雄杰, 等. 基于多尺度时间窗口的核电运行数据关联性分析方法研究 [J]. 南方能源建设, 2023, 10(2): 143-150. DOI: 10.16516/j.gedi.issn2095-8676.2023.02.019. CUI W H, ZHENG S, QIN X J, et al. Research on correlation analysis method for nuclear power operation data based on multi-scale time window [J]. Southern energy construction, 2023, 10(2): 143-150. DOI: 10.16516/j.gedi.issn2095-8676.2023.02.019.
[6]	YIN S, DING S X, XIE X C, et al. A review on basic data-driven approaches for industrial process monitoring [J]. IEEE Transactions on industrial electronics, 2014, 61(11): 6418-6428. DOI: 10.1109/TIE.2014.2301773.
[7]	JIN X H, FAN J C, CHOW T W S. Fault detection for rolling-element bearings using multivariate statistical process control methods [J]. IEEE transactions on instrumentation and measurement, 2019, 68(9): 3128-3136. DOI: 10.1109/TIM.2018.2872610.
[8]	ZHANG Y W, LI S, HU Z Y. Improved multi-scale kernel principal component analysis and its application for fault detection [J]. Chemical engineering research and design, 2012, 90(9): 1271-1280. DOI: 10.1016/j.cherd.2011.11.015.
[9]	JIANG Q C, YAN X F, HUANG B. Performance-driven distributed PCA process monitoring based on fault-relevant variable selection and bayesian inference [J]. IEEE transactions on industrial electronics, 2016, 63(1): 377-386. DOI: 10.1109/TIE.2015.2466557.
[10]	CHANDOLA V, BANERJEE A, KUMAR V. Anomaly detection: a survey [J]. ACM computing surveys (CSUR), 2009, 41(3): 15. DOI: 10.1145/1541880.1541882.
[11]	WANG H, PENG M J, WESLEY HINES J, et al. A hybrid fault diagnosis methodology with support vector machine and improved particle swarm optimization for nuclear power plants [J]. ISA transactions, 2019, 95: 358-371. DOI: 10.1016/j.isatra.2019.05.016.
[12]	NOMIKOS P. Detection and diagnosis of abnormal batch operations based on multi-way principal component analysis world batch forum, Toronto, May 1996 [J]. ISA transactions, 1996, 35(3): 259-266. DOI: 10.1016/S0019-0578(96)00035-3.
[13]	CHEN S Y, JIN G, MA X Y. Satellite on-orbit anomaly detection method based on a dynamic threshold and causality pruning [J]. IEEE access, 2021, 9: 86751-86758. DOI: 10.1109/ACCESS.2021.3088439.
[14]	卢培, 李小宝, 郑晨旭, 等. 350 MW余热锅炉变工况运行特性分析 [J]. 南方能源建设, 2022, 9(3): 41-49. DOI: 10.16516/j.gedi.issn2095-8676.2022.03.005. LU P, LI X B, ZHENG C X, et al. Analysis on operation characteristics of 350 MW waste heat boiler under variable working conditions [J]. Southern energy construction, 2022, 9(3): 41-49. DOI: 10.16516/j.gedi.issn2095-8676.2022.03.005.
[15]	SIFFER A, FOUQUE P A, TERMIER A, et al. Anomaly detection in streams with extreme value theory [C]//Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, Canada, August 13-17, 2017. Halifax: ACM, 2017: 1067-1075. DOI: 10.1145/3097983.3098144.
[16]	CHARRAS-GARRIDO M, DEVILLE Y, LEZAUD P. Corrective to the article: extreme value analysis - an introduction Journal de la SFdS Vol. 154 No2, 66-97 [J]. Journal de la société française de statistique, 2017, 158(3): 27-28.
[17]	YU X L, ZHAO Z B, ZHANG X W, et al. Deep-learning-based open set fault diagnosis by extreme value theory [J]. IEEE transactions on industrial informatics, 2022, 18(1): 185-196. DOI: 10.1109/TII.2021.3070324.
[18]	BEIRLANT J, GOEGEBEUR Y, TEUGELS J, et al. Statistics of extremes: theory and applications [M]. Hoboken: Wiley, 2004. DOI: 10.1002/0470012382.
[19]	HUNDMAN K, CONSTANTINOU V, LAPORTE C, et al. Detecting spacecraft anomalies using LSTMs and nonparametric dynamic thresholding [C]//Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, August 19-23, 2018. London: ACM, 2018: 387-395. DOI: 10.1145/3219819.3219845.
[20]	LUO X Y, ZHENG S, HUANG Y, et al. Molecular clump extraction algorithm based on local density clustering [J]. Research in astronomy and astrophysics, 2021, 22(1): 015003. DOI: 10.1088/1674-4527/ac321d.
[21]	WANG H, PENG M J, YU Y, et al. Fault identification and diagnosis based on KPCA and similarity clustering for nuclear power plants [J]. Annals of nuclear energy, 2021, 150: 107786. DOI: 10.1016/j.anucene.2020.107786.

[1]	王宇轩, 张羽丰, 李连生. 小型先进绝热压缩空气储能系统建模仿真与动态分析 . 南方能源建设, doi: 10.16516/j.ceec.2024-173
[2]	张冬清, 张国华, 徐玲铃, 高晟辅. 调相机在电力系统中的发展应用与动态特性 . 南方能源建设, doi: 10.16516/j.ceec.2024.4.04
[3]	冯国平, 李娟, 解文艳, 吉小恒, 古明生, 黄翔. 基于模糊理论的数字电网发展指数评估 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2023.S1.001
[4]	孙潇, 蔡春荣, 罗志斌, 王小博, 朱光涛, 裴爱国. 70 MPa加氢站动态模拟与能耗分析 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2023.03.017
[5]	刘华全, 李元松, 潘胜平, 张鑫, 赵勇. 导向架平台吸力桶基础施工过程控制关键技术 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2023.01.012
[6]	魏厚俊, 谢研, 孙亚坤, 韩玉鑫, 宋民航. 燃煤机组调峰过程中污染物排放特性及控制技术 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2022.03.006
[7]	任灏, 马兆荣, 李聪, 徐璐. 海上风电多筒导管架基础湿拖过程稳性控制研究 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2021.S1.010
[8]	杨锋斌, 张先提, 王春磊, 孙文龙. 海上升压站正压送风系统计算及控制研究 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2021.01.008
[9]	刘金平, 滕林, 陈向阳. 区域供冷与蓄冷技术发展动态 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2020.03.001
[10]	邱金鹏. 基于SA-PSO的风电消纳经济性动态规划分析 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2019.03.016
[11]	肖颍涛, 王化全, 俞海峰, 胡晓侠, 柴贤东. 基于主成分分析法和模糊综合评价法的配电网评估 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2019.03.018
[12]	刘海喆, 田亮. 主汽压力控制品质与燃料量变化约束关系定量分析 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2018.03.007
[13]	胡金政, 张洁, 陈宏智, 王贺, 郑文棠. 基于随机场理论的强降雨条件下花岗岩残积土边坡的稳定性分析 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2017.03.020
[14]	田帅, 章严韬. 基于产业组织理论的中国水电建设行业与市场分析 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2017.03.025
[15]	彭兴虎. 350 MW机组主厂房布置优化分析 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2015.01.011
[16]	蔡彦枫, 张灿亨, 黄勇. 广东沿海地区极值风速空间插值方法的对比研究 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2015.S1.042
[17]	王若愚, 谢莹华. 深圳电网负荷分类及构成分析 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2015.03.008
[18]	梁赟, 冯永青. 基于可信性理论的有源配电网可靠性分析 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2015.02.011
[19]	王焕然. 一种新型压缩空气储能系统的理论分析 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2015.02.003
[20]	郭祚刚, 邓广义, 范永春, 陈光明. 压缩空气储能系统的理论分析及性能研究 . 南方能源建设, doi: 10.16516/j.gedi.issn2095-8676.2014.01.007

图(18) / 表 (5)

计量

文章访问数: 335
HTML全文浏览量: 242
PDF下载量: 18
被引次数: 0

全文HTML

0. 引言

作为一种低碳、高效的清洁能源^[1-2]，核能在应对全球气候变化中发挥着积极的作用^[3]，2021年的核电厂发电占比达到了4.77%。核电厂所需要的安全性和可靠性使得异常检测技术^[4]在核电领域受到了更广泛的关注。随着监测技术的发展，工业过程中积累了大量高度相关的过程变量，如压力、振动、液位、温度等。通过观察大量的过程变量可发现或预测系统的故障，但对于操作人员而言是巨大的挑战，且对于过程变量间相关性^[5]发生变化而过程变量本身的均值、方差不发生变化的情况，操作人员更难以及时发现故障。随着生产过程的现代化和监测规模的增加，传统的基于模型的方法难以及时发现异常变化，而基于数据驱动的方法，尤其是多元统计过程控制（Multivariate Statistical Process Control，MSPC）方法^[6]能有效地处理庞大且高度相关的数据，而得以有效发展。

基于MSPC的异常检测方法通常遵循2个阶段：（1）利用历史正常数据训练多元统计模型；（2）利用训练的模型对数据实现异常监测。在MSPC中，模型通常由两部分组成：降维模型和监控指标及其控制限^[7]。研究表明，基于主成分分析（Principal Component Analysis，PCA）的MSPC方法非常适合过程故障监测^[8-9]，PCA用相互独立的变量来表示测量过程中的相关向量，将高维数据用低维特征表示，并在子空间中定义监控指标及其控制限^[7]。

异常检测^[10-11]依赖于用户定义的先验阈值，使得异常检测算法对阈值的选取非常敏感。设定较高的阈值会使得部分异常漏检，反之会增加误检率。MSPC方法通过设定置信度来确定监测指标的误差限，以避免虚假警报^[12]。然而，最终的检测效果依然受置信度选择影响。阈值的选取常见的有固定阈值和动态阈值^[13]。固定阈值由于无法适应变化^[14]的数据，使用范围有限。因而研究的重心更多在动态阈值方面，L. Haan^[15]等基于极值理论（Extreme Value Theory，EVT）^[16]提出了（Peak Over Threshold，POT）方法来自动确定阈值，将超过初始阈值的极值进行广义帕累托分布（Generalized Pareto Distribution，GPD）拟合，将极值建模为参数化分布，根据给定的风险值得到最终的阈值。此外，GPD对数据分布不做任何的假设，因而在异常检测中被广泛应用^[17-18]。

POT方法能很好的处理单变量和单峰分布的极值，但是处理多变量或更丰富分布的数据时，特别是对于一个系统中的过程变量间相关性改变而过程变量本身的均值或方差不发生变化的情况，对该算法是一个严峻的挑战。目前通常采用将阈值设置为样本的分位数98%，存在求解困难、计算不稳定，有时无法获得合适的阈值等问题^[19]。

为适应多源数据且各变量相互关联的时变系统的故障监测和预警，文章将MSPC和EVT相结合，提出基于POT方法的多元统计异常检测方法（SPE-POT）。该方法首先基于PCA在残差空间上定义平方预报误差（Squared Predication Error，SPE）统计量及其误差限。然后，对超过其误差限的SPE值的近似广义帕累托分布，通过极大似然估计对估计该分布的参数。最后，根据给定的风险值确定最终阈值实现异常检测。在核电厂仿真数据和历史数据的实验中，该算法实现了对异常的有效检测并降低了虚假警报。极值理论专注于极端事件的建模和分析，这些事件可能对系统的运行产生重大影响。通过将多元统计过程控制和极值理论相结合，可以综合考虑系统中的常规变化和极端事件，从而提高异常检测的灵敏度和可靠性。该方法能够更全面地捕获核电高维运行数据中的异常情况，为核电厂运行和安全提供更可靠的保障。

综上所述，文章的贡献如下：（1）提出了一种基于多元统计和极值理论的异常检测方法，该方法能够适用于高维时序数据；（2）对异常检测采用分级报警，超过SPE误差限时发出预警，超过阈值z_q的发出报警；（3）以一定的风险值来控制报警事故的比率，允许操作人员根据不同级别的设备根据风险值动态调节阈值。

3. 结论

文章从多元统计过程控制出发，提出了基于极值理论的SPE-POT方法。该方法基于PCA模型对历史正常数据建模，在残差空间中得到统计量SPE及其误差限，并引入基于极值理论的POT方法对估计最终阈值进行异常检测，从而提高了方法的性能。在仿真数据集和真实数据集上的实验均表明，提出的方法在准确率误检率优于传统的MSPC方法。同时方法还具有对参数依赖较小的优点，具有良好的实用性。

由于提出的SPE-POT方法利用PCA模型构建历史正常数据模型，是通过发现数据间的相关关系来实现异常检测的，故对数据中存在的噪声具有一定的抑制作用。同时，当设备出现异常后，反映到运行数据上的表现是传感器之间的耦合关系发生明显差异，这是明显区别于噪声的情况，通过多元统计过程控制能迅速捕获这样的异常。然而，对核电厂中存在非线性过程的工况，如设备状态切换、升降负荷等工况的适应性较不足。在后续工作中准备将KPCA^[21]引入本工作中，对核电运行数据中的非线性过程建模，将本方法应用于核电厂瞬态运行工况下，进一步提升该方法的适应场景。

参考文献 (21)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于POT的多元统计过程核电数据异常检测方法

DOI: 10.16516/j.ceec.2024-099

CSTR: 32391.14.j.ceec.2024-099

通讯作者:
罗骁域，（e-mail）lxy@ctgu.edu.cn。

An Anomaly Detection Method for Multivariate Statistical Process Based on POT

计量

基于POT的多元统计过程核电数据异常检测方法

DOI: 10.16516/j.ceec.2024-099

CSTR: 32391.14.j.ceec.2024-099

1. 中核武汉核电运行技术股份有限公司, 湖北武汉 443074

2. 三峡大学电气与新能源学院, 湖北宜昌 443002

3. 三峡大学理学院, 湖北宜昌 443002

通讯作者: 罗骁域，（e-mail）lxy@ctgu.edu.cn。

English Abstract

An Anomaly Detection Method for Multivariate Statistical Process Based on POT

1. China Nuclear Power Operation Technology Corporation, Ltd., Wuhan 443074, Hubei, China

2. College of Electrical Engineering and New Energy, China Three Gorges University, Yichang 443002, Hubei, China

3. College of Science, China Three Gorges University, Yichang 443002, Hubei, China

全文HTML

1.1. 基于PCA的MSPC异常检测方法

1.2. 基于极值理论的POT异常检测方法

1.3. 基于POT的多元统计过程异常检测算法

2.1. 实验设置及评价指标

2.2. 仿真数据实验

2.3. 真实数据实验

目录

留言板

基于POT的多元统计过程核电数据异常检测方法

DOI: 10.16516/j.ceec.2024-099

CSTR: 32391.14.j.ceec.2024-099

通讯作者: 罗骁域，（e-mail）lxy@ctgu.edu.cn。

An Anomaly Detection Method for Multivariate Statistical Process Based on POT

计量

出版历程

基于POT的多元统计过程核电数据异常检测方法

DOI: 10.16516/j.ceec.2024-099

CSTR: 32391.14.j.ceec.2024-099

1. 中核武汉核电运行技术股份有限公司, 湖北 武汉 443074 2. 三峡大学电气与新能源学院, 湖北 宜昌 443002 3. 三峡大学理学院, 湖北 宜昌 443002

通讯作者: 罗骁域，（e-mail）lxy@ctgu.edu.cn。

English Abstract

An Anomaly Detection Method for Multivariate Statistical Process Based on POT

1. China Nuclear Power Operation Technology Corporation, Ltd., Wuhan 443074, Hubei, China 2. College of Electrical Engineering and New Energy, China Three Gorges University, Yichang 443002, Hubei, China 3. College of Science, China Three Gorges University, Yichang 443002, Hubei, China

全文HTML

1.1. 基于PCA的MSPC异常检测方法

1.2. 基于极值理论的POT异常检测方法

1.3. 基于POT的多元统计过程异常检测算法

2.1. 实验设置及评价指标

2.2. 仿真数据实验

2.3. 真实数据实验

目录

通讯作者:
罗骁域，（e-mail）lxy@ctgu.edu.cn。

1. 中核武汉核电运行技术股份有限公司, 湖北武汉 443074

2. 三峡大学电气与新能源学院, 湖北宜昌 443002

3. 三峡大学理学院, 湖北宜昌 443002

1. China Nuclear Power Operation Technology Corporation, Ltd., Wuhan 443074, Hubei, China

2. College of Electrical Engineering and New Energy, China Three Gorges University, Yichang 443002, Hubei, China

3. College of Science, China Three Gorges University, Yichang 443002, Hubei, China