Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Xu, Yanpinga | Ye, Tingconga | Wang, Xinb; * | Lai, Yupingc | Qiu, Jiand | Zhang, Lingjune | Zhang, Xiaa
Affiliations: [a] School of Cyberspace Security, Hangzhou Dianzi University, Xiasha Higher Education Zone, Hangzhou, Zhejiang Province, China | [b] School of Business and Management, Shanghai International Studies University, Shanghai, China | [c] School of Information Science and Technology, North China University of Technology, Shijingshan District, Beijing, China | [d] Center for Undergraduate Education, Westlake University, Xihu District, Hangzhou, China | [e] School of Computer Science and Technology, Hangzhou Dianzi University, Xiasha Higher Education Zone, Hangzhou, Zhejiang, China
Correspondence: [*] Corresponding author. Xin Wang, School of Business and Management, Shanghai International Studies University, 1550 Wenxiang Road, Shanghai PC201620, China. E-mail: [email protected].
Abstract: In the field of security, the data labels are unknown or the labels are too expensive to label, so that clustering methods are used to detect the threat behavior contained in the big data. The most widely used probabilistic clustering model is Gaussian Mixture Models(GMM), which is flexible and powerful to apply prior knowledge for modelling the uncertainty of the data. Therefore, in this paper, we use GMM to build the threat behavior detection model. Commonly, Expectation Maximization (EM) and Variational Inference (VI) are used to estimate the optimal parameters of GMM. However, both EM and VI are quite sensitive to the initial values of the parameters. Therefore, we propose to use Singular Value Decomposition (SVD) to initialize the parameters. Firstly, SVD is used to factorize the data set matrix to get the singular value matrix and singular matrices. Then we calculate the number of the components of GMM by the first two singular values in the singular value matrix and the dimension of the data. Next, other parameters of GMM, such as the mixing coefficients, the mean and the covariance, are calculated based on the number of the components. After that, the initialization values of the parameters are input into EM and VI to estimate the optimal parameters of GMM. The experiment results indicate that our proposed method performs well on the parameters initialization of GMM clustering using EM and VI for estimating parameters.
Keywords: Network threat detection, gaussian mixture models, expectation maximization, variational inference, singular value decomposition, parameters initialization
DOI: 10.3233/JIFS-200066
Journal: Journal of Intelligent & Fuzzy Systems, vol. 40, no. 1, pp. 477-490, 2021
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]