0% found this document useful (0 votes)

15 views17 pages

15-A Novel Multi-Modal Incremental Tensor Decomposition For Anomaly Detection in Large-Scale Networks

This paper presents a novel framework for traffic anomaly detection in large-scale networks using multi-modal incremental tensor decomposition. The proposed method addresses the limitations of existing detection techniques by efficiently processing dynamically growing data, reducing computational costs, and improving detection accuracy with an XGBoost classification algorithm. Experimental results demonstrate a high detection rate of 99.21%, showcasing the framework's scalability and speed.

Uploaded by

saulgoody7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views17 pages

15-A Novel Multi-Modal Incremental Tensor Decomposition For Anomaly Detection in Large-Scale Networks

Uploaded by

saulgoody7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Information Sciences 681 (2024) 121210

Contents lists available at ScienceDirect

Information Sciences
journal homepage: www.elsevier.com/locate/ins

A novel multi-modal incremental tensor decomposition for

anomaly detection in large-scale networks
Rongqiao Fan a,1 , Qiyuan Fan a,1 , Xue Li b , Puming Wang a,∗ , Jing Xu a , Xin Jin a ,
Shaowen Yao a , Peng Liu c
a
School of Software, Yunnan University, Kunming, 650091, China
b
School of Electronic Information Engineering, Henan University of Science and Technology, Xinxiang, 453003, China
c
Guangxi Power Grid Limited Liability Company, Guangxi, 450100, China

A R T I C L E I N F O A B S T R A C T

Keywords: Network traffic anomaly detection is a crucial task for today’s network monitoring and mainte-
Multi-modal incremental tensor nance. However, with the rapid growth of network data volume, the data structure has become
Tensor decomposition more and more complex, showing multi-modal characteristics, which makes traffic anomaly
Machine learning
detection face a great challenge. The earlier proposed anomaly detection methods have the
Anomaly detection
following deficiencies, 𝑖) Most of them are static or dynamic detection methods that only
grow along the temporal modality. 𝑖𝑖) Lower detection rate or higher computational cost. To
address these deficiencies, this article proposes a traffic anomaly detection framework based
on multi-modal incremental tensor decomposition, which has the following three highlights, 𝑖)
Constructing traffic data as a tensor model to fully mine the correlation between data, and the
proposed framework is applicable to the situation where traffic data grows dynamically along
multiple modes. 𝑖𝑖) Using the multi-modal incremental tensor decomposition method to process
dynamically growing data without decomposing all the data, greatly reducing computational cost
and improving data quality. 𝑖𝑖𝑖) Using the XGBoost classification algorithm for anomaly detection
to improve detection accuracy. Finally, the results of experiments on two real network traffic
datasets NSL-KDD and CICDDOS 2019 show that the proposed framework can achieve a high
detection rate of 99.21%, and has the characteristics of good scalability and fast detection speed.

1. Introduction

Abnormal traffic, such as port scanning, denial of service attacks (DoS), distributed denial of service attacks (DDoS), and the
spread of worms, etc., can lead to network congestion, network paralysis, and information leakage, resulting in extremely adverse
effects. In addition, with the continuous expansion of network scale and the increasing complexity of network structure, it is becoming
increasingly difficult to accurately and quickly detect and diagnose abnormal traffic in large-scale network data. In particular, the
wave of telecommuting and cloud migration caused by COVID-19 in recent years has led to a sharp increase in global cyber attacks
and frequent abnormal traffic. Therefore, it is urgent to innovate technologies and methods to improve the accuracy of anomaly
detection and reduce the damage of network attacks.

* Corresponding author.
E-mail address: [email protected] (P. Wang).
1
The Two Authors contribute equally to this work, they are listed as the co-ﬁrst author.

https://doi.org/10.1016/j.ins.2024.121210
Received 13 March 2024; Received in revised form 2 July 2024; Accepted 18 July 2024
Available online 23 July 2024
0020-0255/© 2024 Elsevier Inc. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Early literature on anomaly detection [1] usually constructed traffic data as a matrix model, and used matrix-based decomposition
algorithms such as Principal Component Analysis (PCA) and Singular Value Decomposition (SVD) to process the data, and then
performed anomaly detection by setting thresholds. This method achieves better detection performance when the volume of data is
small and the structure of the data is relatively simple.
In recent years, machine learning (ML) algorithms have developed rapidly and have been widely used in various fields [2]. ML
algorithms have higher detection rates and faster speeds than earlier anomaly detection methods. However, when we perform anomaly
detection, data inevitably has redundancy and noise, which will reduce data quality and affect detection performance. Moreover, with
the rapid growth of data volume, the data structure becomes complex and the data shows multi-modal characteristics, which makes
data processing more and more difficult. For this situation, anomaly detection methods based on tensor decomposition have been
proposed and achieved good detection performance. Because modeling the data as a tensor can well retain the correlation between
the data, and tensor decomposition [3] of the data can effectively reduce the dimensionality of the data and feature extraction, so as
to improve the quality of the data and then improve the detection accuracy. Therefore, some anomaly detection methods combining
tensor decomposition and ML algorithms have also been proposed.
However, most of the anomaly detection methods are static and do not meet the real-time requirements of real-world applications.
Some dynamic detection methods only consider the data growth along one mode. Therefore, when the data grows along multiple
modes, how to efficiently detect anomalies is still a serious challenge.
Aiming at solving the above problems, this paper proposes an anomaly detection framework based on multi-modal incremental
tensor decomposition, which combines the advantages of tensor decomposition and ML to detect abnormal traffic accurately and
quickly. The framework is mainly divided into three steps: data preprocessing, multi-modal incremental tensor decomposition, and
anomaly detection. Where the multi-modal incremental tensor decomposition method is an improvement of the framework proposed
by [4]. The anomaly detection step adopts the extreme gradient boosting (XGBoost) classification algorithm in the ML classification
algorithm, which has the characteristics of fast speed and scalability.
The key contributions of this paper can be summarized as follows.

• This paper proposes an anomaly detection framework that is dynamic and applicable to large-scale network data that grows
along multiple modes.
• The proposed detection framework performs traffic anomaly detection at a faster speed and lower computational cost. The multi-
modal incremental tensor decomposition method calculates the Tucker decomposition results at the current time only based on
the decomposition results of the historical tensor data, and does not decompose all the tensor data or perform expensive SVD
calculations. For massive data in large-scale networks, we have significantly reduced storage and computation costs.
• The dynamically growing data is processed using incremental tensor decomposition, which can reduce data redundancy and
noise and improve data quality. In addition, the XGBoost classification algorithm has better classification performance compared
to other ML methods. Therefore, overall, the detection framework proposed in this paper has better detection performance.

The remaining sections of the paper are organized according to the following structure. Section 2 describes related work. Section 3
introduces the preliminaries related to this paper. Section 4 introduces the framework for network traﬃc anomaly detection based
on multi-modal incremental tensor decomposition proposed in this paper. The performance evaluation and conclusion will be given
in Section 5 and Section 6, respectively.

2. Relation work

Network traffic anomaly detection is an important barrier to prevent network attacks. It judges whether there is abnormal traffic
by detecting some characteristics of network traffic or changes in traffic size.
In this section, we mainly review some network traffic anomaly detection methods, which are divided into statistical-based,
machine learning-based, and tensor decomposition-based detection methods.

2.1. Statistical-based

The statistical-based detection method is relatively mature and has been widely used. It is used to identify normal and abnormal
traffic in the network by setting a threshold.
Huang et al.[5] and Lakhina et al. [6] use the PCA method to project the traffic data constructed as a matrix model into the normal
subspace and abnormal subspace, and then use Q statistical analysis method in the abnormal subspace to identify normal traffic and
abnormal traffic. Yeh et al.[7] and Lee et al. [8] proposed the method of oversampling PCA. This method obtains the principal direction
by using the PCA method on the data, and then uses the “Leave One Out” method to check the influence of each data point on the
change of the principal direction to identify whether the traffic is abnormal. Udhayan et al. [9] proposed a Statistical Segregation
Method (SSM) for DDoS attack detection. This method samples traffic data and compares it with attack status conditions, and then
performs correlation analysis to identify abnormal traffic. Fortunati et al. [10] proposed an improved method of anomaly detection
method based on covariance. This method constructs a covariance matrix based on network traffic data to obtain a norm distribution,
and then detects abnormal traffic by setting a threshold.
Although statistics-based anomaly detection methods are widely used and can detect unknown anomalous traffic, the thresholds
in this method are difficult to balance, resulting in low detection rates.

2
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Table 1
Description of the notation.

Symbol Description

𝑎 Scalars (lowercase letters)

𝐚 vector (Bold lowercase letters)
𝐀 Matrix (bold capital letters)
𝐀(𝑖∶𝑗,∶) Submatrix of matrix 𝐀
𝐀𝑇 Transpose of matrix 𝐀
 Tensor (Calligraphic letters)
† Pseudo-inverse of tensor 
𝐗(∶,∶,𝑘) The frontal slices of a 3-order tensor
𝐀(𝑛) The factor matrix of Tucker decomposition
Θ ≜ {0, 1}𝑁 𝑁 -term two-tuple and (𝑢1 , ⋯ , 𝑢𝑁 ) ∈ Θ
𝑢1 ,⋯,𝑢𝑁 Subtensor of tensor 
()(𝑘) The unfolding of the tensor  along the 𝑘-mode
𝑥𝑖1 ,𝑖2 ,⋯,𝑖𝑁 The element in tensor  ∈ 𝑅𝐼1 ×𝐼2 ×⋯×𝐼𝑁

2.2. Machine learning-based

Anomaly detection based on ML has been a popular research direction in recent years. The detection performance is relatively
improved compared to statistical-based methods.
Han et al.[11] proposed a naive Bayesian network intrusion detection method based on PCA. The method extracts the main
features by PCA and calculates the contribution rate, and then uses the contribution rate as the weight to form a new Bayesian
classification model. Compared with the traditional method, this method reduces the data dimension and improves the detection
performance. Peng et al. [12] proposed an Software Defined Network (SDN) based DDoS attack detection method (DPTCM-KNN
algorithm), which combines KNN and double P-value for DDoS attack detection. Hwang et al.[13] proposed the D-PACK method,
which uses a convolutional neural network (CNN) to automatically learn features, and then uses an unsupervised deep learning (DL)
model to identify abnormal traffic. Li et al.[14] proposed a novel algorithm called Adaptive label Propagation (ALP). ALP identifies
overlapping anomaly groups through tag propagation and belonging coefficients, and deals with the particularity of different types of
nodes and edges in heterogeneous networks through an adaptive neighbor weighting mechanism. Wu et al.[15] proposed an intrusion
detection model based on CNN, which can automatically select features to solve the imbalanced data problem. The model can achieve
better detection performance on the NSL-KDD dataset. Garg et al.[16] proposed a hybrid anomaly detection method based on DL in
SDN. This method uses a restricted Boltzmann machine and a support vector machine for anomaly detection.
However, most ML-based methods do not consider the multi-modal characteristics of data, and do not improve data quality by
adopting more efficient algorithms to remove data redundancy and noise.

2.3. Tensor decomposition-based

Tensor decomposition is a method for processing large-scale data. This method does not destroy the spatial structure and internal
potential information of the original data, and is more robust to noise. [17–22] introduced a lot of tensor decomposition methods,
all of which have achieved good experimental results in their respective application fields.
Sun et al.[23] applied the incremental tensor analysis (ITA) method to anomaly detection, which can effectively reveal hidden
correlations in high-dimensional data and improve the anomaly detection rate. Wang et al.[24] used tensor principal component
analysis (TPCA) to detect network attacks in SDNs and proposed a framework for big data-driven network attack detection. Li et
al.[25] proposed an online anomaly detection method based on tensor decomposition. The method uses incremental CP decomposition
for dynamically growing data, which reduces the computational and storage costs. Xie et al.[26] proposed the TensorDet method,
which applies two new techniques, sequential tensor truncation and two-phase anomaly detection, to improve detection accuracy and
speed. Huang et al.[27] proposed a Dynamic Sequence Tensor Recovery (DSTR) algorithm, which uses the incremental High Order
Singular Value Decomposition (HOSVD) method to process dynamic data to improve detection accuracy and reduce cost. Maranhão
et al.[28] combined Higher Order Orthogonal Iteration (HOOI) and Multiple Denoising (MuDe) methods to improve data quality, and
then used supervised machine learning algorithms for anomaly detection. Xu et al.[29] proposed a DDoS attack detection framework
that combines multi-modal denoising algorithms based on tensor SVD and ML classification algorithms. Compared with statistical
detection methods and traditional ML detection methods, the detection performance is improved.
Although tensor decomposition-based methods have achieved good detection performance in the field of anomaly detection, most
dynamic detection methods only consider the case when the tensor data grows along one mode, and there is a lack of research when
the data grows along multiple modes.

3. Preliminaries

This section introduces the concepts represented by the mathematical notation associated with the tensor. For brevity, the nota-
tional descriptions used in this article are presented in Table 1.

3
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Fig. 1. The horizontal, lateral, and frontal slices of 3-order tensor  .

Fig. 2. Unfolding of a 3-order tensor  ∈ 𝑅𝐼×𝐽 ×𝐾 .

Deﬁnition 1. Tensor is an extension of vectors and matrices to higher dimensions. An 𝑁 -order tensor is denoted as  ∈ 𝑅𝐼1 ×𝐼2 ×⋯×𝐼𝑁 ,
where 𝑁 is the order or mode of  . The elements of the tensor  are denoted as 𝑥𝑖1 ,⋯,𝑖𝑛 ,⋯,𝑖𝑁 , where 𝑖𝑛 ∈ {1, 2, ⋯ , 𝐼𝑛 } and 1 ≤ 𝑛 ≤ 𝑁 .

Definition 2. Slices are two-dimensional sections of a tensor, which is defined by fixing all but two indices. Fig. 1 shows the horizontal
slice X(𝑖,∶,∶) , the lateral slice X(∶,𝑗,∶) and the frontal slices X(∶,∶,𝑘) of the 3-order tensor, respectively.

Deﬁnition 3. Unfolding is ∏ the process of transforming a tensor into a matrix. For a tensor  ∈ 𝑅𝐼1 ×𝐼2 ×⋯×𝐼𝑁 , the mode-𝑘 unfolding
𝐼𝑘 × 𝑖≠𝑘 𝐼𝑖
is denoted as ()(𝑘) ∈ 𝑅 . Fig. 2 shows the unfolding process of the 3-order tensor  ∈ 𝑅𝐼×𝐽 ×𝐾 into three matrices, where
()(1) ∈ 𝑅𝐼×(𝐽 𝐾) , ()(2) ∈ 𝑅 𝐽 ×(𝐼𝐾) and ()(3) ∈ 𝑅𝐾×(𝐼𝐽 ) are the matrices of the tensor unfolded along the ﬁrst, second and third
modes.

Deﬁnition 4. The product of a tensor in the 𝑘-th mode with a matrix or vector is called the 𝑘-mode product. The 𝑘-mode product of
 ∈ 𝑅𝐼1 ×⋯×𝐼𝑘 ×⋯×𝐼𝑁 and A ∈ 𝑅𝑃 ×𝐼𝑘 is denoted as

 = ×𝑘 A ⟺ ()(𝑘) = A()(𝑘) , (1)

which can also be expressed in the form of a tensor unfolding. The elements of the result  ∈ 𝑅𝐼1 ×⋯×𝐼𝑘−1 ×𝑃 ×𝐼𝑘+1 ×⋯×𝐼𝑁 of the 𝑘-mode
product are denoted as
𝐼𝑘
∑
(×𝑘 A)𝑖1 ,⋯,𝑖𝑘−1 ,𝑝,𝑖𝑘+1 ,⋯,𝑖𝑁 = 𝑥𝑖1 ,⋯,𝑖𝑘−1 ,𝑖𝑘 ,𝑖𝑘+1 ,⋯,𝑖𝑁 𝑎𝑝,𝑖𝑘 . (2)
𝑖𝑘 =1

Fig. 3 shows the 1-mode product of a third-order tensor  ∈ 𝑅5×3×3 and a matrix 𝐀 ∈ 𝑅3×5 , and the result is a tensor of size
3 × 3 × 3.
The 𝑘-mode product satisfies a property that the order of multiplication is uncorrelated for different modes in multiplication. If
the modes are different (i.e., 𝑘 ≠ 𝑘′ )

×𝑘 A×𝑘′ A′ = ×𝑘′ A′ ×𝑘 A. (3)

If the modes are the same (i.e., 𝑘=𝑘′ )

×𝑘 A×𝑘 A′ = ×𝑘 (A′ A). (4)

4
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Fig. 3. The 1-mode product of  ∈ 𝑅5×3×3 and 𝐀 ∈ 𝑅3×5 .

Fig. 4. Tucker decomposition of a 3-order tensor  ∈ 𝑅𝐼×𝐽 ×𝐾 .

Deﬁnition 5. The 𝑁 -rank of a tensor  ∈ 𝑅𝐼1 ×𝐼2 ×⋯×𝐼𝑁 is described as an 𝑁 -tuple

(𝑟𝑎𝑛𝑘1 ()(1) , 𝑟𝑎𝑛𝑘2 ()(2) , ⋯ , 𝑟𝑎𝑛𝑘𝑁 ()(𝑁) ), (5)

where 𝑟𝑎𝑛𝑘𝑛 ()(𝑛) is the rank of the tensor  along the 𝑛-th mode unfolding matrix.

Deﬁnition 6. The Tucker decomposition is to decompose the tensor  ∈ 𝑅𝐼1 ×𝐼2 ×⋯×𝐼𝑁 into the form of the product of a core tensor
 ∈ 𝑅𝑅1 ×𝑅2 ×⋯×𝑅𝑁 and 𝑁 factor matrices A(𝑛) ∈ 𝑅𝐼𝑛 ×𝑅𝑛 , which is deﬁned as

 = ×1 A(1) ×2 A(2) ⋯ ×𝑁 A(𝑁) . (6)

Eq. (6) can also be described as  = ×{A(𝑛) }, where the factor matrix {A(𝑛) } is usually orthogonal. When 𝑅𝑛 < 𝐼𝑛 , the core tensor
 can be regarded as a compression of the tensor  . In Fig. 4, the 3-order tensor  ∈ 𝑅𝐼×𝐽 ×𝐾 is decomposed into three factor matrices
A ∈ 𝑅𝐼×𝑅1 , B ∈ 𝑅𝐽 ×𝑅2 , C ∈ 𝑅𝐾×𝑅3 and a 3-order core tensor  ∈ 𝑅𝑅1 ×𝑅2 ×𝑅3 .
The Tucker decomposition of the 3-order tensor can be constructed by the SVD of the matrix. Firstly, the SVD is performed on the
matrices ()(1) , ()(2) and ()(3) . Then the three left singular matrices 𝐀, 𝐁 and 𝐂 are obtained, and ﬁnally the core tensor  will be
calculated according to Eq. (7).

 = ×1 A𝑇 ×2 B𝑇 ×3 C𝑇 . (7)

Deﬁnition 7. Dividing the 𝑁 -th order tensor  into 2𝑁 sub-tensors 𝑢1 ,⋯,𝑢𝑁 ∈Θ and dividing matrix 𝐀(𝑛) ∈ 𝑅𝐼𝑛 ×𝑅𝑛 into 𝐀(𝑛),1 ∈
𝑅𝐼𝑛,1 ×𝑅𝑛 and 𝐀(𝑛),2 ∈ 𝑅𝐼𝑛,2 ×𝑅𝑛 , where 𝐼𝑛,1 + 𝐼𝑛,2 = 𝐼𝑛 and 𝐀𝑇(𝑛) = [𝐀𝑇(𝑛),1 𝐀𝑇(𝑛),2 ] ∈ 𝑅𝑅𝑛 ×(𝐼𝑛,1 +𝐼𝑛,2 ) . Then, the block tensor matrix multipli-
cation is deﬁned as
∑
 × {𝐀(𝑛) } = 𝑢1 ,⋯,𝑢𝑁 × {𝐀(𝑛),𝑢𝑛 }. (8)
(𝑢1 ,⋯,𝑢𝑁 )∈Θ

4. Proposed traﬃc anomaly detection framework

This section introduces the framework for network traﬃc anomaly detection based on the multi-modal incremental tensor decom-
position proposed in this paper. In Fig. 5, the detection framework is mainly divided into three modules, which are data preprocessing,
multi-modal incremental tensor decomposition (MMITD), and anomaly detection.

4.1. Data preprocessing

4.1.1. Numericalization and feature selection

The method proposed in this paper is suitable for numerical data, but there are other data types in many network traﬃc data,
which need to be converted into numerical data. For instance, the label data “normal” and “abnormal” are converted into values “0”
and “1” respectively.

5
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Fig. 5. Proposed network traﬃc anomaly detection framework.

Fig. 6. Constructing the matrix as a 3-order tensor.

Moreover, the existing dataset contains a large number of features, some of which have little eﬀect on the results of anomaly
detection [30]. Therefore, these features can be removed and some meaningful features are retained to improve the detection accuracy.

4.1.2. Standardization and tensor modeling

After numericalization and feature selection of the traffic data, we standardize them to improve the detection speed and detection
rate of the model.
Then the traffic data will be modeled as a tensor. Given a data matrix 𝐌 ∈ 𝑅𝐼𝑁 ×𝐼 , where 𝐼𝑁 represents 𝐼𝑁 pieces of data. We
fold each piece of data 𝐌(𝑖,∶) into a tensor of size 𝐼1 × 𝐼2 × ⋯ × 𝐼𝑁−1 , where 𝐼1 × 𝐼2 × ⋯ × 𝐼𝑁−1 = 𝐼 . Finally, the 𝑁 -order tensor
 ∈ 𝑅𝐼1 ×𝐼2 ×⋯×𝐼𝑁−1 ×𝐼𝑁 will be obtained. Fig. 6 illustrates the process of constructing a third-order tensor from a matrix.
The tensor data we obtain usually contains noise. Tensor decomposition can decompose tensor data into low-rank tensor (noise-
free tensor) and sparse tensor (noise tensor) in Fig. 7. Compared with the original data, low-rank tensor is cleaner, which is beneficial
to improve the accuracy of anomaly detection.

4.2. MMITD method

The main idea of the MMITD method is to calculate the Tucker decomposition result of the tensor at time 𝑡 + 1 according to the
Tucker decomposition result at time 𝑡, and ﬁnally obtain low-rank and clean tensor data at time 𝑡 + 1.
(𝑡) (𝑡) (𝑡) (𝑡)
𝐼1 ×𝐼2 ×⋯×𝐼𝑁−1 ×𝐼𝑁
Firstly, the truncated Tucker decomposition is performed on the tensor data  (𝑡) ∈ 𝑅 to obtain (𝑡) ∈
(𝑡)
𝑅𝑅1 ×𝑅2 ×⋯×𝑅𝑁 and 𝐀(𝑡)
(𝑛)
∈ 𝑅𝐼𝑛 ×𝑅𝑛 , and the truncated rank (𝑅1 , 𝑅2 , ⋯ , 𝑅𝑁 ) is calculated by Eq. (9),

6
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Fig. 7. Decomposing tensor into low-rank tensor (noise-free tensor) and sparse tensor (noise tensor).

∑𝑅𝑛 2
𝑖𝑛 =1
(𝜎𝑖(𝑛) )
𝑛
𝑄≥ , 𝑛 ∈ {1, 2, ⋯ , 𝑁}, (9)
∑𝐼 𝑛 2
𝑖𝑛 =1
(𝜎𝑖(𝑛) )
𝑛

(𝑛)
where 𝜎𝑖 is the singular value of the tensor data along the 𝑛-th modal unfolded matrix and 𝑄 is the ratio. Compared with directly
𝑛
ﬁxing the rank of the tensor in [4], our method allows the tensor data to retain some variability to obtain the main features of the
data.
(𝑡+1)
Then, (𝑡+1) and 𝐀(𝑛) are calculated based on the newly added tensor data at time 𝑡 + 1 and the decomposition results (𝑡) and
𝐀(𝑡)
(𝑛)
at time 𝑡, which includes the partitioning of the incremental tensor and updating the factor matrix and the core tensor.
(𝑡+1)
Finally, the tensor 𝑛𝑒𝑤 is calculated according to Eq. (6) and used for anomaly detection.

4.2.1. Incremental tensor partitioning

(𝑡) (𝑡) (𝑡)
𝐼1 ×𝐼2 ×⋯×𝐼𝑁
For the 𝑁 -order tensor data  (𝑡) ∈ 𝑅 at time 𝑡, if the data grows along 𝑁 modes and the size of the growth is 𝑑𝑛 ,
(𝑡) (𝑡) (𝑡)
the tensor data  (𝑡+1) ∈ 𝑅(𝐼1 +𝑑1 )×(𝐼2 +𝑑2 )×⋯×(𝐼𝑁 +𝑑𝑁 ) at time 𝑡 + 1 will be obtained, where (𝐼 (𝑡) + 𝑑
𝑛 𝑛 ) = 𝐼𝑛 ≥ 𝐼𝑛 . (𝑡+1) (𝑡)

Then, we divide  (𝑡+1) into 2𝑁 sub-tensors, denoted as 𝑢1 ,⋯,𝑢𝑁 , where 𝑢1 , ⋯ , 𝑢𝑁 ∈ Θ ≜ {0, 1}𝑁 is an 𝑁 -term two-tuple. When
(𝑡+1)

(𝑡+1)
𝑢𝑛 = 0, the sub-tensor 0,⋯,0 =  (𝑡) , and the remaining sub-tensors are the newly added data at time 𝑡 + 1.
Finally, we divide the newly added 2𝑁 − 1 sub-tensors (𝑢1 ,⋯,𝑢𝑁 )(𝑢1 ,⋯,𝑢𝑁 )≠(0,⋯,0) into 𝑁 categories according to the number of
(𝑡+1)

indices 𝑢𝑛 = 1, denoted as ℂ𝑛 .
In Fig. 5, taking the 3-order tensor as an example, we divide the newly added 23 − 1 sub-tensors into three categories: ℂ1 , ℂ2 and
ℂ3 .
(𝑡+1) (𝑡+1) (𝑡+1)
ℂ1 = {1,0,0 , 0,1,0 , 0,0,1 },
(𝑡+1) (𝑡+1) (𝑡+1)
ℂ2 = {1,1,0 , 1,0,1 , 0,1,1 }, (10)
(𝑡+1)
ℂ3 = {1,1,1 }.

4.2.2. Updating factor matrix

(𝑡+1)
For all subtensors 𝑢1 ,⋯,𝑢𝑁 in 𝑁 categories, they will be used one or more times to update the matrix that is deﬁned as the extension
matrix. Speciﬁcally, we update the extension matrix 𝐀′(𝑛) if the index 𝑢𝑛 = 1 of the subtensor. The update steps are described below.
If the index 𝑢𝑛 = 1 of the subtensor, constructing the tensor −𝑢𝑛 according to Eq. (11),

−𝑢𝑛 ≜ 𝑢1 ,⋯,𝑢𝑛−1 ,𝑢𝑛+1 ,⋯,𝑢𝑁

(11)
= (𝑡) ×1 𝐀(1) ⋯ ×𝑛−1 𝐀(𝑛−1) ×(𝑛+1) 𝐀(𝑛+1) ⋯ ×𝑁 𝐀(𝑁) ,

where in the right half of the equation, 𝐀(𝑛) = 𝐀(𝑛) ∈ 𝑅𝐼𝑛 ×𝑅𝑛 if 𝑢𝑛 = 0 and 𝐀(𝑛) = 𝐀′(𝑛) ∈ 𝑅𝑑𝑛 ×𝑅𝑛 if 𝑢𝑛 = 1.
(𝑡)

Then, calculating the pseudo-inverse of the matrix of the unfolding of the tensor −𝑢𝑛 along the 𝑛-th mode, the extension matrix
𝐀′(𝑛) will be obtained according to Eq. (12),

𝐀′new
(𝑛)
← 𝛼𝐀′old
(𝑛)
+ (1 − 𝛼)(𝑢(𝑡+1) ) (−𝑢𝑛 )†(𝑛) ,
1 ,⋯,𝑢𝑁 (𝑛)
(12)

where 𝛼 indicates the extent to which the information obtained in the previous step is retained, and 𝛼 ∈ (0, 1).
After obtaining an extension matrix, it does not satisfy the properties of the unitary matrix, so it will be orthogonalized to be
available for the next step. Compared with the method in [4] where only the ﬁnal extension matrix is orthogonalized, our method
ensures that the matrix used in the next step to calculate the tensor −𝑢𝑛 is orthogonalized, thus improving the accuracy of the tensor
decomposition and the anomaly detection rate.
The above update steps are repeated in each category and the ﬁnal extension matrix 𝐀′(𝑛) will be obtained. Then, the matrix 𝐀′(𝑛)
(𝑡) (𝑡+1)
is concatenated with the matrix 𝐀(𝑛) along the second mode according to Eq. (13) to obtain 𝐕(𝑛) .

7
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

𝑇 𝑇
(𝐕(𝑡+1)
(𝑛)
) = [(𝐀(𝑡)
(𝑛)
) (𝐀′(𝑛) )𝑇 ] . (13)

(𝑡+1) (𝑡+1) (𝑡)

Finally, the 𝐕(𝑛) is orthogonalized thus obtaining the factor matrix 𝐀(𝑛) ∈ 𝑅(𝐼𝑛 +𝑑𝑛 )×𝑅𝑛 at time 𝑡 + 1.
Next, we will take the third-order tensor  (𝑡) ∈ 𝑅𝐼×𝐽 ×𝐾
and  (𝑡+1) ∈ 𝑅(𝐼+𝑑1 )×(𝐽 +𝑑2 )×(𝐾+𝑑3 )
as an example to illustrate the update
process of the factor matrix. The previous subsection has divided the newly added tensor data at time 𝑡 + 1 into three categories: ℂ1 ,
ℂ2 , and ℂ3 . Then, we will use the sub-tensors in the three categories to update the extension matrix 𝐀′ , 𝐁′ and 𝐂′ respectively.
(𝑖): Updating Extension Matrix with ℂ1 .
There are three sub-tensors in the ℂ1 , and each sub-tensor is associated with only one extension matrix. According to Eq. (12),
(𝑡+1)
the sub-tensor 1,0,0 is used to update 𝐀′ . The update steps are as follows.
Firstly, we construct the tensor ⋅,0,0 = (𝑡) ×2 𝐁(𝑡) ×3 𝐂(𝑡) and then compute the pseudo-inverse of the matrix of the unfolding of
⋅,0,0 along the ﬁrst mode. Finally, 𝐀′ is updated according to Eq. (14).
(𝑡+1)
𝐀′ ← (1,0,0 ) (⋅,0,0 )†(1) . (14)
(1)
(𝑡+1) (𝑡+1)
Since this is the ﬁrst update of the extension matrix, 𝛼 = 1. Similarly, the two sub-tensors 0,1,0 and 0,0,1 in ℂ1 are used to
update 𝐁′ and 𝐂′ according to Eq. (15),
(𝑡+1)
𝐁′ ← (0,1,0 ) (0,⋅,0 )†(2) ,
(2)
(𝑡+1) (15)
𝐂′ ← (0,0,1 ) (0,0,⋅ )†(3) ,
(3)

where
0,⋅,0 = (𝑡) ×1 𝐀(𝑡) ×3 𝐂(𝑡) ,
(16)
0,0,⋅ = (𝑡) ×1 𝐀(𝑡) ×2 𝐁(𝑡) .
Finally, 𝐀′ , 𝐁′ and 𝐂′ will be orthogonalized. Although we have updated all the extension matrices, the update process in ℂ2 and
ℂ3 is also closely related to the extension matrix. Therefore, the results of this update will be used in the next step.
(𝑖𝑖): Updating Extension Matrix with ℂ2 .
There are three sub-tensors in ℂ2 , and each sub-tensor will be used for the update of two extension matrices. For example, the
(𝑡+1)
sub-tensor 1,1,0 will be used to update 𝐀′ and 𝐁′ in Eq. (17),

(𝑡+1)
𝐀′new ← 𝛼𝐀′old + (1 − 𝛼)(1,1,0 ) (⋅,1,0 )†(1) ,
(1)
(𝑡+1) (17)
𝐁′new ← 𝛼𝐁′old + (1 − 𝛼)(1,1,0 ) (1,⋅,0 )†(2) ,
(2)

where
⋅,1,0 = (𝑡) ×2 𝐁′ ×3 𝐂(𝑡) ,
(18)
1,⋅,0 = (𝑡) ×1 𝐀′ ×3 𝐂(𝑡) ,
since the index 𝑢1 = 𝑢2 = 1, we use the extension matrices 𝐀′ and 𝐁′ updated in ℂ1 instead of the factor matrix 𝐀(𝑡) and 𝐁(𝑡) at time
𝑡. But 𝑢3 = 0, we use the factor matrix 𝐂(𝑡) at time 𝑡.
(𝑡+1) (𝑡+1)
Similarly, the update equations using sub-tensor 1,0,1 and 0,1,1 are shown in Eq. (19),

(𝑡+1)
𝐀′new ← 𝛼𝐀′old + (1 − 𝛼)(1,0,1 ) (⋅,0,1 )†(1) ,
(1)
(𝑡+1)
𝐂′new ← 𝛼𝐂′old + (1 − 𝛼)(1,0,1 ) (1,0,⋅ )†(3) ,
(3)
(𝑡+1) (19)
𝐁′new ← 𝛼𝐁′old + (1 − 𝛼)(0,1,1 ) (0,⋅,1 )†(2) ,
(2)
(𝑡+1)
𝐂′new ← 𝛼𝐂′old + (1 − 𝛼)(0,1,1 ) (0,1,⋅ )†(3) ,
(3)

where
⋅,0,1 = (𝑡) ×2 𝐁(𝑡) ×3 𝐂′ ,
1,0,⋅ = (𝑡) ×1 𝐀′ ×2 𝐁(𝑡) ,
(20)
0,⋅,1 = (𝑡) ×1 𝐀(𝑡) ×3 𝐂′ ,
0,1,⋅ = (𝑡) ×1 𝐀(𝑡) ×2 𝐁′ .
Finally, similar to the previous steps, we orthogonalize the extension matrix obtained each time.
(𝑖𝑖𝑖): Updating Extension Matrix with ℂ3 .
(𝑡+1)
There is only one sub-tensor 1,1,1 in ℂ3 , and 𝑢1 = 𝑢2 = 𝑢3 = 1. The extension matrix is updated according to Eq. (21)

(𝑡+1)
𝐀′new ← 𝛼𝐀′old + (1 − 𝛼)(1,1,1 ) (⋅,1,1 )†(1) ,
(1)
(𝑡+1)
𝐁′new ← 𝛼𝐁′old + (1 − 𝛼)(1,1,1 ) (1,⋅,1 )†(2) , (21)
(2)
(𝑡+1)
𝐂′new ← 𝛼𝐂′old + (1 − 𝛼)(1,1,1 ) (1,1,⋅ )†(3) ,
(3)

8
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

where

⋅,1,1 = (𝑡) ×2 𝐁′ ×3 𝐂′ ,
1,⋅,1 = (𝑡) ×1 𝐀′ ×3 𝐂′ , (22)
1,1,⋅ = (𝑡) ×1 𝐀′ ×2 𝐁′ .
Similarly, orthogonalizing the extension matrix, we will obtain the ﬁnal results 𝐀′ , 𝐁′ and 𝐂′ . Then, concatenating 𝐀(𝑡) , 𝐁(𝑡) and
𝐂(𝑡) with them along the second mode according to Eq. (13). Finally, orthogonalize them, the factor matrix 𝐀(𝑡+1) , 𝐁(𝑡+1) and 𝐂(𝑡+1)
at time 𝑡 + 1 will be obtained.

4.2.3. Updating core tensor

(𝑡+1) (𝑡) (𝑡+1) (𝑡+1)
After obtaining all the factor matrices 𝐀(𝑛) ∈ 𝑅(𝐼𝑛 +𝑑𝑛 )×𝑅𝑛 of the tensor at time 𝑡 + 1, we split them into 𝐀(𝑛),0 = (𝐀(𝑛) ) (𝑡) ∈
(0∶𝐼𝑛 ,∶)
(𝑡)
𝑅𝐼𝑛 ×𝑅𝑛 (𝑡+1)
and 𝐀(𝑛),1 = (𝐀(𝑛) )
(𝑡+1)
(𝑡) (𝑡) ∈ 𝑅𝑑𝑛 ×𝑅𝑛 . After that, the core tensor (𝑡+1) is calculated according to Eq. (23), where Θ′ is
(𝐼𝑛 ∶𝐼𝑛 +𝑑𝑛 ,∶)
the remaining part of Θ except (0, ⋯ , 0).

𝑇
(𝑡+1) =  (𝑡+1) × {(𝐀(𝑡+1)
(𝑛)
) }
∑ (𝑡+1) 𝑇
= 𝑢(𝑡+1)
,⋯,𝑢 × {(𝐀(𝑛),𝑢 ) } 1 𝑁 𝑛
(𝑢1 ,⋯,𝑢𝑁 )∈Θ
(𝑡+1) 𝑇 ∑ 𝑇 (23)
= 0,0,0 × {(𝐀(𝑡+1)
(𝑛),0
) }+ 𝑢(𝑡+1) (𝑡+1)
,⋯,𝑢 × {(𝐀(𝑛),𝑢 ) }
1 𝑁 𝑛
(𝑢1 ,⋯,𝑢𝑁 )∈Θ′
𝑇 ∑ 𝑇
= (𝑡)
× {(𝐀(𝑡+1)
(𝑛),0
) 𝐀(𝑡)
(𝑛)
}+ 𝑢(𝑡+1) (𝑡+1)
,⋯,𝑢 × {(𝐀(𝑛),𝑢 ) }.
1 𝑁 𝑛
(𝑢1 ,⋯,𝑢𝑁 )∈Θ′

Next, the update process of the core tensor is illustrated as an example of the third-order tensor. Firstly, the factor matrices 𝐀(𝑡+1) ,
𝐁(𝑡+1) and 𝐂(𝑡+1) are divided into 𝐀(𝑡+1)
0
(𝑡+1) (𝑡+1) (𝑡+1) (𝑡+1) (𝑡+1)
, 𝐀1 , 𝐁0 , 𝐁1 , 𝐂0 , 𝐂1 , respectively. Then the core tensor is calculated according
to Eq. (24)

(𝑡+1) 𝑇 𝑇 𝑇 ∑ 𝑇 𝑇 𝑇
(𝑡+1) = (0,0,0 )×1 (𝐀(𝑡+1)
0
) ×2 (𝐁(𝑡+1)
0
) ×3 (𝐂(𝑡+1)
0
) + 𝑢(𝑡+1) (𝑡+1)
,𝑢 ,𝑢 ×1 (𝐀𝑢 ) ×2 (𝐁(𝑡+1)
𝑢 ) ×3 (𝐂(𝑡+1)
𝑢 ) . (24)
1 2 3 1 2 3
(𝑢1 ,𝑢2 ,𝑢3 )∈Θ′

Finally, the multi-modal incremental tensor decomposition method is summarized in Algorithm 1.

Algorithm 1: MMITD.
Input: 𝑁 -order tensor  (𝑡) and  (𝑡+1)
(𝑡+1) (𝑡+1)
Output: Low rank 𝑛𝑒𝑤 , (𝑡+1) , 𝐀(𝑛)
1 Rank (𝑅1 , 𝑅2 , ⋯ , 𝑅𝑁 ) of  (𝑡) ← according to Eq. (9)
(𝑡)
2 (𝑡) , 𝐀(𝑛) ← Truncated Tucker decomposition for  (𝑡)
3 ℂ𝑛 ← Partition the tensor  (𝑡+1)
4 for ℂ𝑛 , 𝑛 = {1, 2, ⋯ , 𝑁} do
5 for 𝑢(𝑡+1)
,⋯,𝑢
∈ ℂ𝑛 do
1 𝑁
6 for 𝑢𝑛 ∈ {𝑢1 , 𝑢2 , ⋯ , 𝑢𝑁 } do
7 if 𝑢𝑛 = 1 then
8 Calculate the extension matrix 𝐀′(𝑛) according to Eq. (12)
9 Orthogonalization matrix 𝐀′(𝑛)
10 end
11 end
12 end
13 end
14 for 𝐀′(𝑛) , 𝑛 = {1, 2, ⋯ , 𝑁} do
(𝑡+1) (𝑡)
15 𝐕(𝑛) ← concatenate 𝐀′(𝑛) and 𝐀(𝑛)
(𝑡+1) (𝑡+1)
16 𝐀(𝑛) ← Orthogonalization matrix 𝐕(𝑛)
17 end
(𝑡+1) (𝑡+1) (𝑡+1)
18 Partition 𝐀(𝑛) into 𝐀(𝑛),0 and 𝐀(𝑛),1
(𝑡+1)
19 𝑛𝑒𝑤 ,  (𝑡+1)
← according to Eq. (6) and Eq. (23)
(𝑡+1) (𝑡+1)
20 return 𝑛𝑒𝑤 , (𝑡+1) , 𝐀(𝑛)

9
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Table 2
Description of features in dataset NSL-KDD.

Order Feature Order Feature

(1) duration (21) is_guest_32

(2) Protocol type (22) count
(3) service (23) srv_count
⋯ ⋯ ⋯ ⋯
(19) num_outbound_cmds (39) dst_host_rerror_rate
(20) is_host_32 (40) dst_host_srv_rerror_rate

Table 3
Description of features in dataset CICDDOS2019.

Order Feature Order Feature

(1) Source-Por (33) Packet-Length-Min

(2) Destination-Port (34) Packet-Length-Max
(3) Total-Fwd-Packet (35) Packet-Length-Avg
⋯ ⋯ ⋯ ⋯
(31) Fwd-Packet/s (63) Std-Dev-Time-Idle-Flow
(32) Bwd-Packet/s (64) Min-Time-Idle-Flow

4.3. Anomaly detection

(𝑡+1)
Completing the above steps will obtain the results (𝑡+1) and 𝐀(𝑛) of the tensor Tucker decomposition at time 𝑡 + 1, thus obtaining
(𝑡+1)
the approximate low-rank tensor 𝑛𝑒𝑤 .
Then it is transformed into a two-dimensional form for anomaly detection. Our detection
method uses XGBoost to classify and finally get normal and abnormal traffic.
XGBoost is a gradient-boosting-based ML algorithm with efficient, flexible, and scalable features. It can effectively handle large-
scale data, solve the problem of high-dimensional data, provide fast and accurate classification results, and avoid the phenomenon
of overfitting. Therefore, the XGBoost classification technique is widely used in various fields.

5. Performance evaluation

This section is the experimental part, which mainly evaluates the performance of anomaly detection. The experimental section
contains three subsections. The ﬁrst subsection describes the two datasets used for the experiments, the second subsection mainly
introduces relevant evaluation metrics and the third subsection gives the experimental results and analysis.

5.1. Datasets description

Datasets used for anomaly detection in the experimental part of this article are the commonly used NSL-KDD benchmark dataset
[31] and the recent CICDDOS 2019 dataset [32]. We will describe these two datasets in detail below.

5.1.1. NSL-KDD
There are 41 features in the NSL-KDD dataset. After numerical processing of some features and removing some features that do
not contribute much to the classiﬁcation, the ﬁnal data containing 40 features and class labels is obtained. Then, we change the label
of normal data to ‘0’ and the label of abnormal data to ‘1’. Part of the feature information is shown in Table 2.

5.1.2. CICDDOS2019
The CICDDOS2019 dataset contains normal traﬃc and the latest DDoS attack traﬃc, similar to real-world data. The dataset is very
large and each piece of data contains 87 features. In the experiments, we remove features that have no impact on anomaly detection
from the 87 features, leaving 64 features [28] and labels. Similarly, change the label of normal data to ‘0’ and the label of abnormal
data to ‘1’. Part of the feature information is shown in Table 3.

5.2. Evaluation metrics

This subsection describes the evaluation metrics, including CPU running time, Accuracy, Precision, Recall, False Alarm Rate, and
F1-Score.
𝐂𝐏𝐔 𝐫𝐮𝐧𝐧𝐢𝐧𝐠 𝐭𝐢𝐦𝐞: Evaluate the running efficiency of the detection algorithm, including tensor decomposition time and classi-
fication time.
𝐀𝐜𝐜𝐮𝐫𝐚𝐜𝐲 : Accuracy is a popular evaluation metric for classification models, which indicates the proportion of correctly classified
samples to the total number of samples, and is defined as
𝑇𝑃 +𝑇𝑁
𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = . (25)
𝑇𝑃 + 𝐹𝑃 + 𝑇𝑁 + 𝐹𝑁
10
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Fig. 8. 5-fold cross-validation technology.

𝐏𝐫𝐞𝐜𝐢𝐬𝐢𝐨𝐧: Precision uses the result of the prediction as a judgment criterion and indicates the proportion of samples with a positive
prediction that are correctly predicted, which is denoted as
𝑇𝑃
𝑃 𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 = . (26)
𝑇𝑃 + 𝐹𝑃
𝐑𝐞𝐜𝐚𝐥𝐥: Recall is judged by the actual sample and indicates the proportion of correctly predicted positive samples to the total
actual positive samples, which is denoted as
𝑇𝑃
𝑅𝑒𝑐𝑎𝑙𝑙 = . (27)
𝑇𝑃 + 𝐹𝑁
𝐅𝐚𝐥𝐬𝐞 𝐀𝐥𝐚𝐫𝐦 𝐑𝐚𝐭𝐞: The False Alarm Rate, also known as the False Positive Rate and False Detection Rate, has a lower value
indicating the better performance of the model. The FAR is described as
𝐹𝑃
𝐹 𝐴𝑅 = . (28)
𝑇𝑁 + 𝐹𝑃
𝐅𝟏 − 𝐒𝐜𝐨𝐫𝐞: F1-Score uses both accuracy and recall to evaluate the model. The value is high and the model is more robust only
when both accuracy and recall perform well. The F1-Score is described as
2 ⋅ 𝑃 𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 ⋅ 𝑅𝑒𝑐𝑎𝑙𝑙
𝐹 1−𝑆𝑐𝑜𝑟𝑒 = . (29)
𝑃 𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 + 𝑅𝑒𝑐𝑎𝑙𝑙

5.3. Experimental results and analysis

For the NSL-KDD dataset in Table 2, the matrix is denoted as 𝐗 ∈ 𝑅𝑁×𝐾 . In the detection framework, 𝐗 is modeled as a third-order
tensor  ∈ 𝑅𝐼×𝐽 ×𝐾 , where I=5, J=8, 𝐼 × 𝐽 =𝑁 , and 𝐾 is the total data volume. For the dataset CICDDOS2019 in Table 3, fold it
into a third-order tensor  ∈ 𝑅𝐼×𝐽 ×𝐾 , where 𝐼 =𝐽 =8. In the experimental process, (i) diﬀerent proportions of data are randomly
selected to obtain  ∈ 𝑅𝐼×𝐽 ×𝐾 , where 𝐾 ′ = 𝐾 × 𝑓 and 𝑓 is the selected proportion, (ii) set diﬀerent initial tensor and incremental
′

tensor for different proportion data and perform decomposition of multi-modal incremental tensor using Algorithm 1, (iii) 5-fold
cross-validation is used to generate training and testing sets. As shown in Fig. 8, dataset is divided into 5 blocks. In the experiments,
each block is used as 1 time testing set and 4 times training set, leading to 5 sets of experimental results. The evaluation metrics of
the final detection take the average value of these 5 sets of results.
Then, we explore the effect of the impact factor 𝛼 on the detection performance in the incremental tensor decomposition part.
For the NSL-KDD dataset, the initial tensor data size is set to 4 × 6 × (𝐾 ′ × 0.98), and the total tensor size is 5 × 8 × 𝐾 ′ after the
dimensions are increased. For the CICDDOS 2019 dataset, the initial tensor data size is set to 6 × 6 × (𝐾 ′ × 0.98) and the total tensor
size is 8 × 8 × 𝐾 ′ . 𝑓 is chosen as 0.2 in the experiment, and the experimental results are shown in Fig. 9. It can be found that the
value of the influencing factor 𝛼 has little effect on the accuracy. Therefore, take 𝛼 = 0.6 in the next experiment.
In addition, in the detection framework of this paper, when updating the factor matrix, each extension matrix is orthogonalized
after it has been updated. Compared with orthogonalizing only the final extension matrix, our method can achieve better detection
performance in the NSL-KDD dataset and CICDDOS 2019 dataset, as shown in Fig. 10.
Finally, a series of contrast experiments will be introduced in the next subsection.

11
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Fig. 9. ACC changes with impact factor 𝛼 .

Fig. 10. Shows the change in ACC after orthogonalizing each extension matrix and after orthogonalizing only the last extension matrix. (a) and (b) are the changes in
ACC on the NSL-KDD and CICDDOS2019 datasets, respectively.

5.3.1. Diﬀerent decomposition algorithms

This section compares the latest tensor decomposition techniques, i.e., TSSD [29], HOSVD, CP. Firstly, different sizes of data are
randomly selected in the NSL-KDD dataset and CICDDOS 2019 dataset. Then the latest tensor decomposition technique is used on
both datasets and the XGBoost classification algorithm is used for anomaly detection. The detection performance achieved by the
different decomposition algorithms is shown in Fig. 11 and Fig. 12, from which the following conclusions can be obtained.

• The detection framework proposed in this paper achieves better detection performance for network traﬃc data growing along
multiple modes.
• Compared with other methods, the framework in this paper has higher Accuracy, Precision, Recall, F1-Score, and lower False
Alarm Rate on two network datasets.
• In Fig. 11(f) and Fig. 12(f), it is obvious that the detection framework of this paper has a greatly reduced running time compared
to other methods, and the running time has been taken as a logarithmic result. Therefore, as the volume of data increases, it can
achieve speeds that are dozens, hundreds, or even thousands of times compared to other technologies.

5.3.2. Diﬀerent classiﬁcation algorithms

Firstly, different proportions of data are randomly selected in the two network traffic datasets, which are divided into initial
and incremental tensor, and then decomposed using the multi-modal incremental tensor decomposition algorithm in this paper
to obtain the low-rank tensor. Finally, different machine learning classification algorithms are used on low-rank tensor data to
compare classification performance. The ML classification algorithms include XGBoost, Gradient Boosting Decision Tree (GBDT),
Support Vector Machine (SVM), Logistic Regression (LR), Linear Discriminant Analysis (LDA), and Random Forest (RF) classification
algorithms. The classification performance of different classification algorithms on different datasets is shown in Fig. 13 and Fig. 14,
from which the following conclusions can be obtained.

12
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Fig. 11. Comparing the detection performance of diﬀerent tensor decomposition algorithms on the NSL-KDD dataset.

• Data growing along multiple modes have better detection performance on different classification algorithms and different dataset
sizes after being processed by Algorithm 1.
• Compared to other detection methods, the XGBoost classification technique has faster detection speed and better detection
performance on different proportions of data.
• The traffic anomaly detection framework proposed in this paper is highly robust. Better detection performance can be achieved
on different categories of datasets and different dataset sizes.

13
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Fig. 12. Comparing the detection performance of diﬀerent tensor decomposition algorithms on the CICDDOS2019 dataset.

6. Conclusion

In this paper, we propose an anomaly detection framework based on multi-modal incremental tensor decomposition in large-scale
networks. The framework is suitable for dynamic anomaly detection systems and considers the case where network data grows along
multiple modes. For this type of data, Tucker decomposition of multi-modal incremental tensor is used to reduce computational cost
and remove data redundancy and noise to improve data quality. Compared with tensor decomposition-based methods, our detection
method only utilizes low-cost tensor matrix multiplication and pseudo-inversion of matrices, and does not perform a complete tensor

14
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Fig. 13. Comparing the detection performance of diﬀerent ML algorithms on the NSL-KDD dataset.

decomposition on large-scale data. Therefore, the framework proposed in this paper is more suitable for large-scale data and has
higher detection speed and accuracy.
In addition, the XGBoost classiﬁcation technique is used in the anomaly detection module, which makes our detection framework
perform better. Based on these characteristics, in our future work, we intend to apply this detection framework to SDN network
architectures to avoid serious harm due to network attacks.

15
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Fig. 14. Comparing the detection performance of diﬀerent ML algorithms on the CICDDOS2019 dataset.

CRediT authorship contribution statement

Rongqiao Fan: Writing – original draft, Software, Methodology. Qiyuan Fan: Writing – review & editing, Writing – original draft,
Supervision, Software, Resources, Project administration, Methodology, Formal analysis, Data curation, Conceptualization. Xue Li:
Resources, Investigation. Puming Wang: Writing – review & editing, Supervision, Conceptualization. Jing Xu: Validation, Formal
analysis. Xin Jin: Visualization. Shaowen Yao: Project administration. Peng Liu: Data curation.

16
R. Fan, Q. Fan, X. Li et al. Information Sciences 681 (2024) 121210

Declaration of competing interest

The authors declare that they have no known competing ﬁnancial interests or personal relationships that could have appeared to
inﬂuence the work reported in this paper.

Data availability

The authors do not have permission to share data.

Acknowledgements

This work was supported by National Nature Science Foundation of China Project No. 62166047, 62101481, 62002313 and The
15th Graduate Research Innovation Project of Yunnan University No. KC-23234593.

References

[1] H. Ringberg, A. Soule, J. Rexford, et al., Sensitivity of PCA for traffic anomaly detection, in: Proceedings of the 2007 ACM SIGMETRICS International Conference
on Measurement and Modeling of Computer Systems, 2007, pp. 109–120.
[2] P. Mignone, R. Corizzo, M. Ceci, Distributed and explainable GHSOM for anomaly detection in sensor networks, Mach. Learn. (2024) 1–42.
[3] M. Wang, D. Hong, Z. Han, et al., Tensor decompositions for hyperspectral data processing in remote sensing: a comprehensive review, IEEE Geosci. Remote
Sens. Mag. 11 (1) (2023) 26–72.
[4] H. Xiao, F. Wang, F. Ma, et al., eOTD: an efficient online Tucker decomposition for higher order tensors, in: 2018 IEEE International Conference on Data Mining
(ICDM), IEEE, 2018, pp. 1326–1331.
[5] L. Huang, X.L. Nguyen, M. Garofalakis, et al., In-network PCA and anomaly detection, Adv. Neural Inf. Process. Syst. (2006) 19.
[6] A. Lakhina, M. Crovella, C. Diot, Diagnosing network-wide traffic anomalies, ACM SIGCOMM Comput. Commun. Rev. 34 (4) (2004) 219–230.
[7] Y.R. Yeh, Z.Y. Lee, Y.J. Lee, Anomaly detection via over-sampling principal component analysis, in: New Advances in Intelligent Decision Technologies, Springer,
Berlin, Heidelberg, 2009, pp. 449–458.
[8] Y.J. Lee, Y.R. Yeh, Y.C.F. Wang, Anomaly detection via online oversampling principal component analysis, IEEE Trans. Knowl. Data Eng. 25 (7) (2012) 1460–1470.
[9] J. Udhayan, T. Hamsapriya, Statistical segregation method to minimize the false detections during ddos attacks, Int. J. Netw. Secur. 13 (3) (2011) 152–160.
[10] S. Fortunati, F. Gini, M.S. Greco, et al., An improvement of the state-of-the-art covariance-based methods for statistical anomaly detection algorithms, in: Signal,
Image and Video Processing, vol. 10, 2016, pp. 687–694.
[11] X. Han, L. Xu, M. Ren, et al., A Naive Bayesian network intrusion detection algorithm based on Principal Component Analysis, in: 2015 7th International
Conference on Information Technology in Medicine and Education (ITME), IEEE, 2015, pp. 325–328.
[12] H. Peng, Z. Sun, X. Zhao, et al., A detection method for anomaly flow in software defined network, IEEE Access 6 (2018) 27809–27817.
[13] R.H. Hwang, M.C. Peng, C.W. Huang, et al., An unsupervised deep learning model for early network traffic anomaly detection, IEEE Access 8 (2020) 30387–30399.
[14] Z. Li, X. Chen, J. Song, et al., Adaptive label propagation for group anomaly detection in large-scale networks, IEEE Trans. Knowl. Data Eng. (2022).
[15] K. Wu, Z. Chen, W. Li, A novel intrusion detection model for a massive network using convolutional neural networks, IEEE Access 6 (2018) 50850–50859.
[16] S. Garg, K. Kaur, N. Kumar, et al., Hybrid deep-learning-based anomaly detection scheme for suspicious flow detection in SDN: a social multimedia perspective,
IEEE Trans. Multimed. 21 (3) (2019) 566–578.
[17] P. Wang, L.T. Yang, G. Qian, et al., HO-OTSVD: a novel tensor decomposition and its incremental decomposition for cyber–physical–social networks (CPSN),
IEEE Trans. Netw. Sci. Eng. 7 (2) (2019) 713–725.
[18] P. Wang, L.T. Yang, J. Li, et al., Data fusion in cyber-physical-social systems: state-of-the-art and perspectives, Inf. Fusion 51 (2019) 42–57.
[19] Q. Song, X. Huang, H. Ge, et al., Multi-aspect streaming tensor completion, in: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge
Discovery and Data Mining, 2017, pp. 435–443.
[20] P. Wang, L.T. Yang, J. Li, et al., MMDP: a mobile-IoT based multi-modal reinforcement learning service framework, IEEE Trans. Serv. Comput. 13 (4) (2020)
675–684.
[21] K. Yang, Y. Gao, Y. Shen, et al., Dismastd: an efficient distributed multi-aspect streaming tensor decomposition, in: 2021 IEEE 37th International Conference on
Data Engineering (ICDE), IEEE, 2021, pp. 1080–1091.
[22] C. Liu, T. Wu, Z. Li, et al., Robust online tensor completion for IoT streaming data recovery, IEEE Trans. Neural Netw. Learn. Syst. (2022).
[23] J. Sun, D. Tao, S. Papadimitriou, et al., Incremental tensor analysis: theory and applications, ACM Trans. Knowl. Discov. Data 2 (3) (2008) 1–37.
[24] P. Wang, L.T. Yang, X. Nie, et al., Data-driven software defined network attack detection: state-of-the-art and perspectives, Inf. Sci. 513 (2020) 65–83.
[25] X. Li, K. Xie, X. Wang, et al., Online Internet anomaly detection with high accuracy: a fast tensor factorization solution, in: IEEE INFOCOM 2019-IEEE Conference
on Computer Communications, IEEE, 2019, pp. 1900–1908.
[26] K. Xie, X. Li, X. Wang, et al., Fast tensor factorization for accurate internet anomaly detection, IEEE/ACM Trans. Netw. 25 (6) (2017) 3794–3807.
[27] W. Huang, K. Xie, J. Li, A novel sequence tensor recovery algorithm for quick and accurate anomaly detection, IEEE Trans. Netw. Sci. Eng. 9 (5) (2022) 3531–3545.
[28] J.P.A. Maranhão, J.P.C.L. da Costa, E. Javidi, et al., Tensor based framework for Distributed Denial of Service attack detection, J. Netw. Comput. Appl. 174
(2021) 102894.
[29] J. Xu, X. Li, P. Wang, et al., Multi-modal noise-robust DDoS attack detection architecture in large-scale networks based on tensor SVD, IEEE Trans. Netw. Sci.
Eng. 10 (1) (2022) 152–165.
[30] X. Sáez-de-Cámara, J.L. Flores, C. Arellano, et al., Clustered federated learning architecture for network anomaly detection in large scale heterogeneous IoT
networks, Comput. Secur. 131 (2023) 103299.
[31] M. Tavallaee, E. Bagheri, W. Lu, et al., A detailed analysis of the KDD CUP 99 data set, in: 2009 IEEE Symposium on Computational Intelligence for Security and
Defense Applications, IEEE, 2009, pp. 1–6.
[32] I. Sharafaldin, A.H. Lashkari, S. Hakak, et al., Developing realistic distributed denial of service (DDoS) attack dataset and taxonomy, in: 2019 International
Carnahan Conference on Security Technology (ICCST), IEEE, 2019, pp. 1–8.

Anomaly Detection
No ratings yet
Anomaly Detection
13 pages
Tensor-Based Online Network Anomaly Detection and Diagnosis
No ratings yet
Tensor-Based Online Network Anomaly Detection and Diagnosis
26 pages
Network Anamoly Detection Paper (DKB)
No ratings yet
Network Anamoly Detection Paper (DKB)
34 pages
AI-Driven Anomaly Detection in Network Monitoring
No ratings yet
AI-Driven Anomaly Detection in Network Monitoring
6 pages
Marteau 2021
No ratings yet
Marteau 2021
16 pages
Semi-Supervised Learning For Anomaly Traffic Detection Via Bidirectional Normalizing Flows
No ratings yet
Semi-Supervised Learning For Anomaly Traffic Detection Via Bidirectional Normalizing Flows
14 pages
Symmetry 15 01205
No ratings yet
Symmetry 15 01205
21 pages
A Study On High Speed Outlier Detection
No ratings yet
A Study On High Speed Outlier Detection
17 pages
Anomaly-Aware Network Traffic Estimation Via Outlier-Robust Tensor Completion
No ratings yet
Anomaly-Aware Network Traffic Estimation Via Outlier-Robust Tensor Completion
13 pages
High-Efficiency Anomaly Detection of Traffic Data
No ratings yet
High-Efficiency Anomaly Detection of Traffic Data
9 pages
Path
No ratings yet
Path
16 pages
Anomaly Detection in Network Traffic Using Machine
No ratings yet
Anomaly Detection in Network Traffic Using Machine
16 pages
Machine Learning in Network Anomaly Detection A Survey
No ratings yet
Machine Learning in Network Anomaly Detection A Survey
18 pages
Paper 6 CN
No ratings yet
Paper 6 CN
32 pages
References
No ratings yet
References
10 pages
Improving Performance of Autoencoder-Based Network Anomaly Detection On NSL-KDD Dataset
No ratings yet
Improving Performance of Autoencoder-Based Network Anomaly Detection On NSL-KDD Dataset
11 pages
2 PB
No ratings yet
2 PB
10 pages
Anomaly-Based Intrusion Detection From Network Flow Features Using Variational Autoencoder
No ratings yet
Anomaly-Based Intrusion Detection From Network Flow Features Using Variational Autoencoder
13 pages
1 s2.0 S1877050922015137 Main
No ratings yet
1 s2.0 S1877050922015137 Main
8 pages
Franlin Open - Light NBM0camera Ready-28!3!25
No ratings yet
Franlin Open - Light NBM0camera Ready-28!3!25
9 pages
Anomaly Detection and Attribution in Networks With Temporally Correlated Traffic
No ratings yet
Anomaly Detection and Attribution in Networks With Temporally Correlated Traffic
12 pages
Network Anomaly Detection
No ratings yet
Network Anomaly Detection
18 pages
李涛英文翻译
No ratings yet
李涛英文翻译
12 pages
ARCADE Adversarially Regularized Convolutional Autoencoder For Network Anomaly Detection
No ratings yet
ARCADE Adversarially Regularized Convolutional Autoencoder For Network Anomaly Detection
14 pages
Research Paper CNS
No ratings yet
Research Paper CNS
7 pages
CCN Presentation
No ratings yet
CCN Presentation
13 pages
Anomaly Detection For Data Streams in Large-Scale Distributed Heterogeneous Computing Environments
No ratings yet
Anomaly Detection For Data Streams in Large-Scale Distributed Heterogeneous Computing Environments
11 pages
Enhanced Network Anomaly Detection Using Autoencoders A Deep Learning Approach For Proactive Cybersecurity
No ratings yet
Enhanced Network Anomaly Detection Using Autoencoders A Deep Learning Approach For Proactive Cybersecurity
7 pages
Insdn: A Novel SDN Intrusion Dataset
No ratings yet
Insdn: A Novel SDN Intrusion Dataset
7 pages
A Self Attentional Auto Encoder Based in PDF
No ratings yet
A Self Attentional Auto Encoder Based in PDF
9 pages
CNS Assignment 2
No ratings yet
CNS Assignment 2
6 pages
Graph Anomaly Detection With Graph Neural Networks Current Status and Challenges
No ratings yet
Graph Anomaly Detection With Graph Neural Networks Current Status and Challenges
10 pages
Anomaly in Manet2
No ratings yet
Anomaly in Manet2
14 pages
Anomaly Detection On Attributed Networks
No ratings yet
Anomaly Detection On Attributed Networks
15 pages
Network Anomaly Detection Using A Hybrid Approach of Machine H Öztekin
No ratings yet
Network Anomaly Detection Using A Hybrid Approach of Machine H Öztekin
12 pages
1 s2.0 S1110016824002850 Main
No ratings yet
1 s2.0 S1110016824002850 Main
11 pages
Research
No ratings yet
Research
15 pages
Mausumi Doi - Org.10.32010.26166127.2020.3.2.196.206
No ratings yet
Mausumi Doi - Org.10.32010.26166127.2020.3.2.196.206
12 pages
1 s2.0 S2214212622000394 Main
No ratings yet
1 s2.0 S2214212622000394 Main
8 pages
A Graph Construction Method For Anomalous Traffic Detection With Graph Neural Networks Using Sets of Flow Data
No ratings yet
A Graph Construction Method For Anomalous Traffic Detection With Graph Neural Networks Using Sets of Flow Data
2 pages
Paper 8 CN
No ratings yet
Paper 8 CN
5 pages
A System For Denial-of-Service Attack Detection Based On Multivariate Correlation Analysis
No ratings yet
A System For Denial-of-Service Attack Detection Based On Multivariate Correlation Analysis
10 pages
A Survey of Anomaly Detection Methods in Networks: Weiyu Zhang, Qingbo Yang, Yushui Geng
No ratings yet
A Survey of Anomaly Detection Methods in Networks: Weiyu Zhang, Qingbo Yang, Yushui Geng
3 pages
Amnamoly Detection in Network
No ratings yet
Amnamoly Detection in Network
2 pages
2021 - A Graph Neural Network Method For Distributed Anomaly Detection in IoT - Protogerou Et Al
No ratings yet
2021 - A Graph Neural Network Method For Distributed Anomaly Detection in IoT - Protogerou Et Al
18 pages
IEEE Conference Templa
No ratings yet
IEEE Conference Templa
4 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
4 pages
Network Anomaly Detection-Methods, Systems and Tools
No ratings yet
Network Anomaly Detection-Methods, Systems and Tools
34 pages
Analysis of Pattern Recognition Techniques For Detecting Traffic Anomalies
No ratings yet
Analysis of Pattern Recognition Techniques For Detecting Traffic Anomalies
12 pages
Multi Level Deep Learning Model For Network Anomal
No ratings yet
Multi Level Deep Learning Model For Network Anomal
12 pages
Enhancing Time Series Anomaly Detection: A Hybrid Model Fusion Approach
No ratings yet
Enhancing Time Series Anomaly Detection: A Hybrid Model Fusion Approach
13 pages
Ahmed PDF
No ratings yet
Ahmed PDF
6 pages
2024-26 - Jr.C-120 - Physics Teaching & Test Schedule With Class & Home Work
No ratings yet
2024-26 - Jr.C-120 - Physics Teaching & Test Schedule With Class & Home Work
30 pages
Multi-Level Association Rules For Anomaly Extraction in Backbone Network With Improved Scalability and Efficiency
No ratings yet
Multi-Level Association Rules For Anomaly Extraction in Backbone Network With Improved Scalability and Efficiency
8 pages
Machine Learning Approaches To Network Anomaly Detection: Tarem Ahmed, Boris Oreshkin and Mark Coates
No ratings yet
Machine Learning Approaches To Network Anomaly Detection: Tarem Ahmed, Boris Oreshkin and Mark Coates
6 pages
Online Quiz 1
100% (1)
Online Quiz 1
9 pages
CSEC Mathematics January 2015 P2 PDF
No ratings yet
CSEC Mathematics January 2015 P2 PDF
36 pages
Structures Congress 2017: Buildings and Special Structures
No ratings yet
Structures Congress 2017: Buildings and Special Structures
801 pages
Masters in Public Administration Course of Study Curriculum
100% (1)
Masters in Public Administration Course of Study Curriculum
15 pages
Fractal Time Why A Watched Kettle Never Boils Studies of Nonlinear Phenomena in Life Science Susie Vrobel Download
No ratings yet
Fractal Time Why A Watched Kettle Never Boils Studies of Nonlinear Phenomena in Life Science Susie Vrobel Download
77 pages
Sašo Živanović: Quantificational Aspects of LF
No ratings yet
Sašo Živanović: Quantificational Aspects of LF
285 pages
AI V1 V2 V3 Fall 2020 - 21 Assg 02
No ratings yet
AI V1 V2 V3 Fall 2020 - 21 Assg 02
3 pages
Unit 1 Lesson 1-5
No ratings yet
Unit 1 Lesson 1-5
24 pages
M911 G11 - Transformation Geometry
No ratings yet
M911 G11 - Transformation Geometry
12 pages
Oscillations Printed Notes and Assignment
No ratings yet
Oscillations Printed Notes and Assignment
72 pages
Math 5 Week 2 Q1
No ratings yet
Math 5 Week 2 Q1
9 pages
Sti Thesis Format
100% (2)
Sti Thesis Format
6 pages
E Mahesh PGT Mathematics
No ratings yet
E Mahesh PGT Mathematics
14 pages
Atmel Avr Microcontroller Mega and Xmega in Assembly and C 1st Edition Han-Way Huang Test Bank
50% (2)
Atmel Avr Microcontroller Mega and Xmega in Assembly and C 1st Edition Han-Way Huang Test Bank
2 pages
Leadership Attributes
No ratings yet
Leadership Attributes
25 pages
Cad Unit-3 PDF
No ratings yet
Cad Unit-3 PDF
18 pages
Kindergarten Math Shapes Unit
No ratings yet
Kindergarten Math Shapes Unit
4 pages
Ekeland 1974
No ratings yet
Ekeland 1974
30 pages
Syllabus For Secondary Grade Teacher
No ratings yet
Syllabus For Secondary Grade Teacher
26 pages
Why Are Complex Numbers Needed in Quantum Mechanics? Some Answers For The Introductory Level
No ratings yet
Why Are Complex Numbers Needed in Quantum Mechanics? Some Answers For The Introductory Level
8 pages
Basics of Sigma-Delta Modulation
No ratings yet
Basics of Sigma-Delta Modulation
25 pages
Parametric Equations and Polar Coordinates: Dr. Lê Xuân Đ I
No ratings yet
Parametric Equations and Polar Coordinates: Dr. Lê Xuân Đ I
32 pages
Prior Analytics: Syllogism
No ratings yet
Prior Analytics: Syllogism
9 pages
Supersolid Phases of Hardcore Bosons On The Square Lattice: Correlated Hopping, Next-Nearest Neighbor Hopping and Frustration
No ratings yet
Supersolid Phases of Hardcore Bosons On The Square Lattice: Correlated Hopping, Next-Nearest Neighbor Hopping and Frustration
20 pages
Teaching Learning Based Optimization: Application and Variation
No ratings yet
Teaching Learning Based Optimization: Application and Variation
5 pages
Numerical Calculation of Tertiary Air Duct in The Cement Kiln Installation
No ratings yet
Numerical Calculation of Tertiary Air Duct in The Cement Kiln Installation
3 pages
Taxicab Geometry
No ratings yet
Taxicab Geometry
3 pages
Case Problem 3
No ratings yet
Case Problem 3
5 pages
Flowmeter Result
No ratings yet
Flowmeter Result
7 pages
Quantum Networks: The Future of Computer Communication
From Everand
Quantum Networks: The Future of Computer Communication
Manoj RC
No ratings yet
Storm Systems for Real-Time Data Processing: Definitive Reference for Developers and Engineers
From Everand
Storm Systems for Real-Time Data Processing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
NetFlow Protocols and Applications: Definitive Reference for Developers and Engineers
From Everand
NetFlow Protocols and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Principles of Mesh Networks and Mesh Generation: Definitive Reference for Developers and Engineers
From Everand
Principles of Mesh Networks and Mesh Generation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Principles of Data Forwarding Technologies: Definitive Reference for Developers and Engineers
From Everand
Principles of Data Forwarding Technologies: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Principles of Observability for Modern Systems: Definitive Reference for Developers and Engineers
From Everand
Principles of Observability for Modern Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet

15-A Novel Multi-Modal Incremental Tensor Decomposition For Anomaly Detection in Large-Scale Networks

Uploaded by

15-A Novel Multi-Modal Incremental Tensor Decomposition For Anomaly Detection in Large-Scale Networks

Uploaded by

Information Sciences 681 (2024) 121210

Contents lists available at ScienceDirect

A novel multi-modal incremental tensor decomposition for

𝑎 Scalars (lowercase letters)

2.2. Machine learning-based

2.3. Tensor decomposition-based

Fig. 1. The horizontal, lateral, and frontal slices of 3-order tensor  .

Fig. 2. Unfolding of a 3-order tensor  ∈ 𝑅𝐼×𝐽 ×𝐾 .

 = ×𝑘 A ⟺ ()(𝑘) = A()(𝑘) , (1)

×𝑘 A×𝑘′ A′ = ×𝑘′ A′ ×𝑘 A. (3)

×𝑘 A×𝑘 A′ = ×𝑘 (A′ A). (4)

Fig. 3. The 1-mode product of  ∈ 𝑅5×3×3 and 𝐀 ∈ 𝑅3×5 .

Fig. 4. Tucker decomposition of a 3-order tensor  ∈ 𝑅𝐼×𝐽 ×𝐾 .

Deﬁnition 5. The 𝑁 -rank of a tensor  ∈ 𝑅𝐼1 ×𝐼2 ×⋯×𝐼𝑁 is described as an 𝑁 -tuple

(𝑟𝑎𝑛𝑘1 ()(1) , 𝑟𝑎𝑛𝑘2 ()(2) , ⋯ , 𝑟𝑎𝑛𝑘𝑁 ()(𝑁) ), (5)

 = ×1 A(1) ×2 A(2) ⋯ ×𝑁 A(𝑁) . (6)

4. Proposed traﬃc anomaly detection framework

4.1. Data preprocessing

4.1.1. Numericalization and feature selection

Fig. 5. Proposed network traﬃc anomaly detection framework.

Fig. 6. Constructing the matrix as a 3-order tensor.

4.1.2. Standardization and tensor modeling

4.2. MMITD method

4.2.1. Incremental tensor partitioning

4.2.2. Updating factor matrix

−𝑢𝑛 ≜ 𝑢1 ,⋯,𝑢𝑛−1 ,𝑢𝑛+1 ,⋯,𝑢𝑁

(𝑡+1) (𝑡+1) (𝑡)

4.2.3. Updating core tensor

Finally, the multi-modal incremental tensor decomposition method is summarized in Algorithm 1.

Order Feature Order Feature

(1) duration (21) is_guest_32

Order Feature Order Feature

(1) Source-Por (33) Packet-Length-Min

4.3. Anomaly detection

5.1. Datasets description

5.2. Evaluation metrics

Fig. 8. 5-fold cross-validation technology.

5.3. Experimental results and analysis

Fig. 9. ACC changes with impact factor 𝛼 .

5.3.1. Diﬀerent decomposition algorithms

5.3.2. Diﬀerent classiﬁcation algorithms

CRediT authorship contribution statement

Declaration of competing interest

The authors do not have permission to share data.

You might also like