0% found this document useful (0 votes)

25 views14 pages

2022 A Quantum-Inspired Classifier For Early Web Bot Detection

Uploaded by

vewabev936

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views14 pages

2022 A Quantum-Inspired Classifier For Early Web Bot Detection

Uploaded by

vewabev936

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

1684 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL.

17, 2022

A Quantum-Inspired Classifier for Early

Web Bot Detection
Alberto Cabri , Francesco Masulli , Senior Member, IEEE, Stefano Rovetta , Senior Member, IEEE,
and Grażyna Suchacka , Senior Member, IEEE

Abstract— This paper introduces a novel approach, inspired Many time series applications consider classification accu-
by the principles of Quantum Computing, to address web bot racy as the essential point and no particular importance is given
detection in terms of real-time classification of an incoming to the speed of decision. An example of such tasks is forgeries
data stream of HTTP request headers, in order to ensure the
shortest decision time with the highest accuracy. The proposed detection on signatures, where on-the-fly (OTF) classification
approach exploits the analogy between the intrinsic correlation is not required whereas high accuracy is a crucial performance
of two or more particles and the dependence of each HTTP metric [3].
request on the preceding ones. Starting from the a-posteriori Conversely, timely decisions are an essential feature on an
probability of each request to belong to a particular class, it is extrusion line in order to detect and amend possible defects
possible to assign a Qubit state representing a combination of the
aforementioned probabilities for all available observations of the before the product integrity gets compromised [4].
time series. By leveraging the underlying mathematical details These simple considerations denote the dual aspects of time
of superposition and entanglement on specific subsequences, series analysis, that turn out in selecting different approaches
it is possible to devise a measure of membership to each class, to deal with the various problems. As reported in [5], the
thus enabling the system to take a reliable decision when a approaches can be categorized in offline, whenever a complete
sufficient level of confidence is met or to continue with additional
observations. The results reported in this paper objectively sequence should be analyzed before labeling, or online (also
show the effectiveness of our quantum-inspired algorithm which known as on-the-fly), if a decision must be made as soon as
outperforms other state-of-the-art approaches, including our own possible, based on incoming observations.
one based on the Sequential Probability Ratio Test. The latter is commonly known as early classification of
Index Terms— Quantum-inspired computing, bot detection, time series [1]. Examples of such challenging problems can
sequential classification, early decision, multinomial classification, be found in various industrial scenarios, as shown in Table I,
multivariate sequence classification. often related to the processing of data streams from connected
I. I NTRODUCTION devices or sensors (Internet of Things), which enable harvest-
ing huge amounts of data, most frequently as a sequence of
I N THE era of Big Data, huge volumes of varied data
are collected at high velocity in several contexts, posing
new challenges concerning timely recognition of anomalous
correlated observations or measures. Even video sources can
be treated as a sequence of time related events, where each
or critical events. event is associated to a single video frame.
Whenever event data are indexed on time, the relevant In all those cases, such as the ones listed in Table I,
dataset represents a time series where each observation is measures are collected over time and need to be analyzed in a
somehow related to its temporal neighbors. Being able to timely manner to extract useful information about potentially
automatically classify a sequence is a highly valuable task critical conditions.
and even more important is the ability to label a time series Time series classification models usually target the recog-
with the fewest possible observations [1], [2]. nition rate as their main goal, but this is not sufficient for
early classification or prediction where earliness of decision
Manuscript received December 11, 2021; revised April 9, 2022; accepted
April 12, 2022. Date of publication April 25, 2022; date of current version
becomes a mandatory key performance indicator.
May 6, 2022. This work was supported by the ICT COST Action IC1406 A sequence of events that, for whatever reason, may end up
High-Performance Modelling and Simulation for Big Data Applications compromising a piece of equipment should be detected in the
(cHiPSet). The associate editor coordinating the review of this manuscript
and approving it for publication was Dr. Alexey Vinel. (Corresponding author:
shortest possible time, as any delay could cause damages and
Alberto Cabri.) unnecessary costs [6].
Alberto Cabri and Stefano Rovetta are with the Department of Informatics, This paper addresses the problem of on-the-fly early clas-
Bioengineering, Robotics, and Systems Engineering (DIBRIS), University
of Genoa, 16146 Genoa, Italy (e-mail: [email protected];
sification for online data streams, where data are usually
[email protected]). statistically dependent and inherently correlated over time as
Francesco Masulli is with the Department of Informatics, Bioengineer- in the case of web bot detection, a highly critical task in cyber-
ing, Robotics, and Systems Engineering (DIBRIS), University of Genoa,
16146 Genoa, Italy, and also with the Sbarro Institute for Cancer Research
security applications, where we need to distinguish automatic
and Molecular Medicine, Temple University, Philadelphia, PA 19122 USA web robots from human users.
(e-mail: [email protected]). Moreover we aim at labeling a temporal sequence of events
Grażyna Suchacka is with the Institute of Informatics, University of Opole,
45-040 Opole, Poland (e-mail: [email protected]).
using the smallest number of observations. The task is there-
Digital Object Identifier 10.1109/TIFS.2022.3170237 fore an early decision problem, based on an incomplete set
1556-6021 © 2022 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.

Authorized licensed use limited to: National Institute of Technology. Downloaded on July 29,2024 at 10:06:25 UTC from IEEE Xplore. Restrictions apply.
CABRI et al.: QUANTUM-INSPIRED CLASSIFIER FOR EARLY WEB BOT DETECTION 1685

TABLE I
E XAMPLE OF E ARLY C LASSIFICATION P ROBLEMS FOR T IME S ERIES

of events that requires OTF evaluation and stretches over an Several methods are available for modeling sequential data
undefined time horizon. A critical aspect is finding the optimal but Statistical models, such as ARMA or ARIMA [8], [9],
trade-off between decision speed, defined in relation to the aimed at time series prediction, assume the linearity of data
number of observations required by the trained system to take model which means that the time series is either stationary
a decision, and classification accuracy, which are conflicting or convertible into stationary. Most often, time series are non-
constraints. stationary because their statistical properties vary over time
To this aim, we present a new method for early classification and thus require data models built on training data [10], such
of online data streams, inspired by the principles of quantum as Artificial Neural Networks (ANN) [9].
computing, able to classify a series of HTTP requests with Often, machine learning techniques are not suitable for
outstanding accuracy and very effective in early decision sequential data because these algorithms disregard the statisti-
making without any knowledge of sequences’ time horizon. cal structure of a time series and are sensitive to noise, which
Please consider that no physical interpretation of quantum is always present in data streams.
theory is implied by our algorithm despite the analogical Many effective time series classification approaches are
adoption of the underlying mathematical details. The proposed available in literature [2], [11], but they are not suitable for
approach is completely myopic and no delay cost estimate early decision: it is worth underlining that early decision is
is required to force early decision because it leverages the a task for analyzing data streams collected in real time and
intrinsic structure of data to propose a class label. locating the earliest event that supports a reliable decision,
One important remark is that, to the best of our knowledge, according to a given cost function, from an incomplete set of
no public datasets are available for bot detection, making it temporally related data. It is an example of optimal stopping
difficult to compare the presented results to other relevant theory [12] because a given action is taken from sequential
studies; hence, the SPRT approach, originally discussed in [7], observations of a random variable, according to misclassifica-
has been compared with the quantum-inspired algorithm to tion or delay costs.
confirm its efficacy both in terms of classification metrics and The authors of [13] present a time series classification
decision time. strategy from incomplete information, introducing the notion
The remainder of this paper is organized as follows: of reliability as the probability required when labeling an
Section II presents the state of the art on possible approaches incomplete time series as if it were the complete data stream.
to early data stream classification; Section III introduces As an alternative for sequential binary classification, the
the theoretical background on quantum computing, which authors also refer to SPRT [14], which is a Bayes-optimal
is required to understand the proposed method; Section IV approach, but put in evidence the greedy connotation of this
illustrates the validation process of the proposed method using probabilistic model, where new observations have no impact
synthetic data; Section VI describes the test problem that has on the cumulative log-likelihood calculated from previous
been used to verify this novel approach while Section VII ones.
presents the structure of the dataset used for bot detection and SPRT has also been successfully used in [7] as a probability
the relevant features. Section VIII describes its application to integrator, with reject option, on the same BOT detection task
the chosen classification problem, regarding the analysis of proposed in this paper; it outperforms a real time binomial
web traffic logs of a real e-commerce portal; in Section IX classification approach, presented in [15], that relies on a
the experimental results are reported and commented; lastly, first-order Discrete Time Markov Chain (DTMC) [16], [17]
Section X offers concluding remarks and cues for extending to estimate the class conditional probability according to the
the research and the possible areas of future application. likelihoods of initial state and the following transition patterns.
In [18], the authors address early classification for some
II. S TATE OF THE A RT time-sensitive applications in healthcare by means of an
Monitoring natural and industrial processes often produce effective 1-Nearest Neighbor (1NN) classifier, whose major
massive volumes of sequential data (data streams), usually advantage is not needing any feature selection, pre-processing,
indexed over time. training nor configuration parameters.

Authorized licensed use limited to: National Institute of Technology. Downloaded on July 29,2024 at 10:06:25 UTC from IEEE Xplore. Restrictions apply.
1686 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 17, 2022

In [6], early classification is made by means of probabilis- Quantum Neural Network (QNN) for time series prediction
tic classifiers, named Early Classification framework based and modeling.
on class Discriminativeness and Reliability of Predictions A true quantum algorithm for time series classification is
(ECDIRE), that learn the timestamps when accuracy begins to proposed in [57] where the authors make use of quantum com-
exceed class defined thresholds. The predictions are released puting by formulating the reconstruction task as a quadratic
only when timestamps match the learned values. It focuses unconstrained binary optimization (QUBO) problem, although
on a set of time series of equal length, but ECDIRE can be not quantum-inspired.
utilized on variable or unknown length sequences with few To the best of our knowledge, only a very limited number of
minor changes. quantum-inspired classification methods are available, mainly
Early odor identification by means of electronic nose sen- focused on binary problems.
sors is addressed in [19], where the authors analyze subsequent Binary classification is the objective of a very recent
signal chunks collected at the sensors to feed an ensemble of quantum-inspired method, proposed by [30], that applies quan-
serially connected classifiers, with a reject option, and assign tum formalism to classical computational problems, confirm-
a class label when sufficient confidence is attained. ing a growing interest on the topic and its promising outcomes.
Most early classification approaches in literature, such A binary classifier is used to solve the quantum state dis-
as [2], [20], work on univariate time series and need the entire crimination problem introduced by Helstrom [31] considering
sequence upfront. The approaches for multivariate sequences that multiple copies of a quantum state can provide more
become more complex because the distance measures must be information than the state itself. This supervised algorithm,
able to express the correlation among features [21]. tested on real-world and simulated binomial datasets from
Multivariate time series cannot be treated as a collection Penn Machine Learning Benchmark repository [32], outper-
of univariate ones, because there exists a hidden relationship forms, on average, all the most frequently used classifiers.
among features that holds important information for the rep- Another approach, described in [33], might look similar to
resentation of real processes. the one in this paper: it estimates the density operators for each
In order to leverage the correlation property in multivariate class and applies projective measurement on quantum states
time series, [22], [23] propose Correlation Based Dynamic to label each data element. Though, it does not address time
Time Warping (CBDTW), which creates a non-overlapping series, nor it exploits entanglement in classification, which
segmentation of a time series by means of: confirms the innovative nature of our work.
• Principal Component Analysis (PCA) based similarity The algorithms analyzed so far propose several possible
measures to segment an unclassified sequence; approaches to early time series classifications, but are either
• a cost function to map each chunk to a non-negative real too specific for particular tasks or present some limitations
number and DTW distance to train the classifier. with regard to the number of features in the input stream or
Statistical analysis drives an interesting adaptive the number of classes in the target or require that the whole
non-myopic approach [24] that requires the entire sequence time series be available upfront. Our proposal gets over the
be available upfront and considers a penalty factor, similarly aforesaid limitations by introducing a real-time classification
to [19], related to decision delay and a misclassification cost approach that, in principle, works with any number of features
to balance quality of prediction and speed of decision. and classes to determine a reliable decision at the earliest
Another early classification model suitable for multivariate moment in time, never considering the complete sequence.
time series is presented in [25] on biomedical data, specifically
in multivariate gene expression. This hybrid approach binds
a generative Hidden Markov Model (HMM) model [26], that III. T HE Q UANTUM C LASSIFIER
exploits dependencies among observations on temporal seg-
ments, and a Support Vector Machine (SVM) [27] for efficient Quantum computing applies quantum-mechanical principles
discrimination of sequences. to data processing [34].
A totally different approach to early classification of bio- Those fundamental principles are:
medical multivariate time series based on shapelets is proposed • Superposition that results from linearity of the solu-
in [28]. The method, named Multivariate Shapelet Detection tions of Schrödinger’s equation. Adding together multiple
(MSD), can achieve highly accurate classification rates ana- quantum states determines another valid state and, con-
lyzing up to 64% of each test sequence. versely, any quantum state can be split up as sum of any
The strategy proposed in [29] looks for sub-concepts or number of valid states.
sub-clusters that characterize the same class label. The feature • Entanglement that occurs when the state of a composite
variables are independently scanned to uncover the inner system cannot be written as a product of states of its
structure of the MTS by means of core shapelets eligible for component systems [53]. Entangled particles can express
the classifier. stronger connection than their classical analogues.
In [54], the authors report various quantum algorithms that The quantum bit or qubit is a two-state quantum system
are equivalent to classical machine learning but use quan- that can be in a superposition of state 0 and 1 at the same
tum optimization to accelerate the training process or target time, unlike the classical bits.
binary classification problems such as Quantum SVM [55] The quantum equivalent of classical 0 and 1 logic states is
or Quantum PCA [56]. They also propose an interesting defined by the basis states of a qubit, which can be represented

in ket notation by the following column vectors [35]: If |ψ1 . . . |ψn describe the state of n isolated quantum
systems, the state of the composite system is
1 0
|0 = and |1 = . |ψ = |ψ1 ⊗ · · · ⊗ |ψn .
0 1
The state vectors form an orthonormal basis, hence their The last aspect to consider is how to measure the probabili-
inner products x|y are: ties of each basis state from the resulting composite state: in a
real quantum system, the measurement process alters its state,
0|0 = 1|1 = 1 and 0|1 = 1|0 = 0, which turns into the pure state corresponding to the outcome
of measurement. It can be regarded as an interface between
where the br a operator x| is the conjugate-transpose of ket, the quantum and the classical domains, being the only way to
defined as x| = |x† . extract useful information from a quantum system [38].
A pure qubit state |ψ can be expressed as superposition of According to the third postulate of quantum mechanics,
the basis states a collection of measurement operators acting linearly on the
|ψ = α |0 + β |1 , (1) state space of the system can be used to measure a quantum
state: this is commonly termed projective measurement.
where α and β, termed probability amplitudes, are usually If a system can have M possible valid outcomes, a set of
complex numbers such that |α|2 and |β|2 represent the prob- {Pm : m ∈ M} operators can be identified in order to obtain
ability that, after a measure, the state |ψ is detected in the the probability of measuring m from the system state |ψ,
state |0 or |1 respectively, thus leading to which is

|α|2 + |β|2 = 1. (2) p(m) = ψ| Pm† Pm |ψ ,

where the symbol † indicates complex conjugation and trans-
The factorization of two or more qubits [36] is called a position.
composite state, computed by means of the tensor product ⊗, The operators are subject to the following condition:
as in the following example:
Pm† Pm = I,
|011 = |0 ⊗ |1 ⊗ |1 . (3) m∈M

As sequential data streams are generally characterized by which ensures that all probabilities add up to 1, as per:

an intrinsic correlation among nearby samples, entanglement p(m) = ψ| Pm† Pm |ψ = ψ| I |ψ = 1.
becomes a fundamental property to enforce the interrelation- m∈M m∈M
ship among observations of a time series.
For the two basis states |0 or |1, measurement is per-
By definition, a state is considered entangled if it is not sepa-
formed through the projectors P0 = |0 0| or P1 = |1 1|
rable into its fundamental parts, that is, two distinct particles of
respectively, gathering the probabilities p0 and p1 .
a system are entangled if an item cannot be described without
Therefore, the probability p0 of a qubit being in state |0 can
considering the other one. Moreover, they can be entangled
be obtained through projective measurement by the following
even if separated by considerable distance [37].
equation
As an example,
p0 = ψ| P0 |ψ . (4)
1
|ψ = √ (|00..0 + |11..1)
2 Alternatively, whenever post-measurement state is not sig-
nificant, it is possible to define a density operator that
represents n entangled qubits in equal superposition, or Cat- describes the whole system [38]
State; in the example, states |00..0 and |11..1 have equal
1 1
probabilities | √ |2 = . The above equation is not separable ρ= Pi |ψi ψi | , (5)
2 2 i
because it is impossible to write it as a tensor product.
with the following constraints:
The term CatState refers to quantum superposition of two
macroscopically distinct states and is derived from the hypo- 1) Trace condition: Tr(ρ) = 1,
thetical Schrödinger cat’s experiment. 2) Positivity condition: ρ is a positive operator.
The behavior of a physical system can be described by The trace is a linear operator, hence in the case of a two state
a general framework defined by four postulates of quan- quantum system, the trace condition can be expanded as
tum mechanics. Two postulates are related to superposition
1
and measurement principles, whereas the third one describes Tr(ρ) = Tr( Pi |ψi ψi |)
the evolution of a closed quantum system in terms of the i=0
Schrödinger equation. Finally, the fourth one describes the = Tr(P0 |ψ ψ|) + Tr(P1 |ψ ψ|),
admissible states for composing two or more subsystems and
which leads to the generalized probability pi of state |i ,
asserts that the state space of a composite quantum system
expressed by
is the tensor product (symbol ⊗) of the state space of its
components [38]. pi = Tr(Pi |ψ ψ|). (6)

Authorized licensed use limited to: National Institute of Technology. Downloaded on July 29,2024 at 10:06:25 UTC from IEEE Xplore. Restrictions apply.
1688 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 17, 2022

In this paper, we propose a multinomial generalization TABLE II

of this setting, called Quantum Entangled Multinomial E XAMPLE OF G ENERATED P ROBABILITIES THE Session C OLUMN
I NDENTIFIES THE E LEMENTS IN THE S AME S ERIES , W HOSE C LASS
Classifier (QEMC), by defining the reference orthonormal P ROBABILITIES A RE R EPORTED IN THE Classi C OLUMNS ;
basis for N classes as THE Label I S THE G ROUND T RUTH
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
1 0 0
⎜0⎟ ⎜1⎟ ⎜ 0⎟
⎜ ⎟ ⎜ ⎟ ⎜ ⎟
|0 = ⎜ . ⎟ |1 = ⎜ . ⎟ … |N − 1 = ⎜ . ⎟ .
⎝ .. ⎠ ⎝ .. ⎠ ⎝ .. ⎠
0 0 1
A pure qubit state |ψ derives from the superposition of all
basis states, according to equation
|ψ = α0 |0 + α1 |1 + · · · + αn−1 |N − 1 , (7) QEMC is also characterized as a greedy algorithm, as it
where |αi is the probability of state |i and
|2 |αi = 1. |2 tries to achieve the best classification results by analyzing local
At each time step t, let f i (x t ), i ∈ [0, N − 1] be the class probability maxima, which are not guaranteed to be optimal
conditional probabilities of current observation x t in the data overall.
stream.
Let IV. VALIDATION ON S YNTHETIC DATA
A. Generation of Synthetic Data
αi,t = f i (x t )
The applicability of QEMC was first validated on synthetic
then T subsequent observations of class i can be composed datasets of probabilities, generated for an increasing number
into a T -qubit state |ψi by means of: of classes.
The synthetic datasets simulate the results of an
|ψi = αi |ii . . . i
element-wise stream classification, therefore they contain a
= αi,0 |i ⊗ αi,1 |i ⊗ · · · ⊗ αi,T −1 |i . list of N class probabilities for a specified number of sessions
As an example, the state |ψ0 for a hypothetical class 0 at having variable length up to a desired maximum number of
the fifth observation can be computed through: samples.
In order to ensure a sensible bias for a specific class, every
|ψ0 = α0 |00000 session is randomly assigned a ground truth value and, for each
= α0,0 |0 ⊗ α0,1 |0 ⊗ α0,2 |0 ⊗ α0,3 |0 ⊗ α0,4 |0 . sample, the probability ptrue of the True class is randomly
taken from a continuous uniform distribution in the [0, 1)
The state |ψ representing a whole data stream after T interval.
observations can be expressed as the superposition of N states, The residual probability value, pres = 1 − ptrue , is then
each featuring some correlation among collected observations used in combination with a Dirichlet distribution to generate
of the relevant class, according to: N random values that add up to pres : these likelihoods are
|ψ = |ψ0 + |ψ1 + · · · + |ψ N−1 . (8) arbitrarily allotted to each class and ptrue is added to the True
class.
At every time step t, the state of quantum system |ψ Even if a single event line doesn’t express a clear statement
can be measured to provide the individual class probabilities on which is the True class, the session is clearly biased and
pi (t), i ∈ [0, N − 1] and, given a task dependent level of this is what the algorithm is supposed to exploit in order to
confidence C, make appropriate decisions as: make a timely decision.
i if pi (t) ≥ C Table II displays the sample structure of a N classes data
(9) stream, which is saved as a CSV file.
None i f other wi se
If None is still output when the session ends, it is eventually B. Measuring the Quantum State
classified as undecided (reject option) and considered an error. In section III, the measurement process for determining the
Undecided sessions appear as a separate indicator to be qubit state has been addressed from the theoretical viewpoint,
considered when tuning the appropriate level of confidence C. but it is also useful to add some practical considerations about
As a matter of fact, undecided sessions represent the inability its actual implementation.
of our classifier to fulfill its purpose but, even if it is clear Measurement is the only way to extract useful information
that the correct class cannot be designated, none of the wrong from a quantum system and, in the real world, it exhibits some
ones can be elicited as most representative without committing peculiar properties that should, in principle, be replicated in
a mistake. software simulations. These are:
Eventually, as the probabilities pi (t) measured on state 1) in a real quantum system, the measurement process
|ψ are normalized, for any value of C greater than 0.5, the alters the state of the system;
condition expressed by (9) becomes necessary and sufficient 2) after measurement, the system turns into the pure state
for a mutually exclusive decision. associated to the outcome of measurement.

TABLE III the overall classification scores, which tend to flatten for peep
N UMBER OF S ESSIONS PER C LASS IN S YNTHETICALLY values greater than or equal to 4.
G ENERATED D ATASETS
As an exception, in the binomial case, it is possible to com-
pute the entangled states with a simpler procedure independent
of peep. With more than three classes, experimental evidence
shows that accuracy reaches its upper limit before exceeding
the greatest bearable peep value, which was at the upper limit
of 10 on our machine.

As a consequence, in a real system, it is impossible to B. Results on Synthetic Data

estimate the likelihood of all possible basis states because,
The problem basically aims at optimizing two contrasting
once measured, the qubit no longer contains information about
goals:
the other ones.
Simulation software usually measures the quantum states • maximize classification accuracy,

by generating a random number and reading the associated • minimize the number of observations required to make a

output, which is what quantum theory would require. decision.

Nevertheless, in our quantum-inspired algorithm, we are A possible approach is based on multi-objective optimiza-
not concerned about using a strictly rigorous approach to tion, also known as Pareto optimization [44], to pick the
measurement and, conversely, we utilize the density operator optimal threshold as a function of selected indicators and
defined in (6) to assess the probability, integrated over time, optimization objectives.
of each individual basis state and return the top value and its Possible solutions in the decision space are rated according
associated basis state. to multiple objective functions to find a setting which is
Normalization of the resulting quantum state, before optimal in some sense.
measurement takes place, ensures that all probabilities add Pareto strategy defines a set of non-dominated solutions that
up to one and therefore the classification threshold can be cannot be improved on one objective without degrading at least
constrained within zero and one. one of the others.
With two objective functions, it is possible to plot the
V. E XPERIMENTAL S ETUP solution space and visualize the set of Pareto optimal solutions,
which is also called Pareto frontier.
All experiments were executed on an Intel Core i7 3.4 GHz The performance indicators required to plot the Pareto fron-
workstation, with 16GB RAM, running Microsoft Windows tier are collected by means of a grid search on the following
10 operating system with no CUDA support. algorithm parameters:
The software procedures were developed in Python
• the confidence level C, or decision threshold, with values
language [39], at version 3, with additional support of the
C ∈ {0.55, 0.6, . . . , 0.9, 0.95, 0.99, 0.995, 0.998},
following standard distribution libraries: Numpy [40], Mat-
• the sliding window size with peep ∈ {4, 8}.
plotlib [41], Scikit-Learn [42] and Pandas [43].
Extensive testing was executed on three synthetically gen- For each configuration of the grid search, the legend for
erated datasets, containing from two to four well balanced parameters and summary indicators used in this paper is
classes respectively, totaling 10.000 sessions whose individual reported in Table IV.
length does not exceed 100 observations. Table V reports, for a peep equal to four, the parameters
The detailed breakdown of sessions by class label is and their relevant metrics for those points on the Pareto front
reported in Table III. that maximize classification accuracy, minimize the number
of undecided sessions or the length of the decision sequence.
In order to consider the worst case, undecided sessions were
A. Complexity Analysis included in the accuracy score.
In its simplest implementation, the proposed algorithm It is evident that for low values of decision threshold,
would have an intractable exponential spatial complexity and we have contrasting results depending on the aim of Pareto
cubic time complexity due to the use of tensor product. optimization, whereas on more selective thresholds the per-
Specifically, if N is the number of classes and L max is the formance metrics are exactly the same on both sides. At low
maximum length of the time series, the spatial complexity threshold values, it is possible to zero the number of unclas-
is O(N L max +1 ) whereas the temporal one is O(L 3max ). How- sified sessions, with about 5% decrease in accuracy at the
ever, the finite-memory property of the addressed application advantage of decision speed, even if the greatest number of
problem can be exploited to bound the spatial requirements sessions is classified within the second or third observation.
and, consequently, the time complexity. A sliding window At higher thresholds, accuracy increase exceeds 14.5% at
mechanism was set up to limit the number of observations con- the cost of having 251 undecided sessions, which definitely
sidered when calculating the entangled states. This technique compensates the number of erroneously classified ones of the
was termed peep, for it acts as a peephole on the data stream, former scenarios. Undecided sessions could be considered a
and it was empirically verified that peep values (window sizes) limitation at first glance but, if the algorithm were analyzing
greater than 8, in most cases, don’t bring any improvement to a real time data feed instead of a fixed size dump file, further

Authorized licensed use limited to: National Institute of Technology. Downloaded on July 29,2024 at 10:06:25 UTC from IEEE Xplore. Restrictions apply.
1690 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 17, 2022

TABLE IV
PARAMETERS AND S UMMARY I NDICATORS L OGGED ON G RID S EARCH

TABLE V
C LASSIFICATION R ESULTS FOR 10.000 S ESSIONS W ITH 3 C LASSES (I NCLUDING U NDECIDED S ESSIONS )

observations might become available for undecided sessions This initial experimental session pointed out an intrin-
and sooner or later make a reliable decision. sic limitation of the proposed quantum-inspired approach,
With the same settings, the classifier performance can allegedly due to the hardware specifications of our machine.
also be assessed on an increasing number of classes, easily Basically, in addition to the exponential complexity related to
generated with our tool. The metrics reported in Table VI share sequence length, also the number of classes represents a sort
DT = 0.995 and PEEP = 4 as common settings. of barrier hampering the adoption of QEMC.
The ADS indicator is defined as the average, over the total On the test machine, whose technical specifications are
number of sequences N of decision timestamps ti weighted reported at the beginning of this section, up to 10 classes
by the number of sequences classified at a given instant n i , could be detected simultaneously without compromising over-
that is: all system performance: alternative hierarchical approaches
are possible but major changes to the proposed classifica-
1
N
ADS = ti · n i . tion architecture are required to support two or more levels
N of refinement. For instance, if we were to predict possible
i=1
component failures on a cyber-physical system, it would be
Even if the number of undecided sessions reduces the
possible to implement a first classification level capable of
overall accuracy, its value stays steady above 97%, with very
discriminating among the potentially affected subsystem and
few classification errors in the binary case. If we hadn’t
then pass only the involved data streams to a specialized
considered unclassified streams, as if we could observe more
classifier that is fine tuned for the given subsystem.
events to support a trustful decision, we could ideally reach
In principle, this hierarchical approach allows to cope with
100% accuracy for three and four classes and 99.98% for the
multinomial classification problems of any size, even on edge
binomial case respectively, with as many as 27 observations
computers with extremely limited resources.
analyzed in the single worst case.
Moreover, seventy percent of classified sessions is correctly
labeled within the fifth observation and QEMC needs only VI. T HE B OT D ETECTION P ROBLEM
8 steps to classify the ninety percent.
According to specific goals of the classification task, it is The application area on which we focused our experiments
possible to tune the threshold to favor either accuracy or is cyber-security and specifically web robot detection from
LDS, given that in all cases ADS indicator denotes high HTTP request server logs [5], [45], [46], similarly to the work
classification speed on average. of [47]–[49].

TABLE VI
C LASSIFICATION R ESULTS FOR 10.000 S ESSIONS W ITH 2-3-4 C LASSES

As evidenced in preceding sections, the multinomial version These considerations took the authors of [15] to defining a
of our algorithm is a generalization of the binary approach, sensible taxonomy of possible resource file types, organized
originally designed for bots classification from real-time HTTP into 9 more general aggregations, whence they derived a
traffic data at the web server and uses the same dataset for the semantical representation of all resource request patterns,
experimental part in order to compare the results. which is capable of expressing the differences between humans
It is an early decision, multivariate, sequential classification and robots.
task on a non-stationary data stream.
Web robots, or simply bots, are software programs capable VII. T HE DATASET FOR B OT D ETECTION
of autonomously executing specific tasks over the internet,
whose aim can be either good or malicious [50], [51]. The dataset used to test the proposed algorithm has been
These autonomous agents are pervading the net and many already utilized for [7], [46] to compare DTMC versus SPRT
bots have useful purposes, such as search engine crawlers or and contains the sequences of HTTP request headers from
price comparers, but some others have malicious goals, like many different working sessions.
stealing sensitive data, injecting malware or executing other Each session has been manually labeled as bot (label 1) or
harmful activities, and therefore must be identified as soon as human (label 0) generated and the classifier tries to take a
possible to reduce their negative effects. reliable decision before the session ends or labels the session
Usually, bots are detected through offline analysis of web as undecided. Appropriate actions can then be taken on the
server logs because it allows for a deeper understanding of undecided sessions according to the specific task objectives.
their behavioral model thus putting in evidence the crawling In order to apply the different classification models to
differences between humans and robots [52]. Nevertheless, the same bot dataset, no feature selection policy is imple-
it would be helpful to enable web servers to tell robots and mented and all available features are considered, but proper
humans apart in real time and implement specific management pre-processing transformations are needed on the original
policies that ensure the best user experience. features depending on their type.
Concerning real time detection, to the best of our knowl- The features, as shown in Table VII, can be divided
edge, two methods require special attention and therefore will into three categories, each requiring different pre-processing
be analyzed in detail and compared to the present quantum- actions:
inspired algorithm. The first method, described in [15], • numerical features (N) are standardized by subtracting the
is based on transition maps and hidden Markov models, mean and scaling to the unit variance;
whereas the second one leverages Wald’s Sequential Probabil- • categorical features (C) are transformed into the corre-
ity Ratio Test (SPRT) to gather information from subsequent sponding one-hot encoding;
events and eventually make a decision [7]. • boolean features (B) are simply translated to their numer-
The solution proposed by Doran and Gokhale in [15] is ical equivalent: 0 for False and 1 for T r ue.
an integrated method for real time and offline web robot After each feature has been transformed as explained above,
detection that analyzes the differences between human and each HTTP request is represented as a 25-feature vector and
software visitors in the resource request patterns, considered the corresponding session becomes a series of time related
time invariant by the authors, and imposes a minimum number vectors.
of events to be observed before deciding. The entire dataset contains 13.395 sessions for a total
Some basic concepts have to be defined for a common number of 1.397.838 HTTP requests. The session breakdown
understanding of the remainder of this document: is detailed as 6.190 sessions labeled as bots, 7.200 can be
• a session, according to common practice, is a series of associated to human activities and 5 sessions were excluded
requests pertaining to the same IP address and user agent because it was not possible to allot them to any class with
string, separated by a time gap shorter than thirty minutes; sufficient confidence.
• a request pattern is the ordered sequence of resource Finally, the dataset was prepared for a 10-fold cross-
requests received at the web server during a session. validation training by manually partitioning the sessions into
Though humans and robots request different specific ten roughly balanced subsets, each consisting of 619 and
resources during each visit, it is not possible to characterize 720 sessions for bot and human classes respectively.
visitors by the mere list of requested resources. Conversely, The good balancing between bot and human sessions
the order by which resource files are accessed by a human involves that either accuracy or F1 score can be indifferently
visitor is inherently different from crawling algorithms, that selected as representative metrics to evaluate the performance
are unlikely to exhibit human-like behaviors. of the proposed algorithm.

Authorized licensed use limited to: National Institute of Technology. Downloaded on July 29,2024 at 10:06:25 UTC from IEEE Xplore. Restrictions apply.
1692 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 17, 2022

TABLE VII
F EATURES L IST OF THE F EATURES AVAILABLE FOR M ODEL T RAINING B EFORE P RE -P ROCESSING

VIII. T HE S OFTWARE M ODEL FOR B OT C LASSIFICATION The reference orthonormal basis is defined as:

A. The Two-Stage Classification Model 1 0
|0 = |1 = .
0 1
The classification model can be ideally divided into two
logical stages. The first stage, built upon a deep neural Let xt be a sequence of HTTP request samples associated
network, is responsible for learning the classification model to a specific session and y ∈ {0, 1} be the relevant ground
and assigning an a-posteriori conditional class probability truth, which is obviously the same across each session. The
estimate to each individual HTTP request, independently of probability of request i being bot or human generated is
any other entry of the training sequences. It can eventually be computed by means of the Multi-Layer Perceptron at stage
replaced by any classifier which best fits the available data size one and is stored in pki , k ∈ {0, 1}.
to produce the aforesaid probability estimates: in the present As explained earlier in this document, quantum entangle-
case, the multi-layer perceptron was selected as the best option ment can be used to express a higher level of correlation
amongst the model we tested. among quantum states, therefore, as each request in a session
The second stage is based on the quantum-inspired entan- belongs to a specific class across the whole sequence and
gled classifier described in section III designed for a two they are reasonably correlated because they are generated
classes setting. It is noteworthy that, even if the problem is by the same entity, it sounds sensible to hypothesize that
intrinsically binary, the classification outcome of the quan- quantum entanglement be capable of capturing and exposing
tum module is three-state valued because a session might the intrinsic correlation within each session.
end before the system can take any reliable decision. Those The probabilities of the two classes, estimated by the neural
sessions are then provisionally labeled as undecided and can network, can be used to build a quantum entangled represen-
be either neglected or included in the performance metrics tation of all subsequent requests in a session. The multi-layer
computation, slightly affecting the overall results. perceptron classifier does not capture temporal information;
here we use it to assign the likelihood of each individual
sample to belong to either class. As the request order in each
B. Stage 1: Probability Estimation sequence is preserved to reflect the web navigation pattern,
The neural network implements supervised learning, setting QEMC deals with correlation by means of entanglement.
As expressed by (1), given the probabilities of the i -th
aside a fraction of the dataset for model validation and using
the remaining part for training with 10-fold cross-validation. observation in the sequence of length T , it can be linked to
The neural network is based on the MLPClassifier of the scikit- the two basis states |0 and |1, hence it is possible to compute
αi and βi as
learn toolkit [42] and it is designed as a sigmoid output unit
√ √
on top of two 50-units hidden layers with ReLU activation αi = p0i βi = p1i (10)
function. This neural network configuration has heuristically
and then create the T -qubits separable states |ψ0 and |ψ1 ,
proved to be the most effective among those tested for the
dataset under examination. The terminal sigmoid layer has according to (3), from
been selected because its output is a real number constrained |ψ0 = α |00 . . . 0 = α0 |0 ⊗ α1 |0 ⊗ . . . ⊗ αT −1 |0
between zero and one and therefore can be interpreted by the |ψ1 = β |11 . . . 1 = β0 |1 ⊗ β1 |1 ⊗ . . . ⊗ βT −1 |1
cascade stage as a probability estimate for the relevant class.
(11)
In the generalized approach for N classes, the output layer
is composed by N Softmax units that calculate probabilities The entangled state represented by a stream of n requests
whose sum is always 1. can be then expressed as the superposition of the two states
from (11):
C. Stage 2: The Quantum Classifier Module |ψ = |ψ0 + |ψ1 (12)
The second stage is the Quantum Entangled Multinomial In order to tell whether the current sample is due to a bot
Classifier proposed in section III for the binomial setting. or a human, it is necessary to measure, from the entangled

state |ψ by means of (4) or (6), the probabilities of the basis

states |0 and |1 and compare those measurements against
a properly tuned threshold C to take a decision, if enough
information is contained in the given |ψ.
If no decision can be taken at current step, then a new |ψ
is computed by adding another observation until one of the
measures meets the threshold or the session ends, thus leaving
the output as undecided.
Finally, a variation of the approach described above has been
tested by computing the probability amplitudes, as of (10),
, gr ade ∈ R+ . Even if αi
grade grade
by means of αi cannot
be considered a probability amplitude anymore, this option
adds a degree of freedom to the proposed quantum-inspired
algorithm, acting like a fuzziness index, that is helpful to Fig. 1. Scenario A - accuracy vs grade classification accuracy at increasing
improve the classification results and tune the output of the values of grade.
classifier, say for instance to reduce the number of unclassified
sessions. Moreover, when gr ade = 0.5, the solution is
equivalent to the formal theory. For each scenario described in Table VIII, our aims are
the minimization of the number of requests analyzed to make
a decision and of the number of unclassified sessions, along
IX. E XPERIMENTAL R ESULTS AND D ISCUSSION with the maximization of classification accuracy, hence the
A. The Test Scenarios same performance indicators have been considered:
The effectiveness of the proposed method can be demon- • LDS: length of the decision sequence; the shorter the
strated with respect to the most representative performance better,
metrics for the analyzed dataset and it is helpful to compare • ACC: accuracy of classification, defined as the total num-
the algorithm with one that shows optimal results on the same ber of correct assignments divided by the total number of
problem. sessions; the higher the better,
In this regard, the Sequential Probability Ratio Test from • TOTUC: total number of unclassified sessions left; to be
Wald [14] has been compared with QEMC on the same prob- minimized.
abilities estimated by the training of stage 1. Presently, to the Pareto front plots have been generated for we need to
best of our knowledge, SPRT, proposed in [7], outperforms all optimize more than one objective function simultaneously
other state-of-the-art approaches. at the time of decision making. These are contrasting goals
The main focus of the present work is not necessarily because we would like to maximize accuracy whereas the
showing that the new approach outperforms the best state- length of the decision sequence and the number of unclas-
of-the-art methods, but proving the effectiveness of a new sified sessions should be minimized. This implies that no
paradigm that exploits quantum properties to take timely and single solution exists that can optimize all objectives but
reliable decisions. every nondominated solutions is Pareto-optimal and represent
The implemented two-stage model was beneficial to support an acceptable solution for the problem. A solution is said
the deployment of both Sequential Probability Ratio Test and nondominated if any improvement on an objective function
the Quantum-inspired Entangled Multinomial Classifier along implies a downgrade on the other ones. For the analysis of
with the synoptical comparison of the respective results. our results, only the solutions at extremes of the values range
Three scenarios have been chosen to fairly and extensively have been considered.
compare the proposed and the reference approaches and
possibly highlight any weaknesses in the new method. The
reported results were computed as the average over more than B. Scenario A
two hundreds runs to provide a reliable assessment of our This scenario has been setup to assess the impact of different
algorithm. values of grade on the performance indicators.
The number of sessions used for training has been reduced The decision thresholds have been set to fixed values,
down to the 30% of entire dataset and the peep and grade identified as optimal by means of Pareto analysis, and 50%
hyper-parameters are relevant only to the Quantum-inspired of available sessions have been set aside for model validation.
approach. Moreover, as the SPRT algorithm has been imple- The SPRT classification ends with ACC equal to 0.9422,
mented in the logarithmic form [7], the threshold reported in leaving only 4 unclassified sessions and using 3 steps for
Table VIII are actually converted into their log. The Validation LDS. As of QEMC, different values of grade have been tested,
column represents the portion of dataset set apart for algorithm as shown in figure 1 but, according to the Pareto frontier plot in
performance assessment. It is worth noting that the peep figure 2, the optimal points to consider for the comparison with
mechanism, though required to limit the computational effort, SPRT correspond to grade 0.4, which maximizes the accuracy,
is a disadvantage for QEMC algorithm because it bounds the and 2.6 which minimizes the length of decision sequence to
method’s memory. the same value as SPRT.

Authorized licensed use limited to: National Institute of Technology. Downloaded on July 29,2024 at 10:06:25 UTC from IEEE Xplore. Restrictions apply.
1694 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 17, 2022

TABLE VIII
E XPERIMENTAL S CENARIOS

Fig. 2. Scenario A - pareto front analysis classification accuracy versus lengh Fig. 3. Accuracy vs decision step classification accuracy achieved versus the
of the decision sequence at variable values of Grade. number of requests analyzed to make a decision.

At grade 0.4, the ACC value is 0.9585, the highest for the ( = 0.0322 versus best SPRT) and the 90% of sessions is
setting, but the number of unclassified sessions is 50, which is classified within the second step. For these threshold values,
extremely high compared to SPRT, and LDS is 10. Conversely, the accuracy of SPRT is slightly lower (0.9204) than the best
at grade 2.6, accuracy is only slightly less than in the previous case, but the number of unclassified sessions decreases to
case (ACC = 0.9512, = 0.0073) but LDS is exactly the 4 while maintaining the same LDS value.
same as in SPRT and the number of unclassified sessions drops
to zero. Nevertheless, in both cases, classification accuracy is D. Scenario C
greater than in SPRT (worst case = 0.009). The third scenario compares the performance indicators
when varying grade in the threshold setting that is best
C. Scenario B for SPRT and with peep = 6, which should improve the
This scenario evaluates the classification results with regard accuracy of QEMC by considering more samples in the
to variable threshold values on 70% of sessions used for decision process. Even in this case the results for SPRT are
validation with grade set at 0.5, which is the default value ACC is 0.9205, TUC is 8 and LDS is 3 because the peep
for QEMC. mechanism only applies to QEMC, which conversely improves
The best results for SPRT are achieved with lower and upper its classification performance depending on the Pareto optimal
thresholds set to the logarithm of 0.1 and 0.85 respectively; values of grade.
in this configuration, ACC is 0.9205, TUC is 8 and LDS is 3. The optimal value to maximize accuracy is 0.2, as shown
The metrics for QEMC at the best thresholds for SPRT are figure 5, where accuracy is 0.9589, a bit higher ( = 0.0004)
slightly better in accuracy (0.9302), which means that the than in Scenario A with peep at 4, showing that it is possible
overall number of correctly classified sessions is greater, but to achieve better classification rates by considering more
it might take longer to take a decision (LDS = 5), even if samples. This is paid for in terms of LDS, that grows to 15,
in both cases the 90% of sessions is classified at the first TUC that spikes to 91 and on the number of steps required to
step, and the number of unclassified sessions is almost double classify the 90% of the sessions which becomes 3.
(TUC = 15). On the other side, the optimal value of grade to minimize
For the current setting, figure 3 visualizes the rate of LDS is 2.4, which not only requires at most 2 samples to
correctly classified sessions for the two methods: SPRT iden- take a reliable decision but also allows to achieve zero on
tifies a greater percentage at the first two requests but no the total number of unclassified sessions. The good point here
great improvement is achieved on the third and last step. is that accuracy is only 10−4 worse than for SPRT, with only
Conversely, QEMC takes over at the third request and the 1 request needed to classify 90% of the sessions in both cases.
overall performance is nearly 1% better than SPRT. The three scenarios proposed above are representative of the
The best threshold pair for QEMC is 0.25 for the lower and various combinations of post-training hyper-parameters and
0.95 for the upper threshold where, despite even greater values expose both the pros and cons of the novel quantum-inspired
of LDS (7) and TUC (23), the accuracy sensibly rises to 0.9527 approach.

account, as shown in sections IV and IX, it is often limited to

4 to 6 samples. Greater peep values do not bring any benefit to
the classification performance but increase the computational
effort.
Finally, while SPRT is designed as a binary classifier and
requires a modified approach to be applied in a multi-class
problem, the QEMC method is natively suited for multinomial
problems by simply expanding the orthonormal basis through
the addition of further basis states.

X. C ONCLUSION
In the present paper, we analyzed the general structure of a
Fig. 4. Scenario C - accuracy vs grade classification accuracy at increasing temporal sequence of data and pointed out the benefits of real
values of grade. time classification of non stationary data streams, underlining
its application in cyber-security with on-the-fly bot detection.
We introduced QEMC, a novel quantum-inspired multino-
mial classifier for early detection of significant events on time
series, that has been validated in a synthetic experimental
setting to confirm the motivating results obtained with its
binary version applied to bot detection.
The proposed technique relies on superposition and entan-
glement to integrate the class probability of each individual
event in the time series, estimated by an upstream stage,
and produce an overall score, with reject option, capable of
supporting trustful decisions even in case of a limited number
of events.
Our method has been successfully compared with another
effective bot detection approach, namely SPRT, and its results
Fig. 5. Scenario C - pareto front analysis classification accuracy versus lengh have been analyzed with reference to the contrasting objectives
of the decision sequence at variable values of grade. of classification accuracy, number of undecided sessions and
speed of decision.
The extensive experimental studies, tested on traffic streams
Classification accuracy for QEMC can be sensibly boosted from an actual Polish e-commerce server, showed that SPRT
by properly selecting the peep and grade values, at the same is able to detect, in real time, over 90% of all bots and is espe-
threshold conditions, by means of Pareto analysis. Moreover cially powerful given a very limited number of observations,
the same parameters can be used to achieve particular objec- despite it requires no minimum quantity of HTTP requests to
tives, such as zero unclassified sessions or a shorter decision be observed before making a decision.
sequence, while preserving the performance indicators that, Nonetheless, our innovative quantum-inspired multinomial
in the worst case, are fairly equal. In fact, by tuning peep classifier for early detection of significant events on time series
and grade, it is possible to increase the convergence speed of can produce better overall scores and is similarly capable of
the classification algorithm and reduce the number of requests supporting trustful decisions even in case of a limited number
needed to take a decision to even less than SPRT. of events, both in the binary and in the multinomial setting.
It is worth noting that a reduction in the training size of The results were analyzed with reference to the contrasting
the dataset has a smaller impact on classification accuracy objectives of classification accuracy, number of undecided
for QEMC than for SPRT, in the same setting: experimental sessions and speed of decision, showing that the proposed
evidence shows that, with a validation ratio of 50%, accuracy quantum-inspired algorithm, in our opinion, natively covers
is 0.9573 for QEMC and 0.9421 for SPRT whereas, with an area of application (non-stationary data stream classifica-
30% of the sessions used for training, the relevant values are tion) that so far has not yet found reliable and performing
0.9535 and 0.9204 respectively. Hence Q E MC = 0.0038 and approaches.
S P RT = 0.0217, which is nearly 6 times greater than the This paper demonstrates the effectiveness of the proposed
former. algorithm that, compared to other approaches, was proven to
Another important consideration is related to the peep value: outperform not only SPRT but also, by transitive property,
the adoption of such mechanism is imposed by the computa- other very powerful state-of-the-art techniques.
tional performance downgrade on long sequences when the Moreover, the proposed approach represents a complete real
decision process requires to consider many requests to meet time classification framework for a critical application, such
the desired confidence level. However, regardless of the length as bot detection, and can easily be integrated, as a plug-in,
of a session, the number of samples that have to be taken into in a web architecture.

Authorized licensed use limited to: National Institute of Technology. Downloaded on July 29,2024 at 10:06:25 UTC from IEEE Xplore. Restrictions apply.
1696 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 17, 2022

With regard to the methods analyzed in section II, some [10] G. P. Zhang, “Time series forecasting using a hybrid ARIMA and neural
additional notes are worth reporting to highlight the advan- network model,” Neurocomputing, vol. 50, pp. 159–175, Jan. 2003.
[11] S. Laxman and P. S. Sastry, “A survey of temporal data mining,”
tages and disadvantages of current implementation of the new Sadhana, vol. 31, no. 2, pp. 173–198, Apr. 2006.
approach: [12] G. Peskir and A. N. Shiriaev, Optimal Stopping and Free-Boundary
Problems (Lectures in Mathematics ETH Zürich). Boston, MA, USA:
1) QEMC is tolerant against non-standardized numerical Birkhäuser-Verlag, 2006.
features, which is usually considered a compelling trans- [13] N. Parrish, H. S. Anderson, M. R. Gupta, and D. Y. Hsiao, “Classifying
formation for machine learning tasks; with confidence from incomplete information,” J. Mach. Learn. Res.,
vol. 14, no. 1, pp. 3561–3589, Dec. 2013.
2) with QEMC, it is possible to dramatically reduce [14] A. Wald, “Sequential tests of statistical hypotheses,” Ann. Math. Statist.,
the number of training sequences with no significant vol. 16, no. 2, pp. 117–186, Jun. 1945.
decrease of classification scores; [15] D. Doran and S. S. Gokhale, “An integrated method for real time and
offline web robot detection,” Expert Syst., vol. 33, no. 6, pp. 592–606,
3) in current configuration of the classification framework, Dec. 2016.
solutions are not interpretable, therefore some areas of [16] F. Biagini and M. Campanino, “Discrete time Markov chains,” in Ele-
application might be precluded to QEMC; ments Probability and Statistics, vol. 98. Cham, Switzerland: Springer,
2016, pp. 81–87.
4) no estimate on reliability of decisions is currently avail- [17] N. Privault, Understanding Markov Chains: Examples and Applica-
able in QEMC; tions (Springer Undergraduate Mathematics Series), 2nd ed. Singapore:
5) dependencies on grade parameter have not yet been Springer, 2018.
[18] Z. Xing, J. Pei, and P. S. Yu, “Early classification on time series,” Knowl.
explored in depth, but could open the way to a fuzzy Inf. Syst., vol. 31, no. 1, pp. 105–127, Apr. 2012.
flavor of the classifier. [19] N. Hatami and C. Chira, “Classifiers with a reject option for early time-
series classification,” Dec. 2013, arXiv:1312.3989.
In our opinion, considering the interesting results achieved [20] Z. Xing, J. Pei, and P. S. Yu, “Early prediction on time series: A nearest
with this initial formulation of QEMC, the last three items neighbor approach,” in Proc. 21st Int. Conf. Artif. Intell. (IJCAI), 2009,
represent interesting areas of investigation, where near future pp. 1297–1302.
[21] H. Anderson, N. Parrish, and M. R. Gupta, “Early time-series classi-
research should be directed. We also believe that the proposed fication with reliability guarantee,” Sandia National Lab, Albuquerque,
algorithm might open the way to new approaches for time NM, USA, Tech. Rep. SAND2012-7379C 480398, 2012.
series prediction and clustering, but so far we do not envisage [22] Z. Bankó and J. Abonyi, “Correlation based dynamic time warping,” in
Proc. 8th Int. Symp. Hung. Researchers Comput. Intell. Inf., 2007.
any sensible evolution. [23] Z. Bankó and J. Abonyi, “Correlation based dynamic time warp-
Replacement of the ANN with explainable ways to compute ing of multivariate time series,” Expert Syst. Appl., vol. 39, no. 17,
the probability estimates of observations might also open new pp. 12814–12823, Dec. 2012.
[24] A. Dachraoui, A. Bondu, and A. Cornuéjols, “Early classification of
perspectives for the quantum-inspired technique, especially if time series as a non myopic sequential decision making problem,” in
accompanied by a measure of decision reliability. Machine Learning and Knowledge Discovery in Databases, vol. 9284.
Cham, Switzerland: Springer, 2015, pp. 433–447.
[25] M. F. Ghalwash, D. Ramljak, and Z. Obradović, “Early classification
ACKNOWLEDGMENT of multivariate time series using a hybrid HMM/SVM model,” in Proc.
IEEE Int. Conf. Bioinform. Biomed., Philadelphia, PA, USA, Oct. 2012,
The authors would like to thank Paolo Solinas for his pre- pp. 1–6.
cious support in reviewing some technical aspects of quantum [26] L. Rabiner, “A tutorial on hidden Markov models and selected applica-
theory. tions in speech recognition,” Proc. IEEE, vol. 77, no. 2, pp. 257–286,
Feb. 1989.
[27] T. Hastie, R. Tibshirani, and J. H. Friedman, The Elements of Statistical
R EFERENCES Learning: Data Mining, Inference, and Prediction (Springer Series in
Statistics), 2nd ed. New York, NY, USA: Springer, 2009.
[1] T. Santos and R. Kern, “A literature survey of early time series [28] M. F. Ghalwash and Z. Obradovic, “Early classification of multivariate
classification and deep learning,” in Proc. SAMI iKNOW, 2016, pp. 1–7. temporal observations by extraction of interpretable shapelets,” BMC
[2] Z. Xing, J. Pei, and E. Keogh, “A brief survey on sequence classifica- Bioinf., vol. 13, no. 1, p. 195, Dec. 2012.
tion,” ACM SIGKDD Explorations Newslett., vol. 12, no. 1, pp. 40–48, [29] G. He, Y. Duan, R. Peng, X. Jing, T. Qian, and L. Wang, “Early
Jun. 2010. classification on multivariate time series,” Neurocomputing, vol. 149,
[3] A. Hassaïne and S. Al-Maadeed, “An online signature verification pp. 777–787, Feb. 2015.
system for forgery and disguise detection,” in Proc. Neural Inf. Process., [30] G. Sergioli, R. Giuntini, and H. Freytes, “A new quantum approach
vol. 7666. Berlin, Germany: Springer, 2012, pp. 552–559. to binary classification,” PLoS ONE, vol. 14, no. 5, May 2019,
[4] A. Oleff, B. Küster, M. Stonis, and L. Overmeyer, “Process monitoring Art. no. e0216224.
for material extrusion additive manufacturing: A state-of-the-art review,” [31] C. W. Helstrom, “Quantum detection and estimation theory,” J. Statist.
Prog. Additive Manuf., vol. 6, no. 4, pp. 705–730, May 2021. Phys., vol. 1, no. 2, pp. 231–252, 1969.
[5] S. Rovetta, A. Cabri, F. Masulli, and G. Suchacka, “Bot or not? A case [32] R. S. Olson, W. L. Cava, P. Orzechowski, R. J. Urbanowicz, and J.
study on bot recognition from web session logs,” in Quantifying Process- H. Moore, “PMLB: A large benchmark suite for machine learning
ing Biomedical and Behavioral Signals, vol. 103. Cham, Switzerland: evaluation and comparison,” Mar. 2017, arXiv:1703.00512.
Springer, 2019, pp. 197–206. [33] P. Tiwari and M. Melucci, “Towards a quantum-inspired binary classi-
[6] U. Mori, A. Mendiburu, E. Keogh, and J. A. Lozano, “Reliable early fier,” IEEE Access, vol. 7, pp. 42354–42372, 2019.
classification of time series based on discriminating the classes over [34] E. Rieffel and W. Polak, Quantum Computing: A Gentle Introduc-
time,” Data Mining Knowl. Discovery, vol. 31, no. 1, pp. 233–263, tion (Scientific and Engineering Computation). Cambridge, MA, USA:
Jan. 2017. MIT Press, 2011.
[7] G. Suchacka, A. Cabri, S. Rovetta, and F. Masulli, “Efficient on- [35] V. Moret-Bonillo, “Can artificial intelligence benefit from quantum
the-fly web bot detection,” Knowl.-Based Syst., vol. 223, Jul. 2021, computing?” Prog. Artif. Intell., vol. 3, no. 2, pp. 89–105, Mar. 2015.
Art. no. 107074. [36] A. Ekert, P. M. Hayden, and H. Inamori, “Basic concepts in quantum
[8] P. J. Brockwell and R. A. Davis, Eds., Introduction to Time Series and computation,” in Coherent Atomic Matter Waves, vol. 72, R. Kaiser,
Forecasting (Springer Texts in Statistics). New York, NY, USA: Springer, C. Westbrook, and F. David, Eds. Berlin, Germany: Springer, 2001,
2002. pp. 661–701.
[9] R. Adhikari and R. K. Agrawal, “An introductory study on time series [37] E. G. Rieffel and W. Polak, “An introduction to quantum computing for
modeling and forecasting,” Feb. 2013, arXiv:1302.6613. non-physicists,” Jan. 1998, arXiv:quant-ph/9809016.

[38] E. B. Guedes, F. M. de Assis, and R. A. C. Medeiros, “Fundamentals of Alberto Cabri received the degree in electronic
quantum information processing,” in Quantum Zero-Error Information engineering from the University of Genoa, Italy,
Theory. Cham, Switzerland: Springer, 2016, pp. 7–26. in 1992, and the Ph.D. degree in computer science
[39] G. Van Rossum and F. L. Drake, Jr., Python Reference Manual. and systems engineering. He is currently a qualified
Amsterdam, The Netherlands: Centrum voor Wiskunde en Informatica Teacher of computer science with the Public Sec-
Amsterdam, 1995. ondary Schools, Genoa, Italy. He is also a Profes-
[40] C. R. Harris et al., “Array programming with numpy,” Nature, vol. 585, sional Engineer with the University of Genoa. His
no. 7825, pp. 357–362, Sep. 2020. research focuses on machine learning and he has
[41] J. D. Hunter, “Matplotlib: A 2D graphics environment,” Comput. Sci. developed an innovative quantum inspired algorithm
Eng., vol. 9, no. 3, pp. 90–95, May 2007. for multivariate time series classification.
[42] F. Pedregosa et al., “Scikit-learn: Machine learning in Python,” J. Mach.
Learn. Res., vol. 12, pp. 2825–2830, Oct. 2012.
[43] W. McKinney, “Data structures for statistical computing in Python,”
in Proc. 9th Python Sci. Conf., Austin, TX, USA, vol. 445, 2010,
pp. 51–56. Francesco Masulli (Senior Member, IEEE) is cur-
[44] V. Pareto, Manuel d’économie Politique. Geneva, Switzerland: Librairie rently a Full Professor of computer science with the
Droz, 1981. University of Genoa, Italy, and an Adjunct Professor
[45] G. Suchacka, “Analysis of aggregated bot and human traffic on e- with Temple University, Philadelphia, PA, USA.
commerce site,” in Proc. Conf. Comput. Sci. Inf. Syst., Sep. 2014, He held visiting positions at Radboud University,
pp. 1123–1130. Njmegen, The Netherlands; the International Com-
[46] A. Cabri, G. Suchacka, S. Rovetta, and F. Masulli, “Online web bot puter Science Institute, Berkeley, CA, USA; and the
detection using a sequential classification approach,” in Proc. IEEE I3S Laboratory, University of Nice Sophia Antipolis,
20th Int. Conf. High Perform. Comput. Commun., IEEE 16th Int. Conf. France. He is the author of more than 250 papers in
Smart City, IEEE 4th Int. Conf. Data Sci. Syst. (HPCC/SmartCity/DSS), machine learning, neural networks, clustering, fuzzy
Jun. 2018, pp. 1536–1540. systems, and their applications. He serves as the
[47] A. Lagopoulos, G. Tsoumakas, and G. Papadopoulos, “Web robot Chair for IEEE Italy Section Computational Intelligence Society Chapter.
detection in academic publishing,” Nov. 2017, arXiv:1711.05098.
[48] A. Stassopoulou and M. D. Dikaiakos, “Web robot detection: A proba-
bilistic reasoning approach,” Comput. Netw., vol. 53, no. 3, pp. 265–278, Stefano Rovetta (Senior Member, IEEE) is cur-
Feb. 2009. rently an Associate Professor of computer science
[49] P.-N. Tan and V. Kumar, “Discovery of web robot sessions based on their
with the University of Genova, Italy. He has authored
navigational patterns,” Data Mining Knowl. Discovery, vol. 6, no. 1,
more than 170 scientific articles in machine learn-
pp. 9–35, 2002.
ing, neural networks, clustering, fuzzy systems,
[50] I. Zeifman. (Jan. 2017). Bot Traffic Report 2016. [Online]. Available:
and bioinformatics. He is a member of the Italian
https://www.incapsula.com/blog/bot-traffic-report-2016.html
Neural Network Society, the European Neural Net-
[51] G. Buehrer, J. Stokes, K. Chellapilla, and J. Platt, “Classification of
work Society, and the European Society for Fuzzy
automated web traffic,” in Weaving Services and People on the World
Logic and Technology. He received the 2008 Pattern
Wide Web, Berlin, Germany: Springer-Verlag, Jan. 2009.
Recognition Society Award. He was the chair of
[52] G. Suchacka and M. Sobkow, “Detection of internet robots using a
international conferences.
Bayesian approach,” in Proc. IEEE 2nd Int. Conf. Cybern. (CYBCONF),
Jun. 2015, pp. 365–370.
[53] M. Nielsen and I. Chuang, Quantum Computation and Quantum Infor-
mation. Cambridge, U.K.: Cambridge Univ. Press, 2010, p. 96.
[54] D. Emmanoulopoulos and S. Dimoska, “Quantum machine learning in Grażyna Suchacka (Senior Member, IEEE)
finance: Time series forecasting,” Feb. 2022, arXiv:2202.00599. received the M.Sc. degree in computer science,
[55] P. Rebentrost, M. Mohseni, and S. Lloyd, “Quantum support vector the M.Sc. degree in management, and the Ph.D.
machine for big data classification,” Phys. Rev. Lett., vol. 113, no. 13, degree (Hons.) in computer science from the
Sep. 2014, Art. no. 130503. Wrocław University of Science and Technology,
[56] S. Lloyd, M. Mohseni, and P. Rebentrost, “Quantum principal compo- Poland. She is currently an Assistant Professor with
nent analysis,” Nature Phys., vol. 10, no. 9, pp. 631–633, Jul. 2014. the Institute of Informatics, University of Opole,
[57] S. Yarkoni, A. Kleshchonok, Y. Dzerin, F. Neukart, and M. Hilbert, Poland. Her research interests include data analysis
Semi-Supervised Time Series Classification Method for Quantum Com- and modeling, data mining, machine learning, and
puting (Quantum Machine Intelligence), New York, NY, USA: Springer, quality of web service with special regard to bot
Apr. 2021. detection and electronic commerce support.

Authorized licensed use limited to: National Institute of Technology. Downloaded on July 29,2024 at 10:06:25 UTC from IEEE Xplore. Restrictions apply.

CT Series
No ratings yet
CT Series
6 pages
Approaches and Applications of Early Classification
No ratings yet
Approaches and Applications of Early Classification
15 pages
Calibrated One-Class Classification For Unsupervised Time Series Anomaly Detection
No ratings yet
Calibrated One-Class Classification For Unsupervised Time Series Anomaly Detection
14 pages
Information Geometry Univariate Time Series
No ratings yet
Information Geometry Univariate Time Series
12 pages
Uncertainty Theories and Multisensor Data Fusion
From Everand
Uncertainty Theories and Multisensor Data Fusion
Alain Appriou
No ratings yet
Patri BigData 2014
No ratings yet
Patri BigData 2014
10 pages
Open Challenges For Machine Learning Based Early Decision-Making Research
No ratings yet
Open Challenges For Machine Learning Based Early Decision-Making Research
20 pages
A Novel Anomaly Detection Approach For Internet of Things Time Series Data
No ratings yet
A Novel Anomaly Detection Approach For Internet of Things Time Series Data
13 pages
HybridAD A Hybrid Model-Driven Anomaly Detection Approach For Multivariate Time Series
No ratings yet
HybridAD A Hybrid Model-Driven Anomaly Detection Approach For Multivariate Time Series
13 pages
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
From Everand
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
Byron Ellis
No ratings yet
Time-Aware Detection Systems: Proceedings
No ratings yet
Time-Aware Detection Systems: Proceedings
3 pages
Early Failure Detection of Paper Manufacturing Machinery Using Nearest Neighbor-Based Feature Extraction
No ratings yet
Early Failure Detection of Paper Manufacturing Machinery Using Nearest Neighbor-Based Feature Extraction
19 pages
ARenault IJCNN23 CR
No ratings yet
ARenault IJCNN23 CR
11 pages
Evaluation Metrics For Anomaly Detection Algorithm
No ratings yet
Evaluation Metrics For Anomaly Detection Algorithm
18 pages
TimeGPT 1 2310.03589
No ratings yet
TimeGPT 1 2310.03589
12 pages
Chaos Mesh for Resilient Kubernetes Deployments: The Complete Guide for Developers and Engineers
From Everand
Chaos Mesh for Resilient Kubernetes Deployments: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
OpenTelemetry in Practice: Definitive Reference for Developers and Engineers
From Everand
OpenTelemetry in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Deep Learning For Time-Series Prediction in IIoT P
No ratings yet
Deep Learning For Time-Series Prediction in IIoT P
20 pages
Time Series 10.1007@s10618 019 00619 1
No ratings yet
Time Series 10.1007@s10618 019 00619 1
47 pages
Change Point Detection in Time Series Data With Random Forests
No ratings yet
Change Point Detection in Time Series Data With Random Forests
13 pages
Bake Off Redux: A Review and Experimental Evaluation of Recent Time Series Classification Algorithms
No ratings yet
Bake Off Redux: A Review and Experimental Evaluation of Recent Time Series Classification Algorithms
61 pages
Time Series Forest For Classification and Feature Extraction
No ratings yet
Time Series Forest For Classification and Feature Extraction
22 pages
Principles of Observability for Modern Systems: Definitive Reference for Developers and Engineers
From Everand
Principles of Observability for Modern Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
1 s2.0 S0167739X23000560 Main
No ratings yet
1 s2.0 S0167739X23000560 Main
12 pages
Batch Reinforcement Learning Approach Using Recursive Feature Elimination For Network Intrusion Detection
No ratings yet
Batch Reinforcement Learning Approach Using Recursive Feature Elimination For Network Intrusion Detection
16 pages
Observer Techniques and Applications: Definitive Reference for Developers and Engineers
From Everand
Observer Techniques and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Deep Learning For Time Series Classification A Rev
No ratings yet
Deep Learning For Time Series Classification A Rev
48 pages
Unsupervised Model Selection For Time-Series Anomaly Detection
No ratings yet
Unsupervised Model Selection For Time-Series Anomaly Detection
25 pages
Detection Des Anomalie
No ratings yet
Detection Des Anomalie
17 pages
Catch22 CAnonical Time-Series CHaracteristics Sele
No ratings yet
Catch22 CAnonical Time-Series CHaracteristics Sele
32 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
24 pages
1 s2.0 S0020025522010532 Main
No ratings yet
1 s2.0 S0020025522010532 Main
13 pages
Time GPT
No ratings yet
Time GPT
12 pages
Itimes Investigating Semi-Supervised Time Series Classification Via Irregular Time Sampling
No ratings yet
Itimes Investigating Semi-Supervised Time Series Classification Via Irregular Time Sampling
9 pages
Multi-Class Intrusion Detection Based On Transformer For IoT Networks Using CIC-IoT-2023 Dataset
No ratings yet
Multi-Class Intrusion Detection Based On Transformer For IoT Networks Using CIC-IoT-2023 Dataset
25 pages
FULLTEXT02
No ratings yet
FULLTEXT02
63 pages
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
From Everand
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
M. Sreedevi
No ratings yet
Lian Duke 0066D 13204
No ratings yet
Lian Duke 0066D 13204
117 pages
Mastering OpenTelemetry: Building Scalable Observability Systems for Cloud-Native Applications
From Everand
Mastering OpenTelemetry: Building Scalable Observability Systems for Cloud-Native Applications
Robert Johnson
No ratings yet
Time Series Classification
No ratings yet
Time Series Classification
7 pages
Storm Systems for Real-Time Data Processing: Definitive Reference for Developers and Engineers
From Everand
Storm Systems for Real-Time Data Processing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
SBMP Ad Ai (Changed 1)
No ratings yet
SBMP Ad Ai (Changed 1)
11 pages
Anomaly Detection
No ratings yet
Anomaly Detection
51 pages
OpenTracing in Distributed Systems: Definitive Reference for Developers and Engineers
From Everand
OpenTracing in Distributed Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Self-Supervised Contrastive Representation Learning For Semi-Supervised Time-Series Classification
No ratings yet
Self-Supervised Contrastive Representation Learning For Semi-Supervised Time-Series Classification
15 pages
Principles of Real-Time Data Streaming: Definitive Reference for Developers and Engineers
From Everand
Principles of Real-Time Data Streaming: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Deep Learning For Anomaly Detection in Time-Series Data Review Analysis and Guidelines
No ratings yet
Deep Learning For Anomaly Detection in Time-Series Data Review Analysis and Guidelines
23 pages
TapNet - Multivariate Time Series Classification With Attentional Prototypical Network
No ratings yet
TapNet - Multivariate Time Series Classification With Attentional Prototypical Network
8 pages
Stream Processing Techniques and Patterns: Definitive Reference for Developers and Engineers
From Everand
Stream Processing Techniques and Patterns: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
From Everand
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Sebastian Thelen
5/5 (1)
NetFlow Protocols and Applications: Definitive Reference for Developers and Engineers
From Everand
NetFlow Protocols and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Tensor-Based Online Network Anomaly Detection and Diagnosis
No ratings yet
Tensor-Based Online Network Anomaly Detection and Diagnosis
26 pages
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
From Everand
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
Fouad Sabry
No ratings yet
Comprehensive Guide to Zipkin: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Zipkin: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mivar NETs and logical inference with the linear complexity
From Everand
Mivar NETs and logical inference with the linear complexity
Varlamov, Oleg O.
No ratings yet
Coralogix Essentials: Definitive Reference for Developers and Engineers
From Everand
Coralogix Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Applsci 13 10745
No ratings yet
Applsci 13 10745
22 pages
Practical Observability Engineering with Relic: Definitive Reference for Developers and Engineers
From Everand
Practical Observability Engineering with Relic: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Thundra Observability and Monitoring Solutions: Definitive Reference for Developers and Engineers
From Everand
Thundra Observability and Monitoring Solutions: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Istio in Production Environments: Definitive Reference for Developers and Engineers
From Everand
Istio in Production Environments: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Online Time-Series Anomaly Detection A Survey of M
No ratings yet
Online Time-Series Anomaly Detection A Survey of M
36 pages
Table.1 Demographic Profile of The Respondents in Terms of Age
No ratings yet
Table.1 Demographic Profile of The Respondents in Terms of Age
5 pages
Section 1-Short Cantilever ST
No ratings yet
Section 1-Short Cantilever ST
5 pages
AppendixEL 02schedule D 2
No ratings yet
AppendixEL 02schedule D 2
428 pages
Bilal Khan Paper
No ratings yet
Bilal Khan Paper
18 pages
TESDA Crim
No ratings yet
TESDA Crim
2 pages
2024-25 Master Thesis Validated Topics EUCONEXUS Supervisors
No ratings yet
2024-25 Master Thesis Validated Topics EUCONEXUS Supervisors
2 pages
Motion in 2D DPP 7 Min
No ratings yet
Motion in 2D DPP 7 Min
3 pages
8 Total Quality Management Principles - Lucidchart Blog
No ratings yet
8 Total Quality Management Principles - Lucidchart Blog
12 pages
San Chit
No ratings yet
San Chit
2 pages
LEADERSHIP Notes
No ratings yet
LEADERSHIP Notes
6 pages
GIS A Tool For Sustainable Development PDF
No ratings yet
GIS A Tool For Sustainable Development PDF
11 pages
Listening 3
No ratings yet
Listening 3
4 pages
Daily Time Record Daily Time Record: A.M. P.M. A.M. P.M
No ratings yet
Daily Time Record Daily Time Record: A.M. P.M. A.M. P.M
1 page
UC3843 ChipsWinner
No ratings yet
UC3843 ChipsWinner
11 pages
Art Appreciation - Assignment 1
No ratings yet
Art Appreciation - Assignment 1
1 page
STS Reviewer
No ratings yet
STS Reviewer
23 pages
Essay On Greenhouse Effect
100% (2)
Essay On Greenhouse Effect
3 pages
0471 Thermal Insulation and Pliable Membranes
No ratings yet
0471 Thermal Insulation and Pliable Membranes
9 pages
CE118 Project Part 1
No ratings yet
CE118 Project Part 1
42 pages
Photographic Superimpositions
100% (1)
Photographic Superimpositions
10 pages
2322 B EN UM AGFA CR Detectors Plates and Cassettes
No ratings yet
2322 B EN UM AGFA CR Detectors Plates and Cassettes
54 pages
GS4 Ethics Notes by @CSEWhy
No ratings yet
GS4 Ethics Notes by @CSEWhy
26 pages
Library Management System Using Java: ASHUTOSH PATRA (2001229024) LALAJI PRASAD PANDA (2001229088) BINAYAK BAL (2001229025)
No ratings yet
Library Management System Using Java: ASHUTOSH PATRA (2001229024) LALAJI PRASAD PANDA (2001229088) BINAYAK BAL (2001229025)
28 pages
White Paper: MPO Connector Basics and Best Practices
No ratings yet
White Paper: MPO Connector Basics and Best Practices
9 pages
ANOVA Poplar-Trees
No ratings yet
ANOVA Poplar-Trees
3 pages
Aiml Notes Chapter-3
No ratings yet
Aiml Notes Chapter-3
34 pages
Module02 Precalculus Voctech
No ratings yet
Module02 Precalculus Voctech
8 pages
Science Fair Literature Review Example
100% (2)
Science Fair Literature Review Example
4 pages
Meaning of Political Science
No ratings yet
Meaning of Political Science
4 pages

2022 A Quantum-Inspired Classifier For Early Web Bot Detection

Uploaded by

2022 A Quantum-Inspired Classifier For Early Web Bot Detection

Uploaded by

1684 IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL.

A Quantum-Inspired Classifier for Early

|α|2 + |β|2 = 1. (2) p(m) = ψ| Pm† Pm |ψ ,

In this paper, we propose a multinomial generalization TABLE II

As a consequence, in a real system, it is impossible to B. Results on Synthetic Data

output, which is what quantum theory would require. decision.

state |ψ by means of (4) or (6), the probabilities of the basis

account, as shown in sections IV and IX, it is often limited to

You might also like