RST Approach For Efficient CARs Mining
RST Approach For Efficient CARs Mining
4, November 2014
34
I.
INTRODUCTION
DOI: 10.9756/BIJDM.10365
35
BACKGROUND
AB
ABC
BCD
Total Trans=4
Support({AB})=3/4=75%
Support({BC})=2/4=50%
(2)
Items
ABD
AB
ABC
BCD
Conf(AB)=3/3=100%
Conf(BD})=2/4=50%
36
represents its domain {sold, not sold}. An attribute Aj is nonbinary domain is represented by j items and =1 binary
vectors such that n is the number of attributes of the nonbinary domain. For example, for the best representation of a
customer wealth level, we associate to the attribute income
the domain constituted by 3 (j=3) items {high, medium, low}
defined as follows: a1 = {high income"}, a2 = {middle
income"} and a3 ={low income}.
B. Bitmap Representation
A relation or table uses as data source for ARM approach,
some attributes are measurable with discrete variable as some
numerical or textual values on behalf of some range. However,
the form of original data representation could be changed
exactly so that, each attribute in the new Bitmap table is an
exact value of one item in the original table, and each attribute
value should be 1 or 0, expressing if it exist there is a 1,
otherwise a 0 in the bitmap table [7].
Let be the example of table 3 where attributes representing
data are {X}, {Y} and {Z}. The attribute X has two values {A
and B} = {Account debited, Account credited}, the attribute Y
has three values {C, D and E} = {low income, high income,
middle income} and the attribute Z has two values {F, G} =
{according loan, not according loan}. There are 7 items for the
resultant Bitmap table {A, B, C, D, E, F and G}.
Table 3: Original Relation Data
Tid
1
2
3
4
5
Account
Debited
Debited
Debited
Debited
Credited
income
middle
low
middle
high
high
According Loan
yes
no
yes
yes
no
A
1
1
1
1
0
B
0
0
0
0
1
C
0
1
0
0
0
D
0
0
0
1
1
E
1
0
1
0
0
F
1
0
1
1
0
G
0
1
0
0
1
any
item(s)
any
item(s)
CAR
Same support
Same confidence
Has only single item. No item from I
appear as consequent
No class label from Y can appear as a
rule condition
III.
ROUGH SET
37
A. Approximations
As defined before, as starting point of the RST, the
indiscernibility relation is intended to express the fact that due
to the lack of knowledge, but it is unable to distinguish some
objects employing the available information. RST includes
another important concept which is Approximations. The
approximation is also associated with the meaning of the
approximations of topological operations [11].
[8].
1.
2.
3.
NEG(B)
B*
BR(B)
POS(B)
B*
38
Transaction
Class
1
2
3
4
5
6
7
Education
Education
Education
Sport
Sport
Sport
Sport
The set represents all the possible cases, the set of all
attributes denoted by A, and the set of all attribute values
denoted by V. An information table defines an information
function I: A V.
{[(, )] | , I(, ) = }
aA
Va
| () ()|
||
| () ()|
| ()|
=7
Then the support of (CR) is 2/7=28%.
The confidence of CR is the S(CR)/support ([]c)=2/2=1.
39
V.
CONCLUSION
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
40