0% found this document useful (0 votes)

5 views12 pages

DAC: Deep Autoencoder-Based Clustering, A General Deep Learning Framework of Representation Learning

The document introduces DAC, a Deep Autoencoder-based Clustering framework designed to enhance clustering performance by learning effective data representations through deep neural networks. It addresses the challenges posed by high-dimensional noisy data that traditional clustering methods struggle with, demonstrating significant improvements in K-Means clustering results across various datasets, including MNIST and Fashion-MNIST. The approach utilizes a clustering-weighted loss function to prioritize important features, resulting in better clustering accuracy as evidenced by experimental results.

Uploaded by

Ramtin Karbaschi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views12 pages

DAC: Deep Autoencoder-Based Clustering, A General Deep Learning Framework of Representation Learning

Uploaded by

Ramtin Karbaschi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

DAC: Deep Autoencoder-based Clustering, a General

Deep Learning Framework of Representation Learning

Si Lu1 and Ruisi Li

Portland State University

arXiv:2102.07472v1 [cs.LG] 15 Feb 2021

Abstract. Clustering performs an essential role in many real world applications,

such as market research, pattern recognition, data analysis, and image processing.
However, due to the high dimensionality of the input feature values, the data
being fed to clustering algorithms usually contains noise and thus could lead to
in-accurate clustering results. While traditional dimension reduction and feature
selection algorithms could be used to address this problem, the simple heuristic
rules used in those algorithms are based on some particular assumptions. When
those assumptions does not hold, these algorithms then might not work. In this
paper, we propose DAC, Deep Autoencoder-based Clustering, a generalized data-
driven framework to learn clustering representations using deep neuron networks.
Experiment results show that our approach could effectively boost performance
of the K-Means clustering algorithm on a variety types of datasets.

Keywords: clustering, K-Means, representation learning, deep neuron networks,

deep autoencoder

1 Introduction

Clustering is the task of grouping samples such that the ones in the same group are more
similar to each other than to the ones in other groups. Nowadays, clustering performs
as a basic and essential pre-processing step of many real world applications. For exam-
ple, it could be used to help with fake news identification [6], document analysis [16],
marketing and sales, etc. Specifically, clustering algorithms can figure out useful in-
formation for the applications via grouping according to a variety of data similarity
metrics and data grouping schemes. For example, similar patches could be used for
image denoising [1–3] or depth enhancement [9], and clustering could be used to find
good similar patches [8].
To let the samples be properly assigned to different groups(called clusters), mean-
ingful feature values of the samples need to be obtained first. However, in real world
applications, the data we get is often of high dimensions [5] and usually contains noise,
making the clustering difficult to succeed. For example, in the MNIST dataset [7], each
input hand-written digit image has 784 pixels. While we know some pixels (e.g. the
ones at image corners) might not be as useful as others(e.g. the ones around image
centers), it is difficult to manually distinguish them in clustering.
Traditional dimensionality reduction algorithms, namely Principle Component Anal-
ysis(PCA) [10], Linear Discriminant Analysis(LDA) [4], and Canonical Correlation
2 Si Lu et al.

Analysis(CCA) [13], could be used to reduce the number of features. In addition, fea-
ture selection algorithms can be used to select from the original feature values a set of
useful and noiseless ones. These algorithms aim to extract the core information given
the redundant and correlated input high-dimension data features. However, these al-
gorithms often fail mainly due to two reasons. Firstly, most of them require complex
mathematical analysis, which is difficult and time consuming as well. Secondly, their
is no single approach that could work for all types of datasets. Different datasets could
have different dimensions, data sizes and even might be used in totally different ap-
plications. Some datasets are linear and some of them are non-linear. As a result, it is
difficult to find a way to generally work on all types of datasets.
Recently, due to the emerging of the powerful deep neuron networks, deep learning-
based approaches have been introduced to learn better data representations and achieve
appealing performance improvements for clustering algorithms. One simple approach is
to learn representations using deep auto-encoders. Specifically, the original input high
dimensional features are fed into a encoder that generates a low dimensional output.
This output is further fed to a decoder that tries to recover the raw input data as much as
possible. However, most of the existing approaches [11, 15] are using images as input
and thus using convolutional neuron networks in their work.
In this paper, we propose Deep Clustering Autoencoder, a simple but more general
framework for representation learning that takes feature vectors as input. Thus, our
approach could be applied to more generalized datasets. In addition, according to the
group labels, we propose a scheme to adaptively weight all input features. We combine
this estimated weight with the loss function computation during training. Experiment
results show that our approach could effectively improve the performance of K-Means
clustering algorithm on different types of datasets, namely MNIST, Fashion-MNIST
[14], as well as Human Activities and Postural Transitions Data Set (HAPT) [12].
The rest of the paper is organized as follows: in section 2 we describe the overview
of our deep autoencoder-based clustering. We then describe the deep autoencoder for
representation learning in more details in section 3. We finally show experimental re-
sults in section 4 and conclude in section 5.

2 Overview of Deep Autoencoder-based Clustering

Figure 1 shows an overview of our deep autoencoder-based clustering framework. There

are two main steps: training and clustering testing. In the training step, a deep autoen-
coder with an encoder and a decoder is trained using the training set. Here a flattened
input vector is fed into the multilayer deep encoder which has a low dimensional learned
representation. This learned representation is further fed into a decoder that tries to re-
cover an output of the same size as the input. The training process of this autoencoder
tries to reconstruct the input as much as possible. In the following clustering step, we
apply the autoencoder to the testing set. The output of the encoder (learned represen-
tations) is then fed to a classic K-Means algorithm to do clustering. The learned low
dimensional representation vector contains key information of the given input, and thus
yield better clustering results.
Deep Autoencoder-based Clustering 3

Fig. 1. Overview of our Deep Autoencoder-based Clustering on MNSIT dataset. The autoencoder
(consists of an encoder and a decoder) tries to encode and decode the input features such that the
decoded output is as close to the input as possible. The input size is 28X28 = 784, the size of
the learned low-dimension representation is 10. In the testing stage, the learned encoder output is
then fed into the classic K-Means algorithm to do clustering.

3 Deep Autoencoder for Representation Learning

The architecture of our deep autoencoder for representation learning is shown in Figure
1. As could be seen, the model is not as complex as some of the advanced neuron
networks. The reason is that we do not want our model to over-fit in two-folds. First,
we do not want our model to over-fit on the training dataset over the testing dataset.
Second, we do not want our model to over-fit on the reconstruction problem it-self over
the clustering problem. Thus, we select a model of reasonable median complexity.

3.1 Encoder
The encoder aims to encode or compress the input data into a smaller size represen-
tation, and at the same time preserve as much key information as possible. As shown
in Figure 1, the encoder consists of 8 layers, include the input layer and the learned
representation output layer. Here the input layer is being normalized such that all its
values is in the range of (0, 1). Specifically, from the beginning, each larger layer is
fully connected to the next smaller layer followed by a couple of activation layers.
4 Si Lu et al.

There are mainly two types of activation layers, Relu and Tanh, as shown in Equation
1 and 2. Adding the Relu layers could introduce non-linearity to our model, making
it more robust against non-linear input data. The Tanh layer, on the other hand, could
transform the data into a normalized range of (−1, 1), to alleviate the gradient vanish-
ing/exploding problem.

Relu(x) = max(0, x) (1)

ex + e−x
cosh(x) =
2
ex − e−x
sinh(x) = (2)
2
sinh(x) ex − e−x
tanh(x) = = x
cosh(x) e + e−x

3.2 Decoder
The decoder aims to decode or decompress the encoded output to reconstruct the origi-
nal input data as much as possible. It contains nine layers, include the input layer, which
is the output of the encoder, and the final output layer. Specifically, each smaller layer is
fully connected to the next larger layer followed by a Tanh activation layer. In addition,
the decoder has a Sigmoid activation layer (shown in Equation 3) at the final stage to
enforce the output values lie into the range of (0, 1).
1
Sigmoid(x) = (3)
1 + e−x

3.3 Objective Function

Clustering-weighted MSE Loss While the goal of the classic autoencoder is to recon-
struct the original input as much as possible, it counts each input feature value equally.
However, it is possible that each individual input feature contributes differently to the
final clustering results. For example, in MNIST dataset, the pixels at the four corners
of almost all images are of the same color black (with zero intensity input values), thus
have no impact to the final clustering at all. On the other hand, some pixels around
the center of the images are likely to perform more important roles. We thus propose a
scheme to compute a clustering-weighted MSE loss to let the autoencoder focus more
on the reconstruction of more important input feature values, as shown in Equation 4.
Pn
wi (yi − yˆi )2
Lcmse = i=1 (4)
n
Here wi is the weight of each feature. It is computed using all ith feature values
sampled from a subset of the training dataset with m samples. Denote all ith feature
values as {xi k|k = 1, 2, .., m} and the corresponding ground truth group/cluster la-
bels of the m samples {lk |k = 1, 2, .., m}. The corresponding feature weight will be
Deep Autoencoder-based Clustering 5

large if both of the two following conditions are met. First, all sampled values in the
same groups/clusters have small differences. Second, all sampled values in different
groups/clusters have large differences. Thus, the weight is computed as:

2 2
e−(xip −xiq ) (1 − e−(xip −xiq ) )
P P
lp =lq lp 6=lq
wi = P • P (5)
1 1
lp =lq lp 6=lq

Fig. 2. A map of the clustering weight computed for MNIST dataset using 1000 samples from
the training set. It could be seen that pixels at boundaries and corners are less important than the
ones around image centers.

Figure 2 shows a map of the clustering weight computed for MNIST dataset using
1000 samples from the training set. Pixels at boundaries and corners are less impor-
tant than the ones around image centers, thus have smaller weights(white means larger
weights).

Final Objective Function The final objective function then combines the Clustering-
weighted MSE Loss and a standard L2 norm regularization, as shown in Equation 6.
Here the L2 norm regularization Lr is computed using all parameters from the autoen-
coder. β is a balancing factor with a default value of 0.00001.

L = Lcmse + β L̇r (6)

6 Si Lu et al.

4 Experimental Results

4.1 Dataset

We evaluate our approach on the classic MNIST hand-written digits dataset. This dataset
has 50, 000 images as the training set and 10, 000 images as the testing set. There are
10 groups in total. We show some samples of MNIST dataset in Figure 3.

Fig. 3. Samples of the MNIST dataset.

4.2 Measurement Metrics

To evaluate our framework, we apply our trained encoder to the testing dataset. We
then compare the generated representations from our trained encoder to the raw input
features by applying them to the K-Means algorithm. To measure the performance of
clustering algorithms, we use the Adjusted Rand Index (ARI). Specifically, this metrics
computes a similarity between two clustering results by considering all pairs of samples
and counting pairs that are assigned in the same or different clusters in the predicted and
ground truth clustering results. The proposed approach is denoted as DAC.

4.3 Experiment setup

We implement our framework in Python and PyTorch and test it on a desktop with RTX
2080-Ti. We train the autoencoder for 200 epochs using Adam Optimization Algorithm.
The initial learning rate is set to 0.003 and will decrease with the number of epochs
during training.
Deep Autoencoder-based Clustering 7

4.4 Results on MNIST

Table 1 shows the quantitative performance of the proposed approach in terms of ARI.
Comparing to the raw K-Means algorithm, our approach (DAC) boosts the K-Means
algorithm’s performace from 0.3477 to 0.6624, which is a 90.50% boost. We also show
some of the reconstructed results by our trained autoencoder in Figure 4. It shows that
our trained autoencoder can properly reconstruct the raw input hand-written digits.

Table 1. Clustering results on MNIST testing dataset.

K-Mesn DAC
ARI 0.3477 0.6624

Fig. 4. Sample results of our trained autoencoder on MNIST dataset. Top: raw input images.
Bottom: reconstructed images

4.5 Results on other datasets

To test the robustness of our approach against different data types, we apply our method
to two other datasets: Fashion-MNIST [14], and Human Activities and Postural Transi-
tions Data Set (HAPT) [12].
Fashion-MNIST is a similar dataset to MNIST, with the same image format and
image size. It has 60, 000 images as training set and 10, 000 images as testing set.
The only difference is the content: it contains images of 10 types of clothes. The ten
categories are shown in Table 2. We show some samples of this dataset in Figure 5.
Human Activities and Postural Transitions Data Set is a dataset that has been cap-
tured by smart phone’s sensors [12]. The authors captured 3-axial linear acceleration
8 Si Lu et al.

Table 2. Fashion-MNIST category labels.

T-shirt/top Trouser Pullover Dress Coat

Sandal Shirt Sneaker Bag Ankle boot

Fig. 5. Samples of the Fashion-MNIST dataset.

and 3-axial angular velocity at a constant rate of 50Hz using the embedded accelerom-
eter and gyroscope of the device, which is a smartphone (Samsung Galaxy S II). There
are 30 volunteers whose ages are in the range of 19-48 years old. In their data capturing
experiment, the volunteers was doing one of twelve activities. There are six basic activi-
ties: three static postures (standing, sitting, lying) and three dynamic activities (walking,
walking downstairs and walking upstairs). Another six postural transitions that occurred
between the static postures have also been added to the dataset. These are: stand-to-sit,
sit-to-stand, sit-to-lie, lie-to-sit, stand-to-lie, and lie-to-stand. All twelve types of activ-
ities are shown in Table 3.

Table 3. HAPT category labels.

walking walking upstairs walking downstairs sitting

standing laying stand to sit sit to stand
standing laying stand to sit sit to stand
sit to lie lie to sit stand to lie lie to stand

The sensor signals (accelerometer and gyroscope) were then denoised by some
noise filters. The authors then sampled in fixed-width sliding windows of 2.56 sec and
50% overlap (128 readings/window), leading to a sample size of 561 features. Each
sample is captured when the volunteer is doing one type of activities. During the cap-
ture process, 70% of the volunteers were randomly selected to generate the training set
Deep Autoencoder-based Clustering 9

Fig. 6. Sample results of our trained on Fashion-MNIST dataset. Top: raw input images. Bottom:
reconstructed images

and 30% were selected to generate the testing set. In total, this dataset has 7767 samples
for training and 3162 samples for testing.
We apply our method to Fashion-MNIST dataset and report the results in Table 4.
Here as the Fashion-MNIST is a more complex dataset, we modified the autoencoder
and show the modified autoencoder architecture in Figure 7. It can be seen that com-
paring to using raw input features in K-Means clustering, our method boosts ARI from
0.3039 to 0.4702, yields to a improvement of 54.7%.
We then apply our method to the HAPT dataset and report the results in Table 3.
Here as this dataset’s inputs are of lower dimension than MNIST, we modified the
autoencoder accordingly and show the modified autoencoder architecture in Figure 8. It
can be seen that even with this temporal sequence dataset, our method could effectively
improve the K-Means algorithm’s performance by 30%. These results also show that
our method could be generally applied to other data types. We also show some of the
reconstructed results by our trained autoencoder in Figure 6. It shows that our trained
autoencoder can properly reconstruct the raw input fashion images.

Table 4. Clustering results on Fashion-MNIST testing dataset.

K-Mesn DAC
ARI 0.3039 0.4702

Table 5. Clustering results on HAPT testing dataset.

K-Mesn DAC
ARI 0.4290 0.5594
10 Si Lu et al.

5 Conclusion

In this paper, we propose DAC, Deep Autoencoder-based Clustering, a generalized

data-driven framework to learn low dimensional clustering representations using trained
deep neuron networks. Specifically, we train a multi-layer deep autoencoder to en-
code and decode the raw input samples. The encoded output of the encoder is then
fed to a classic K-Means algorithm to do clustering. We design a scheme to compute a
clustering-based weight in the training objective function to train the autoencoder and
let it focus more on the reconstruction of more important features. Experimental results
show that our approach could effectively boost the performance of a classic clustering
algorithm: K-Means by 30% to 90% on MNIST dataset. In addtion, our method could
be also applied to other types of clustering datasets, such as Fashion-MNIST and Hu-
man Activities and Postural Transitions Data Set (HAPT). Experimental results show
that our framework could still be able to improve K-Means algorithm’s performance by
as much as 55%
Deep Autoencoder-based Clustering 11

Fig. 7. Overview of our Deep Autoencoder-based Clustering on Fashion-MNSIT dataset.

Fig. 8. Overview of our Deep Autoencoder-based Clustering on HAPT dataset.

12 Si Lu et al.

References
1. Buades, A., Coll, B., Morel, J.: A non-local algorithm for image denoising. In: IEEE Con-
ference on Computer Vision and Pattern Recognition (CVPR), vol. 2, pp. 60–65 (2005)
2. Chen, F., Zhang, L., Yu, H.: External patch prior guided internal clustering for image denois-
ing. In: IEEE International Conference on Computer Vision (ICCV), pp. 603–611 (2015)
3. Dabov, K., Foi, A., Katkovnik, V., Egiazarian, K.: Image denoising by sparse 3-d transform-
domain collaborative filtering. IEEE Transactions on Image Processing 16(8), 2080–2095
(2007)
4. Duda, R.O., Hart, P.E., Stork, D.G.: Pattern classification. john wiley & sons. Inc., New York
2 (2001)
5. Han, J., Pei, J., Kamber, M.: Data mining: concepts and techniques. Elsevier (2011)
6. Hosseinimotlagh, S., Papalexakis, E.E.: Unsupervised content-based identification of fake
news articles with tensor decomposition ensembles. In: Proceedings of the Workshop on
Misinformation and Misbehavior Mining on the Web (MIS2) (2018)
7. LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document
recognition. Proceedings of the IEEE 86(11), 2278–2324 (1998)
8. Lu, S.: Good similar patches for image denoising. In: 2019 IEEE Winter Conference on
Applications of Computer Vision (WACV), pp. 1886–1895. IEEE (2019)
9. Lu, S., Ren, X., Liu, F.: Depth enhancement via low-rank matrix completion. In: Proceedings
of the IEEE conference on computer vision and pattern recognition, pp. 3390–3397 (2014)
10. Pearson, K.: Liii. on lines and planes of closest fit to systems of points in space. The Lon-
don, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 2(11), 559–572
(1901)
11. Pu, Y., Gan, Z., Henao, R., Yuan, X., Li, C., Stevens, A., Carin, L.: Variational autoencoder
for deep learning of images, labels and captions. Advances in neural information processing
systems 29, 2352–2360 (2016)
12. Reyes-Ortiz, J.L., Oneto, L., Samà, A., Parra, X., Anguita, D.: Transition-aware human ac-
tivity recognition using smartphones. Neurocomputing 171, 754–767 (2016)
13. Sun, Q.S., Zeng, S.G., Liu, Y., Heng, P.A., Xia, D.S.: A new method of feature fusion and its
application in image recognition. Pattern Recognition 38(12), 2437–2448 (2005)
14. Xiao, H., Rasul, K., Vollgraf, R.: Fashion-mnist: a novel image dataset for benchmarking
machine learning algorithms. arXiv preprint arXiv:1708.07747 (2017)
15. Yang, X., Deng, C., Zheng, F., Yan, J., Liu, W.: Deep spectral clustering using dual au-
toencoder network. In: Proceedings of the IEEE/CVF Conference on Computer Vision and
Pattern Recognition (CVPR) (2019)
16. Zhao, Y., Karypis, G.: Evaluation of hierarchical clustering algorithms for document datasets.
In: Proceedings of the eleventh international conference on Information and knowledge man-
agement, pp. 515–524 (2002)

Gen AI Unit 2
100% (1)
Gen AI Unit 2
65 pages
Convolutional Autoencoder For Image Denoising
No ratings yet
Convolutional Autoencoder For Image Denoising
11 pages
SCSA3015 Deep Learning Unit 3
100% (1)
SCSA3015 Deep Learning Unit 3
23 pages
Autoencoders - Bits and Bytes of Deep Learning - Towards Data Science
No ratings yet
Autoencoders - Bits and Bytes of Deep Learning - Towards Data Science
10 pages
Deep Learning Autoencoders
No ratings yet
Deep Learning Autoencoders
31 pages
Autoencoder GAN Edited
No ratings yet
Autoencoder GAN Edited
138 pages
Denoising Autoencoders tr1316
No ratings yet
Denoising Autoencoders tr1316
16 pages
Autoencoder - Unit 4
No ratings yet
Autoencoder - Unit 4
39 pages
Spectral Clustering Via Ensemble Deep Autoencoder
No ratings yet
Spectral Clustering Via Ensemble Deep Autoencoder
33 pages
Auto Encoder S
No ratings yet
Auto Encoder S
32 pages
Deep Clustering Based On Embedded Auto Encoder
No ratings yet
Deep Clustering Based On Embedded Auto Encoder
16 pages
Autoencoders: Presented By: 2019220013 Balde Lansana (
No ratings yet
Autoencoders: Presented By: 2019220013 Balde Lansana (
21 pages
Chap 6 Embedding
No ratings yet
Chap 6 Embedding
44 pages
Dijazi - Deep Clustering Via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization - 17
No ratings yet
Dijazi - Deep Clustering Via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization - 17
13 pages
Unsupervised Learning To Aid Labelling For Supervised Learning
No ratings yet
Unsupervised Learning To Aid Labelling For Supervised Learning
29 pages
D5 PPT
No ratings yet
D5 PPT
79 pages
Auto-Encoder Based Dimensionality Reduction
No ratings yet
Auto-Encoder Based Dimensionality Reduction
25 pages
Autoencoders and Their Applications in Machine Learning
No ratings yet
Autoencoders and Their Applications in Machine Learning
52 pages
Introduction To Autoencoders: A Brief Overview
No ratings yet
Introduction To Autoencoders: A Brief Overview
27 pages
Deep Clustering With Convolutional Autoencoders
No ratings yet
Deep Clustering With Convolutional Autoencoders
10 pages
Auto-Encoder Based Data Clustering: Abstract. Linear or Non-Linear Data Transformations Are Widely Used
No ratings yet
Auto-Encoder Based Data Clustering: Abstract. Linear or Non-Linear Data Transformations Are Widely Used
8 pages
SML Unit 3
No ratings yet
SML Unit 3
38 pages
Unit V
No ratings yet
Unit V
22 pages
Deep Learning: Autoencoder
No ratings yet
Deep Learning: Autoencoder
42 pages
2016-CVPR-Joint Unsupervised Learning of Deep Representations and Image Clusters
No ratings yet
2016-CVPR-Joint Unsupervised Learning of Deep Representations and Image Clusters
10 pages
Chapter17 Autoencoders
No ratings yet
Chapter17 Autoencoders
23 pages
Dual Autoencoder Clustering
No ratings yet
Dual Autoencoder Clustering
10 pages
Autoencoders, Unsupervised Learning, and Deep Architectures
No ratings yet
Autoencoders, Unsupervised Learning, and Deep Architectures
14 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
155 pages
W9a Autoencoders Pca
No ratings yet
W9a Autoencoders Pca
7 pages
1 Autoencoders
No ratings yet
1 Autoencoders
22 pages
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
61 pages
Autoencoder For Neuroimage: Abstract. Variational Autoencoder (Vae) As A Class of Neural Networks
No ratings yet
Autoencoder For Neuroimage: Abstract. Variational Autoencoder (Vae) As A Class of Neural Networks
7 pages
Lecture 14 Autoencoders
No ratings yet
Lecture 14 Autoencoders
39 pages
Rifai Et Al. - 2011 - Contractive Auto-Encoders Explicit Invariance During Feature Extraction
No ratings yet
Rifai Et Al. - 2011 - Contractive Auto-Encoders Explicit Invariance During Feature Extraction
8 pages
Deep - Learning
No ratings yet
Deep - Learning
49 pages
Autoencoders
No ratings yet
Autoencoders
14 pages
Autoencoders
No ratings yet
Autoencoders
12 pages
Auto Encoder
No ratings yet
Auto Encoder
16 pages
1 s2.0 S0950705122010772 Main
No ratings yet
1 s2.0 S0950705122010772 Main
10 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Unit 3
No ratings yet
Unit 3
23 pages
Generative Models
No ratings yet
Generative Models
65 pages
Best-4 - Topological Gradient Based Competitive Learning
No ratings yet
Best-4 - Topological Gradient Based Competitive Learning
12 pages
Differentiable Deep Clustering With Cluster Size Constraints
No ratings yet
Differentiable Deep Clustering With Cluster Size Constraints
8 pages
1 s2.0 S0925231221009486 Main
No ratings yet
1 s2.0 S0925231221009486 Main
7 pages
DNN2 PDF
No ratings yet
DNN2 PDF
5 pages
7 Icann2011
No ratings yet
7 Icann2011
8 pages
Auto Encoder
No ratings yet
Auto Encoder
11 pages
A Group Theoretic Perspective On Unsupervised Deep Learning
No ratings yet
A Group Theoretic Perspective On Unsupervised Deep Learning
2 pages
Unit IV V Deep Learning Material
No ratings yet
Unit IV V Deep Learning Material
32 pages
Stacked Autoencoders. - Towards Data Science
No ratings yet
Stacked Autoencoders. - Towards Data Science
9 pages
Autoencoders
No ratings yet
Autoencoders
4 pages
Alqahtani 2018
No ratings yet
Alqahtani 2018
5 pages
2017 IDEC Guo
No ratings yet
2017 IDEC Guo
7 pages
Dl-Unit 3
No ratings yet
Dl-Unit 3
14 pages
Phase 2
No ratings yet
Phase 2
4 pages
Module 03
No ratings yet
Module 03
13 pages
1c53091d 1746848322051
No ratings yet
1c53091d 1746848322051
7 pages
Synthetic Speech Detection Through Short Term and Long-Term Prediction Traces
100% (1)
Synthetic Speech Detection Through Short Term and Long-Term Prediction Traces
14 pages
Deep Learning 2024
100% (1)
Deep Learning 2024
16 pages
Machine Learning Tutorial
No ratings yet
Machine Learning Tutorial
149 pages
Project Report
No ratings yet
Project Report
47 pages
Machine Learning
No ratings yet
Machine Learning
109 pages
Deep Learning Goodfellow I. Download
No ratings yet
Deep Learning Goodfellow I. Download
62 pages
Data Analytics and Machine Learning For Smart Process Manufacturing Recent Advances and Perspectives in The Big Data Era
No ratings yet
Data Analytics and Machine Learning For Smart Process Manufacturing Recent Advances and Perspectives in The Big Data Era
7 pages
A Review of Dimensionality Reduction Techniques For Efficient INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING Computation Computation
No ratings yet
A Review of Dimensionality Reduction Techniques For Efficient INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING Computation Computation
8 pages
(Fall 2024) Deep Learning 3
No ratings yet
(Fall 2024) Deep Learning 3
54 pages
NNDL Notes Unit 3
No ratings yet
NNDL Notes Unit 3
38 pages
Generative Evaluation of Audio Representations
No ratings yet
Generative Evaluation of Audio Representations
17 pages
Scaling Proprioceptive-Visual Learning With Heterogeneous Pre-Trained Transformers
No ratings yet
Scaling Proprioceptive-Visual Learning With Heterogeneous Pre-Trained Transformers
24 pages
Okok Projects 2023
No ratings yet
Okok Projects 2023
45 pages
Flowerformer: Empowering Neural Architecture Encoding Using A Flow-Aware Graph Transformer
No ratings yet
Flowerformer: Empowering Neural Architecture Encoding Using A Flow-Aware Graph Transformer
12 pages
Computer Aided Civil Eng - 2022 - Yong - Prompt Engineering For Zero Shot and Few Shot Defect Detection and Classification
No ratings yet
Computer Aided Civil Eng - 2022 - Yong - Prompt Engineering For Zero Shot and Few Shot Defect Detection and Classification
19 pages
Gao Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-Training CVPR 2024 Paper
No ratings yet
Gao Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-Training CVPR 2024 Paper
11 pages
2002.08277 - When Radiology Report Generation Meets Knowledge Graph
No ratings yet
2002.08277 - When Radiology Report Generation Meets Knowledge Graph
8 pages
Lu 等 - 2024 - 3DGTN 3-D dual-attention GLocal transformer network for point cloud classification and segmentation
No ratings yet
Lu 等 - 2024 - 3DGTN 3-D dual-attention GLocal transformer network for point cloud classification and segmentation
13 pages
A Comprehensive Survey of Deep Learning For Image Captioning
No ratings yet
A Comprehensive Survey of Deep Learning For Image Captioning
36 pages
The Platonic Representation Hypothesis
No ratings yet
The Platonic Representation Hypothesis
26 pages
Semppl: Predicting Pseudo - Labels For Better Contrastive Representations
No ratings yet
Semppl: Predicting Pseudo - Labels For Better Contrastive Representations
25 pages
Hu 2018 Deep Stock
No ratings yet
Hu 2018 Deep Stock
5 pages
Augmentation-Free Self-Supervised Learning
No ratings yet
Augmentation-Free Self-Supervised Learning
9 pages
SSD: A U F S - S O D: Nified Ramework FOR ELF Upervised Utlier Etection
No ratings yet
SSD: A U F S - S O D: Nified Ramework FOR ELF Upervised Utlier Etection
17 pages
2 - Self-Supervised Learning For Anomaly Detection and Localization
No ratings yet
2 - Self-Supervised Learning For Anomaly Detection and Localization
28 pages
CCA Loss
No ratings yet
CCA Loss
5 pages
A Generic Energy Prediction Model of Machine Tools Using Deep Learning
No ratings yet
A Generic Energy Prediction Model of Machine Tools Using Deep Learning
11 pages
Learning 3D Representations From 2D Pre-Trained Models Via Image-to-Point Masked Autoencoders
No ratings yet
Learning 3D Representations From 2D Pre-Trained Models Via Image-to-Point Masked Autoencoders
15 pages
Context Encoders: Feature Learning by Inpainting
No ratings yet
Context Encoders: Feature Learning by Inpainting
12 pages
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
From Everand
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
Nietsnie Trebla
No ratings yet

DAC: Deep Autoencoder-Based Clustering, A General Deep Learning Framework of Representation Learning

Uploaded by

DAC: Deep Autoencoder-Based Clustering, A General Deep Learning Framework of Representation Learning

Uploaded by

DAC: Deep Autoencoder-based Clustering, a General

Deep Learning Framework of Representation Learning

Si Lu1 and Ruisi Li

Portland State University

Abstract. Clustering performs an essential role in many real world applications,

Keywords: clustering, K-Means, representation learning, deep neuron networks,

2 Overview of Deep Autoencoder-based Clustering

Figure 1 shows an overview of our deep autoencoder-based clustering framework. There

3 Deep Autoencoder for Representation Learning

Relu(x) = max(0, x) (1)

3.3 Objective Function

L = Lcmse + β L̇r (6)

Fig. 3. Samples of the MNIST dataset.

4.2 Measurement Metrics

4.3 Experiment setup

4.4 Results on MNIST

Table 1. Clustering results on MNIST testing dataset.

4.5 Results on other datasets

Table 2. Fashion-MNIST category labels.

T-shirt/top Trouser Pullover Dress Coat

Fig. 5. Samples of the Fashion-MNIST dataset.

Table 3. HAPT category labels.

walking walking upstairs walking downstairs sitting

Table 4. Clustering results on Fashion-MNIST testing dataset.

Table 5. Clustering results on HAPT testing dataset.

In this paper, we propose DAC, Deep Autoencoder-based Clustering, a generalized

Fig. 7. Overview of our Deep Autoencoder-based Clustering on Fashion-MNSIT dataset.

Fig. 8. Overview of our Deep Autoencoder-based Clustering on HAPT dataset.

You might also like