The goal is to augment the TacNet II architecture for tactile material classification with self-supervised learning. This will be done by integrating TacNet II into the Joint-Embedding Predictive Architecture (JEPA) framework to learn representations from unlabeled data and improve classification performance. The adapted model and an autoencoder baseline will be evaluated on tactile classification tasks.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
28 views2 pages
Proposal 06
The goal is to augment the TacNet II architecture for tactile material classification with self-supervised learning. This will be done by integrating TacNet II into the Joint-Embedding Predictive Architecture (JEPA) framework to learn representations from unlabeled data and improve classification performance. The adapted model and an autoencoder baseline will be evaluated on tactile classification tasks.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2
Enhancing Tactile Material Classification with Self-
Supervised TacNet Architectures
Mohamad Marwan El Sidani Linus Eck Felix Kroll (Supervisor) Technische Universität München Technische Universität München Technische Universität München Munich, Germany Munich, Germany Munich, Germany [email protected][email protected][email protected]
I. OBJECTIVE: III. TECHNICAL OUTLINE:.
The primary goal of this project is to augment the TacNet II A. Approach: architecture, developed by Tulbure and Bäuml [1], which was initially crafted for tactile material classification, by • Architecture Initialization and Baseline Training: integrating self-supervised learning techniques. This project Initialize the TacNet II model to establish a performance baseline, focusing on understanding its integrates TacNet II within a Joint-Embedding Predictive architecture and optimizing its performance with Architecture (JEPA) to assess Assran et al.'s [2] claim on labeled tactile data. This stage lays the groundwork for JEPA's proficiency in learning robust and abstract further adaptations and comparisons. representations from unlabeled data, potentially outperforming other self-supervised learning methods in • Adaptation to Self-Supervised Learning with JEPA: tactile classification tasks. This initiative addresses the Modify TacNet II to predict tactile embeddings using significant challenge in robotics related to the costly and the JEPA framework, refining data sampling and time-consuming acquisition of labeled data [3]. masking techniques to enhance learning from contextual data. Detail the theoretical adjustments and The innovation lies in using self-supervised learning practical adaptations necessary for integrating TacNet strategies to enhance TacNet II by integrating it as an encoder II into a self-supervised learning framework. within the JEPA framework. We hypothesize this approach Incorporate the JEPA Framework’s encoder into an classification framework and fine tune it on labeled will predict tactile embeddings instead of direct sensory data. outputs, thus improving the system's ability to extract more generalizable features from materials and enhancing • Adaptation to Self-Supervised Learning with an performance in tactile material classification. Additionally, Autoencoder: Employ TacNet II as a base for this project will validate JEPA’s superiority over traditional constructing an autoencoder, training it on unlabeled autoencoders and test whether TacNet II, adapted to a self- data, then modifying it by removing the decoder and supervised learning framework, produces credible results. integrating a classification head on the encoder, which is then fine-tuned on labeled data. II. RELATED WORK: • Experimental Evaluation of Encoder Efficacy: "Baishya and Bäuml [4] along with Tulbure and Bäuml [1] Conduct experiments to compare the adapted TacNet have significantly contributed to tactile sensing for material II (as a JEPA encoder) with the autoencoder. Focus on classification with TacNet and its enhanced version, TacNet evaluating the quality and utility of the features each II, demonstrating high accuracy in interpreting complex model extracts through tactile data classification tasks, spatio-temporal signals from tactile sensors. Despite their providing insights into their practical applications and success, the reliance on extensively labeled datasets is a efficiency in classifying materials and directly testing major impediment in dynamic real-world settings [3]. each method’s adaptation to real-world applicability. B. Novel Contributions: To address the challenge of extensively labeled datasets, self- supervised learning using autoencoders is often employed • Adaptation of Self-Supervised methods to Tactile [5], allowing the network to extract features from unlabeled Data: We are developing two distinct models: a joint- data. Although autoencoders are effective for dimensionality embedding architecture (JEPA) and an autoencoder. Each model is tailored to predict tactile data features, reduction and image reconstruction, they frequently struggle contributing to the exploration of self-supervised to learn abstract, meaningful features essential for methodologies in the tactile domain. classification tasks [6]. • Benchmarking and Evaluation: Both models will Recent advances in self-supervised learning, such as Assran undergo rigorous evaluation on tactile material et al.'s I-JEPA [2], provide innovative methods for abstract classification tasks. Additionally, an investigation into representation learning from images without traditional the generalization of the learned embeddings will be labels or augmentations. While initially tailored for visual conducted. We will compare their performance against data, applying I-JEPA's principles could greatly enhance each other and the baseline TacNet II model, aiming to tactile sensing. This approach holds promise for improving highlight advancements in feature extraction, material classification using unlabeled data, potentially generalization, and classification accuracy. reducing the need for extensive labeled datasets. IV. REFERENCES: [1] A. Tulbure and B. Bäuml, "Superhuman Performance in Tactile Material Classification and Differentiation with a Flexible Pressure- Sensitive Skin," 2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids), Beijing, China, 2018, pp. 1-9 [2] M. Assran et al., "Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture," 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 2023, pp. 15619-15629 [3] T. Fredriksson, D. I. Mattos, J. Bosch, and H. H. Olsson, "Data Labeling: An Empirical Investigation into Industrial Challenges and Mitigation Strategies," in Product-Focused Software Process Improvement, PROFES 2020, Lecture Notes in Computer Science, vol. 12562, M. Morisio, M. Torchiano, and A. Jedlitschka, Eds. Cham: Springer, 2020 [4] S. S. Baishya and B. Bäuml, "Robust material classification with a tactile skin using deep learning," 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Daejeon, Korea (South), 2016, pp. 8-15 [5] K. Berahmand, F. Daneshfar, E.S. Salehi et al. "Autoencoders and their applications in machine learning: a survey," Artif Intell Rev 57, 2024 [6] W. Guo, J. Cai and S. Wang, "Unsupervised discriminative feature representation via adversarial auto-encoder, " Appl Intell 50, 2020, pp. 1155–1171
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB