ClusterSLAM: A SLAM backend for simultaneous rigid body clustering and motion estimation

Huang, Jiahui; Yang, Sheng; Zhao, Zishuo; Lai, Yu-Kun; Hu, Shi-Min

doi:10.1007/s41095-020-0195-3

ClusterSLAM: A SLAM backend for simultaneous rigid body clustering and motion estimation

Research Article
Open access
Published: 07 January 2021

Volume 7, pages 87–101, (2021)
Cite this article

Download PDF

You have full access to this open access article

Computational Visual Media Aims and scope

ClusterSLAM: A SLAM backend for simultaneous rigid body clustering and motion estimation

Download PDF

Jiahui Huang¹,
Sheng Yang²,
Zishuo Zhao¹,
Yu-Kun Lai³ &
…
Shi-Min Hu¹

634 Accesses
Explore all metrics

Abstract

We present a practical backend for stereo visual SLAM which can simultaneously discover individual rigid bodies and compute their motions in dynamic environments. While recent factor graph based state optimization algorithms have shown their ability to robustly solve SLAM problems by treating dynamic objects as outliers, their dynamic motions are rarely considered. In this paper, we exploit the consensus of 3D motions for landmarks extracted from the same rigid body for clustering, and to identify static and dynamic objects in a unified manner. Specifically, our algorithm builds a noise-aware motion affinity matrix from landmarks, and uses agglomerative clustering to distinguish rigid bodies. Using decoupled factor graph optimization to revise their shapes and trajectories, we obtain an iterative scheme to update both cluster assignments and motion estimation reciprocally. Evaluations on both synthetic scenes and KITTI demonstrate the capability of our approach, and further experiments considering online efficiency also show the effectiveness of our method for simultaneously tracking ego-motion and multiple objects.

Article PDF

Asynchronous Adaptive Conditioning for Visual-Inertial SLAM

Keyframe-based RGB-D dense visual SLAM fused semantic cues in dynamic scenes

Article 07 April 2024

Local optimized and scalable frame-to-model SLAM

Article 22 July 2015

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Agarwal, P.; Tipaldi, G. D.; Spinello, L.; Stachniss, C.; Burgard, W. Robust map optimization using dynamic covariance scaling. In: Proceedings of the IEEE International Conference on Robotics and Automation, 62–69, 2013.
Carlone, L.; Censi, A.; Dellaert, F. Selecting good measurements via ℓ1 relaxation: A convex approach for robust estimation over graphs. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2667–2674, 2014.
Kim, D. H.; Kim, J. H. Effective background model-based RGB-D dense visual odometry in a dynamic environment. IEEE Transactions on Robotics Vol. 32, No. 6, 1565–1573, 2016.
Article Google Scholar
Bescos, B.; Facil, J. M.; Civera, J.; Neira, J. DynaSLAM: Tracking, mapping, and inpainting in dynamic scenes. IEEE Robotics and Automation Letters Vol. 3, No. 4, 4076–4083, 2018.
Article Google Scholar
Rünz, M.; Agapito, L. Co-fusion: Real-time segmentation, tracking and fusion of multiple objects. In: Proceedings of the IEEE International Conference on Robotics and Automation, 4471–4478, 2017.
Runz, M.; Buffier, M.; Agapito, L. MaskFusion: Real-time recognition, tracking and reconstruction of multiple moving objects. In: Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 10–20, 2018.
Barsan, I. A.; Liu, P.; Pollefeys, M.; Geiger, A. Robust dense mapping for large-scale dynamic environments. In: Proceedings of the IEEE International Conference on Robotics and Automation, 7510–7517, 2018.
Xu, B.; Li, W.; Tzoumanikas, D.; Bloesch, M.; Davison, A.; Leutenegger, S.; MID-fusion: Octree-based object-level multi-instance dynamic SLAM. In: Proceedings of the IEEE International Conference on Robotics and Automation, 5231–5237, 2019.
Paull, L.; Huang, G.; Seto, M.; Leonard, J. J. Communication-constrained multi-AUV cooperative SLAM. In: Proceedings of the IEEE International Conference on Robotics and Automation, 509–516, 2015.
Li, P. L.; Qin, T.; Shen, S. J. Stereo vision-based semantic 3D object and ego-motion tracking for autonomous driving. In: Computer Vision — ECCV 2018. Lecture Notes in Computer Science, Vol. 11206. Ferrari, V.; Hebert, M.; Sminchisescu, C.; Weiss, Y. Eds. Springer Cham, 664–679, 2018.
Chapter Google Scholar
Jaimez, M.; Kerl, C.; Gonzalez-Jimenez, J.; Cremers, D. Fast odometry and scene flow from RGB-D cameras based on geometric clustering. In: Proceedings of the IEEE International Conference on Robotics and Automation, 3992–3999, 2017.
He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, 2961–2969, 2017.
Chen, L. C.; Papandreou, G.; Kokkinos, I.; Murphy, K.; Yuille, A. L. DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 40, No. 4, 834–848, 2018.
Article Google Scholar
Lenz, P.; Ziegler, J.; Geiger, A.; Roser, M. Sparse scene flow segmentation for moving object detection in urban environments. In: Proceedings of the IEEE Intelligent Vehicles Symposium, 926–932, 2011.
Huang, J.; Yang, S.; Zhao, Z.; Lai, Y.-K.; Hu, S.-M. Clusterslam: A slam backend for simultaneous rigid body clustering and motion estimation. In: Proceedings of the IEEE International Conference on Computer Vision, 5875–5884, 2019.
Geiger, A.; Lenz, P.; Stiller, C.; Urtasun, R. Vision meets robotics: The KITTI dataset. The International Journal of Robotics Research Vol. 32, No. 11, 1231–1237, 2013.
Article Google Scholar
Alcantarilla, P. F.; Yebes, J. J.; Almazán, J.; Bergasa, L. M. On combining visual SLAM and dense scene flow to increase the robustness of localization and mapping in dynamic environments. In: Proceedings of the IEEE International Conference on Robotics and Automation, 1290–1297, 2012.
Mur-Artal, R.; Tardos, J. D. ORB-SLAM2: An open-source SLAM system for monocular, stereo, and RGB-D cameras. IEEE Transactions on Robotics Vol. 33, No. 5, 1255–1262, 2017.
Article Google Scholar
Kundu, A.; Krishna, K. M.; Jawahar, C. Realtime multibody visual SLAM with a smoothly moving monocular camera. In: Proceedings of the IEEE International Conference on Computer Vision, 2080–2087, 2011.
Judd, K. M.; Gammell, J. D.; Newman, P. Multimotion visual odometry (MVO): Simultaneous estimation of camera and third-party motions. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 3949–3956, 2018.
Dinesh Reddy, N.; Vo, M.; Narasimhan, S. G. CarFusion: Combining point tracking and part detection for dynamic 3D reconstruction of vehicles. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1906–1915, 2018.
Strecke, M.; Stuckler, J. Em-fusion: Dynamic object-level slam with probabilistic data association. In: Proceedings of the IEEE International Conference on Computer Vision, 5865–5874, 2019.
Saputra, M. R. U.; Markham, A.; Trigoni, N. Visual SLAM and structure from motion in dynamic environments. ACM Computing Surveys Vol. 51, No. 2, 1–36, 2018.
Article Google Scholar
Costeira, J. P.; Kanade, T. A multibody factorization method for independently moving objects. International Journal of Computer Vision Vol. 29, No. 3, 159–179, 1998.
Article Google Scholar
Li, T.; Kallem, V.; Singaraju, D.; Vidal, R. Projective factorization of multiple rigid-body motions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1–6, 2007.
Fischler, M. A.; Bolles, R. C. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM Vol. 24, No. 6, 381–395, 1981.
Article MathSciNet Google Scholar
Azartash, H.; Lee, K.; Nguyen, T. Q. Visual odometry for RGB-D cameras for dynamic scenes. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 1280–1284, 2014.
Xu, X.; Cheong, L.F.; Li, Z. Motion segmentation by exploiting complementary geometric models. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2859–2867, 2018.
Vidal, R.; Ma, Y.; Soatto, S.; Sastry, S. Two-view multibody structure from motion. International Journal of Computer Vision Vol. 68, No. 1, 7–25, 2006.
Article Google Scholar
Vidal, R.; Hartley, R. Three-view multibody structure from motion. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 30, No. 2, 214–227, 2008.
Article Google Scholar
Ilg, E.; Mayer, N.; Saikia, T.; Keuper, M.; Dosovitskiy, A.; Brox, T. FlowNet 2.0: Evolution of optical flow estimation with deep networks. In: Proceedings of the IEEE International Conference on Computer Vision, 2462–2470, 2017.
Xie, Z.-F.; Guo, Y.-C.; Zhang, S.-H.; Zhang, W.-J.; Ma, L.-Z. Multi-exposure motion estimation based on deep convolutional networks. Journal of Computer Science and Technology Vol. 33, No. 3, 487–501, 2018.
Article Google Scholar
Zhang, C. C.; Liu, Z. L. Prior-free dependent motion segmentation using Helmholtz-Hodge decomposition based object-motion oriented map. Journal of Computer Science and Technology Vol. 32, No. 3, 520–535, 2017.
Article MathSciNet Google Scholar
Isack, H.; Boykov, Y. Energy-based geometric multimodel fitting. International Journal of Computer Vision Vol. 97, No. 2, 123–147, 2012.
Article Google Scholar
Fan, R. C.; Zhang, F. L., Zhang, M.; Martin, R. R. Robust tracking-by-detection using a selection and completion mechanism. Computational Visual Media Vol. 3, No. 3, 285–294, 2017.
Article Google Scholar
Yuan, G.; Sun, P. H.; Zhao, J.; Li, D. X.; Wang, C. W. A review of moving object trajectory clustering algorithms. Artificial Intelligence Review Vol. 47, No. 1, 123–144, 2017.
Article Google Scholar
Guha, S.; Rastogi, R.; Shim, K. CURE: An efficient clustering algorithm for large databases. ACM SIGMOD Record Vol. 27, No. 2, 73–84, 1998.
Article Google Scholar
Sokal, R. R. A statistical method for evaluating systematic relationship. University of Kansas Science Bulletin Vol. 28, 1409–1438, 1958.
Google Scholar
DeTone, D.; Malisiewicz, T.; Rabinovich, A. SuperPoint: Self-supervised interest point detection and description. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 337, 2018.
Hartley, R.; Zisserman, A. Multiple View Geometry in Computer Vision. Cambridge University Press, 2003.
Defays, D. An efficient algorithm for a complete link method. The Computer Journal Vol. 20, No. 4, 364–366, 1977.
Article MathSciNet Google Scholar
Nguyen, N.; Caruana, R. Consensus clusterings. In: Proceedings of the IEEE International Conference on Data Mining, 607–612, 2007.
Newcombe, R. A.; Izadi, S.; Hilliges, O.; Molyneaux, D.; Kim, D.; Davison, A. J.; Kohi, P.; Shotton, J.; Hodges, S.; Fitzgibbon, A. KinectFusion: Real-time dense surface mapping and tracking. In: Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 127–136, 2011.
Cao, Y. P.; Kobbelt, L., Hu, S. M. Real-time high-accuracy three-dimensional reconstruction with consumer RGB-D cameras. ACM Transactions on Graphics Vol. 37, No. 5, Article No. 171, 2018.
Google Scholar
Song, S.; Yu, F.; Zeng, A.; Chang, A. X.; Savva, M.; Funkhouser, T. Semantic scene completion from a single depth image. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1746–1754, 2017.
Dosovitskiy, A.; Ros, G.; Codevilla, F.; Lopez, A.; Koltun, V. CARLA: An open urban driving simulator. In: Proceedings of the 1st Annual Conference on Robot Learning, 1–16, 2017.
Kümmerle, R.; Grisetti, G.; Strasdat, H.; Konolige, K.; Burgard, W. G²o: A general framework for graph optimization. In: Proceedings of the IEEE International Conference on Robotics and Automation, 3607–3613, 2011.
Meilǎ M. Comparing clusterings by the variation of information. In: Learning Theory and Kernel Machines. Lecture Notes in Computer Science, Vol. 2777. Schölkopf, B.; Warmuth, M.K. Eds. Springer Berlin Heidelberg, 173–187, 2003.
Google Scholar
Ravankar, A.; Ravankar, A.; Kobayashi, Y.; Hoshino, Y.; Peng, C. C. Path smoothing techniques in robot navigation: State-of-the-art, current and future challenges. Sensors Vol. 18, No. 9, 3170, 2018.
Article Google Scholar
Murali, V.; Chiu, H.-P.; Samarasekera, S.; Kumar, R. T. Utilizing semantic visual landmarks for precise vehicle navigation. In: Proceedings of the IEEE International Conference on Intelligent Transportation Systems, 1–8, 2017.

Download references

Acknowledgements

This work was supported by the National Key Technology R&D Program (Project No. 2017YFB1002604), the Joint NSFC-DFG Research Program (Project No. 61761136018), and the National Natural Science Foundation of China (Project No. 61521002).

Author information

Authors and Affiliations

BNRist, Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China
Jiahui Huang, Zishuo Zhao & Shi-Min Hu
Alibaba A.I. Labs, Hangzhou, 311121, China
Sheng Yang
School of Computer Science and Informatics, Cardiff University, Cardiff, CF24 3AA, UK
Yu-Kun Lai

Authors

Jiahui Huang
View author publications
You can also search for this author in PubMed Google Scholar
Sheng Yang
View author publications
You can also search for this author in PubMed Google Scholar
Zishuo Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Kun Lai
View author publications
You can also search for this author in PubMed Google Scholar
Shi-Min Hu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shi-Min Hu.

Additional information

Jiahui Huang received his B.S. degree in computer science and technology from Tsinghua University in 2018. He is currently a Ph.D. candidate in computer science in Tsinghua University. His research interests include computer vision, robotics, and computer graphics.

Sheng Yang received his Ph.D. degree in computer science from Tsinghua University in 2019. He is currently a software engineer in Alibaba. His research interests include SLAM, robotics, and computer graphics.

Zishuo Zhao is an undergraduate student in the Institute for Interdisciplinary Information Sciences in Tsinghua University. His research interests include computer graphics, computational geometry, operations research, and algorithm theory.

Yu-Kun Lai received his bachelor and Ph.D. degrees in computer science from Tsinghua University in 2003 and 2008, respectively. He is currently a professor in the School of Computer Science & Informatics, Cardiff University. His research interests include computer graphics, geometry processing, image processing and computer vision. He is on the editorial boards of Computer Graphics Forum and The Visual Computer.

Shi-Min Hu is currently a professor in the Department of Computer Science and Technology, Tsinghua University, Beijing. He received his Ph.D. degree from Zhejiang University in 1996. His research interests include digital geometry processing, video processing, rendering, computer animation, and computer-aided geometric design. He has published more than 100 papers in journals and refereed conferences. He is the Editor-in-Chief of Computational Visual Media (Springer), and on the editorial boards of several journals, including Computer Aided Design (Elsevier) and Computers & Graphics (Elsevier).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Other papers from this open access journal are available free of charge from http://www.springer.com/journal/41095. To submit a manuscript, please go to https://www.editorialmanager.com/cvmj.

Reprints and permissions

About this article

Cite this article

Huang, J., Yang, S., Zhao, Z. et al. ClusterSLAM: A SLAM backend for simultaneous rigid body clustering and motion estimation. Comp. Visual Media 7, 87–101 (2021). https://doi.org/10.1007/s41095-020-0195-3

Download citation

Received: 09 April 2020
Accepted: 04 September 2020
Published: 07 January 2021
Issue Date: March 2021
DOI: https://doi.org/10.1007/s41095-020-0195-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Abstract
Article PDF
References
Acknowledgements
Author information
Additional information
Rights and permissions
About this article

Agarwal, P.; Tipaldi, G. D.; Spinello, L.; Stachniss, C.; Burgard, W. Robust map optimization using dynamic covariance scaling. In: Proceedings of the IEEE International Conference on Robotics and Automation, 62–69, 2013.
Carlone, L.; Censi, A.; Dellaert, F. Selecting good measurements via ℓ1 relaxation: A convex approach for robust estimation over graphs. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2667–2674, 2014.
Kim, D. H.; Kim, J. H. Effective background model-based RGB-D dense visual odometry in a dynamic environment. IEEE Transactions on Robotics Vol. 32, No. 6, 1565–1573, 2016.
Article Google Scholar
Bescos, B.; Facil, J. M.; Civera, J.; Neira, J. DynaSLAM: Tracking, mapping, and inpainting in dynamic scenes. IEEE Robotics and Automation Letters Vol. 3, No. 4, 4076–4083, 2018.
Article Google Scholar
Rünz, M.; Agapito, L. Co-fusion: Real-time segmentation, tracking and fusion of multiple objects. In: Proceedings of the IEEE International Conference on Robotics and Automation, 4471–4478, 2017.
Runz, M.; Buffier, M.; Agapito, L. MaskFusion: Real-time recognition, tracking and reconstruction of multiple moving objects. In: Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 10–20, 2018.
Barsan, I. A.; Liu, P.; Pollefeys, M.; Geiger, A. Robust dense mapping for large-scale dynamic environments. In: Proceedings of the IEEE International Conference on Robotics and Automation, 7510–7517, 2018.
Xu, B.; Li, W.; Tzoumanikas, D.; Bloesch, M.; Davison, A.; Leutenegger, S.; MID-fusion: Octree-based object-level multi-instance dynamic SLAM. In: Proceedings of the IEEE International Conference on Robotics and Automation, 5231–5237, 2019.
Paull, L.; Huang, G.; Seto, M.; Leonard, J. J. Communication-constrained multi-AUV cooperative SLAM. In: Proceedings of the IEEE International Conference on Robotics and Automation, 509–516, 2015.
Li, P. L.; Qin, T.; Shen, S. J. Stereo vision-based semantic 3D object and ego-motion tracking for autonomous driving. In: Computer Vision — ECCV 2018. Lecture Notes in Computer Science, Vol. 11206. Ferrari, V.; Hebert, M.; Sminchisescu, C.; Weiss, Y. Eds. Springer Cham, 664–679, 2018.
Chapter Google Scholar
Jaimez, M.; Kerl, C.; Gonzalez-Jimenez, J.; Cremers, D. Fast odometry and scene flow from RGB-D cameras based on geometric clustering. In: Proceedings of the IEEE International Conference on Robotics and Automation, 3992–3999, 2017.
He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, 2961–2969, 2017.
Chen, L. C.; Papandreou, G.; Kokkinos, I.; Murphy, K.; Yuille, A. L. DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 40, No. 4, 834–848, 2018.
Article Google Scholar
Lenz, P.; Ziegler, J.; Geiger, A.; Roser, M. Sparse scene flow segmentation for moving object detection in urban environments. In: Proceedings of the IEEE Intelligent Vehicles Symposium, 926–932, 2011.
Huang, J.; Yang, S.; Zhao, Z.; Lai, Y.-K.; Hu, S.-M. Clusterslam: A slam backend for simultaneous rigid body clustering and motion estimation. In: Proceedings of the IEEE International Conference on Computer Vision, 5875–5884, 2019.
Geiger, A.; Lenz, P.; Stiller, C.; Urtasun, R. Vision meets robotics: The KITTI dataset. The International Journal of Robotics Research Vol. 32, No. 11, 1231–1237, 2013.
Article Google Scholar
Alcantarilla, P. F.; Yebes, J. J.; Almazán, J.; Bergasa, L. M. On combining visual SLAM and dense scene flow to increase the robustness of localization and mapping in dynamic environments. In: Proceedings of the IEEE International Conference on Robotics and Automation, 1290–1297, 2012.
Mur-Artal, R.; Tardos, J. D. ORB-SLAM2: An open-source SLAM system for monocular, stereo, and RGB-D cameras. IEEE Transactions on Robotics Vol. 33, No. 5, 1255–1262, 2017.
Article Google Scholar
Kundu, A.; Krishna, K. M.; Jawahar, C. Realtime multibody visual SLAM with a smoothly moving monocular camera. In: Proceedings of the IEEE International Conference on Computer Vision, 2080–2087, 2011.
Judd, K. M.; Gammell, J. D.; Newman, P. Multimotion visual odometry (MVO): Simultaneous estimation of camera and third-party motions. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 3949–3956, 2018.
Dinesh Reddy, N.; Vo, M.; Narasimhan, S. G. CarFusion: Combining point tracking and part detection for dynamic 3D reconstruction of vehicles. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1906–1915, 2018.
Strecke, M.; Stuckler, J. Em-fusion: Dynamic object-level slam with probabilistic data association. In: Proceedings of the IEEE International Conference on Computer Vision, 5865–5874, 2019.
Saputra, M. R. U.; Markham, A.; Trigoni, N. Visual SLAM and structure from motion in dynamic environments. ACM Computing Surveys Vol. 51, No. 2, 1–36, 2018.
Article Google Scholar
Costeira, J. P.; Kanade, T. A multibody factorization method for independently moving objects. International Journal of Computer Vision Vol. 29, No. 3, 159–179, 1998.
Article Google Scholar
Li, T.; Kallem, V.; Singaraju, D.; Vidal, R. Projective factorization of multiple rigid-body motions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1–6, 2007.
Fischler, M. A.; Bolles, R. C. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM Vol. 24, No. 6, 381–395, 1981.
Article MathSciNet Google Scholar
Azartash, H.; Lee, K.; Nguyen, T. Q. Visual odometry for RGB-D cameras for dynamic scenes. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 1280–1284, 2014.
Xu, X.; Cheong, L.F.; Li, Z. Motion segmentation by exploiting complementary geometric models. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2859–2867, 2018.
Vidal, R.; Ma, Y.; Soatto, S.; Sastry, S. Two-view multibody structure from motion. International Journal of Computer Vision Vol. 68, No. 1, 7–25, 2006.
Article Google Scholar
Vidal, R.; Hartley, R. Three-view multibody structure from motion. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 30, No. 2, 214–227, 2008.
Article Google Scholar
Ilg, E.; Mayer, N.; Saikia, T.; Keuper, M.; Dosovitskiy, A.; Brox, T. FlowNet 2.0: Evolution of optical flow estimation with deep networks. In: Proceedings of the IEEE International Conference on Computer Vision, 2462–2470, 2017.
Xie, Z.-F.; Guo, Y.-C.; Zhang, S.-H.; Zhang, W.-J.; Ma, L.-Z. Multi-exposure motion estimation based on deep convolutional networks. Journal of Computer Science and Technology Vol. 33, No. 3, 487–501, 2018.
Article Google Scholar
Zhang, C. C.; Liu, Z. L. Prior-free dependent motion segmentation using Helmholtz-Hodge decomposition based object-motion oriented map. Journal of Computer Science and Technology Vol. 32, No. 3, 520–535, 2017.
Article MathSciNet Google Scholar
Isack, H.; Boykov, Y. Energy-based geometric multimodel fitting. International Journal of Computer Vision Vol. 97, No. 2, 123–147, 2012.
Article Google Scholar
Fan, R. C.; Zhang, F. L., Zhang, M.; Martin, R. R. Robust tracking-by-detection using a selection and completion mechanism. Computational Visual Media Vol. 3, No. 3, 285–294, 2017.
Article Google Scholar
Yuan, G.; Sun, P. H.; Zhao, J.; Li, D. X.; Wang, C. W. A review of moving object trajectory clustering algorithms. Artificial Intelligence Review Vol. 47, No. 1, 123–144, 2017.
Article Google Scholar
Guha, S.; Rastogi, R.; Shim, K. CURE: An efficient clustering algorithm for large databases. ACM SIGMOD Record Vol. 27, No. 2, 73–84, 1998.
Article Google Scholar
Sokal, R. R. A statistical method for evaluating systematic relationship. University of Kansas Science Bulletin Vol. 28, 1409–1438, 1958.
Google Scholar
DeTone, D.; Malisiewicz, T.; Rabinovich, A. SuperPoint: Self-supervised interest point detection and description. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 337, 2018.
Hartley, R.; Zisserman, A. Multiple View Geometry in Computer Vision. Cambridge University Press, 2003.
Defays, D. An efficient algorithm for a complete link method. The Computer Journal Vol. 20, No. 4, 364–366, 1977.
Article MathSciNet Google Scholar
Nguyen, N.; Caruana, R. Consensus clusterings. In: Proceedings of the IEEE International Conference on Data Mining, 607–612, 2007.
Newcombe, R. A.; Izadi, S.; Hilliges, O.; Molyneaux, D.; Kim, D.; Davison, A. J.; Kohi, P.; Shotton, J.; Hodges, S.; Fitzgibbon, A. KinectFusion: Real-time dense surface mapping and tracking. In: Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 127–136, 2011.
Cao, Y. P.; Kobbelt, L., Hu, S. M. Real-time high-accuracy three-dimensional reconstruction with consumer RGB-D cameras. ACM Transactions on Graphics Vol. 37, No. 5, Article No. 171, 2018.
Google Scholar
Song, S.; Yu, F.; Zeng, A.; Chang, A. X.; Savva, M.; Funkhouser, T. Semantic scene completion from a single depth image. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1746–1754, 2017.
Dosovitskiy, A.; Ros, G.; Codevilla, F.; Lopez, A.; Koltun, V. CARLA: An open urban driving simulator. In: Proceedings of the 1st Annual Conference on Robot Learning, 1–16, 2017.
Kümmerle, R.; Grisetti, G.; Strasdat, H.; Konolige, K.; Burgard, W. G²o: A general framework for graph optimization. In: Proceedings of the IEEE International Conference on Robotics and Automation, 3607–3613, 2011.
Meilǎ M. Comparing clusterings by the variation of information. In: Learning Theory and Kernel Machines. Lecture Notes in Computer Science, Vol. 2777. Schölkopf, B.; Warmuth, M.K. Eds. Springer Berlin Heidelberg, 173–187, 2003.
Google Scholar
Ravankar, A.; Ravankar, A.; Kobayashi, Y.; Hoshino, Y.; Peng, C. C. Path smoothing techniques in robot navigation: State-of-the-art, current and future challenges. Sensors Vol. 18, No. 9, 3170, 2018.
Article Google Scholar
Murali, V.; Chiu, H.-P.; Samarasekera, S.; Kumar, R. T. Utilizing semantic visual landmarks for precise vehicle navigation. In: Proceedings of the IEEE International Conference on Intelligent Transportation Systems, 1–8, 2017.

Navigation

ClusterSLAM: A SLAM backend for simultaneous rigid body clustering and motion estimation

Abstract

Article PDF

Similar content being viewed by others

Asynchronous Adaptive Conditioning for Visual-Inertial SLAM

Keyframe-based RGB-D dense visual SLAM fused semantic cues in dynamic scenes

Local optimized and scalable frame-to-model SLAM

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Search

Navigation

ClusterSLAM: A SLAM backend for simultaneous rigid body clustering and motion estimation

Abstract

Article PDF

Similar content being viewed by others

Asynchronous Adaptive Conditioning for Visual-Inertial SLAM

Keyframe-based RGB-D dense visual SLAM fused semantic cues in dynamic scenes

Local optimized and scalable frame-to-model SLAM

Explore related subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords