2D to 3D Image Conversion Algorithms
2D to 3D Image Conversion Algorithms
1051/itmconf/20246401010
ICACS24
1 Introduction
Nowadays, one of the core challenges of image-based modeling and computer vision is
producing a realistic 3D model from 2D photos. There are numerous algorithms available
today for reconstructing 3D from 2D photos, techniques use only a single image as input
(Binocular disparity, Motion Parallax, Image blur, Silhouette, and Structure from motion),
and techniques use Two or more than two images (Linear perspective, Atmosphere scattering
and Shape from shading) each technique has unique execution requirements, advantages, and
disadvantages. Particularly image content is widely used in different applications of
computer vision and image processing. Images are widely used in real-time applications. For
instance, applications about human identity use photos (faces) to identify humans. In some
applications, images are used to preserve historical activities or happenings for future
revision and retrieval. Digital image processing is the approach in which digital computers
are used to deal with image processing. In image processing, there is some process involved
based on the problem at hand. For each problem associated with image processing specific
algorithm is used.
For instance, image segmentation is done by using segmentation-based algorithms. Image
processing therefore is based on several algorithms. Images can be in either 2D or 3D form.
In many real-life applications 2D and 3D images are being used. In the process of using them,
sometimes, it is important to convert from 2D to 3D and vice versa [1].
*
Corresponding author: [email protected]
© The Authors, published by EDP Sciences. This is an open access article distributed under the terms of the Creative Commons
Attribution License 4.0 (https://creativecommons.org/licenses/by/4.0/).
ITM Web of Conferences 64, 01010 (2024) https://doi.org/10.1051/itmconf/20246401010
ICACS24
There are many techniques available for image conversion. Traditional image processing
approaches are used to convert from 2D to 3D. These approaches are based on image
processing algorithms. The conversion process involves certain heuristics and methods.
Another category of conversion techniques is learning-based approaches. Learning-based
approaches such as RANSAC (RAndom SAmple Consensus) and (3D Lider) are very
dynamic and they learn from available data rather than following certain common heuristics
[13].
2 Methodology
Artificial Intelligence has been used widely in the medical field. It ranges from medical
diagnosis to treatment. It even helped sometimes in replacing the traditional medical
procedures because it avoids the basic errors that can be made by humans. The following
sections are the summery offered by this research in the major medical areas that can benefit
from the application of artificial intelligence.
Table 1: Shows final list of article references and their publisher
Publisher References
IEEE 4
Elsevier 9
Springer 2
Google Scholar 1
3 Literature Review
This section reviews the literature on different methods associated with 2D to 3D image
conversion. It provides insights on various aspects of conversion methods including machine
learning and deep learning.
2
ITM Web of Conferences 64, 01010 (2024) https://doi.org/10.1051/itmconf/20246401010
ICACS24
3
ITM Web of Conferences 64, 01010 (2024) https://doi.org/10.1051/itmconf/20246401010
ICACS24
accurate classification and retrieval, emphasizing the advantages and challenges posed by the
three-dimensional spatial structure. The proposed algorithm showcases effectiveness, and
promising applications in various fields, and future research will focus on computational
complexity and cost considerations to enhance deployment ability [4].
4
ITM Web of Conferences 64, 01010 (2024) https://doi.org/10.1051/itmconf/20246401010
ICACS24
5
ITM Web of Conferences 64, 01010 (2024) https://doi.org/10.1051/itmconf/20246401010
ICACS24
6
ITM Web of Conferences 64, 01010 (2024) https://doi.org/10.1051/itmconf/20246401010
ICACS24
correlation. The method's applicability extends to square and scalable color images, offering
potential use in key generation, watermarking, and steganography techniques [3].
[4] Deep learning CNN and RANSAC KITTI A more reliable and
sliding object accurate approach is yet
window detection desired.
technique benchmark
dataset
[6] Deep learning Laser Flow- Custom Voxel and point cloud-
scanning and constrained dataset based approaches are to
stereo vision clustering be investigated in the
techniques algorithm future.
7
ITM Web of Conferences 64, 01010 (2024) https://doi.org/10.1051/itmconf/20246401010
ICACS24
As presented in Table 2, different methods are used for 2D to 3D image conversion and vice
versa particularly based on deep learning.
As presented in Table 3, there are different kinds of image processing methods used for 2D
to 3D image conversion besides providing their merits and demerits.
8
ITM Web of Conferences 64, 01010 (2024) https://doi.org/10.1051/itmconf/20246401010
ICACS24
5 Conclusion
A systematic review is made in this paper on existing methods of 2D to 3D image conversion and vice
versa. 3D object reconstruction from 2D images is a very active and useful field that has many practical
applications such as medical, military, security, logistics, and virtual tourism.
There are many more application fields for 3D object reconstruction such as Free-viewpoint video
reconstruction, robotic mapping, city planning, Gaming, virtual environments, robot navigation,
archaeology, augmented reality, reverse engineering, motion capture, Gesture recognition, and hand
tracking. Computer vision applications need image processing and learning-based techniques for
dealing with image content. In many computer vision applications, it is essential to deal with 2D and
3D images. It is also important to convert 2D to 3D and vice versa for different applications. This
systematic review throws light on the current level of innovation in image conversion techniques. Many
image processing techniques and deep learning models are found in the literature for image conversion.
CNN and CNN variants are widely used for image conversion. Transfer learning is used to enhance the
reuse of trained models. GAN-based approaches are also found useful for image conversion. Moreover,
different computer vision applications are needed to convert from either 2D to 3D or vice versa. Many
image processing techniques are also used for image conversion. From the insights of this paper, it is
observed that image conversion with learning-based approaches could leverage quality in conversion
besides enabling underlying applications for better performance.
References
9
ITM Web of Conferences 64, 01010 (2024) https://doi.org/10.1051/itmconf/20246401010
ICACS24
10. Li, M., Lin, Zh., Mech, R., Yumer, E. and Ramanan, D. Photo-Sketching: Inferring
Contour Drawings From Images. IEEE., pp.1-10.
http://DOI:10.1109/WACV.2019.00154, (2019).
11. Shamsolmoali, P., Zareapoor, M., Granger, E., Zhou, H., Wang, R., Celebi M. E. and
Yang, J. Image synthesis with adversarial networks: A comprehensive survey and case
studies. Elsevier. http://doi:10.1016/j.inffus.2021.02.014, (2021).
12. Tu, X., Zhao, J., Xie, M., Jiang, Z. , Balamurugan, K., Luo, Y., Zhao, Y., He, L. , Ma,
Zh. and Feng, J. 3D Face Reconstruction From A Single Image Assisted by 2D Face
Images in the Wild. IEEE, 23, pp. 1160- 1172. http://doi:10.1109/tmm.2020.2993962,
(2021).
13. Wu, O., Li, X., Wang, K. and Bilal, H. Regional feature fusion for on-road detection of
objects using camera and 3D-LiDAR in high-speed autonomous vehicles. Springer,
p.18195–18213. https://doi.org/10.1007/s00500-023-09278-3, (2023).
14. Wu, Zh., Zhang, J., Fang, Q., Yu, H. and Haiyan, M. Mesoscopic modelling of concrete
material under static and dynamic loadings: A review. Elsevier.
http://doi:10.1016/j.conbuildmat.2021.122419 , (2021).
15. Xu, Y., Tong, X. and Stilla, U. Voxel-based representation of 3D point clouds: Methods,
applications, and its potential use in the construction industry. Elsevier.
http://doi:10.1016/j.autcon.2021.103675, (2021).
16. Yang, X., Li, Y. and Lyu, S.. EXPOSING DEEP FAKES USING INCONSISTENT
HEAD POSES. IEEE., pp.8261-8265, (2019).
10