Skip to main content

Showing 1–50 of 57 results for author: Civera, J

Searching in archive cs. Search in all archives.
.
  1. Addressing the challenges of loop detection in agricultural environments

    Authors: Nicolás Soncini, Javier Civera, Taihú Pire

    Abstract: While visual SLAM systems are well studied and achieve impressive results in indoor and urban settings, natural, outdoor and open-field environments are much less explored and still present relevant research challenges. Visual navigation and local mapping have shown a relatively good performance in open-field environments. However, globally consistent mapping and long-term localization still depen… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Journal ref: Journal of Field Robotics (2024), 1-10

  2. arXiv:2408.01737  [pdf, other

    cs.RO

    Real-time Localization and Mapping in Architectural Plans with Deviations

    Authors: Muhammad Shaheer, Jose Andres Millan-Romera, Hriday Bavle, Marco Giberna, Jose Luis Sanchez-Lopez, Javier Civera, Holger Voos

    Abstract: Having prior knowledge of an environment boosts the localization and mapping accuracy of robots. Several approaches in the literature have utilized architectural plans in this regard. However, almost all of them overlook the deviations between actual as-built environments and as-planned architectural designs, introducing bias in the estimations. To address this issue, we present a novel localizati… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

  3. arXiv:2407.20391  [pdf, other

    cs.CV cs.RO

    Alignment Scores: Robust Metrics for Multiview Pose Accuracy Evaluation

    Authors: Seong Hun Lee, Javier Civera

    Abstract: We propose three novel metrics for evaluating the accuracy of a set of estimated camera poses given the ground truth: Translation Alignment Score (TAS), Rotation Alignment Score (RAS), and Pose Alignment Score (PAS). The TAS evaluates the translation accuracy independently of the rotations, and the RAS evaluates the rotation accuracy independently of the translations. The PAS is the average of the… ▽ More

    Submitted 2 August, 2024; v1 submitted 29 July, 2024; originally announced July 2024.

  4. arXiv:2407.02422  [pdf, other

    cs.CV

    Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition

    Authors: Sergio Izquierdo, Javier Civera

    Abstract: Visual Place Recognition (VPR) plays a critical role in many localization and mapping pipelines. It consists of retrieving the closest sample to a query image, in a certain embedding space, from a database of geotagged references. The image embedding is learned to effectively describe a place despite variations in visual appearance, viewpoint, and geometric changes. In this work, we formulate how… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2405.15518  [pdf, other

    cs.CV

    Feature Splatting for Better Novel View Synthesis with Low Overlap

    Authors: T. Berriel Martins, Javier Civera

    Abstract: 3D Gaussian Splatting has emerged as a very promising scene representation, achieving state-of-the-art quality in novel view synthesis significantly faster than competing alternatives. However, its use of spherical harmonics to represent scene colors limits the expressivity of 3D Gaussians and, as a consequence, the capability of the representation to generalize as we move away from the training v… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  6. arXiv:2404.17251  [pdf, other

    cs.CV

    Camera Motion Estimation from RGB-D-Inertial Scene Flow

    Authors: Samuel Cerezo, Javier Civera

    Abstract: In this paper, we introduce a novel formulation for camera motion estimation that integrates RGB-D images and inertial data through scene flow. Our goal is to accurately estimate the camera motion in a rigid 3D environment, along with the state of the inertial measurement unit (IMU). Our proposed method offers the flexibility to operate as a multi-frame optimization or to marginalize older data, t… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR2024 Workshop on Visual Odometry and Computer Vision Applications

  7. arXiv:2403.13395  [pdf, other

    cs.CV cs.RO

    Unifying Local and Global Multimodal Features for Place Recognition in Aliased and Low-Texture Environments

    Authors: Alberto García-Hernández, Riccardo Giubilato, Klaus H. Strobl, Javier Civera, Rudolph Triebel

    Abstract: Perceptual aliasing and weak textures pose significant challenges to the task of place recognition, hindering the performance of Simultaneous Localization and Mapping (SLAM) systems. This paper presents a novel model, called UMF (standing for Unifying Local and Global Multimodal Features) that 1) leverages multi-modality by cross-attention blocks between vision and LiDAR features, and 2) includes… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted submission to International Conference on Robotics and Automation (ICRA), 2024

  8. arXiv:2402.16598  [pdf, other

    cs.CV cs.RO

    PCR-99: A Practical Method for Point Cloud Registration with 99 Percent Outliers

    Authors: Seong Hun Lee, Javier Civera, Patrick Vandewalle

    Abstract: We propose a robust method for point cloud registration that can handle both unknown scales and extreme outlier ratios. Our method, dubbed PCR-99, uses a deterministic 3-point sampling approach with two novel mechanisms that significantly boost the speed: (1) an improved ordering of the samples based on pairwise scale consistency, prioritizing the point correspondences that are more likely to be i… ▽ More

    Submitted 2 August, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  9. arXiv:2312.05995  [pdf, other

    cs.CV

    From Correspondences to Pose: Non-minimal Certifiably Optimal Relative Pose without Disambiguation

    Authors: Javier Tirado-Garín, Javier Civera

    Abstract: Estimating the relative camera pose from $n \geq 5$ correspondences between two calibrated views is a fundamental task in computer vision. This process typically involves two stages: 1) estimating the essential matrix between the views, and 2) disambiguating among the four candidate relative poses that satisfy the epipolar geometry. In this paper, we demonstrate a novel approach that, for the firs… ▽ More

    Submitted 27 March, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: Accepted to CVPR 2024

  10. arXiv:2311.15937  [pdf, other

    cs.CV

    Optimal Transport Aggregation for Visual Place Recognition

    Authors: Sergio Izquierdo, Javier Civera

    Abstract: The task of Visual Place Recognition (VPR) aims to match a query image against references from an extensive database of images from different places, relying solely on visual cues. State-of-the-art pipelines focus on the aggregation of features extracted from a deep backbone, in order to form a global descriptor for each image. In this context, we introduce SALAD (Sinkhorn Algorithm for Locally Ag… ▽ More

    Submitted 27 June, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  11. arXiv:2309.14823  [pdf, other

    cs.CL

    Segmentation-Free Streaming Machine Translation

    Authors: Javier Iranzo-Sánchez, Jorge Iranzo-Sánchez, Adrià Giménez, Jorge Civera, Alfons Juan

    Abstract: Streaming Machine Translation (MT) is the task of translating an unbounded input text stream in real-time. The traditional cascade approach, which combines an Automatic Speech Recognition (ASR) and an MT system, relies on an intermediate segmentation step which splits the transcription stream into sentence-like units. However, the incorporation of a hard segmentation constrains the MT system and i… ▽ More

    Submitted 25 May, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: Pre-MIT Press publication version. 18 pages, 13 figures

  12. arXiv:2309.06792  [pdf, other

    cs.CV

    Motion-Bias-Free Feature-Based SLAM

    Authors: Alejandro Fontan, Javier Civera, Michael Milford

    Abstract: For SLAM to be safely deployed in unstructured real world environments, it must possess several key properties that are not encompassed by conventional benchmarks. In this paper we show that SLAM commutativity, that is, consistency in trajectory estimates on forward and reverse traverses of the same route, is a significant issue for the state of the art. Current pipelines show a significant bias b… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: BMVC 2023

  13. arXiv:2309.05388  [pdf, other

    cs.CV cs.RO

    Robust Single Rotation Averaging Revisited

    Authors: Seong Hun Lee, Javier Civera

    Abstract: In this work, we propose a novel method for robust single rotation averaging that can efficiently handle an extremely large fraction of outliers. Our approach is to minimize the total truncated least unsquared deviations (TLUD) cost of geodesic distances. The proposed algorithm consists of three steps: First, we consider each input rotation as a potential initial solution and choose the one that y… ▽ More

    Submitted 28 February, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

  14. arXiv:2308.11242  [pdf, other

    cs.RO cs.AI

    Faster Optimization in S-Graphs Exploiting Hierarchy

    Authors: Hriday Bavle, Jose Luis Sanchez-Lopez, Javier Civera, Holger Voos

    Abstract: 3D scene graphs hierarchically represent the environment appropriately organizing different environmental entities in various layers. Our previous work on situational graphs extends the concept of 3D scene graph to SLAM by tightly coupling the robot poses with the scene graph entities, achieving state-of-the-art results. Though, one of the limitations of S-Graphs is scalability in really large env… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: 4 pages, 3 figures, IROS 2023 Workshop Paper

  15. arXiv:2308.10525  [pdf, other

    cs.CV

    LightDepth: Single-View Depth Self-Supervision from Illumination Decline

    Authors: Javier Rodríguez-Puigvert, Víctor M. Batlle, J. M. M. Montiel, Ruben Martinez-Cantin, Pascal Fua, Juan D. Tardós, Javier Civera

    Abstract: Single-view depth estimation can be remarkably effective if there is enough ground-truth depth data for supervised training. However, there are scenarios, especially in medicine in the case of endoscopies, where such data cannot be obtained. In such cases, multi-view self-supervision and synthetic-to-real transfer serve as alternative approaches, however, with a considerable performance reduction… ▽ More

    Submitted 19 September, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

  16. GNSS-stereo-inertial SLAM for arable farming

    Authors: Javier Cremona, Javier Civera, Ernesto Kofman, Taihú Pire

    Abstract: The accelerating pace in the automation of agricultural tasks demands highly accurate and robust localization systems for field robots. Simultaneous Localization and Mapping (SLAM) methods inevitably accumulate drift on exploratory trajectories and primarily rely on place revisiting and loop closing to keep a bounded global localization error. Loop closure techniques are significantly challenging… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted for publication in Journal of Field Robotics, 2023

  17. arXiv:2306.16917  [pdf, other

    cs.CV cs.LG cs.RO

    The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes

    Authors: David Recasens, Martin R. Oswald, Marc Pollefeys, Javier Civera

    Abstract: Estimating camera motion in deformable scenes poses a complex and open research challenge. Most existing non-rigid structure from motion techniques assume to observe also static scene parts besides deforming scene parts in order to establish an anchoring reference. However, this assumption does not hold true in certain relevant application cases such as endoscopies. Deformable odometry and SLAM pi… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  18. arXiv:2305.12250  [pdf, other

    cs.CV

    DAC: Detector-Agnostic Spatial Covariances for Deep Local Features

    Authors: Javier Tirado-Garín, Frederik Warburg, Javier Civera

    Abstract: Current deep visual local feature detectors do not model the spatial uncertainty of detected features, producing suboptimal results in downstream applications. In this work, we propose two post-hoc covariance estimates that can be plugged into any pretrained deep feature detector: a simple, isotropic covariance estimate that uses the predicted score at a given pixel location, and a full covariance… ▽ More

    Submitted 15 August, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

  19. arXiv:2305.09566  [pdf, other

    cs.CV

    Ray-Patch: An Efficient Querying for Light Field Transformers

    Authors: T. Berriel Martins, Javier Civera

    Abstract: In this paper we propose the Ray-Patch querying, a novel model to efficiently query transformers to decode implicit representations into target views. Our Ray-Patch decoding reduces the computational footprint and increases inference speed up to one order of magnitude compared to previous models, without losing global attention, and hence maintaining specific task metrics. The key idea of our nove… ▽ More

    Submitted 17 August, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

  20. arXiv:2305.09295  [pdf, other

    cs.RO

    Graph-based Global Robot Simultaneous Localization and Mapping using Architectural Plans

    Authors: Muhammad Shaheer, Jose Andres Millan-Romera, Hriday Bavle, Jose Luis Sanchez-Lopez, Javier Civera, Holger Voos

    Abstract: In this paper, we propose a solution for graph-based global robot simultaneous localization and mapping (SLAM) using architectural plans. Before the start of the robot operation, the previously available architectural plan of the building is converted into our proposed architectural graph (A-Graph). When the robot starts its operation, it uses its onboard LIDAR and odometry to carry out an online… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 4 Page Workshop paper for ICRA 2023. arXiv admin note: substantial text overlap with arXiv:2303.02076

  21. arXiv:2303.02076  [pdf, other

    cs.RO cs.AI

    Graph-based Global Robot Localization Informing Situational Graphs with Architectural Graphs

    Authors: Muhammad Shaheer, Jose Andres Millan-Romera, Hriday Bavle, Jose Luis Sanchez-Lopez, Javier Civera, Holger Voos

    Abstract: In this paper, we propose a solution for legged robot localization using architectural plans. Our specific contributions towards this goal are several. Firstly, we develop a method for converting the plan of a building into what we denote as an architectural graph (A-Graph). When the robot starts moving in an environment, we assume it has no knowledge about it, and it estimates an online situation… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 8 pages, 5 Figures, IROS 2023 conference

  22. arXiv:2212.11770  [pdf, other

    cs.RO cs.AI

    S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations

    Authors: Hriday Bavle, Jose Luis Sanchez-Lopez, Muhammad Shaheer, Javier Civera, Holger Voos

    Abstract: In this paper, we present an evolved version of Situational Graphs, which jointly models in a single optimizable factor graph (1) a pose graph, as a set of robot keyframes comprising associated measurements and robot poses, and (2) a 3D scene graph, as a high-level representation of the environment that encodes its different geometric elements with semantic attributes and the relational informatio… ▽ More

    Submitted 26 May, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: 8 Pages, 7 Figures, 10 Tables

  23. arXiv:2212.05376  [pdf, other

    cs.RO cs.CV

    What's Wrong with the Absolute Trajectory Error?

    Authors: Seong Hun Lee, Javier Civera

    Abstract: One of the limitations of the commonly used Absolute Trajectory Error (ATE) is that it is highly sensitive to outliers. As a result, in the presence of just a few outliers, it often fails to reflect the varying accuracy as the inlier trajectory error or the number of outliers varies. In this work, we propose an alternative error metric for evaluating the accuracy of the reconstructed camera trajec… ▽ More

    Submitted 9 July, 2024; v1 submitted 10 December, 2022; originally announced December 2022.

  24. arXiv:2211.13551  [pdf, other

    cs.CV

    SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks

    Authors: Sergio Izquierdo, Javier Civera

    Abstract: Estimating a dense depth map from a single view is geometrically ill-posed, and state-of-the-art methods rely on learning depth's relation with visual appearance using deep neural networks. On the other hand, Structure from Motion (SfM) leverages multi-view constraints to produce very accurate but sparse maps, as matching across images is typically limited by locally discriminative texture. In thi… ▽ More

    Submitted 31 March, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

  25. arXiv:2211.08754  [pdf, other

    cs.RO cs.AI

    Advanced Situational Graphs for Robot Navigation in Structured Indoor Environments

    Authors: Hriday Bavle, Jose Luis Sanchez-Lopez, Muhammad Shaheer, Javier Civera, Holger Voos

    Abstract: Mobile robots extract information from its environment to understand their current situation to enable intelligent decision making and autonomous task execution. In our previous work, we introduced the concept of Situation Graphs (S-Graphs) which combines in a single optimizable graph, the robot keyframes and the representation of the environment with geometric, semantic and topological abstractio… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 4 pages, IROS 2022 Workshop Paper. arXiv admin note: text overlap with arXiv:2202.12197

  26. arXiv:2206.02570  [pdf, other

    stat.ME cs.DS math.ST

    RODIAN: Robustified Median

    Authors: Seong Hun Lee, Javier Civera

    Abstract: We propose a robust method for averaging numbers contaminated by a large proportion of outliers. Our method, dubbed RODIAN, is inspired by the key idea of MINPRAN [1]: We assume that the outliers are uniformly distributed within the range of the data and we search for the region that is least likely to contain outliers only. The median of the data within this region is then taken as RODIAN. Our ap… ▽ More

    Submitted 18 November, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

  27. EndoMapper dataset of complete calibrated endoscopy procedures

    Authors: Pablo Azagra, Carlos Sostres, Ángel Ferrandez, Luis Riazuelo, Clara Tomasini, Oscar León Barbed, Javier Morlana, David Recasens, Victor M. Batlle, Juan J. Gómez-Rodríguez, Richard Elvira, Julia López, Cristina Oriol, Javier Civera, Juan D. Tardós, Ana Cristina Murillo, Angel Lanas, José M. M. Montiel

    Abstract: Computer-assisted systems are becoming broadly used in medicine. In endoscopy, most research focuses on the automatic detection of polyps or other pathologies, but localization and navigation of the endoscope are completely performed manually by physicians. To broaden this research and bring spatial Artificial Intelligence to endoscopies, data from complete procedures is needed. This paper introdu… ▽ More

    Submitted 10 October, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: 17 pages, 14 figures, 8 tables

    Journal ref: Sci Data 10, 671 (2023)

  28. arXiv:2203.02459  [pdf, other

    cs.CL

    From Simultaneous to Streaming Machine Translation by Leveraging Streaming History

    Authors: Javier Iranzo-Sánchez, Jorge Civera, Alfons Juan

    Abstract: Simultaneous Machine Translation is the task of incrementally translating an input sentence before it is fully available. Currently, simultaneous translation is carried out by translating each sentence independently of the previously translated text. More generally, Streaming MT can be understood as an extension of Simultaneous MT to the incremental translation of a continuous input text stream. I… ▽ More

    Submitted 31 March, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: ACL 2022 - Camera ready; v3: expanded data pre-processing

  29. arXiv:2202.12197  [pdf, other

    cs.RO cs.AI

    Situational Graphs for Robot Navigation in Structured Indoor Environments

    Authors: Hriday Bavle, Jose Luis Sanchez-Lopez, Muhammad Shaheer, Javier Civera, Holger Voos

    Abstract: Mobile robots should be aware of their situation, comprising the deep understanding of their surrounding environment along with the estimation of its own state, to successfully make intelligent decisions and execute tasks autonomously in real environments. 3D scene graphs are an emerging field of research that propose to represent the environment in a joint model comprising geometric, semantic and… ▽ More

    Submitted 1 July, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: 8 pages, 6 figures, RAL/IROS 2022

  30. arXiv:2202.01821  [pdf, other

    cs.CV cs.RO

    Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization

    Authors: Andrea Vallone, Frederik Warburg, Hans Hansen, Søren Hauberg, Javier Civera

    Abstract: Place recognition and visual localization are particularly challenging in wide baseline configurations. In this paper, we contribute with the \emph{Danish Airs and Grounds} (DAG) dataset, a large collection of street-level and aerial images targeting such cases. Its main challenge lies in the extreme viewing-angle difference between query and reference images with consequent changes in illuminatio… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: Submitted to RA-L (IROS)

  31. A Model for Multi-View Residual Covariances based on Perspective Deformation

    Authors: Alejandro Fontan, Laura Oliva, Javier Civera, Rudolph Triebel

    Abstract: In this work, we derive a model for the covariance of the visual residuals in multi-view SfM, odometry and SLAM setups. The core of our approach is the formulation of the residual covariances as a combination of geometric and photometric noise sources. And our key novel contribution is the derivation of a term modelling how local 2D patches suffer from perspective deformation when imaging 3D surfa… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

  32. arXiv:2201.10602  [pdf, other

    cs.CV cs.RO

    Jacobian Computation for Cumulative B-Splines on SE(3) and Application to Continuous-Time Object Tracking

    Authors: Javier Tirado, Javier Civera

    Abstract: In this paper we propose a method that estimates the $SE(3)$ continuous trajectories (orientation and translation) of the dynamic rigid objects present in a scene, from multiple RGB-D views. Specifically, we fit the object trajectories to cumulative B-Splines curves, which allow us to interpolate, at any intermediate time stamp, not only their poses but also their linear and angular velocities and… ▽ More

    Submitted 24 May, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: Accepted at IEEE Robotics and Automation Letters

  33. arXiv:2112.08906  [pdf, other

    cs.CV

    On the Uncertain Single-View Depths in Colonoscopies

    Authors: Javier Rodríguez-Puigvert, David Recasens, Javier Civera, Rubén Martínez-Cantín

    Abstract: Estimating depth information from endoscopic images is a prerequisite for a wide set of AI-assisted technologies, such as accurate localization and measurement of tumors, or identification of non-inspected areas. As the domain specificity of colonoscopies -- deformable low-texture environments with fluids, poor lighting conditions and abrupt sensor motions -- pose challenges to multi-view 3D recon… ▽ More

    Submitted 20 July, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: 11 pages

  34. arXiv:2111.08831  [pdf, other

    cs.CV

    HARA: A Hierarchical Approach for Robust Rotation Averaging

    Authors: Seong Hun Lee, Javier Civera

    Abstract: We propose a novel hierarchical approach for multiple rotation averaging, dubbed HARA. Our method incrementally initializes the rotation graph based on a hierarchy of triplet support. The key idea is to build a spanning tree by prioritizing the edges with many strong triplet supports and gradually adding those with weaker and fewer supports. This reduces the risk of adding outliers in the spanning… ▽ More

    Submitted 29 March, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: Accepted to CVPR2022

  35. arXiv:2104.14202  [pdf, other

    cs.CV cs.RO

    Bayesian Deep Neural Networks for Supervised Learning of Single-View Depth

    Authors: Javier Rodríguez-Puigvert, Rubén Martínez-Cantín, Javier Civera

    Abstract: Uncertainty quantification is essential for robotic perception, as overconfident or point estimators can lead to collisions and damages to the environment and the robot. In this paper, we evaluate scalable approaches to uncertainty quantification in single-view supervised depth learning, specifically MC dropout and deep ensembles. For MC dropout, in particular, we explore the effect of the dropout… ▽ More

    Submitted 15 December, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

  36. arXiv:2104.08817  [pdf, other

    cs.CL

    Stream-level Latency Evaluation for Simultaneous Machine Translation

    Authors: Javier Iranzo-Sánchez, Jorge Civera, Alfons Juan

    Abstract: Simultaneous machine translation has recently gained traction thanks to significant quality improvements and the advent of streaming applications. Simultaneous translation systems need to find a trade-off between translation quality and response time, and with this purpose multiple latency measures have been proposed. However, latency evaluations for simultaneous translation are estimated at the s… ▽ More

    Submitted 8 September, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021 Camera Ready

  37. arXiv:2103.16525  [pdf, other

    cs.CV cs.LG cs.RO

    Endo-Depth-and-Motion: Reconstruction and Tracking in Endoscopic Videos using Depth Networks and Photometric Constraints

    Authors: David Recasens, José Lamarca, José M. Fácil, J. M. M. Montiel, Javier Civera

    Abstract: Estimating a scene reconstruction and the camera motion from in-body videos is challenging due to several factors, e.g. the deformation of in-body cavities or the lack of texture. In this paper we present Endo-Depth-and-Motion, a pipeline that estimates the 6-degrees-of-freedom camera pose and dense 3D scene models from monocular endoscopic videos. Our approach leverages recent advances in self-su… ▽ More

    Submitted 3 July, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

  38. arXiv:2011.12663  [pdf, other

    cs.CV

    Bayesian Triplet Loss: Uncertainty Quantification in Image Retrieval

    Authors: Frederik Warburg, Martin Jørgensen, Javier Civera, Søren Hauberg

    Abstract: Uncertainty quantification in image retrieval is crucial for downstream decisions, yet it remains a challenging and largely unexplored problem. Current methods for estimating uncertainties are poorly calibrated, computationally expensive, or based on heuristics. We present a new method that views image embeddings as stochastic features rather than deterministic features. Our two main contributions… ▽ More

    Submitted 17 September, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

    Journal ref: 2021 ICCV

  39. arXiv:2011.11724  [pdf, other

    cs.CV

    Rotation-Only Bundle Adjustment

    Authors: Seong Hun Lee, Javier Civera

    Abstract: We propose a novel method for estimating the global rotations of the cameras independently of their positions and the scene structure. When two calibrated cameras observe five or more of the same points, their relative rotation can be recovered independently of the translation. We extend this idea to multiple views, thereby decoupling the rotation estimation from the translation and structure esti… ▽ More

    Submitted 27 March, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: Accepted to CVPR 2021

  40. arXiv:2010.00052  [pdf, other

    cs.CV

    DOT: Dynamic Object Tracking for Visual SLAM

    Authors: Irene Ballester, Alejandro Fontan, Javier Civera, Klaus H. Strobl, Rudolph Triebel

    Abstract: In this paper we present DOT (Dynamic Object Tracking), a front-end that added to existing SLAM systems can significantly improve their robustness and accuracy in highly dynamic environments. DOT combines instance segmentation and multi-view geometry to generate masks for dynamic objects in order to allow SLAM systems based on rigid scene models to avoid such image areas in their optimizations.… ▽ More

    Submitted 30 September, 2020; originally announced October 2020.

  41. arXiv:2008.01258  [pdf, other

    cs.CV

    Robust Uncertainty-Aware Multiview Triangulation

    Authors: Seong Hun Lee, Javier Civera

    Abstract: We propose a robust and efficient method for multiview triangulation and uncertainty estimation. Our contribution is threefold: First, we propose an outlier rejection scheme using two-view RANSAC with the midpoint method. By prescreening the two-view samples prior to triangulation, we achieve the state-of-the-art efficiency. Second, we compare different local optimization methods for refining the… ▽ More

    Submitted 5 August, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

  42. arXiv:2008.01254  [pdf, other

    cs.CV

    Geometric Interpretations of the Normalized Epipolar Error

    Authors: Seong Hun Lee, Javier Civera

    Abstract: In this work, we provide geometric interpretations of the normalized epipolar error. Most notably, we show that it is directly related to the following quantities: (1) the shortest distance between the two backprojected rays, (2) the dihedral angle between the two bounding epipolar planes, and (3) the $L_1$-optimal angular reprojection error.

    Submitted 30 December, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

  43. arXiv:2004.00732  [pdf, other

    cs.CV

    Robust Single Rotation Averaging

    Authors: Seong Hun Lee, Javier Civera

    Abstract: We propose a novel method for single rotation averaging using the Weiszfeld algorithm. Our contribution is threefold: First, we propose a robust initialization based on the elementwise median of the input rotation matrices. Our initial solution is more accurate and robust than the commonly used chordal $L_2$-mean. Second, we propose an outlier rejection scheme that can be incorporated in the Weisz… ▽ More

    Submitted 4 November, 2020; v1 submitted 1 April, 2020; originally announced April 2020.

  44. arXiv:2003.09025  [pdf, other

    cs.RO

    LoCoQuad: A Low-Cost Arachnoid Quadruped Robot for Research and Education

    Authors: Manuel Bernal, Javier Civera

    Abstract: Developing real robotic systems requires a tight integration of mechanics, electronics and software. Most of the times, existing robotic platforms are either closed or expensive or both, and in-house solutions are costly to develop and maintain. Open-source and low-cost designs are essential to facilitate the access to real robotic platforms and enable further progress in the field. LoCoQuad is… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

    Comments: 8 pages, 12 figures

  45. RGB-D Odometry and SLAM

    Authors: Javier Civera, Seong Hun Lee

    Abstract: The emergence of modern RGB-D sensors had a significant impact in many application fields, including robotics, augmented reality (AR) and 3D scanning. They are low-cost, low-power and low-size alternatives to traditional range sensors such as LiDAR. Moreover, unlike RGB cameras, RGB-D sensors provide the additional depth information that removes the need of frame-by-frame triangulation for 3D scen… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: This is the pre-submission version of the manuscript that was later edited and published as a chapter in RGB-D Image Analysis and Processing

  46. arXiv:1911.03167  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Europarl-ST: A Multilingual Corpus For Speech Translation Of Parliamentary Debates

    Authors: Javier Iranzo-Sánchez, Joan Albert Silvestre-Cerdà, Javier Jorge, Nahuel Roselló, Adrià Giménez, Albert Sanchis, Jorge Civera, Alfons Juan

    Abstract: Current research into spoken language translation (SLT),or speech-to-text translation, is often hampered by the lack of specific data resources for this task, as currently available SLT datasets are restricted to a limited set of language pairs. In this paper we present Europarl-ST, a novel multilingual SLT corpus containing paired audio-text samples for SLT from and into 6 European languages, for… ▽ More

    Submitted 12 February, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: Accepted by ICASSP2020. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  47. arXiv:1907.11917  [pdf, other

    cs.CV

    Triangulation: Why Optimize?

    Authors: Seong Hun Lee, Javier Civera

    Abstract: For decades, it has been widely accepted that the gold standard for two-view triangulation is to minimize the cost based on reprojection errors. In this work, we challenge this idea. We propose a novel alternative to the classic midpoint method that leads to significantly lower 2D errors and parallax errors. It provides a numerically stable closed-form solution based solely on a pair of backprojec… ▽ More

    Submitted 23 August, 2019; v1 submitted 27 July, 2019; originally announced July 2019.

    Comments: Accepted to BMVC2019 (oral presentation)

  48. arXiv:1904.02028  [pdf, other

    cs.CV

    CAM-Convs: Camera-Aware Multi-Scale Convolutions for Single-View Depth

    Authors: Jose M. Facil, Benjamin Ummenhofer, Huizhong Zhou, Luis Montesano, Thomas Brox, Javier Civera

    Abstract: Single-view depth estimation suffers from the problem that a network trained on images from one camera does not generalize to images taken with a different camera model. Thus, changing the camera model requires collecting an entirely new training dataset. In this work, we propose a new type of convolution that can take the camera parameters into account, thus allowing neural networks to learn cali… ▽ More

    Submitted 3 April, 2019; originally announced April 2019.

    Comments: Camera ready version for CVPR 2019. Project page: http://webdiis.unizar.es/~jmfacil/camconvs/

  49. arXiv:1903.09115  [pdf, other

    cs.CV

    Closed-Form Optimal Two-View Triangulation Based on Angular Errors

    Authors: Seong Hun Lee, Javier Civera

    Abstract: In this paper, we study closed-form optimal solutions to two-view triangulation with known internal calibration and pose. By formulating the triangulation problem as $L_1$ and $L_\infty$ minimization of angular reprojection errors, we derive the exact closed-form solutions that guarantee global optimality under respective cost functions. To the best of our knowledge, we are the first to present su… ▽ More

    Submitted 29 July, 2019; v1 submitted 21 March, 2019; originally announced March 2019.

    Comments: Accepted to ICCV2019

  50. arXiv:1903.08094  [pdf, other

    cs.CV

    Corners for Layout: End-to-End Layout Recovery from 360 Images

    Authors: Clara Fernandez-Labrador, Jose M. Facil, Alejandro Perez-Yus, Cédric Demonceaux, Javier Civera, Jose J. Guerrero

    Abstract: The problem of 3D layout recovery in indoor scenes has been a core research topic for over a decade. However, there are still several major challenges that remain unsolved. Among the most relevant ones, a major part of the state-of-the-art methods make implicit or explicit assumptions on the scenes -- e.g. box-shaped or Manhattan layouts. Also, current methods are computationally expensive and not… ▽ More

    Submitted 25 March, 2019; v1 submitted 19 March, 2019; originally announced March 2019.