Fast and automatic periacetabular osteotomy fragment pose estimation using intraoperatively implanted fiducials and single-view fluoroscopy

R B Grupp; R J Murphy; R A Hegeman; C P Alexander; M Unberath; Y Otake; B A McArthur; M Armand; R H Taylor

doi:10.1088/1361-6560/aba089

1. Introduction

Patients suffering from developmental dysplasia of the hip (DDH) typically have severe pain and reduced coverage of the femoral head, which can lead to joint osteoarthritis and subluxation of the femur (Gala et al 2016). Joint-preserving pelvic osteotomies, such as the Periacetabular osteotomy (PAO), treat DDH by reorienting the hip joint for increased femoral head coverage (Ganz et al 1988). Specifically for PAO, four osteotomies are performed about the acetabulum, fracturing it from the pelvis and allowing it to be adjusted to the desired pose (Ganz et al 1988). In the conventional approach, PAO surgeons rely on 2D x-ray images, tactile feedback, experience and acumen to navigate the surgery (Ganz et al 1988). Clinicians typically assess femoral head coverage intraoperatively using specific radiographic measurements, such as the lateral center edge (LCE) angle (Wiberg 1939), derived from fluoroscopy (Troelsen 2009). However, this approach does not indicate the full 3D alignment of the acetabular fragment, nor does it describe additional biomechanical parameters, which have the potential to improve surgical outcomes (Hipp et al 1999, Armand et al 2005, Armiger et al 2009, Niknafs et al 2013, Liu et al 2016a). A 3D example of a relocated fragment and a corresponding 2D fluoroscopic view is shown in figure 1.

Figure 1. Refer to the following caption and surrounding text. — **Figure 1.** Examples of an adjusted acetabular fragment visualized in 3D (a) and in a corresponding 2D fluoroscopic image (b). The fragment pose shown in (a) was estimated using the view shown in (b). A precise model of the acetabular fragment is not required by the proposed method; the 3D bone surfaces in (a) were constructed using a preoperative plan of the osteotomies. The anatomical axes of the anterior pelvic plane are also shown in (a); left/right (LR) as X-axis, inferior/superior (IS) as Y-axis, and anterior/posterior (AP) as Z-axis.
Download figure:
Standard image High-resolution image

In this paper, we propose a processing pipeline that is capable of automatically reporting fragment poses from a single fluoroscopic view with mean runtimes below one second. The pipeline is inspired by Roentgen stereometric analysis (RSA) techniques, which use metallic ball-bearings (BBs) to track the movement of bones or surgical implants over time (Selvik 1990).

Two constellations of BBs are injected into the patient’s pelvis prior to osteotomy: one co-located on the expected acetabular bone fragment and the other on the larger pelvis portion. The 3D locations of the BBs are reconstructed using three fluoroscopic views of the constellations. Once the acetabulum is relocated, the 3D orientation and position of the fragment is automatically calculated using a single fluoroscopic view.

Existing navigation systems for PAO have traditionally relied on optical tracking devices for tool and bone fragment tracking (Langlotz et al 1997, Akiyama et al 2010, Murphy et al 2015, Liu et al 2016b, Takao et al 2017). Due to their limited fields of view and susceptibility to line-of-sight obstructions, optical trackers introduce significant positioning constraints which are challenging to satisfy in already crowded operating rooms. Since our method only requires a small BB injection device and fluoroscopy, which is already common throughout orthopaedic operating rooms for joint surgery, we believe the proposed fluoroscopic method is more easily deployable than approaches relying on optical tracking technology. Furthermore, the registration process with an optical tracker requires a certain amount of bone exposure and may become more challenging when using minimally invasive incisions (Troelsen et al 2008). Since hip surgeries incorporating RSA do not demonstrate significantly different operating times than corresponding surgeries without RSA (Shah et al 2018), the process of injecting BBs into the pelvis should not substantially interfere with existing surgical workflows. Compared to existing approaches which leverage fluoroscopy (Grupp et al 2020a), our method only requires a single fluoroscopic image per pose estimate, does not rely on any knowledge of the 3D fragment shape, and runs without user initialization in a fraction of the time.

After BB injection, the proposed method does not require any specialized equipment or additional workflow. Moreover, the pose estimation executes quickly and automatically between fluoroscopic captures. The primary clinical contribution of this paper is the ability to report 3D orientation and position of the acetabular fragment, while requiring minimal modification to an existing surgical workflow. In terms of technical contribution, this paper is the first method leveraging intraoperatively constructed fiducial constellations to automatically recover point correspondences and poses of multiple objects moving non-coherently in uncalibrated single-view fluoroscopy.

1.1. Related work

Early navigation systems for PAO, and other pelvic osteotomies, relied on optical trackers (Langlotz et al 1997, Langlotz et al 1998, Mayman et al 2002, Akiyama et al 2010), or custom cutting guides (Radermacher et al 1998, Otsuki et al 2013). These systems only provided intraoperative assistance during performance of the acetabular osteotomies; pose estimates of the relocated fragment were not produced.

More recent systems have focused on reporting the pose of a relocated fragment (Murphy et al 2015, Liu et al 2016b, Murphy et al 2016, Takao et al 2017, De Raedt et al 2018, Grupp et al 2020a). Fragment pose updates may be provided in real-time by directly attaching an optically tracked rigid body to the fragment as demonstrated in (Liu et al 2016b). However, attaching a large rigid body to the acetabular region is challenging, especially when using a minimally invasive technique specialized for PAO (Troelsen et al 2008). In order to estimate fragment poses and avoid the attachment of an extra rigid body, (Murphy et al 2015) and (Takao et al 2017) digitize specific points on the fragment with an optically tracked pointer tool after each adjustment of the fragment. This digitization adds minor overhead to the operative time in (Murphy et al 2015) and causes some ambiguity between rotation and translation in (Takao et al 2017). In (Murphy et al 2015), fragment pose errors ranged from 1.4^° to 1.8^° in rotation and 1.0–2.2 mm in translation.

Eliminating the need for optical tracking systems, (Grupp et al 2020a) used multiple fluoroscopic views to track the acetabular fragment, ipsilateral femur, and pelvis. A multiple-component intensity-based 2D/3D registration of patient anatomy was used, requiring no external objects and maintaining compatibility with any PAO approach. However, the method suffers from several limitations and constraints that interfere with a typical surgical workflow:

An approximate AP fluoroscopic view, two additional views, and manual annotation of a single anatomical landmark are required to initialize the method.
Accuracy of the approach degrades as intraoperative fragment shapes differ from preoperatively planned shapes.
The computation time on state-of-the-art hardware is not real-time, approximately 25 s.

To overcome these limitations, the methods described in this paper leverage implanted BBs and extend RSA-related techniques to automatically track the migration of the acetabular fragment using a single view per adjustment.

Since its introduction in the 1970s, RSA has been used for a variety of applications (Selvik 1990), including the longitudinal analysis of orthopaedic implant migration (Valstar et al 2002), bone growth (Karrholm et al 1984), and even PAO stability (Mechlenburg et al 2007). Recent work in the RSA community has incorporated 2D/3D registration technology to track the movement of bones and implants without relying on inserted BBs (De Bruin et al 2008, Seehaus et al 2012). Similar to (Grupp et al 2020a), these methods require manual input and multiple x-ray views.

The implantation of BBs for intraoperative fragment tracking during PAO was first introduced in (Murphy et al 2013) and (Armand et al 2018). However, this approach is not easily incorporated into a surgical workflow, since it requires: the manual identification of BB correspondences, multiple post-osteotomy views, and a calibrated CBCT C-arm.

Several methods for automatic 3D BB reconstruction have been developed by the CBCT community (Hamming et al 2009, Yaniv 2009, Dang et al 2012, Choi et al 2014). These methods require a calibrated C-arm, leverage more than three projections, or rely on an orbital motion constraint to help establish correspondences. For intraoperative coronary artery reconstruction, epipolar constraints have been applied to automatically prune invalid point correspondences between two (Yang et al 2009), and three (Blondel et al 2006), fluoroscopic views. Structure-from-Motion pipelines operate in a similar fashion and use the dense correspondences found in large photographic collections to reconstruct rigid structures in 3D (Snavely et al 2008, Westoby et al 2012, Schonberger and Frahm 2016).

Methods using a known 3D marker constellation, two 2D x-ray views, and varying levels of manual interaction have been developed for patient positioning and motion compensation in radiation therapy (Schweikard et al 2000, Litzenberg et al 2002, Aubry et al 2004).

Single plane RSA was proposed in order to avoid the less-common, bi-planar, imaging devices used in RSA (Yuan et al 2002). However, the method requires the use of a calibration cage, does not address the establishment of 2D/3D correspondences, and was only evaluated for single object pose recovery.

Automatic pose and correspondence estimation between a single rigid collection of BBs and one fluoroscopic view was described in (Tang et al 2000) and (Kang et al 2013). The poses of custom tailored fiducial objects using lines and ellipses may also be computed automatically in a single view (Jain et al 2005a, Steger et al 2013). These poses are generally restricted to tracking the relative motion of the C-arm, since the relationship between the patient’s anatomy and the intraoperatively inserted fiducial is typically unknown.

The method of (Tang et al 2000) was incorporated into fluoroscopic systems for estimating the poses of multiple BB constellations required for knee kinematics (Tang et al 2004, Ioppolo et al 2007). However, the mechanism used to identify constellation membership and establish 2D/3D correspondences was not described.

The pipeline proposed in this paper is able to accurately, quickly, and automatically provide pose estimates of a relocated bone fragment during PAO. No reliance on external tracking devices is required. Furthermore, the pose estimation method does not require: a calibrated C-arm, multiple-views, a specific constellation pattern, accurate knowledge of the fragment shape, or any manual establishment of correspondence.

2. Materials and methods

The method introduced in this paper requires some preoperative processing and two distinct phases during the surgery. CT scanning, segmentation of the anatomy, and anatomical landmark digitization make up the preoperative processing. The first intraoperative phase is performed only once and consists of BB injection and reconstruction. Pose estimation of the acetabular fragment from a single fluoroscopic view represents the second intraoperative addition. In order to achieve the desired amount of femoral head coverage, it is typical for a surgeon to iterate between collecting fluoroscopy and adjusting the fragment. Therefore, our processing combines intelligent pruning and GPU acceleration to avoid any significant delay to the workflow. Figure 2 shows the workflow of the proposed method at a high level. Detailed flow charts of the reconstruction and pose estimation sub-components are provided in supplementary section S-1 (stacks.iop.org/PMB/65/245019/mmedia). Full details of the preoperative processing, intraoperative BB reconstruction, and intraoperative fragment pose estimation are now provided.

Figure 2. Refer to the following caption and surrounding text. — **Figure 2.** A summary of the surgical workflow proposed for this method, including the data required for each step. The key contributions of this work are the BB reconstruction and single-view pose estimation components, which are highlighted in gray. Detailed workflows for the reconstruction and pose estimation components are shown in supplementary figures S-1 and S-2, respectively.
Download figure:
Standard image High-resolution image

2.1. Preoperative processing

Preoperative processing proceeds identically to that in (Grupp et al 2020a), which we briefly describe here. A lower torso CT scan is obtained and resampled to have 1 mm isotropic voxel spacing. An automated method (Krcah et al 2011) is used for an initial segmentation of the pelvis and femurs; any inconsistencies around the femoral head and acetabulum are cleaned up manually. Anatomical landmarks are manually annotated to define the anterior pelvic plane (APP) coordinate system (Nikou et al 2000), and also for later use as initialization of pre-osteotomy pelvis registrations. The origin of the APP is set at the center of the ipsilateral femoral head, and the mapping from APP coordinates to the CT volume coordinates is denoted by $T_{APP}^{V}$ . Six additional landmarks are manually annotated in order to create a planned fragment shape, which is only used to visualize the intraoperative movement of the fragment. Examples of the APP axes orientation and a planned fragment shape are shown in figure 1.

2.2. Intraoperative BB reconstruction

A Halifax Biomedical Inc. injection device is used to implant two, four-BB constellations onto the ipsilateral side of the patient’s pelvis, with one constellation lying on the area expected to lie on the acetabular fragment and the other on the larger pelvis fragment. BB injection is conducted after performing soft-tissue dissection, but prior to osteotomy. The injection device may be maneuvered either with direct sight or fluoroscopic guidance to ensure that the BB constellations will lie on on separate bone fragments after performance of the osteotomies. For the experiments conducted in this paper, fluoroscopy was not used during the BB insertion process. Although at least three BBs must not be colinear within each constellation in order to later track their rigid movement (West et al 2001), this constraint is not difficult to satisfy in practice due to the 3D curvature of the bone.

Three distinct fluoroscopic views are collected while the patient anatomy remains stationary. For each view, a variation of the radial symmetry algorithm (Loy and Zelinsky 2003) is used to automatically locate each BB in 2D. Parameter values and details of this approach are listed in supplementary section S-1.1. The 3D locations of each BB are constructed by recovering the relative pose information of each view, establishing inter-view BB correspondences, and performing triangulation (Hartley and Zisserman 2003).

Using the strategy laid out in (Grupp et al 2020a), relative poses between the three views are recovered by performing 2D/3D rigid registrations of the patient’s preoperative pelvis to each view. Since some of our pre-osteotomy views have excessive pelvic tilt and violate the approximate AP view assumption, we select more than the single landmark described in (Grupp et al 2020a) for initialization of the pipeline. Supplementary section S-1.2 describes the parameters used for the intensity-based registrations.

BB correspondences are automatically established using a combination of anatomical information and the multiple-view geometry between the three C-arm poses. Two of the views are selected to create a candidate set of two-view, single-BB, correspondences and triangulated 3D points. Although we have made no assumptions about the geometry of these views, one of the views was always an approximate AP orientation with a variable amount of pelvic tilt. The candidate correspondences are created by first considering all possible combinations of single-BB correspondences between the two views, and pruning candidates that result in a triangulated point located more than 10 mm away from the pelvis surface. The red sphere shown in figure 4 is an example of a correspondence pruned in this way. Candidate three-view correspondences are constructed by pairing each of the remaining two-view correspondences with every 2D BB detection in the third view. For each candidate three-view correspondence, the two-view 3D triangulation is re-projected into the third view and the distance to the hypothesized 2D match is recorded. Intuitively, re-projection distances for valid correspondences should be smaller than distances from invalid matches, as shown with the green and yellow re-projections in figure 4. Correct three-view correspondences are established by greedily selecting the candidate correspondences with minimum re-projection distances in the third view. The final 3D reconstructions are triangulated using the correct three-view correspondences. In this way, the third view is used to enforce consistency and refine the 3D triangulation. A visual example is shown in figure 4 and a more formal description is located in supplementary section S-1.3.

Figure 3. Refer to the following caption and surrounding text. — **Figure 3.** The Halifax bead injection device used in four of the cadaver surgeries is shown on the left. On the right, a pre-osteotomy fluoroscopic image is shown with automatic detections of injected beads highlighted by yellow circles; every injected BB was detected. The larger BBs were used to help establish the ground truth pose of the fragment and as such, are not used and not detected during intraoperative pose estimation.
Download figure:
Standard image High-resolution image

Figure 4. Refer to the following caption and surrounding text. — **Figure 4.** A visual example of the pre-osteotomy reconstruction process for a *single* BB. Three fluoroscopic views used for reconstruction are shown in (a), (b), and (c). The initial two-view triangulations are derived from (a) and (b), while (c) is used for re-projections of initial triangulations. Regions pertinent to this example are indicated by yellow boxes, and are magnified in the bottom row. 3D renderings of the patient’s ipsilateral hemi-pelvis and the relative location of the C-arm detector for the first two views are shown in (d). The green circle in (a) indicates the location of a detected BB, whose 3D location is to be reconstructed. In (b), the green circle shows the detected BB location with true correspondence to BB in (a); the red square and yellow diamond show detected locations with incorrect correspondence. In (d), the three colored spheres are initial triangulations of the BB from (a) when matched with the BBs of varying colors in (b). The red sphere is not located on the pelvis and its candidate correspondence is pruned. However, the green and yellow spheres are located on the pelvis and must be checked using (c). Lines between the x-ray source and BB locations on the detector are colored consistently with (a), (b), and (c); note the intersection between the green lines. The green circle in (c) indicates the detected location of the BB in true correspondence with the green circles in (a) and (b). The green ‘X’ is the re-projection of the green sphere from (d) and the yellow asterisk is the re-projection of the yellow sphere. Since the green sphere was triangulated using a correct correspondence, its re-projected distance to the BB detection in (c) is very small compared to the re-projected distance of the yellow sphere, which was triangulated using an incorrect correspondence.
Download figure:
Standard image High-resolution image

After pruning reconstructed BBs on the contralateral side, the process is completed by classifying the remaining BBs as fragment/non-fragment using a K-Means clustering of the BB positions (K = 2).

Annotation speed and computation time is not a critical factor at this point in the procedure, since the BB constellations are not required until the fragment has been relocated; osteotomies may be performed immediately after the three fluoroscopic views are obtained.

2.3. Intraoperative pose estimation

After osteotomies have been performed and the acetabular fragment has been relocated, a single fluoroscopic image may be used to recover the fragment’s pose with respect to the APP, Δ_APP. Once the poses of the ilium and fragment BB constellations with respect to the C-arm, $T_C^{IL}$ and $T_C^{FR}$ , are recovered, Δ_APP is computed as in (1).

$\begin{equation} \Delta_{APP} = T_V^{APP} T_C^{FR} T_{IL}^C T_{APP}^{V} \end{equation} \tag{ 1 }$

Since the BB constellations are constructed in the original pelvis volume coordinate frame, the composition of $T_C^{FR} T_{IL}^C$ is valid and maps points on the preoperative fragment region to their adjusted locations. Using Δ_APP, the current pose of the fragment may be visualized (figure 1), and pose parameters or biomechanical (e.g. LCE) angles may also be displayed.

As was the case for each fluoroscopic view used for pre-osteotomy BB reconstruction, the radial symmetry algorithm is used to detect each BB in the 2D fluoroscopic image automatically. Since the 3D/2D BB correspondences are not yet established, it is not feasible to directly apply classic PnP approaches (Hartley and Zisserman 2003) for calculation of $T_C^{IL}$ or $T_C^{FR}$ . Since manual identification is tedious and error-prone, an automatic method to establish correspondences is the appropriate intraoperative strategy. One possible, although naïve, approach would be to enumerate over all possible correspondences and their poses. Digitally Reconstructed Radiographs (DRRs) and similarities with the fluoroscopic image would be computed for each candidate pose, with the actual pose implied by the best similarity score. When exactly all 8 BBs are detected in the view and 4 correspondences are used to establish a candidate pose, state-of-the-art GPUs may be capable of efficiently computing the 1,680 similarity scores of all possible ilium BB correspondences. However, the tractability of performing a brute-force search over similarity scores quickly diminishes as additional BBs are detected. Additional detections may result from false detections on the screws and K-wires used to fix the fragment, and are possibly exacerbated during bilateral cases due to detections on the contralateral side. Examples of 2D BB detections in fluoroscopy images are shown in supplementary Figures S-10, S-11, and S-12, with the number of BB detections per image varying between 7 and 21. For specimen 1, there were 13, 12, and 21 total detections in projections 1, 2, and 3, yielding 17,160, 11,880, and 143,640 possible ilium BB correspondences, respectively. Even with state-of-the-art GPUs, the sheer number of possible poses precludes the brute-force strategy from working in an intraoperatively compatible timeframe. However, we shall describe a procedure for pruning the number of candidate poses by several orders of magnitude, enabling the required similarity scores to be intraoperatively computed through GPU acceleration.

2.3.1. General pose pruning strategy

For a given 4-BB constellation, the general pruning strategy enumerates over each 3-BB sub-constellation. Furthermore, the full set of possible 3-BB 3D/2D correspondences for each sub-constellation is examined. Potential solutions to the P3P problem (Fischler and Bolles 1981) are considered for each set of correspondences. Since we are concerned with pose estimation using fluoroscopic imagery, our approach to the P3P problem assumes that the BB constellation lies between the x-ray source and detector. This assumption enables solutions to be ignored which: are impossible given the rigid structure of the constellation, or which place the BB constellation too close to the x-ray source. Many poses that would be produced from incorrect 3-BB correspondences are discarded in this way. In addition to the point sets and hypothesized correspondences, a set of source-to-detector ratios is also required as input to the P3P solver. The source-to-detector ratios are used to back-project one of the 2D BB detections to possible 3D locations, simplifying the pruning problem. Full details of this approach are described in supplementary section S-1.4. Solutions reported by the P3P solver are further pruned according to anatomical constraints. The candidate source-to-detector ratios and anatomical constraints differ for the ilium and fragment BB constellations.

2.3.2. Ilium BB pose estimation

The pose of the ilium BB constellation is recovered first and is then used to assist with establishing the pose of the fragment BB constellation. A set of 129 uniformly spaced source-to-detector ratios is used for each ilium P3P invocation: $\left \{0.6 + 0.003\,125k | k = 0, 1, \dots, 128 \right \}$ . Using the APP coordinate frame, a reference AP orientation of the pre-osteotomy pelvis, with respect to the C-arm, is constructed and used for pruning anatomically implausible ilium poses. The AP orientation has the following properties: the patient is supine with the x-ray detector placed anteriorly, the AP axis is parallel to the C-arm depth axis, the IS axis is parallel to the 2D image row axis with the top of the image more superior than the bottom, and the LR axis is parallel to the 2D image column axis. Each candidate P3P pose is examined to obtain the difference in orientation from the reference AP pose and an Euler decomposition is used to obtain rotation angles about each anatomical axis. Poses are pruned when the magnitude of any Euler angle is greater than 60^°. Using such a large range of allowable angles permits all reasonable C-arm orientations while eliminating highly unlikely poses, such as those that place the detector beneath, or nearly orthogonal with, the surface of the operating table. An example of a pose pruned using this logic is shown in the top row of figure 5. Using each remaining candidate ilium pose, the original fragment BB constellation is projected into the view; e.g. where the fragment BBs would be located in 2D had the fragment not been moved. Since the majority of fragment movement consists of rotation, the re-projected fragment BBs should lie nearby to 2D BB detections. Poses are pruned when less than 3 of the fragment BBs are projected inside the bounds of the 2D image. For each projected fragment BB, the distance to the nearest 2D detection is calculated, and the three BBs with the smallest nearest distances are recorded. When the mean distance associated with these three BBs is greater than 200 pixels the candidate ilium pose is pruned.

Figure 5. Refer to the following caption and surrounding text. — **Figure 5.** The top row shows an ilium pose pruned for excessive difference from the reference AP orientation (137^° about the AP axis). The green sphere indicates the x-ray source with a green line connecting to the principal point on the x-ray detector. For this example, the candidate correspondences were able to satisfy the constraints of the P3P solver. However, the implausibility of the pose reveals the incorrectness of the correspondences. The bottom row depicts several examples of ilium poses and correspondences used for initialization of the full-pelvis intensity-based, 2D/3D, registration. Green edges, derived from a specific pelvis pose, are overlaid over the intraoperative fluoroscopic image. Agreement between the overlaid edges and base image indicates agreement between the hypothesized pose and true pose. Image similarity scores are listed in the bottom right of each overlay. The scores are computed from DRRs, computed at each candidate pose, and the intraoperative fluoroscopic image. Lower scores indicate better similarity, with the bottom right example representing the most likely pose of the three.
Download figure:
Standard image High-resolution image

Once the general pruning strategy has been completed for the ilium BB constellation, the naïve brute-force, intensity-based, approach is used to select the best ilium pose from the remaining candidates. The bottom row of figure 5 shows several examples of image similarity calculated from poses derived from different correspondences. This pose is used as initialization for an intensity-based 2D/3D registration of the pre-osteotomy pelvis to the fluoroscopic image. Details of the intensity-based registration parameters are listed in supplementary section S-1.2. Using the pose estimate computed during the intensity-based registration, final ilium BB correspondences are established by re-projecting the 3D ilium BBs into 2D. Correspondences are greedily assigned based on the minimum 2D distances between projected BB locations and detected 2D BB locations. However, no correspondence is established for projected BBs with minimum distances greater than 10.5 pixels. When less than two correspondences are established we consider the algorithm to have failed in establishing the ilium pose and no further processing is performed. The ilium pose is set to the intensity-based registration pose when exactly two correspondences are established. When three or four correspondences are established, the ilium pose is refined by optimizing over the corresponding ilium BB re-projection distances starting from the intensity-based pose as the initial guess.

The set of 2D BB detections is pruned down to exclude: BBs already matched to the ilium, and any BBs that are distant from the expected location of the fragment. A BB is considered distant if the closest, re-projected, fragment BB is greater than 200 pixels away. This is a variation of the process previously used for pruning ilium poses by re-projection of 3D fragment BBs.

2.3.3. Fragment BB pose estimation

The fragment pose recovery is started by conducting the general pruning strategy over candidate fragment BB correspondences and poses. Since approximate depth of the BBs is known from the ilium pose recovery process, only 33 source-to-detector ratios are passed to the P3P solver. A reference source-to-detector ratio, $\hat{r}$ , is computed by mapping the centroid of the fragment 3D BB constellation into the C-arm coordinate frame using the ilium pose. The source-to-detector ratios are then uniformly sampled about this reference: $\left \{\hat{r} \pm 0.003\,125k | k = 0, 1, \dots, 16 \right \}$ . Using each solution produced by the P3P solver, the relative pose of the fragment is computed using (1). Any relative pose with rotation magnitude greater than 60^° or translation magnitude greater than 30 mm is pruned. Figure 6 shows an example fragment pose that was pruned in this fashion.

Figure 6. Refer to the following caption and surrounding text. — **Figure 6.** An example of an implausible fragment pose which was pruned due to a large rotation of 142^°. Despite their incorrectness, the candidate correspondences used to compute this candidate fragment pose were able to satisfy the P3P solver constraints.
Download figure:
Standard image High-resolution image

Due to the difficult nature of the chiseling process, the true shape of the acetabular fragment usually differs from the preoperatively planned shape. For this reason, image similarities are not used to select the best candidate returned from the general pruning process. Instead, the best candidate is selected by choosing the pose yielding the largest number of matching BBs and the smallest mean re-projection distance. The match criterion used for ilium matches is reused here. When less than 3 BBs are matched, the method reports failure. However, the fragment pose is refined by an optimization over re-projection distances if at least 3 BBs are matched. The optimization is regularized by the translation magnitude of the fragment pose relative to the APP. This regularization is reasonable, since the fragment movement is believed to consist primarily of rotation and the approximate depth is known from the ilium pose.

It is important to note that the approach described here only requires correspondences to be established for three ilium BBs and three fragment BBs. Therefore, the proposed method provides some robustness to occlusion, since it is unlikely that more than one BB from a single constellation will be occluded for any given view. Likewise, it is still feasible to obtain fragment pose estimates when a single BB (per constellation) becomes dislodged from the bone.

2.4. Cadaver experiments

Surgeries performed on three, non-dysplastic, cadaveric specimens were used to evaluate the proposed method. Specimens 1, 2, and 3 were aged 89, 87, 94 and were male, female, and male, respectively. Preoperative processing and planning was performed bilaterally for each specimen and six PAOs were performed by our surgeon co-author, B.A.M. The Halifax injector, using BBs of 1 mm diameter, was only used during surgeries for specimens 2 and 3. For specimen 1, bone burs were created on the surface of the pelvis, and 1.5 mm diameter BBs were affixed with cyanoacrylate. The larger BBs were also inserted into specimens 2 and 3, but were only used for ground truth calculations. For both BB insertion approaches, direct visual guidance was used to ensure a sufficient separation of the two BB constellations. A comprehensive discussion of BB insertion and the fragment pose ground truth protocol is found in (Grupp et al 2020a). Ground truth poses for specimen 1 were calculated using the 2D/3D known BB constellation approach, whereas specimens 2 and 3 use the 3D/3D method. For specimen 1, the 3D BB positions were manually digitized from a postoperative CT scan and mapped into the preoperative CT frame using a rigid registration between the two images. Pre-osteotomy and post-osteotomy poses of the BB constellations were recovered using a series of 2D/3D, paired point, registrations between the 3D BBs and manually annotated 2D BB locations in the fluoroscopic views. The ground truth fragment pose was defined as the rigid movement of the fragment BBs between the pre-ostetomy and post-osteotomy poses. In addition to the pre and postoperative CT scans for specimens 2 and 3, a CT scan was also collected after BB implantation, but prior to any osteotomies. The positions of the pre and post-osteotomy BBs were manually identified in the corresponding CT scans. A paired point 3D/3D rigid registration between the pre and post-osteotomy BBs was used to calculate the ground truth pose of the fragment.

Three fluoroscopic views were used to reconstruct the pre-osteotomy 3D BB constellations for each surgery. Pose estimation of the relocated fragment was conducted on 3 separate fluoroscopic images, each with different viewing geometries. All fluoroscopy was obtained using a Siemens CIOS Fusion C-arm with 30 inch flat panel detector. The intrinsic parameters of the C-arm were naïvely approximated using metadata present in DICOM images produced by the system. A source-to-detector distance of 1020 mm, isotropic pixel spacings of 0.194 mm, no skew or distortion, and a principal point at the center of the projection were used. For C-arms with minimal distortion, previous work has demonstrated that approximate intrinsics yield accurate reconstructions of small objects and accurate relative pose estimates (Jain et al 2005b). The approximate intrinsics used in this paper are therefore reasonable, as the reconstructions are of small BBs and the final relative fragment pose estimates are not with respect to the C-arm coordinate frame.

A video example of the full pipeline, from preoperative planning to intraoperative pose estimation is available online at: https://youtu.be/0E0U9G81q8g.

3. Results

3.1. Intraoperative BB reconstruction

For pre-osteotomy BB reconstruction, there were no missed detections in the 2D images and a single false detection in one image. Table 1 summarizes the reconstruction errors. The mean reconstruction error of the larger BBs implanted into specimen 1 was 2.6 mm. For specimens 2 and 3, the mean reconstruction error of the smaller, injected, BBs was 1.4 mm. The mean computation time for the entire reconstruction pipeline was 8.3 ± 0.4 seconds. When excluding the 2D/3D full pelvis registration time required for relative pose recovery of the C-arm, the BB detection, correspondence establishment and reconstruction took 0.7 ± 0.1 s. Timing measurements were conducted using a single NVIDIA P100 (PCIe) GPU and seven cores of an Intel Xeon E5-2680 v4 CPU. A detailed breakdown of these runtimes is found in supplementary table S-2.

Table 1. A summary of BB reconstruction errors for each surgery, specified by the cadaver specimen number and operative side. The means and standard deviations of reconstruction errors are given for the separate ilium and fragment BB constellations and also the entire set of BBs. For each surgery, four BBs were reconstructed for each of the ilium and fragment constellations.

	Reconstruction errors (mm)
Surgery	Ilium BBs	Fragment BBs	All BBs
1 Left	2.5 ± 0.4	3.2 ± 0.2	2.9 ± 0.5
1 Right	2.1 ± 0.2	2.7 ± 0.5	2.4 ± 0.5
2 Left	1.6 ± 0.3	1.4 ± 0.2	1.5 ± 0.3
2 Right	1.3 ± 0.5	1.1 ± 0.1	1.2 ± 0.3
3 Left	1.3 ± 0.3	1.1 ± 0.4	1.2 ± 0.3
3 Right	1.4 ± 0.2	1.6 ± 0.2	1.5 ± 0.2

Thumbnails of each fluoroscopy image used for BB reconstruction during the cadaver experiments are found in supplementary Figures S-7, S-8, and S-9.

3.2. Intraoperative pose estimation

On average, 99.6% of the maximum number of ilium poses are pruned by the P3P solver step. Using anatomical constraints, the remaining poses are pruned by an average of 95.9%. After this pruning, an average of 57 candidate ilium poses were used for the exhaustive image similarity searches. The maximum number of fragment poses were pruned by an average of 97.3% using the P3P solver, and anatomical constraints pruned 81.2% of the remaining poses on average. In order to determine the final fragment BB correspondences, an average of 20 fragment poses remained to be searched after pruning. A summary of the maximum number of ilium and fragment poses considered, and the actual poses considered due to pruning, is shown in supplementary table S-3.

Pose estimation was successfully performed on 18 total views (3 per surgery). The distributions of fragment rotation, LCE angle, and translation errors is shown in figure 7. Errors in rotation were below 3^° for 12 of the 18 cases, with a mean of 2.4^°. When the rotation errors were decomposed about anatomical axes, only rotation about the IS axis had errors greater than 3^°. In terms of both mean and standard deviation, rotation measurements about the AP axis were the most accurate, followed by LR, and then IS. The maximum 3D LCE angle error was 1.8^° and the mean was 1.0^°. The mean translation error was 2.1 mm, and was less than 3 mm for 15 of the 18 estimates. Mean translation errors about the anatomical axes were all within 0.2 mm of each other, and the maximum difference between standard deviations was 0.3 mm. An entire listing of errors for each pose estimate is shown in supplementary table S-4.

Figure 7. Refer to the following caption and surrounding text. — **Figure 7.** A visualization of the distributions of errors associated with each fragment pose estimate. Rotation and lateral center edge (LCE) angle errors are shown in (a), and translation errors are shown in (b). For rotation and translation errors, the magnitude is listed along with the errors about the left-right (LR), inferior-superior (IS), and anterior-posterior (AP) anatomical axes. Each circle represents the error of a pose estimate, the white circle indicates the median error, the vertical bar indicates the [25, 75] percentile range, and a kernel density function is fit around the samples.
Download figure:
Standard image High-resolution image

All four ilium BBs were matched in 6 of the 18 cases and all four fragment BBs were matched in 16 of the 18 cases. The mean rotation, translation, and LCE angle errors for estimates with 4 ilium BBs matched were 1.7^°, 2.1 mm, and 1.0^°, respectively. With less than 4 ilium BBs matched, the mean errors were 2.8^°, 2.0 mm, and 1.1^°, respectively. With 4 fragment BBs matched, the mean rotation, translation, and LCE angle errors were 2.3^°, 2.1 mm, and 1.0^°, respectively. The mean errors were 3.1^°, 1.5 mm, and 1.6^°, when less than 4 fragment BBs matched. Supplementary table S-5 includes a full summary of the number of BB detections and matches in each image.

In the third view for the left side of specimen 1, one ilium BB was outside the image bounds and not detected. On the left side of specimen 2, one of the ilium BBs was occluded by K-wire in each view and therefore not detected. Analysis of the postoperative CT revealed that this BB was actually dislodged by either: performance of the ilium osteotomy or insertion of the K-wire. The missed ilium detections in views 1 and 2 on the right side of specimen 2, were occluded by screws. Occlusion by K-wire also caused the missed ilium detection in view 2 on the right side of specimen 3. However, according to the postoperative CT this ilium BB was also displaced from the bone. The missed fragment BB detections were caused by K-wire occlusion. There were no missed detections of ipsilateral BBs that were present in the scene and not occluded by screws or K-wires.

False detections corresponding to contralateral BBs were observed in 8 of the projections, with an average of 5 detections over these projections. An average of 4 false BB detections were caused by the screws visible in the 6 projections used for specimen 1. No other false BB detections were reported.

The mean computation time for the single-view pose estimation was 0.7 ± 0.2 seconds, and was measured using the same hardware used to record reconstruction times. Supplementary table S-6 provides a detailed breakdown of these runtimes.

Thumbnails of each fluoroscopy image used for fragment pose estimation during the cadaver experiments are found in supplementary figures S-10, S-11, and S-12.

4. Discussion

Although one third of fragment pose estimates had rotation errors larger than 3^°, LCE angle errors were well below the 3^° success criteria identified in (Grupp et al 2020a). This indicates that the proposed method is able to quantify the amount of lateral femoral head coverage, resulting from an intraoperatively relocated acetabulum, within clinically acceptable error thresholds. The 3^° LCE angle error threshold was chosen so that clinicians can target a correction with a 35^° LCE angle and have confidence that the actual LCE lies in a 30^°–40^° range, which is indicative of dysplasia correction and associated with lower incidents of failure (Hartig-Andreasen et al 2012).

Given the automatic nature of the method and the relatively quick runtime, it should be feasible for clinicians to smoothly move between making pose adjustments to the fragment, taking fluoroscopic shots, and receiving feedback regarding the current pose estimate.

The mean rotation error when less than the full number of BBs were matched in either constellation, was 1.1^° greater than the mean rotation error over the cases matching full BB constellations. However, mean translation and mean LCE angle error were less effected by unmatched ilium BBs. When only 3 fragment BBs were matched, the mean LCE angle error was 0.6^° larger than the mean LCE angle error associated with all fragment BBs matched. Therefore, the number of matched BBs in each constellation may be used to convey confidences in the estimated poses. When less than 4 fragment BBs are matched, confidence in any rotation and LCE angle would be lowered. For cases when all 4 fragment BBs were matched, but less than 4 ilium BBs were matched, confidence in LCE angle would remain unaffected, however confidence in general rotation would be reduced.

Highlighting the robustness of the method, all LCE errors remained below the 3^° error threshold, even when BBs were missing from the view or occluded. The method was also robust against BBs which were displaced from the pelvis, but remained in the field of view and detected. This was demonstrated on the right side of specimen 3, where an ilium BB had been dislodged from the bone and was detected in views 1 and 2. Since the P3P solver does not return solutions for non-rigid transformations of the constellations, the displaced BB was not matched, despite the detection of all four ilium BBs in these views.

At least three BBs from each constellation must be detected in the fluoroscopy images for pose estimation to succeed. If fewer than three BBs were automatically detected, the system could be operated using locations manually annotated by a human operator. Although this would eliminate the possibility of real-time pose estimation, the system could still be used to verify fragment poses after a sequence of manually guided adjustments. Since complexity of pose estimation processing is dependent on the number of BBs detected, a large number of false-positive detections may also preclude real-time application of the proposed approach. These exceptional cases were not encountered during the experiments of this paper.

Average BB reconstruction errors for specimen 1 were greater than those of specimens 2 and 3. This was most likely caused by larger 2D BB localization errors for specimen 1. This is to be expected, since the BBs used for specimen 1 were larger than those used for specimens 2 and 3. Only one of the six LCE errors for specimen 1 was greater than 1^°, indicating that the proposed method is not dependent on a single size of BBs.

As part of future work, we plan to also incorporate additional biomechanical indicators such as the anterior center edge angle and the acetabular index. These measurements will enable a more complete intraoperative indication of femoral head coverage, which could potentially improve surgical outcomes.

The performance of the method compares favorably to the fiducial-free method (FFM) proposed in (Grupp et al 2020a). When a postoperatively segmented fragment shape was retrospectively used for pose estimation, the FFM was reported to have mean rotation error of 2.2^°, mean translation error of 2.2 mm, and mean LCE error of 1.1^°. The mean rotation error of the proposed method is only slightly larger than the FFM mean rotation error, while translation and LCE angle errors of the proposed method are slightly smaller. When the FFM uses an intraoperatively refined version of a preoperatively planned fragment shape, the mean rotation, translation, and LCE angle errors increase to 3.5^°, 2.5 mm, and 1.8^°, respectively. Considering that an accurate segmentation of the fragment shape is not available intraoperatively, the method proposed in this paper provides a more accurate assessment of fragment pose and femoral head coverage than the FFM.

In the current state of practice, preoperative CT data cannot be effectively used for intraoperative assessment of anatomical angles. As a result, contemporary preoperative imaging usually consists solely of standing radiographs. Although the proposed method requires a preoperative CT of the patient to be collected, the patient specific CT may be replaced with a statistical atlas of pelvis anatomy (Otake et al 2015) in the future. In this approach, the patient’s anatomy would be reconstructed using a deformable 2D/3D registration between patient-specific 2D x-ray images and the atlas (Sadowsky et al 2007, Hurvitz and Joskowicz 2008, Zheng 2010, Kang et al 2016). A precise cartilage model is required for a comprehensive biomechanical analysis, including estimates of the joint contact pressure (Armiger et al 2009). Since a statistical atlas may not be capable of satisfactorily reconstructing the cartilage model, partial CT of the patient’s acetabulum may be used to augment the statistical model (Chintalapani et al 2010, Grupp et al 2016). Furthermore, the process of collecting standing radiographs and a partial CT scan of the pelvis could potentially deliver a much smaller effective radiation dose than a complete CT scan of the pelvis.

The only manual portion of the intraoperative pipeline is the annotation of anatomical landmarks during BB reconstruction. Recent advances in fluoroscopic deep learning enable these landmarks to be localized automatically (Bier et al 2019), and incorporated into a completely automatic intraoperative pipeline, further reducing the impact on existing surgical workflows (Grupp et al 2020b). It is possible that an extension of these approaches for 2D BB localization could be faster, and more robust to other metallic objects, than the current radial symmetry approach. The reconstruction and pose estimation processing pipelines do not rely on specific details of the 2D BB detection method, and improved BB detection algorithms may be substituted as necessary.

Although the registration framework leveraged from (Grupp et al 2020a) is a highly optimized C++ library with OpenCL GPU acceleration, the pruning algorithms described in this paper were implemented as serial C++ routines. We believe faster fragment pose estimation times should be possible, since the candidate poses and correspondences evaluated in each pruning phase are not dependent on one another, and the computations may be done in parallel. For the current number of BBs proposed in this paper (8), pre-osteotomy BB reconstruction runtimes are dominated by pelvis registration times. Any efforts to speed up reconstructions of a similar number of BBs should therefore focus on improving the 2D/3D registration pipeline. However, the ‘brute force’ approach used to construct all possible inter-view correspondences should also be improved if the number of BBs or point fiducials were to substantially increase. Some efficiency could be gained by using epipolar constraints to prune implausible correspondences prior to the initial triangulation step.

Although optical tracking systems theoretically enable the continuous and real-time tracking of objects, this is not actually achieved by the optical tracker-based systems which require the repeated digitization of points lying on the fragment (Murphy et al 2015, Murphy et al 2016, Takao et al 2017, De Raedt et al 2018). This digitization process is not real-time in nature and potentially slower than collecting another fluoroscopic image followed by the pose estimation computation proposed in this paper.

Since surgical equipment could potentially dislodge implanted fiducial objects during the acts of chiseling or drilling, we believe that intraoperative tool tracking is an essential component of future work. A navigational capability for the osteotomes and drill should allow surgeons to avoid collisions with fiducial objects. Real-time pose estimates of tools could be obtained by augmenting patterns of BBs to each object, collecting fluoroscopic views containing both tool and pelvis BBs, and applying methods similar to those proposed in this paper.

The risks associated with fiducial dislodgment may be further reduced by not using BBs and instead affixing temporary structures to the bone during surgery and removing the structures once the fragment is satisfactorily fixed in place. Sufficiently large structures should be chosen so that the object may be easily removed if it were to become dislodged.

For example, two deformable metallic grids of wires could be impressed on the bone surfaces of the fragment and ilium. Wire intersection points would be treated as point fiducials and sharp feet on the underside would facilitate insertion into bone. Each grid could be pressed against the surface of the pelvis, temporarily attached, and removed at the conclusion of the surgery. If an osteotome or K-wire were to come into contact with the grid during chiseling or drilling, the grid would most likely deform and become partially detached from the pelvis. However, removal of this larger grid should be significantly easier and lower risk, compared to the removal of a small, loose, BB.

Sets of percutaneously inserted wires with radiographic markers located at the tips of each wire could also serve as temporarily affixed fiducial objects. Prior to osteotomy, sets of three or four wires would be percutaneously inserted into both the ilium and fragment bone regions. As each wire tip is expected to remain fixed throughout the surgery, the radiographic markers could be used as point fiducials.

For either of these proposed fiducial objects, future work will include the development of a new 2D point fiducial detection method, which would replace the radial symmetry algorithm used in this paper. However, all other components of the reconstruction and pose estimation pipelines would remain unchanged, as they only rely on the list of point fiducial locations produced by the radial symmetry implementation.

Assuming a sufficient navigational capability to avoid intraoperative BB dislodgment, long-term BB implantation should be clinically viable. This is indicated by several long-term studies which rely on RSA beads implanted into the pelvis in order to measure the stability of acetabular THA implants (Nieuwenhuijse et al 2012, Klerken et al 2015, Otten et al 2016). To the best of our knowledge no complications have arisen when RSA beads have become detached from the bone (Lawrie et al 2003) and are not at risk of moving into the joint space (Shah et al 2018). Since the methods described in this paper are not currently intended for long-term analysis of the joint, it may also be feasible to use biodegradable implanted fiducial markers in place of standard tantalum beads (Stallmann et al 2005).

When the proposed method is used alongside workflows which already rely on fluoroscopy for validating the fragment’s pose, no substantial additional radiation dose is expected to be delivered to patients. The three additional views required for 3D reconstruction of the implanted BBs do not represent a particular risk and will certainly impart a smaller dose than a CBCT collection. Since the proposed method is capable of reporting fragment pose estimates for a wide range of view geometries, the traditional process of collecting many fluoroscopic views needed to obtain an appropriate view for human interpretation should be eliminated. Therefore, we anticipate that the system could actually reduce the number of fluoroscopic views used for checking the fragment pose. The radiation dose delivered to surgeons and operating team members when using fluoroscopy during PAO has been found to be safe so long as proper protective equipment was used (Mechlenburg et al 2009). Further studies, under realistic clinical conditions, need to be performed in order to accurately determine the expected radiation doses imparted by the proposed system.

The proposed pose estimation workflow may also be used for cadaveric PAO training. Pose estimates provided by the system could act as feedback for the mental estimates of the surgeon. In this way, the system may improve surgeons’ association of tactile sensing and fluoroscopic interpretation with a fragment’s true pose.

As none of the cadaveric specimens exhibited hip dysplasia, no postoperative assessment of clinical adequacy was conducted. The primary focus of these experiments was to determine the feasibility, accuracy and runtime of the proposed pose estimation algorithm and workflow. Cadaveric specimens with uncorrected hip dysplasia are not readily available as a result of the advanced ages typically associated with the specimens. Therefore, we believe that further studies on dysplastic hips could potentially be the focus of a clinical trial.

5. Conclusion

This paper has proposed a new method for pose estimation of acetabular fragments using fluoroscopy and two constellations of intraoperatively implanted BBs. Cadaveric studies have shown that the method is able to provide clinically accurate estimates of the LCE angle, a well-established indicator of femoral head coverage. Once the BB constellations have been reconstructed in 3D, all fragment poses are calculated automatically using a single-view, and in sub-second runtime. No other surgical equipment beyond a flat panel C-arm and BB injector is required. The C-arm does not need to be calibrated, encoded, or motorized. Unlike other fluoroscopic approaches, accurate knowledge of the bone fragment’s shape is not necessary. For these reasons, the proposed method provides minimal deviation from the standard surgical workflow, and should be easily mastered by clinicians already performing RSA.

Acknowledgment

We are grateful for the assistance of Mr Demetries Boston during the cadaveric surgeries. This research was supported by NIH/NIBIB grants R01EB006 839, R21EB020 113, MEXT/JSPS KAKENHI 26 108 004, Johns Hopkins University Internal Funds, and a Johns Hopkins University Applied Physics Laboratory Graduate Student Fellowship. Part of this research project was conducted using computational resources at the Maryland Advanced Research Computing Center (MARCC). Finally, we would also like to thank the anonymous reviewers for their insightful comments which helped to improve the quality of this manuscript.

Supplementary material

This submission includes a supplementary document. An additional video with detailed descriptions of the methods and example results is available online at: https://youtu.be/0E0U9G81q8g .

Fast and automatic periacetabular osteotomy fragment pose estimation using intraoperatively implanted fiducials and single-view fluoroscopy

Article metrics

Permissions

Share this article

Dates

Abstract

Supplementary data