-
Closing the AI generalization gap by adjusting for dermatology condition distribution differences across clinical settings
Authors:
Rajeev V. Rikhye,
Aaron Loh,
Grace Eunhae Hong,
Preeti Singh,
Margaret Ann Smith,
Vijaytha Muralidharan,
Doris Wong,
Rory Sayres,
Michelle Phung,
Nicolas Betancourt,
Bradley Fong,
Rachna Sahasrabudhe,
Khoban Nasim,
Alec Eschholz,
Basil Mustafa,
Jan Freyberg,
Terry Spitz,
Yossi Matias,
Greg S. Corrado,
Katherine Chou,
Dale R. Webster,
Peggy Bui,
Yuan Liu,
Yun Liu,
Justin Ko
, et al. (1 additional authors not shown)
Abstract:
Recently, there has been great progress in the ability of artificial intelligence (AI) algorithms to classify dermatological conditions from clinical photographs. However, little is known about the robustness of these algorithms in real-world settings where several factors can lead to a loss of generalizability. Understanding and overcoming these limitations will permit the development of generali…
▽ More
Recently, there has been great progress in the ability of artificial intelligence (AI) algorithms to classify dermatological conditions from clinical photographs. However, little is known about the robustness of these algorithms in real-world settings where several factors can lead to a loss of generalizability. Understanding and overcoming these limitations will permit the development of generalizable AI that can aid in the diagnosis of skin conditions across a variety of clinical settings. In this retrospective study, we demonstrate that differences in skin condition distribution, rather than in demographics or image capture mode are the main source of errors when an AI algorithm is evaluated on data from a previously unseen source. We demonstrate a series of steps to close this generalization gap, requiring progressively more information about the new source, ranging from the condition distribution to training data enriched for data less frequently seen during training. Our results also suggest comparable performance from end-to-end fine tuning versus fine tuning solely the classification layer on top of a frozen embedding model. Our approach can inform the adaptation of AI algorithms to new settings, based on the information and resources available.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
UIT-Saviors at MEDVQA-GI 2023: Improving Multimodal Learning with Image Enhancement for Gastrointestinal Visual Question Answering
Authors:
Triet M. Thai,
Anh T. Vo,
Hao K. Tieu,
Linh N. P. Bui,
Thien T. B. Nguyen
Abstract:
In recent years, artificial intelligence has played an important role in medicine and disease diagnosis, with many applications to be mentioned, one of which is Medical Visual Question Answering (MedVQA). By combining computer vision and natural language processing, MedVQA systems can assist experts in extracting relevant information from medical image based on a given question and providing preci…
▽ More
In recent years, artificial intelligence has played an important role in medicine and disease diagnosis, with many applications to be mentioned, one of which is Medical Visual Question Answering (MedVQA). By combining computer vision and natural language processing, MedVQA systems can assist experts in extracting relevant information from medical image based on a given question and providing precise diagnostic answers. The ImageCLEFmed-MEDVQA-GI-2023 challenge carried out visual question answering task in the gastrointestinal domain, which includes gastroscopy and colonoscopy images. Our team approached Task 1 of the challenge by proposing a multimodal learning method with image enhancement to improve the VQA performance on gastrointestinal images. The multimodal architecture is set up with BERT encoder and different pre-trained vision models based on convolutional neural network (CNN) and Transformer architecture for features extraction from question and endoscopy image. The result of this study highlights the dominance of Transformer-based vision models over the CNNs and demonstrates the effectiveness of the image enhancement process, with six out of the eight vision models achieving better F1-Score. Our best method, which takes advantages of BERT+BEiT fusion and image enhancement, achieves up to 87.25% accuracy and 91.85% F1-Score on the development test set, while also producing good result on the private test set with accuracy of 82.01%.
△ Less
Submitted 19 November, 2023; v1 submitted 6 July, 2023;
originally announced July 2023.
-
Goal-Oriented Communications in Federated Learning via Feedback on Risk-Averse Participation
Authors:
Shashi Raj Pandey,
Van Phuc Bui,
Petar Popovski
Abstract:
We treat the problem of client selection in a Federated Learning (FL) setup, where the learning objective and the local incentives of the participants are used to formulate a goal-oriented communication problem. Specifically, we incorporate the risk-averse nature of participants and obtain a communication-efficient on-device performance, while relying on feedback from the Parameter Server (\texttt…
▽ More
We treat the problem of client selection in a Federated Learning (FL) setup, where the learning objective and the local incentives of the participants are used to formulate a goal-oriented communication problem. Specifically, we incorporate the risk-averse nature of participants and obtain a communication-efficient on-device performance, while relying on feedback from the Parameter Server (\texttt{PS}). A client has to decide its transmission plan on when not to participate in FL. This is based on its intrinsic incentive, which is the value of the trained global model upon participation by this client. Poor updates not only plunge the performance of the global model with added communication cost but also propagate the loss in performance on other participating devices. We cast the relevance of local updates as \emph{semantic information} for developing local transmission strategies, i.e., making a decision on when to ``not transmit". The devices use feedback about the state of the PS and evaluate their contributions in training the learning model in each aggregation period, which eventually lowers the number of occupied connections. Simulation results validate the efficacy of our proposed approach, with up to $1.4\times$ gain in communication links utilization as compared with the baselines.
△ Less
Submitted 19 May, 2023;
originally announced May 2023.
-
Joint Beam Placement and Load Balancing Optimization for Non-Geostationary Satellite Systems
Authors:
Van Phuc Bui,
Trinh Van Chien,
Eva Lagunas,
Joël Grotz,
Symeon Chatzinotas,
Björn Ottersten
Abstract:
Non-geostationary (Non-GSO) satellite constellations have emerged as a promising solution to enable ubiquitous high-speed low-latency broadband services by generating multiple spot-beams placed on the ground according to the user locations. However, there is an inherent trade-off between the number of active beams and the complexity of generating a large number of beams. This paper formulates and…
▽ More
Non-geostationary (Non-GSO) satellite constellations have emerged as a promising solution to enable ubiquitous high-speed low-latency broadband services by generating multiple spot-beams placed on the ground according to the user locations. However, there is an inherent trade-off between the number of active beams and the complexity of generating a large number of beams. This paper formulates and solves a joint beam placement and load balancing problem to carefully optimize the satellite beam and enhance the link budgets with a minimal number of active beams. We propose a two-stage algorithm design to overcome the combinatorial structure of the considered optimization problem providing a solution in polynomial time. The first stage minimizes the number of active beams, while the second stage performs a load balancing to distribute users in the coverage area of the active beams. Numerical results confirm the benefits of the proposed methodology both in carrier-to-noise ratio and multiplexed users per beam over other benchmarks.
△ Less
Submitted 29 July, 2022;
originally announced July 2022.
-
Does Your Dermatology Classifier Know What It Doesn't Know? Detecting the Long-Tail of Unseen Conditions
Authors:
Abhijit Guha Roy,
Jie Ren,
Shekoofeh Azizi,
Aaron Loh,
Vivek Natarajan,
Basil Mustafa,
Nick Pawlowski,
Jan Freyberg,
Yuan Liu,
Zach Beaver,
Nam Vo,
Peggy Bui,
Samantha Winter,
Patricia MacWilliams,
Greg S. Corrado,
Umesh Telang,
Yun Liu,
Taylan Cemgil,
Alan Karthikesalingam,
Balaji Lakshminarayanan,
Jim Winkens
Abstract:
We develop and rigorously evaluate a deep learning based system that can accurately classify skin conditions while detecting rare conditions for which there is not enough data available for training a confident classifier. We frame this task as an out-of-distribution (OOD) detection problem. Our novel approach, hierarchical outlier detection (HOD) assigns multiple abstention classes for each train…
▽ More
We develop and rigorously evaluate a deep learning based system that can accurately classify skin conditions while detecting rare conditions for which there is not enough data available for training a confident classifier. We frame this task as an out-of-distribution (OOD) detection problem. Our novel approach, hierarchical outlier detection (HOD) assigns multiple abstention classes for each training outlier class and jointly performs a coarse classification of inliers vs. outliers, along with fine-grained classification of the individual classes. We demonstrate the effectiveness of the HOD loss in conjunction with modern representation learning approaches (BiT, SimCLR, MICLe) and explore different ensembling strategies for further improving the results. We perform an extensive subgroup analysis over conditions of varying risk levels and different skin types to investigate how the OOD detection performance changes over each subgroup and demonstrate the gains of our framework in comparison to baselines. Finally, we introduce a cost metric to approximate downstream clinical impact. We use this cost metric to compare the proposed method against a baseline system, thereby making a stronger case for the overall system effectiveness in a real-world deployment scenario.
△ Less
Submitted 8 April, 2021;
originally announced April 2021.
-
Supervised Transfer Learning at Scale for Medical Imaging
Authors:
Basil Mustafa,
Aaron Loh,
Jan Freyberg,
Patricia MacWilliams,
Megan Wilson,
Scott Mayer McKinney,
Marcin Sieniek,
Jim Winkens,
Yuan Liu,
Peggy Bui,
Shruthi Prabhakara,
Umesh Telang,
Alan Karthikesalingam,
Neil Houlsby,
Vivek Natarajan
Abstract:
Transfer learning is a standard technique to improve performance on tasks with limited data. However, for medical imaging, the value of transfer learning is less clear. This is likely due to the large domain mismatch between the usual natural-image pre-training (e.g. ImageNet) and medical images. However, recent advances in transfer learning have shown substantial improvements from scale. We inves…
▽ More
Transfer learning is a standard technique to improve performance on tasks with limited data. However, for medical imaging, the value of transfer learning is less clear. This is likely due to the large domain mismatch between the usual natural-image pre-training (e.g. ImageNet) and medical images. However, recent advances in transfer learning have shown substantial improvements from scale. We investigate whether modern methods can change the fortune of transfer learning for medical imaging. For this, we study the class of large-scale pre-trained networks presented by Kolesnikov et al. on three diverse imaging tasks: chest radiography, mammography, and dermatology. We study both transfer performance and critical properties for the deployment in the medical domain, including: out-of-distribution generalization, data-efficiency, sub-group fairness, and uncertainty estimation. Interestingly, we find that for some of these properties transfer from natural to medical images is indeed extremely effective, but only when performed at sufficient scale.
△ Less
Submitted 21 January, 2021; v1 submitted 14 January, 2021;
originally announced January 2021.
-
A deep learning system for differential diagnosis of skin diseases
Authors:
Yuan Liu,
Ayush Jain,
Clara Eng,
David H. Way,
Kang Lee,
Peggy Bui,
Kimberly Kanada,
Guilherme de Oliveira Marinho,
Jessica Gallegos,
Sara Gabriele,
Vishakha Gupta,
Nalini Singh,
Vivek Natarajan,
Rainer Hofmann-Wellenhof,
Greg S. Corrado,
Lily H. Peng,
Dale R. Webster,
Dennis Ai,
Susan Huang,
Yun Liu,
R. Carter Dunn,
David Coz
Abstract:
Skin conditions affect an estimated 1.9 billion people worldwide. A shortage of dermatologists causes long wait times and leads patients to seek dermatologic care from general practitioners. However, the diagnostic accuracy of general practitioners has been reported to be only 0.24-0.70 (compared to 0.77-0.96 for dermatologists), resulting in referral errors, delays in care, and errors in diagnosi…
▽ More
Skin conditions affect an estimated 1.9 billion people worldwide. A shortage of dermatologists causes long wait times and leads patients to seek dermatologic care from general practitioners. However, the diagnostic accuracy of general practitioners has been reported to be only 0.24-0.70 (compared to 0.77-0.96 for dermatologists), resulting in referral errors, delays in care, and errors in diagnosis and treatment. In this paper, we developed a deep learning system (DLS) to provide a differential diagnosis of skin conditions for clinical cases (skin photographs and associated medical histories). The DLS distinguishes between 26 skin conditions that represent roughly 80% of the volume of skin conditions seen in primary care. The DLS was developed and validated using de-identified cases from a teledermatology practice serving 17 clinical sites via a temporal split: the first 14,021 cases for development and the last 3,756 cases for validation. On the validation set, where a panel of three board-certified dermatologists defined the reference standard for every case, the DLS achieved 0.71 and 0.93 top-1 and top-3 accuracies respectively. For a random subset of the validation set (n=963 cases), 18 clinicians reviewed the cases for comparison. On this subset, the DLS achieved a 0.67 top-1 accuracy, non-inferior to board-certified dermatologists (0.63, p<0.001), and higher than primary care physicians (PCPs, 0.45) and nurse practitioners (NPs, 0.41). The top-3 accuracy showed a similar trend: 0.90 DLS, 0.75 dermatologists, 0.60 PCPs, and 0.55 NPs. These results highlight the potential of the DLS to augment general practitioners to accurately diagnose skin conditions by suggesting differential diagnoses that may not have been considered. Future work will be needed to prospectively assess the clinical impact of using this tool in actual clinical workflows.
△ Less
Submitted 11 September, 2019;
originally announced September 2019.
-
Quantifying discretization errors for soft-tissue simulation in computer assisted surgery: a preliminary study
Authors:
Michel Duprez,
Stéphane P. A. Bordas,
Marek Bucki,
Huu Phuoc Bui,
Franz Chouly,
Vanessa Lleras,
Claudio Lobos,
Alexei Lozinski,
Pierre-Yves Rohan,
Satyendra Tomar
Abstract:
Errors in biomechanics simulations arise from modeling and discretization. Modeling errors are due to the choice of the mathematical model whilst discretization errors measure the impact of the choice of the numerical method on the accuracy of the approximated solution to this specific mathematical model. A major source of discretization errors is mesh generation from medical images, that remains…
▽ More
Errors in biomechanics simulations arise from modeling and discretization. Modeling errors are due to the choice of the mathematical model whilst discretization errors measure the impact of the choice of the numerical method on the accuracy of the approximated solution to this specific mathematical model. A major source of discretization errors is mesh generation from medical images, that remains one of the major bottlenecks in the development of reliable, accurate, automatic and efficient personalized, clinically-relevant Finite Element (FE) models in biomechanics. The impact of mesh quality and density on the accuracy of the FE solution can be quantified with \emph{a posteriori} error estimates. Yet, to our knowledge, the relevance of such error estimates for practical biomechanics problems has seldom been addressed, see [25]. In this contribution, we propose an implementation of some a posteriori error estimates to quantify the discretization errors and to optimize the mesh. More precisely, we focus on error estimation for a user-defined quantity of interest with the Dual Weighted Residual (DWR) technique. We test its applicability and relevance in two situations, corresponding to computations for a tongue and an artery, using a simplified setting, i.e., plane linearized elasticity with contractility of the soft-tissue modeled as a pre-stress. Our results demonstrate the feasibility of such methodology to estimate the actual solution errors and to reduce them economically through mesh refinement.
△ Less
Submitted 18 June, 2018;
originally announced June 2018.
-
Corotational Cut Finite Element Method for real-time surgical simulation: application to needle insertion simulation
Authors:
Huu Phuoc Bui,
Satyendra Tomar,
Stéphane P. A. Bordas
Abstract:
This paper describes the use of the corotational cut Finite Element Method (FEM) for real-time surgical simulation. Users only need to provide a background mesh which is not necessarily conforming to the boundaries/interfaces of the simulated object. The details of the surface, which can be directly obtained from binary images, are taken into account by a multilevel embedding algorithm applied to…
▽ More
This paper describes the use of the corotational cut Finite Element Method (FEM) for real-time surgical simulation. Users only need to provide a background mesh which is not necessarily conforming to the boundaries/interfaces of the simulated object. The details of the surface, which can be directly obtained from binary images, are taken into account by a multilevel embedding algorithm applied to elements of the background mesh that cut by the surface. Boundary conditions can be implicitly imposed on the surface using Lagrange multipliers. The implementation is verified by convergence studies with optimal rates. The algorithm is applied to various needle insertion simulations (e.g. for biopsy or brachytherapy) into brain and liver to verify the reliability of method, and numerical results show that the present method can make the discretisation independent from geometric description, and can avoid the complexity of mesh generation of complex geometries while retaining the accuracy of the standard FEM. Using the proposed approach is very suitable for real-time and patient specific simulations as it improves the simulation accuracy by taking into account automatically and properly the simulated geometry.
△ Less
Submitted 8 December, 2017;
originally announced December 2017.
-
Controlling the Error on Target Motion through Real-time Mesh Adaptation: Applications to Deep Brain Stimulation
Authors:
Huu Phuoc Bui,
Satyendra Tomar,
Hadrien Courtecuisse,
Michel Audette,
Stéphane Cotin,
Stéphane P. A. Bordas
Abstract:
We present an error-controlled mesh refinement procedure for needle insertion simulation and apply it to the simulation of electrode implantation for deep brain stimulation, including brain shift. Our approach enables to control the error in the computation of the displacement and stress fields around the needle tip and needle shaft by suitably refining the mesh, whilst maintaining a coarser mesh…
▽ More
We present an error-controlled mesh refinement procedure for needle insertion simulation and apply it to the simulation of electrode implantation for deep brain stimulation, including brain shift. Our approach enables to control the error in the computation of the displacement and stress fields around the needle tip and needle shaft by suitably refining the mesh, whilst maintaining a coarser mesh in other parts of the domain. We demonstrate through academic and practical examples that our approach increases the accuracy of the displacement and stress fields around the needle without increasing the computational expense. This enables real-time simulations. The proposed methodology has direct implications to increase the accuracy and control the computational expense of the simulation of percutaneous procedures such as biopsy, brachytherapy, regional anesthesia, or cryotherapy and can be essential to the development of robotic guidance.
△ Less
Submitted 30 September, 2017; v1 submitted 25 April, 2017;
originally announced April 2017.
-
Studying the influence of inclusion characteristics on the characteristic length involved in quasi-brittle materials using the lattice element method
Authors:
Huu Phuoc Bui,
Vincent Richefeu,
Frédéric Dufour
Abstract:
Unlike nonlocal models, there is no need to introduce an internal length in the constitutive law for lattice model at the mesoscopic scale. Actually, the internal length is not explicitly introduced but rather governed by the mesostructure characteristics themselves. The influence of the mesostructure on the width of the fracture process zone which is assumed to be correlated to the characteristic…
▽ More
Unlike nonlocal models, there is no need to introduce an internal length in the constitutive law for lattice model at the mesoscopic scale. Actually, the internal length is not explicitly introduced but rather governed by the mesostructure characteristics themselves. The influence of the mesostructure on the width of the fracture process zone which is assumed to be correlated to the characteristic length of the homogenized quasi-brittle material is studied. The influence of the ligament size (a structural parameter) is also investigated. This analysis provides recommendations/warnings when extracting an internal length required for nonlocal damage models from the material mesostructure
△ Less
Submitted 19 November, 2016;
originally announced November 2016.
-
Real-time Error Control for Surgical Simulation
Authors:
Huu Phuoc Bui,
Satyendra Tomar,
Hadrien Courtecuisse,
Stéphane Cotin,
Stéphane Bordas
Abstract:
Objective: To present the first real-time a posteriori error-driven adaptive finite element approach for real-time simulation and to demonstrate the method on a needle insertion problem. Methods: We use corotational elasticity and a frictional needle/tissue interaction model. The problem is solved using finite elements within SOFA. The refinement strategy relies upon a hexahedron-based finite elem…
▽ More
Objective: To present the first real-time a posteriori error-driven adaptive finite element approach for real-time simulation and to demonstrate the method on a needle insertion problem. Methods: We use corotational elasticity and a frictional needle/tissue interaction model. The problem is solved using finite elements within SOFA. The refinement strategy relies upon a hexahedron-based finite element method, combined with a posteriori error estimation driven local $h$-refinement, for simulating soft tissue deformation. Results: We control the local and global error level in the mechanical fields (e.g. displacement or stresses) during the simulation. We show the convergence of the algorithm on academic examples, and demonstrate its practical usability on a percutaneous procedure involving needle insertion in a liver. For the latter case, we compare the force displacement curves obtained from the proposed adaptive algorithm with that obtained from a uniform refinement approach. Conclusions: Error control guarantees that a tolerable error level is not exceeded during the simulations. Local mesh refinement accelerates simulations. Significance: Our work provides a first step to discriminate between discretization error and modeling error by providing a robust quantification of discretization error during simulations.
△ Less
Submitted 14 February, 2017; v1 submitted 8 October, 2016;
originally announced October 2016.