Auto-Detection of Safety Issues in Baby Products
Authors:
Graham Bleaney,
Matthew Kuzyk,
Julian Man,
Hossein Mayanloo,
H. R. Tizhoosh
Abstract:
Every year, thousands of people receive consumer product related injuries. Research indicates that online customer reviews can be processed to autonomously identify product safety issues. Early identification of safety issues can lead to earlier recalls, and thus fewer injuries and deaths. A dataset of product reviews from Amazon.com was compiled, along with \emph{SaferProducts.gov} complaints and…
▽ More
Every year, thousands of people receive consumer product related injuries. Research indicates that online customer reviews can be processed to autonomously identify product safety issues. Early identification of safety issues can lead to earlier recalls, and thus fewer injuries and deaths. A dataset of product reviews from Amazon.com was compiled, along with \emph{SaferProducts.gov} complaints and recall descriptions from the Consumer Product Safety Commission (CPSC) and European Commission Rapid Alert system. A system was built to clean the collected text and to extract relevant features. Dimensionality reduction was performed by computing feature relevance through a Random Forest and discarding features with low information gain. Various classifiers were analyzed, including Logistic Regression, SVMs, Na{ï}ve-Bayes, Random Forests, and an Ensemble classifier. Experimentation with various features and classifier combinations resulted in a logistic regression model with 66\% precision in the top 50 reviews surfaced. This classifier outperforms all benchmarks set by related literature and consumer product safety professionals.
△ Less
Submitted 21 July, 2018; v1 submitted 27 April, 2018;
originally announced May 2018.
Inverse modeling of hydrologic systems with adaptive multi-fidelity Markov chain Monte Carlo simulations
Authors:
Jiangjiang Zhang,
Jun Man,
Guang Lin,
Laosheng Wu,
Lingzao Zeng
Abstract:
Markov chain Monte Carlo (MCMC) simulation methods are widely used to assess parametric uncertainties of hydrologic models conditioned on measurements of observable state variables. However, when the model is CPU-intensive and high-dimensional, the computational cost of MCMC simulation will be prohibitive. In this situation, a CPU-efficient while less accurate low-fidelity model (e.g., a numerical…
▽ More
Markov chain Monte Carlo (MCMC) simulation methods are widely used to assess parametric uncertainties of hydrologic models conditioned on measurements of observable state variables. However, when the model is CPU-intensive and high-dimensional, the computational cost of MCMC simulation will be prohibitive. In this situation, a CPU-efficient while less accurate low-fidelity model (e.g., a numerical model with a coarser discretization, or a data-driven surrogate) is usually adopted. Nowadays, multi-fidelity simulation methods that can take advantage of both the efficiency of the low-fidelity model and the accuracy of the high-fidelity model are gaining popularity. In the MCMC simulation, as the posterior distribution of the unknown model parameters is the region of interest, it is wise to distribute most of the computational budget (i.e., the high-fidelity model evaluations) therein. Based on this idea, in this paper we propose an adaptive multi-fidelity MCMC algorithm for efficient inverse modeling of hydrologic systems. In this method, we evaluate the high-fidelity model mainly in the posterior region through iteratively running MCMC based on a Gaussian process (GP) system that is adaptively constructed with multi-fidelity simulation. The error of the GP system is rigorously considered in the MCMC simulation and gradually reduced to a negligible level in the posterior region. Thus, the proposed method can obtain an accurate estimate of the posterior distribution with a small number of the high-fidelity model evaluations. The performance of the proposed method is demonstrated by three numerical case studies in inverse modeling of hydrologic systems.
△ Less
Submitted 14 June, 2018; v1 submitted 6 December, 2017;
originally announced December 2017.