Skip to main content

Showing 1–31 of 31 results for author: Stewart, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.19110  [pdf, other

    cs.AI

    GPT Deciphering Fedspeak: Quantifying Dissent Among Hawks and Doves

    Authors: Denis Peskoff, Adam Visokay, Sander Schulhoff, Benjamin Wachspress, Alan Blinder, Brandon M. Stewart

    Abstract: Markets and policymakers around the world hang on the consequential monetary policy decisions made by the Federal Open Market Committee (FOMC). Publicly available textual documentation of their meetings provides insight into members' attitudes about the economy. We use GPT-4 to quantify dissent among members on the topic of inflation. We find that transcripts and minutes reflect the diversity of m… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  2. arXiv:2406.04643  [pdf, other

    cs.CL

    More Victories, Less Cooperation: Assessing Cicero's Diplomacy Play

    Authors: Wichayaporn Wongkamjan, Feng Gu, Yanze Wang, Ulf Hermjakob, Jonathan May, Brandon M. Stewart, Jonathan K. Kummerfeld, Denis Peskoff, Jordan Lee Boyd-Graber

    Abstract: The boardgame Diplomacy is a challenging setting for communicative and cooperative artificial intelligence. The most prominent communicative Diplomacy AI, Cicero, has excellent strategic abilities, exceeding human players. However, the best Diplomacy players master communication, not just tactics, which is why the game has received attention as an AI challenge. This work seeks to understand the de… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2405.03932  [pdf, other

    cs.AI cs.CL

    CleanGraph: Human-in-the-loop Knowledge Graph Refinement and Completion

    Authors: Tyler Bikaun, Michael Stewart, Wei Liu

    Abstract: This paper presents CleanGraph, an interactive web-based tool designed to facilitate the refinement and completion of knowledge graphs. Maintaining the reliability of knowledge graphs, which are grounded in high-quality and error-free facts, is crucial for real-world applications such as question-answering and information retrieval systems. These graphs are often automatically assembled from textu… ▽ More

    Submitted 7 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  4. arXiv:2405.00892  [pdf, other

    cs.CV cs.AI

    Wake Vision: A Large-scale, Diverse Dataset and Benchmark Suite for TinyML Person Detection

    Authors: Colby Banbury, Emil Njor, Matthew Stewart, Pete Warden, Manjunath Kudlur, Nat Jeffries, Xenofon Fafoutis, Vijay Janapa Reddi

    Abstract: Tiny machine learning (TinyML), which enables machine learning applications on extremely low-power devices, suffers from limited size and quality of relevant datasets. To address this issue, we introduce Wake Vision, a large-scale, diverse dataset tailored for person detection, the canonical task for TinyML visual sensing. Wake Vision comprises over 6 million images, representing a hundredfold inc… ▽ More

    Submitted 6 June, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  5. arXiv:2402.11183  [pdf, other

    cs.CY cs.HC

    Materiality and Risk in the Age of Pervasive AI Sensors

    Authors: Matthew Stewart, Emanuel Moss, Pete Warden, Brian Plancher, Susan Kennedy, Mona Sloane, Vijay Janapa Reddi

    Abstract: Artificial intelligence systems connected to sensor-laden devices are becoming pervasive, which has significant implications for a range of AI risks, including to privacy, the environment, autonomy, and more. There is therefore a growing need for increased accountability around the responsible development and deployment of these technologies. In this paper, we provide a comprehensive analysis of t… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  6. arXiv:2310.01080  [pdf, other

    cs.DB

    Rel2Graph: Automated Mapping From Relational Databases to a Unified Property Knowledge Graph

    Authors: Ziyu Zhao, Wei Liu, Tim French, Michael Stewart

    Abstract: Although a few approaches are proposed to convert relational databases to graphs, there is a genuine lack of systematic evaluation across a wider spectrum of databases. Recognising the important issue of query mapping, this paper proposes an approach Rel2Graph, an automatic knowledge graph construction (KGC) approach from an arbitrary number of relational databases. Our approach also supports the… ▽ More

    Submitted 26 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

  7. arXiv:2309.09212  [pdf, other

    cs.RO

    RobotPerf: An Open-Source, Vendor-Agnostic, Benchmarking Suite for Evaluating Robotics Computing System Performance

    Authors: Víctor Mayoral-Vilches, Jason Jabbour, Yu-Shun Hsiao, Zishen Wan, Martiño Crespo-Álvarez, Matthew Stewart, Juan Manuel Reina-Muñoz, Prateek Nagras, Gaurav Vikhe, Mohammad Bakhshalipour, Martin Pinzger, Stefan Rass, Smruti Panigrahi, Giulio Corradi, Niladri Roy, Phillip B. Gibbons, Sabrina M. Neuman, Brian Plancher, Vijay Janapa Reddi

    Abstract: We introduce RobotPerf, a vendor-agnostic benchmarking suite designed to evaluate robotics computing performance across a diverse range of hardware platforms using ROS 2 as its common baseline. The suite encompasses ROS 2 packages covering the full robotics pipeline and integrates two distinct benchmarking approaches: black-box testing, which measures performance by eliminating upper layers and re… ▽ More

    Submitted 29 January, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

  8. arXiv:2309.08181  [pdf, other

    cs.CL

    Large Language Models for Failure Mode Classification: An Investigation

    Authors: Michael Stewart, Melinda Hodkiewicz, Sirui Li

    Abstract: In this paper we present the first investigation into the effectiveness of Large Language Models (LLMs) for Failure Mode Classification (FMC). FMC, the task of automatically labelling an observation with a corresponding failure mode code, is a critical task in the maintenance domain as it reduces the need for reliability engineers to spend their time manually analysing work orders. We detail our a… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 8 pages, 3 tables

  9. arXiv:2308.07832  [pdf, ps, other

    cs.LG cs.AI stat.ME

    REFORMS: Reporting Standards for Machine Learning Based Science

    Authors: Sayash Kapoor, Emily Cantrell, Kenny Peng, Thanh Hien Pham, Christopher A. Bail, Odd Erik Gundersen, Jake M. Hofman, Jessica Hullman, Michael A. Lones, Momin M. Malik, Priyanka Nanayakkara, Russell A. Poldrack, Inioluwa Deborah Raji, Michael Roberts, Matthew J. Salganik, Marta Serra-Garcia, Brandon M. Stewart, Gilles Vandewiele, Arvind Narayanan

    Abstract: Machine learning (ML) methods are proliferating in scientific research. However, the adoption of these methods has been accompanied by failures of validity, reproducibility, and generalizability. These failures can hinder scientific progress, lead to false consensus around invalid claims, and undermine the credibility of ML-based science. ML methods are often applied and fail in similar ways acros… ▽ More

    Submitted 19 September, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

  10. arXiv:2306.08848  [pdf, other

    cs.LG cs.CY cs.HC

    Datasheets for Machine Learning Sensors: Towards Transparency, Auditability, and Responsibility for Intelligent Sensing

    Authors: Matthew Stewart, Pete Warden, Yasmine Omri, Shvetank Prakash, Joao Santos, Shawn Hymel, Benjamin Brown, Jim MacArthur, Nat Jeffries, Sachin Katti, Brian Plancher, Vijay Janapa Reddi

    Abstract: Machine learning (ML) sensors are enabling intelligence at the edge by empowering end-users with greater control over their data. ML sensors offer a new paradigm for sensing that moves the processing and analysis to the device itself rather than relying on the cloud, bringing benefits like lower latency and greater data privacy. The rise of these intelligent edge devices, while revolutionizing are… ▽ More

    Submitted 16 February, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

  11. arXiv:2306.04746  [pdf, other

    stat.ME cs.CL cs.LG stat.ML

    Using Imperfect Surrogates for Downstream Inference: Design-based Supervised Learning for Social Science Applications of Large Language Models

    Authors: Naoki Egami, Musashi Hinck, Brandon M. Stewart, Hanying Wei

    Abstract: In computational social science (CSS), researchers analyze documents to explain social and political phenomena. In most scenarios, CSS researchers first obtain labels for documents and then explain labels using interpretable regression analyses in the second step. One increasingly common way to annotate documents cheaply at scale is through large language models (LLMs). However, like other scalabl… ▽ More

    Submitted 14 January, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  12. arXiv:2304.04640  [pdf, other

    cs.AI

    NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems

    Authors: Jason Yik, Korneel Van den Berghe, Douwe den Blanken, Younes Bouhadjar, Maxime Fabre, Paul Hueber, Denis Kleyko, Noah Pacik-Nelson, Pao-Sheng Vincent Sun, Guangzhi Tang, Shenqi Wang, Biyan Zhou, Soikat Hasan Ahmed, George Vathakkattil Joseph, Benedetto Leto, Aurora Micheli, Anurag Kumar Mishra, Gregor Lenz, Tao Sun, Zergham Ahmed, Mahmoud Akl, Brian Anderson, Andreas G. Andreou, Chiara Bartolozzi, Arindam Basu , et al. (73 additional authors not shown)

    Abstract: Neuromorphic computing shows promise for advancing computing efficiency and capabilities of AI applications using brain-inspired principles. However, the neuromorphic research field currently lacks standardized benchmarks, making it difficult to accurately measure technological advancements, compare performance with conventional methods, and identify promising future research directions. Prior neu… ▽ More

    Submitted 17 January, 2024; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: Updated from whitepaper to full perspective article preprint

  13. arXiv:2301.11899  [pdf

    cs.LG cs.AR cs.CY

    Is TinyML Sustainable? Assessing the Environmental Impacts of Machine Learning on Microcontrollers

    Authors: Shvetank Prakash, Matthew Stewart, Colby Banbury, Mark Mazumder, Pete Warden, Brian Plancher, Vijay Janapa Reddi

    Abstract: The sustained growth of carbon emissions and global waste elicits significant sustainability concerns for our environment's future. The growing Internet of Things (IoT) has the potential to exacerbate this issue. However, an emerging area known as Tiny Machine Learning (TinyML) has the opportunity to help address these environmental challenges through sustainable computing practices. TinyML, the d… ▽ More

    Submitted 21 November, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: Communications of the ACM (CACM) November 2023 Issue

  14. arXiv:2208.09082  [pdf, other

    cs.HC

    Standing Balance Improvement Using Vibrotactile Feedback in Virtual Reality

    Authors: M. Rasel Mahmud, Michael Stewart, Alberto Cordova, John Quarles

    Abstract: Virtual Reality (VR) users often encounter postural instability, i.e., balance issues, which can be a significant impediment to universal usability and accessibility, particularly for those with balance impairments. Prior research has validated imbalance issues, but little effort has been made to mitigate them. We recruited 39 participants (with balance impairments: 18, without balance impairments… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: 10 pages, 7 figures. arXiv admin note: text overlap with arXiv:2202.04743

  15. arXiv:2208.08390  [pdf, other

    cs.HC

    Auditory Feedback to Make Walking in Virtual Reality More Accessible

    Authors: M. Rasel Mahmud, Michael Stewart, Alberto Cordova, John Quarles

    Abstract: The objective of this study is to investigate the impact of several auditory feedback modalities on gait (i.e., walking patterns) in virtual reality (VR). Prior research has substantiated gait disturbances in VR users as one of the primary obstacles to VR usability. However, minimal research has been done to mitigate this issue. We recruited 39 participants (with mobility impairments: 18, without… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Comments: 10 pages, 6 figures

  16. arXiv:2208.02403  [pdf, other

    cs.HC

    Vibrotactile Feedback to Make Real Walking in Virtual Reality More Accessible

    Authors: M. Rasel Mahmud, Michael Stewart, Alberto Cordova, John Quarles

    Abstract: This research aims to examine the effects of various vibrotactile feedback techniques on gait (i.e., walking patterns) in virtual reality (VR). Prior studies have demonstrated that gait disturbances in VR users are significant usability barriers. However, adequate research has not been performed to address this problem. In our study, 39 participants (with mobility impairments: 18, without mobility… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: 13 pages, 7 figures

  17. arXiv:2207.11243  [pdf, other

    cs.CV cs.GR

    Multiface: A Dataset for Neural Face Rendering

    Authors: Cheng-hsin Wuu, Ningyuan Zheng, Scott Ardisson, Rohan Bali, Danielle Belko, Eric Brockmeyer, Lucas Evans, Timothy Godisart, Hyowon Ha, Xuhua Huang, Alexander Hypes, Taylor Koska, Steven Krenn, Stephen Lombardi, Xiaomin Luo, Kevyn McPhail, Laura Millerschoen, Michal Perdoch, Mark Pitts, Alexander Richard, Jason Saragih, Junko Saragih, Takaaki Shiratori, Tomas Simon, Matt Stewart , et al. (6 additional authors not shown)

    Abstract: Photorealistic avatars of human faces have come a long way in recent years, yet research along this area is limited by a lack of publicly available, high-quality datasets covering both, dense multi-view camera captures, and rich facial expressions of the captured subjects. In this work, we present Multiface, a new multi-view, high-resolution human face dataset collected from 13 identities at Reali… ▽ More

    Submitted 26 June, 2023; v1 submitted 22 July, 2022; originally announced July 2022.

  18. arXiv:2206.03266  [pdf, other

    cs.LG cs.AR eess.SP

    Machine Learning Sensors

    Authors: Pete Warden, Matthew Stewart, Brian Plancher, Colby Banbury, Shvetank Prakash, Emma Chen, Zain Asgar, Sachin Katti, Vijay Janapa Reddi

    Abstract: Machine learning sensors represent a paradigm shift for the future of embedded machine learning applications. Current instantiations of embedded machine learning (ML) suffer from complex integration, lack of modularity, and privacy and security concerns from data movement. This article proposes a more data-centric paradigm for embedding sensor intelligence on edge devices to combat these challenge… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  19. arXiv:2202.04743  [pdf, other

    cs.HC

    Auditory Feedback for Standing Balance Improvement in Virtual Reality

    Authors: M. Rasel Mahmud, Michael Stewart, Alberto Cordova, John Quarles

    Abstract: Virtual Reality (VR) users often experience postural instability, i.e., balance problems, which could be a major barrier to universal usability and accessibility for all, especially for persons with balance impairments. Prior research has confirmed the imbalance effect, but minimal research has been conducted to reduce this effect. We recruited 42 participants (with balance impairments: 21, withou… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

    Comments: 10 pages

  20. arXiv:2109.00725  [pdf, other

    cs.CL cs.LG

    Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

    Authors: Amir Feder, Katherine A. Keith, Emaad Manzoor, Reid Pryzant, Dhanya Sridhar, Zach Wood-Doughty, Jacob Eisenstein, Justin Grimmer, Roi Reichart, Margaret E. Roberts, Brandon M. Stewart, Victor Veitch, Diyi Yang

    Abstract: A fundamental goal of scientific research is to learn about causal relationships. However, despite its critical role in the life and social sciences, causality has not had the same importance in Natural Language Processing (NLP), which has traditionally placed more emphasis on predictive tasks. This distinction is beginning to fade, with an emerging area of interdisciplinary research at the conver… ▽ More

    Submitted 30 July, 2022; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: Accepted to Transactions of the Association for Computational Linguistics (TACL)

  21. arXiv:2106.04008  [pdf, other

    cs.LG

    Widening Access to Applied Machine Learning with TinyML

    Authors: Vijay Janapa Reddi, Brian Plancher, Susan Kennedy, Laurence Moroney, Pete Warden, Anant Agarwal, Colby Banbury, Massimo Banzi, Matthew Bennett, Benjamin Brown, Sharad Chitlangia, Radhika Ghosal, Sarah Grafman, Rupert Jaeger, Srivatsan Krishnan, Maximilian Lam, Daniel Leiker, Cara Mann, Mark Mazumder, Dominic Pajak, Dhilan Ramaprasad, J. Evan Smith, Matthew Stewart, Dustin Tingley

    Abstract: Broadening access to both computational and educational resources is critical to diffusing machine-learning (ML) innovation. However, today, most ML resources and experts are siloed in a few countries and organizations. In this paper, we describe our pedagogical approach to increasing access to applied ML through a massive open online course (MOOC) on Tiny Machine Learning (TinyML). We suggest tha… ▽ More

    Submitted 9 June, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Understanding the underpinnings of the TinyML edX course series: https://www.edx.org/professional-certificate/harvardx-tiny-machine-learning

  22. arXiv:2103.10585  [pdf, other

    cs.CY cs.CR cs.HC

    The evolving ecosystem of COVID-19 contact tracing applications

    Authors: Benjamin Levy, Matthew Stewart

    Abstract: Since the outbreak of the novel coronavirus, COVID-19, there has been increased interest in the use of digital contact tracing as a means of stopping chains of viral transmission, provoking alarm from privacy advocates. Concerning the ethics of this technology, recent studies have predominantly focused on (1) the formation of guidelines for ethical contact tracing, (2) the analysis of specific imp… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: 15 pages

  23. arXiv:2010.08775  [pdf, other

    cs.LG physics.geo-ph

    Using machine learning to reduce ensembles of geological models for oil and gas exploration

    Authors: Anna Roubícková, Lucy MacGregor, Nick Brown, Oliver Thomson Brown, Mike Stewart

    Abstract: Exploration using borehole drilling is a key activity in determining the most appropriate locations for the petroleum industry to develop oil fields. However, estimating the amount of Oil In Place (OIP) relies on computing with a very significant number of geological models, which, due to the ever increasing capability to capture and refine data, is becoming infeasible. As such, data reduction tec… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

    Comments: Pre-print in 2019 IEEE/ACM 5th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-5) (pp. 42-49). IEEE

    Journal ref: In 2019 IEEE/ACM 5th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-5) (pp. 42-49). IEEE

  24. arXiv:2007.12702  [pdf, other

    stat.ME cs.LG stat.ML

    Naïve regression requires weaker assumptions than factor models to adjust for multiple cause confounding

    Authors: Justin Grimmer, Dean Knox, Brandon M. Stewart

    Abstract: The empirical practice of using factor models to adjust for shared, unobserved confounders, $\mathbf{Z}$, in observational settings with multiple treatments, $\mathbf{A}$, is widespread in fields including genetics, networks, medicine, and politics. Wang and Blei (2019, WB) formalizes these procedures and develops the "deconfounder," a causal inference method using factor models of $\mathbf{A}$ to… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

  25. arXiv:2003.10097  [pdf, other

    cs.CL

    E2EET: From Pipeline to End-to-end Entity Typing via Transformer-Based Embeddings

    Authors: Michael Stewart, Wei Liu

    Abstract: Entity Typing (ET) is the process of identifying the semantic types of every entity within a corpus. In contrast to Named Entity Recognition, where each token in a sentence is labelled with zero or one class label, ET involves labelling each entity mention with one or more class labels. Existing entity typing models, which operate at the mention level, are limited by two key factors: they do not m… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

  26. arXiv:1911.06172  [pdf, other

    cs.CL

    Word-level Lexical Normalisation using Context-Dependent Embeddings

    Authors: Michael Stewart, Wei Liu, Rachel Cardell-Oliver

    Abstract: Lexical normalisation (LN) is the process of correcting each word in a dataset to its canonical form so that it may be more easily and more accurately analysed. Most lexical normalisation systems operate at the character-level, while word-level models are seldom used. Recent language models offer solutions to the drawbacks of word-level LN models, yet, to the best of our knowledge, no research has… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

  27. arXiv:1909.01807  [pdf, other

    cs.CL

    ICDM 2019 Knowledge Graph Contest: Team UWA

    Authors: Michael Stewart, Majigsuren Enkhsaikhan, Wei Liu

    Abstract: We present an overview of our triple extraction system for the ICDM 2019 Knowledge Graph Contest. Our system uses a pipeline-based approach to extract a set of triples from a given document. It offers a simple and effective solution to the challenge of knowledge graph construction from domain-specific text. It also provides the facility to visualise useful information about each triple such as the… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

  28. arXiv:1802.02163  [pdf, other

    stat.ML cs.CL stat.ME

    How to Make Causal Inferences Using Texts

    Authors: Naoki Egami, Christian J. Fong, Justin Grimmer, Margaret E. Roberts, Brandon M. Stewart

    Abstract: New text as data techniques offer a great promise: the ability to inductively discover measures that are useful for testing social science theories of interest from large collections of text. We introduce a conceptual framework for making causal inferences with discovered measures as a treatment or outcome. Our framework enables researchers to discover high-dimensional textual interventions and es… ▽ More

    Submitted 6 February, 2018; originally announced February 2018.

    Comments: 47 pages

  29. arXiv:1710.11214  [pdf, other

    cs.CY cs.LG stat.ML

    How Algorithmic Confounding in Recommendation Systems Increases Homogeneity and Decreases Utility

    Authors: Allison J. B. Chaney, Brandon M. Stewart, Barbara E. Engelhardt

    Abstract: Recommendation systems are ubiquitous and impact many domains; they have the potential to influence product consumption, individuals' perceptions of the world, and life-altering decisions. These systems are often evaluated or trained with data from users already exposed to algorithmic recommendations; this creates a pernicious feedback loop. Using simulations, we demonstrate how using data confoun… ▽ More

    Submitted 26 November, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

  30. arXiv:1403.2004  [pdf, other

    cs.CL

    Natural Language Feature Selection via Cooccurrence

    Authors: Michael Stewart

    Abstract: Specificity is important for extracting collocations, keyphrases, multi-word and index terms [Newman et al. 2012]. It is also useful for tagging, ontology construction [Ryu and Choi 2006], and automatic summarization of documents [Louis and Nenkova 2011, Chali and Hassan 2012]. Term frequency and inverse-document frequency (TF-IDF) are typically used to do this, but fail to take advantage of the s… ▽ More

    Submitted 8 March, 2014; originally announced March 2014.

  31. arXiv:1312.3891  [pdf, ps, other

    cs.PL cs.CR

    Algorithmic Diversity for Software Security

    Authors: Michael Stewart

    Abstract: Software diversity protects against a modern-day exploits such as code-reuse attacks. When an attacker designs a code-reuse attack on an example executable, it relies on replicating the target environment. With software diversity, the attacker cannot reliably replicate their target. This is a security benefit which can be applied to massive-scale software distribution. When applied to large-scale… ▽ More

    Submitted 13 December, 2013; originally announced December 2013.