Skip to main content

Showing 1–20 of 20 results for author: Koch, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.18108  [pdf, other

    cs.LG cs.CY cs.SI physics.soc-ph

    Graph Neural Ordinary Differential Equations for Coarse-Grained Socioeconomic Dynamics

    Authors: James Koch, Pranab Roy Chowdhury, Heng Wan, Parin Bhaduri, Jim Yoon, Vivek Srikrishnan, W. Brent Daniel

    Abstract: We present a data-driven machine-learning approach for modeling space-time socioeconomic dynamics. Through coarse-graining fine-scale observations, our modeling framework simplifies these complex systems to a set of tractable mechanistic relationships -- in the form of ordinary differential equations -- while preserving critical system behaviors. This approach allows for expedited 'what if' studie… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  2. arXiv:2406.09637  [pdf, other

    cs.CV

    Industrial Language-Image Dataset (ILID): Adapting Vision Foundation Models for Industrial Settings

    Authors: Keno Moenck, Duc Trung Thieu, Julian Koch, Thorsten Schüppstuhl

    Abstract: In recent years, the upstream of Large Language Models (LLM) has also encouraged the computer vision community to work on substantial multimodal datasets and train models on a scale in a self-/semi-supervised manner, resulting in Vision Foundation Models (VFM), as, e.g., Contrastive Language-Image Pre-training (CLIP). The models generalize well and perform outstandingly on everyday objects or scen… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Dataset at https://github.com/kenomo/ilid training- and evaluation-related code at https://github.com/kenomo/industrial-clip

  3. arXiv:2404.19605  [pdf, other

    cs.LG cs.CV physics.ao-ph

    Data-Driven Invertible Neural Surrogates of Atmospheric Transmission

    Authors: James Koch, Brenda Forland, Bruce Bernacki, Timothy Doster, Tegan Emerson

    Abstract: We present a framework for inferring an atmospheric transmission profile from a spectral scene. This framework leverages a lightweight, physics-based simulator that is automatically tuned - by virtue of autodifferentiation and differentiable programming - to construct a surrogate atmospheric profile to model the observed data. We demonstrate utility of the methodology by (i) performing atmospheric… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Manuscript accepted for presentation and publication at the 2024 IEEE International Geoscience and Remote Sensing Symposium (IGARSS)

  4. arXiv:2404.06647  [pdf, other

    cs.CY cs.AI cs.LG

    From Protoscience to Epistemic Monoculture: How Benchmarking Set the Stage for the Deep Learning Revolution

    Authors: Bernard J. Koch, David Peterson

    Abstract: Over the past decade, AI research has focused heavily on building ever-larger deep learning models. This approach has simultaneously unlocked incredible achievements in science and technology, and hindered AI from overcoming long-standing limitations with respect to explainability, ethical harms, and environmental efficiency. Drawing on qualitative interviews and computational analyses, our three-… ▽ More

    Submitted 10 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  5. arXiv:2403.12938  [pdf, other

    cs.LG

    Neural Differential Algebraic Equations

    Authors: James Koch, Madelyn Shapiro, Himanshu Sharma, Draguna Vrabie, Jan Drgona

    Abstract: Differential-Algebraic Equations (DAEs) describe the temporal evolution of systems that obey both differential and algebraic constraints. Of particular interest are systems that contain implicit relationships between their components, such as conservation relationships. Here, we present Neural Differential-Algebraic Equations (NDAEs) suitable for data-driven modeling of DAEs. This methodology is b… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  6. arXiv:2312.14211  [pdf, ps, other

    cs.CL astro-ph.IM cs.AI

    Experimenting with Large Language Models and vector embeddings in NASA SciX

    Authors: Sergi Blanco-Cuaresma, Ioana Ciucă, Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Kelly E. Lockhart, Felix Grezes, Thomas Allen, Golnaz Shapurian, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Daniel Chivvis, Fernanda de Macedo Alves, Jean-Claude Paquin, Jennifer Bartlett, Mugdha Polimera, Stephanie Jarmak

    Abstract: Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed a… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in the proceedings of the 33th annual international Astronomical Data Analysis Software & Systems (ADASS XXXIII)

  7. arXiv:2310.17499  [pdf, other

    cs.CL cs.LG eess.AS

    The IMS Toucan System for the Blizzard Challenge 2023

    Authors: Florian Lux, Julia Koch, Sarina Meyer, Thomas Bott, Nadja Schauffler, Pavel Denisov, Antje Schweitzer, Ngoc Thang Vu

    Abstract: For our contribution to the Blizzard Challenge 2023, we improved on the system we submitted to the Blizzard Challenge 2021. Our approach entails a rule-based text-to-phoneme processing system that includes rule-based disambiguation of homographs in the French language. It then transforms the phonemes to spectrograms as intermediate representations using a fast and efficient non-autoregressive synt… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Published at the Blizzard Challenge Workshop 2023, colocated with the Speech Synthesis Workshop 2023, a sattelite event of the Interspeech 2023

  8. arXiv:2307.12674  [pdf, other

    cs.CV

    Industrial Segment Anything -- a Case Study in Aircraft Manufacturing, Intralogistics, Maintenance, Repair, and Overhaul

    Authors: Keno Moenck, Arne Wendt, Philipp Prünte, Julian Koch, Arne Sahrhage, Johann Gierecker, Ole Schmedemann, Falko Kähler, Dirk Holst, Martin Gomse, Thorsten Schüppstuhl, Daniel Schoepflin

    Abstract: Deploying deep learning-based applications in specialized domains like the aircraft production industry typically suffers from the training data availability problem. Only a few datasets represent non-everyday objects, situations, and tasks. Recent advantages in research around Vision Foundation Models (VFM) opened a new area of tasks and models with high generalization capabilities in non-semanti… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  9. arXiv:2304.09047  [pdf, other

    cs.LG cs.CE

    Neural Lumped Parameter Differential Equations with Application in Friction-Stir Processing

    Authors: James Koch, WoongJo Choi, Ethan King, David Garcia, Hrishikesh Das, Tianhao Wang, Ken Ross, Keerti Kappagantula

    Abstract: Lumped parameter methods aim to simplify the evolution of spatially-extended or continuous physical systems to that of a "lumped" element representative of the physical scales of the modeled system. For systems where the definition of a lumped element or its associated physics may be unknown, modeling tasks may be restricted to full-fidelity simulations of the physics of a system. In this work, we… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  10. arXiv:2212.00744  [pdf, ps, other

    cs.CL astro-ph.IM

    Improving astroBERT using Semantic Textual Similarity

    Authors: Felix Grezes, Thomas Allen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Pavlos Protopapas

    Abstract: The NASA Astrophysics Data System (ADS) is an essential tool for researchers that allows them to explore the astronomy and astrophysics scientific literature, but it has yet to exploit recent advances in natural language processing. At ADASS 2021, we introduced astroBERT, a machine learning language model tailored to the text used in astronomy papers in ADS. In this work we: - announce the first… ▽ More

    Submitted 29 November, 2022; originally announced December 2022.

  11. arXiv:2210.12223  [pdf, other

    cs.CL cs.SD eess.AS

    Low-Resource Multilingual and Zero-Shot Multispeaker TTS

    Authors: Florian Lux, Julia Koch, Ngoc Thang Vu

    Abstract: While neural methods for text-to-speech (TTS) have shown great advances in modeling multiple speakers, even in zero-shot settings, the amount of data needed for those approaches is generally not feasible for the vast majority of the world's over 6,000 spoken languages. In this work, we bring together the tasks of zero-shot voice cloning and multilingual low-resource TTS. Using the language agnosti… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted to AACL 2022

  12. arXiv:2210.07002  [pdf, other

    cs.SD cs.CL eess.AS

    Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy

    Authors: Sarina Meyer, Pascal Tilli, Pavel Denisov, Florian Lux, Julia Koch, Ngoc Thang Vu

    Abstract: In order to protect the privacy of speech data, speaker anonymization aims for hiding the identity of a speaker by changing the voice in speech recordings. This typically comes with a privacy-utility trade-off between protection of individuals and usability of the data for downstream applications. One of the challenges in this context is to create non-existent voices that sound as natural as possi… ▽ More

    Submitted 20 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: IEEE Spoken Language Technology Workshop 2022

  13. arXiv:2208.00880  [pdf, other

    cs.LG

    Physics-informed Machine Learning of Parameterized Fundamental Diagrams

    Authors: James Koch, Thomas Maxner, Vinay Amatya, Andisheh Ranjbari, Chase Dowling

    Abstract: Fundamental diagrams describe the relationship between speed, flow, and density for some roadway (or set of roadway) configuration(s). These diagrams typically do not reflect, however, information on how speed-flow relationships change as a function of exogenous variables such as curb configuration, weather or other exogenous, contextual information. In this paper we present a machine learning met… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  14. arXiv:2207.05549  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    PoeticTTS -- Controllable Poetry Reading for Literary Studies

    Authors: Julia Koch, Florian Lux, Nadja Schauffler, Toni Bernhart, Felix Dieterle, Jonas Kuhn, Sandra Richter, Gabriel Viehhauser, Ngoc Thang Vu

    Abstract: Speech synthesis for poetry is challenging due to specific intonation patterns inherent to poetic speech. In this work, we propose an approach to synthesise poems with almost human like naturalness in order to enable literary scholars to systematically examine hypotheses on the interplay between text, spoken realisation, and the listener's perception of poems. To meet these special requirements fo… ▽ More

    Submitted 18 October, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: Presented at Interspeech 2022

  15. arXiv:2207.04962  [pdf, other

    math.DS cs.LG

    Structural Inference of Networked Dynamical Systems with Universal Differential Equations

    Authors: James Koch, Zhao Chen, Aaron Tuor, Jan Drgona, Draguna Vrabie

    Abstract: Networked dynamical systems are common throughout science in engineering; e.g., biological networks, reaction networks, power systems, and the like. For many such systems, nonlinearity drives populations of identical (or near-identical) units to exhibit a wide range of nontrivial behaviors, such as the emergence of coherent structures (e.g., waves and patterns) or otherwise notable dynamics (e.g.,… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  16. arXiv:2207.04834  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Speaker Anonymization with Phonetic Intermediate Representations

    Authors: Sarina Meyer, Florian Lux, Pavel Denisov, Julia Koch, Pascal Tilli, Ngoc Thang Vu

    Abstract: In this work, we propose a speaker anonymization pipeline that leverages high quality automatic speech recognition and synthesis systems to generate speech conditioned on phonetic transcriptions and anonymized speaker embeddings. Using phones as the intermediate representation ensures near complete elimination of speaker identity information from the input while preserving the original phonetic co… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: Accepted at Interspeech 2022

  17. arXiv:2206.12229  [pdf, other

    cs.SD cs.CL eess.AS

    Exact Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech

    Authors: Florian Lux, Julia Koch, Ngoc Thang Vu

    Abstract: The cloning of a speaker's voice using an untranscribed reference sample is one of the great advances of modern neural text-to-speech (TTS) methods. Approaches for mimicking the prosody of a transcribed reference audio have also been proposed recently. In this work, we bring these two tasks together for the first time through utterance level normalization in conjunction with an utterance level spe… ▽ More

    Submitted 21 October, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: Accepted to IEEE SLT 2022

  18. arXiv:2105.14111  [pdf, other

    cs.LG cs.AI

    Goal Misgeneralization in Deep Reinforcement Learning

    Authors: Lauro Langosco, Jack Koch, Lee Sharkey, Jacob Pfau, Laurent Orseau, David Krueger

    Abstract: We study goal misgeneralization, a type of out-of-distribution generalization failure in reinforcement learning (RL). Goal misgeneralization failures occur when an RL agent retains its capabilities out-of-distribution yet pursues the wrong goal. For instance, an agent might continue to competently avoid obstacles, but navigate to the wrong place. In contrast, previous works have typically focused… ▽ More

    Submitted 9 January, 2023; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: Published in ICML 2022. 9 Pages

  19. arXiv:1806.11476  [pdf, ps, other

    cs.CR

    A Predictable Incentive Mechanism for TrueBit

    Authors: Julia Koch, Christian Reitwiessner

    Abstract: TrueBit is a protocol that uses interactive verification to allow a resource-constrained computation environment like a blockchain to perform much larger computations than usual in a trusted way. As long as a single honest participant is present to verify the computation, an invalid computation cannot get accepted. In TrueBit, the presence of such a verifier is incentivised by randomly injected… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

  20. arXiv:1601.00289  [pdf, other

    cs.DC cs.SI

    An Empirical Comparison of Big Graph Frameworks in the Context of Network Analysis

    Authors: Jannis Koch, Christian L. Staudt, Maximilian Vogel, Henning Meyerhenke

    Abstract: Complex networks are relational data sets commonly represented as graphs. The analysis of their intricate structure is relevant to many areas of science and commerce, and data sets may reach sizes that require distributed storage and processing. We describe and compare programming models for distributed computing with a focus on graph algorithms for large-scale complex network analysis. Four frame… ▽ More

    Submitted 3 January, 2016; originally announced January 2016.