Skip to main content

Showing 1–25 of 25 results for author: Johansson, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.19498  [pdf, other

    cs.AI

    Machine Psychology: Integrating Operant Conditioning with the Non-Axiomatic Reasoning System for Advancing Artificial General Intelligence Research

    Authors: Robert Johansson

    Abstract: This paper introduces an interdisciplinary framework called Machine Psychology, which merges principles from operant learning psychology with a specific Artificial Intelligence model, the Non-Axiomatic Reasoning System (NARS), to enhance Artificial General Intelligence (AGI) research. The core premise of this framework is that adaptation is crucial to both biological and artificial intelligence an… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  2. arXiv:2405.03340  [pdf, other

    cs.AI

    Functional Equivalence with NARS

    Authors: Robert Johansson, Patrick Hammer, Tony Lofthouse

    Abstract: This study explores the concept of functional equivalence within the framework of the Non-Axiomatic Reasoning System (NARS), specifically through OpenNARS for Applications (ONA). Functional equivalence allows organisms to categorize and respond to varied stimuli based on their utility rather than perceptual similarity, thus enhancing cognitive efficiency and adaptability. In this study, ONA was mo… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  3. arXiv:2403.16584  [pdf, other

    cs.CL

    Can Large Language Models (or Humans) Disentangle Text?

    Authors: Nicolas Audinet de Pieuchon, Adel Daoud, Connor Thomas Jerzak, Moa Johansson, Richard Johansson

    Abstract: We investigate the potential of large language models (LLMs) to disentangle text variables--to remove the textual traces of an undesired forbidden variable in a task sometimes known as text distillation and closely related to the fairness in AI and causal inference literature. We employ a range of various LLM approaches in an attempt to disentangle text by identifying and removing information abou… ▽ More

    Submitted 3 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: To appear as: Nicolas Audinet de Pieuchon, Adel Daoud, Connor T. Jerzak, Moa Johansson, Richard Johansson. Can Large Language Models (or Humans) Disentangle Text? In: Sixth Workshop on NLP and Computational Social Science at NAACL, 2024

    MSC Class: 68T50 ACM Class: I.2.7; H.1.2

  4. arXiv:2403.16142  [pdf, other

    cs.CL cs.AI

    What Happens to a Dataset Transformed by a Projection-based Concept Removal Method?

    Authors: Richard Johansson

    Abstract: We investigate the behavior of methods that use linear projections to remove information about a concept from a language representation, and we consider the question of what happens to a dataset transformed by such a method. A theoretical analysis and experiments on real-world and synthetic data show that these methods inject strong statistical dependencies into the transformed datasets. After app… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  5. arXiv:2402.06963  [pdf, other

    cs.LG cs.AI stat.ML

    Tree Ensembles for Contextual Bandits

    Authors: Hannes Nilsson, Rikard Johansson, Niklas Åkerblom, Morteza Haghir Chehreghani

    Abstract: We propose a novel framework for contextual multi-armed bandits based on tree ensembles. Our framework integrates two widely used bandit methods, Upper Confidence Bound and Thompson Sampling, for both standard and combinatorial settings. We demonstrate the effectiveness of our framework via several experimental studies, employing both XGBoost and random forest, two popular tree ensemble methods. C… ▽ More

    Submitted 12 July, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: The first two authors contributed equally to this work

  6. arXiv:2311.01307  [pdf, other

    cs.CL

    The Effect of Scaling, Retrieval Augmentation and Form on the Factual Consistency of Language Models

    Authors: Lovisa Hagström, Denitsa Saynova, Tobias Norlund, Moa Johansson, Richard Johansson

    Abstract: Large Language Models (LLMs) make natural interfaces to factual knowledge, but their usefulness is limited by their tendency to deliver inconsistent answers to semantically equivalent questions. For example, a model might predict both "Anne Redpath passed away in Edinburgh." and "Anne Redpath's life ended in London." In this work, we identify potential causes of inconsistency and evaluate the effe… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2023

  7. arXiv:2305.16243  [pdf, other

    cs.CL

    Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models

    Authors: Ehsan Doostmohammadi, Tobias Norlund, Marco Kuhlmann, Richard Johansson

    Abstract: Augmenting language models with a retrieval mechanism has been shown to significantly improve their performance while keeping the number of parameters low. Retrieval-augmented models commonly rely on a semantic retrieval mechanism based on the similarity between dense representations of the query chunk and potential neighbors. In this paper, we study the state-of-the-art Retro model and observe th… ▽ More

    Submitted 4 July, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  8. arXiv:2304.08115  [pdf, other

    cs.CL

    An Empirical Study of Multitask Learning to Improve Open Domain Dialogue Systems

    Authors: Mehrdad Farahani, Richard Johansson

    Abstract: Autoregressive models used to generate responses in open-domain dialogue systems often struggle to take long-term context into account and to maintain consistency over a dialogue. Previous research in open-domain dialogue generation has shown that the use of \emph{auxiliary tasks} can introduce inductive biases that encourage the model to improve these qualities. However, most previous research ha… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 11 pages, 1 figure, 4 tables, 2 appendices, NoDaLiDa2023

  9. arXiv:2302.12128  [pdf, other

    cs.CL

    On the Generalization Ability of Retrieval-Enhanced Transformers

    Authors: Tobias Norlund, Ehsan Doostmohammadi, Richard Johansson, Marco Kuhlmann

    Abstract: Recent work on the Retrieval-Enhanced Transformer (RETRO) model has shown that off-loading memory from trainable weights to a retrieval database can significantly improve language modeling and match the performance of non-retrieval models that are an order of magnitude larger in size. It has been suggested that at least some of this performance gain is due to non-trivial generalization based on bo… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  10. arXiv:2302.01582  [pdf, other

    cs.CL

    Controlling for Stereotypes in Multimodal Language Model Evaluation

    Authors: Manuj Malik, Richard Johansson

    Abstract: We propose a methodology and design two benchmark sets for measuring to what extent language-and-vision language models use the visual signal in the presence or absence of stereotypes. The first benchmark is designed to test for stereotypical colors of common objects, while the second benchmark considers gender stereotypes. The key idea is to compare predictions when the image conforms to the ster… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

  11. arXiv:2209.08982  [pdf, other

    cs.CL

    How to Adapt Pre-trained Vision-and-Language Models to a Text-only Input?

    Authors: Lovisa Hagström, Richard Johansson

    Abstract: Current language models have been criticised for learning language from text alone without connection between words and their meaning. Consequently, multimodal training has been proposed as a way for creating models with better language understanding by providing the lacking connection. We focus on pre-trained multimodal vision-and-language (VL) models for which there already are some results on t… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  12. arXiv:2205.07065  [pdf, other

    cs.CL

    What do Models Learn From Training on More Than Text? Measuring Visual Commonsense Knowledge

    Authors: Lovisa Hagström, Richard Johansson

    Abstract: There are limitations in learning language from text alone. Therefore, recent focus has been on developing multimodal models. However, few benchmarks exist that can measure what language models learn about language from multimodal training. We hypothesize that training on a visual modality should improve on the visual commonsense knowledge in language models. Therefore, we introduce two evaluation… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

    Comments: Accepted to the ACL Student Research Workshop 2022

  13. arXiv:2205.00465  [pdf, other

    cs.CL

    Conceptualizing Treatment Leakage in Text-based Causal Inference

    Authors: Adel Daoud, Connor T. Jerzak, Richard Johansson

    Abstract: Causal inference methods that control for text-based confounders are becoming increasingly important in the social sciences and other disciplines where text is readily available. However, these methods rely on a critical assumption that there is no treatment leakage: that is, the text only contains information about the confounder and no information about treatment assignment. When this assumption… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  14. arXiv:2201.06399  [pdf, other

    eess.SY cs.MA cs.RO math.DS math.OC

    Cooperative constrained motion coordination of networked heterogeneous vehicles

    Authors: Zhiyong Sun, Marcus Greiff, Anders Robertsson, Rolf Johansson, Brian D. O. Anderson

    Abstract: We consider the problem of cooperative motion coordination for multiple heterogeneous mobile vehicles subject to various constraints. These include nonholonomic motion constraints, constant speed constraints, holonomic coordination constraints, and equality/inequality geometric constraints. We develop a general framework involving differential-algebraic equations and viability theory to determine… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: 23 pages, 4 figures. Extended version of the paper at IEEE ICRA. Text overlap with arXiv:1809.05509. Submitted to an IEEE journal for publication

  15. arXiv:2109.11321  [pdf, other

    cs.CL

    Transferring Knowledge from Vision to Language: How to Achieve it and how to Measure it?

    Authors: Tobias Norlund, Lovisa Hagström, Richard Johansson

    Abstract: Large language models are known to suffer from the hallucination problem in that they are prone to output statements that are false or inconsistent, indicating a lack of knowledge. A proposed solution to this is to provide the model with additional data modalities that complements the knowledge obtained through text. We investigate the use of visual data to complement the knowledge of large langua… ▽ More

    Submitted 30 September, 2021; v1 submitted 23 September, 2021; originally announced September 2021.

  16. arXiv:1909.08289  [pdf, other

    cs.RO

    Segmentation of Robot Movements using Position and Contact Forces

    Authors: Martin Karlsson, Anders Robertsson, Rolf Johansson

    Abstract: In this paper, a method for autonomous segmentation of demonstrated robot movements is proposed. Position data is clustered into Gaussian mixture models (GMMs), and an initial set of segments is identified from the Gaussian basis functions. A Kalman filter is used to detect sudden changes in the contact force/torque measurements, and this is used to update and verify the initial segmentation point… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

  17. arXiv:1905.11176  [pdf, other

    cs.RO eess.SY

    Temporally Coupled Dynamical Movement Primitives in Cartesian Space

    Authors: Martin Karlsson, Anders Robertsson, Rolf Johansson

    Abstract: Control of robot orientation in Cartesian space implicates some difficulties, because the rotation group SO(3) is not contractible, and only globally contractible state spaces support continuous and globally asymptotically stable feedback control systems. In this paper, unit quaternions are used to represent orientations, and it is first shown that the unit quaternion set minus one single point is… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

  18. arXiv:1905.11130  [pdf, other

    cs.RO eess.SY

    Autonomous Interpretation of Demonstrations for Modification of Dynamical Movement Primitives

    Authors: Martin Karlsson, Anders Robertsson, Rolf Johansson

    Abstract: The concept of dynamical movement primitives (DMPs) has become popular for modeling of motion, commonly applied to robots. This paper presents a framework that allows a robot operator to adjust DMPs in an intuitive way. Given a generated trajectory with a faulty last part, the operator can use lead-through programming to demonstrate a corrective trajectory. A modified DMP is formed, based on the f… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2017, Singapore

  19. arXiv:1811.06350  [pdf, ps, other

    eess.SY cs.MA cs.RO cs.SC math.OC

    Temporal viability regulation for control affine systems with applications to mobile vehicle coordination under time-varying motion constraints

    Authors: Marcus Greiff, Zhiyong Sun, Anders Robertsson, Rolf Johansson

    Abstract: Controlled invariant set and viability regulation of dynamical control systems have played important roles in many control and coordination applications. In this paper we develop a temporal viability regulation theory for general dynamical control systems, and in particular for control affine systems. The time-varying viable set is parameterized by time-varying constraint functions, with the aim t… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

    Comments: 7 pages, 3 figures. Submitted to a conference for publication

  20. arXiv:1806.09919  [pdf, other

    cs.LG eess.SY stat.ML

    Tangent-Space Regularization for Neural-Network Models of Dynamical Systems

    Authors: Fredrik Bagge Carlson, Rolf Johansson, Anders Robertsson

    Abstract: This work introduces the concept of tangent space regularization for neural-network models of dynamical systems. The tangent space to the dynamics function of many physical systems of interest in control applications exhibits useful properties, e.g., smoothness, motivating regularization of the model Jacobian along system trajectories using assumptions on the tangent space of the dynamics. Without… ▽ More

    Submitted 26 June, 2018; originally announced June 2018.

  21. arXiv:1804.06586  [pdf, other

    eess.SY cs.RO

    Composite Adaptive Control for Bilateral Teleoperation Systems without Persistency of Excitation

    Authors: Yuling Li, Yixin Yin, Sen Zhang, Jie Dong, Rolf Johansson

    Abstract: Composite adaptive control schemes, which use both the system tracking errors and the prediction error to drive the update laws, have become widespread in achieving an improvement of system performance. However, a strong persistent-excitation (PE) condition should be satisfied to guarantee the parameter convergence. This paper proposes a novel composite adaptive control to guarantee parameter conv… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.

    Comments: 21 pages, 9 figures, submitted to Journal of The Franklin Institute

  22. arXiv:1804.04290  [pdf, other

    eess.SY cs.HC math.OC

    Bilateral Teleoperation of Multiple Robots under Scheduling Communication

    Authors: Yuling Li, Kun Liu, Wei He, Yixin Yin, Rolf Johansson, Kai Zhang

    Abstract: In this paper, bilateral teleoperation of multiple slaves coupled to a single master under scheduling communication is investigated. The sampled-data transmission between the master and the multiple slaves is fulfilled over a delayed communication network, and at each sampling instant, only one slave is allowed to transmit its current information to the master side according to some scheduling pro… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: 13 pages, 12 figures, 4 tables, submitted to IEEE Transactions on Control Systems Technology

  23. arXiv:1706.05913  [pdf

    cs.CR cs.DB

    Fusing restricted information

    Authors: Magnus Jändel, Pontus Svenson, Ronnie Johansson

    Abstract: Information fusion deals with the integration and merging of data and information from multiple (heterogeneous) sources. In many cases, the information that needs to be fused has security classification. The result of the fusion process is then by necessity restricted with the strictest information security classification of the inputs. This has severe drawbacks and limits the possible disseminati… ▽ More

    Submitted 18 May, 2017; originally announced June 2017.

    Comments: 9 pages, author contacts: [email protected], [email protected]

    ACM Class: H.2.8

    Journal ref: Proc 17th Int Conf on Information Fusion (2014)

  24. arXiv:1412.6045  [pdf, ps, other

    cs.CL

    A Simple and Efficient Method To Generate Word Sense Representations

    Authors: Luis Nieto Piña, Richard Johansson

    Abstract: Distributed representations of words have boosted the performance of many Natural Language Processing tasks. However, usually only one representation per word is obtained, not acknowledging the fact that some words have multiple meanings. This has a negative effect on the individual word representations and the language model as a whole. In this paper we present a simple model that enables recen… ▽ More

    Submitted 19 December, 2014; v1 submitted 18 December, 2014; originally announced December 2014.

    Comments: 5 pages, submission to ICLR 2015

  25. arXiv:0801.4417  [pdf, ps, other

    quant-ph cond-mat.mes-hall cond-mat.supr-con cs.GT

    Controllable coherent population transfers in superconducting qubits for quantum computing

    Authors: L. F. Wei, J. R. Johansson, L. X. Cen, S. Ashhab, Franco Nori

    Abstract: We propose an approach to coherently transfer populations between selected quantum states in one- and two-qubit systems by using controllable Stark-chirped rapid adiabatic passages (SCRAPs). These {\it evolution-time insensitive} transfers, assisted by easily implementable single-qubit phase-shift operations, could serve as elementary logic gates for quantum computing. Specifically, this proposa… ▽ More

    Submitted 28 January, 2008; originally announced January 2008.

    Comments: 4 pages, 6 figures. to appear in Physical Review Letters

    Journal ref: Phys. Rev. Lett. 100, 113601 (2008)