Skip to main content

Showing 1–50 of 83 results for author: Jones, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.15475  [pdf, other

    cs.RO cs.AI

    A Multi-Level Corroborative Approach for Verification and Validation of Autonomous Robotic Swarms

    Authors: Dhaminda B. Abeywickrama, Suet Lee, Chris Bennett, Razanne Abu-Aisheh, Tom Didiot-Cook, Simon Jones, Sabine Hauert, Kerstin Eder

    Abstract: Modelling and characterizing emergent behaviour within a swarm can pose significant challenges in terms of 'assurance'. Assurance tasks encompass adherence to standards, certification processes, and the execution of verification and validation (V&V) methods, such as model checking. In this study, we propose a holistic, multi-level modelling approach for formally verifying and validating autonomous… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 15 pages, 11 figures

    ACM Class: I.2.9; D.2; I.6

  2. arXiv:2405.12862  [pdf, other

    cs.AI

    Toward Constraint Compliant Goal Formulation and Planning

    Authors: Steven J. Jones, Robert E. Wray

    Abstract: One part of complying with norms, rules, and preferences is incorporating constraints (such as knowledge of ethics) into one's goal formulation and planning processing. We explore in a simple domain how the encoding of knowledge in different ethical frameworks influences an agent's goal formulation and planning processing and demonstrate ability of an agent to satisfy and satisfice when its collec… ▽ More

    Submitted 10 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: 16 pages + refs. 5 figures, 2 tables. Minor revisions based on reviewer feedback. Accepted for presentation at Advances in Cognitive Systems (Jun 2024, Palermo)

    ACM Class: I.2.11; I.2.8

  3. arXiv:2404.09802  [pdf, other

    cs.CR cs.LG

    The Performance of Sequential Deep Learning Models in Detecting Phishing Websites Using Contextual Features of URLs

    Authors: Saroj Gopali, Akbar S. Namin, Faranak Abri, Keith S. Jones

    Abstract: Cyber attacks continue to pose significant threats to individuals and organizations, stealing sensitive data such as personally identifiable information, financial information, and login credentials. Hence, detecting malicious websites before they cause any harm is critical to preventing fraud and monetary loss. To address the increasing number of phishing attacks, protective mechanisms must be hi… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  4. arXiv:2403.02778  [pdf, other

    cs.PL

    Abstracting Denotational Interpreters

    Authors: Sebastian Graf, Simon Peyton Jones, Sven Keidel

    Abstract: We explore denotational interpreters: denotational semantics that produce coinductive traces of a corresponding small-step operational semantics. By parameterising our denotational interpreter over the semantic domain and then varying it, we recover dynamic semantics with different evaluation strategies as well as summary-based static analyses such as type analysis, all from the same generic inter… ▽ More

    Submitted 12 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Preprint; submitted to POPL'25

  5. arXiv:2402.18751  [pdf, other

    cs.LG cs.CV

    Multi-Sensor and Multi-temporal High-Throughput Phenotyping for Monitoring and Early Detection of Water-Limiting Stress in Soybean

    Authors: Sarah E. Jones, Timilehin Ayanlade, Benjamin Fallen, Talukder Z. Jubery, Arti Singh, Baskar Ganapathysubramanian, Soumik Sarkar, Asheesh K. Singh

    Abstract: Soybean production is susceptible to biotic and abiotic stresses, exacerbated by extreme weather events. Water limiting stress, i.e. drought, emerges as a significant risk for soybean production, underscoring the need for advancements in stress monitoring for crop breeding and production. This project combines multi-modal information to identify the most effective and efficient automated methods t… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 25 pages, 5 figures

  6. arXiv:2402.03116  [pdf, other

    cs.HC cs.LG

    Feature-Action Design Patterns for Storytelling Visualizations with Time Series Data

    Authors: Saiful Khan, Scott Jones, Benjamin Bach, Jaehoon Cha, Min Chen, Julie Meikle, Jonathan C Roberts, Jeyan Thiyagalingam, Jo Wood, Panagiotis D. Ritsos

    Abstract: We present a method to create storytelling visualization with time series data. Many personal decisions nowadays rely on access to dynamic data regularly, as we have seen during the COVID-19 pandemic. It is thus desirable to construct storytelling visualization for dynamic data that is selected by an individual for a specific context. Because of the need to tell data-dependent stories, predefined… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  7. arXiv:2401.13554  [pdf, other

    cs.CV

    PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition

    Authors: Otto Brookes, Majid Mirmehdi, Colleen Stephens, Samuel Angedakin, Katherine Corogenes, Dervla Dowd, Paula Dieguez, Thurston C. Hicks, Sorrel Jones, Kevin Lee, Vera Leinert, Juan Lapuente, Maureen S. McCarthy, Amelia Meier, Mizuki Murai, Emmanuelle Normand, Virginie Vergnes, Erin G. Wessling, Roman M. Wittig, Kevin Langergraber, Nuria Maldonado, Xinyu Yang, Klaus Zuberbuhler, Christophe Boesch, Mimi Arandjelovic , et al. (2 additional authors not shown)

    Abstract: We present the PanAf20K dataset, the largest and most diverse open-access annotated video dataset of great apes in their natural environment. It comprises more than 7 million frames across ~20,000 camera trap videos of chimpanzees and gorillas collected at 14 field sites in tropical Africa as part of the Pan African Programme: The Cultured Chimpanzee. The footage is accompanied by a rich set of an… ▽ More

    Submitted 31 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted at IJCV

  8. arXiv:2312.10041  [pdf

    cs.RO

    Digital Twin Technology Enabled Proactive Safety Application for Vulnerable Road Users: A Real-World Case Study

    Authors: Erik Rua, Kazi Hasan Shakib, Sagar Dasgupta, Mizanur Rahman, Steven Jones

    Abstract: While measures, such as traffic calming and advance driver assistance systems, can improve safety for Vulnerable Road Users (VRUs), their effectiveness ultimately relies on the responsible behavior of drivers and pedestrians who must adhere to traffic rules or take appropriate actions. However, these measures offer no solution in scenarios where a collision becomes imminent, leaving no time for wa… ▽ More

    Submitted 24 November, 2023; originally announced December 2023.

    Comments: 19 pages, 9 figures, submitted to the Transportation Research Board 2024 TRB Annual Meeting

  9. arXiv:2311.08706  [pdf, other

    cs.CY cs.AI

    Aligned: A Platform-based Process for Alignment

    Authors: Ethan Shaotran, Ido Pesok, Sam Jones, Emi Liu

    Abstract: We are introducing Aligned, a platform for global governance and alignment of frontier models, and eventually superintelligence. While previous efforts at the major AI labs have attempted to gather inputs for alignment, these are often conducted behind closed doors. We aim to set the foundation for a more trustworthy, public-facing approach to safety: a constitutional committee framework. Initial… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 11 pages, 7 figures. For associated public report, see https://energize.ai/openai

  10. DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding

    Authors: Kehinde Ajayi, Xin Wei, Martin Gryder, Winston Shields, Jian Wu, Shawn M. Jones, Michal Kucer, Diane Oyen

    Abstract: Recent advances in computer vision (CV) and natural language processing have been driven by exploiting big data on practical applications. However, these research fields are still limited by the sheer volume, versatility, and diversity of the available datasets. CV tasks, such as image captioning, which has primarily been carried out on natural images, still struggle to produce accurate and meanin… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  11. arXiv:2309.16673  [pdf

    cs.NI

    Harnessing Digital Twin Technology for Adaptive Traffic Signal Control: Improving Signalized Intersection Performance and User Satisfaction

    Authors: Sagar Dasgupta, Mizanur Rahman, Ph. D., Steven Jones, Ph. D

    Abstract: In this study, a digital twin (DT) technology based Adaptive Traffic Signal Control (ATSC) framework is presented for improving signalized intersection performance and user satisfaction. Specifically, real-time vehicle trajectory data, future traffic demand prediction and parallel simulation strategy are considered to develop two DT-based ATSC algorithms, namely DT1 (Digital Twin 1) and DT2 (Digit… ▽ More

    Submitted 1 July, 2023; originally announced September 2023.

  12. arXiv:2307.12451  [pdf, other

    q-bio.BM cs.LG stat.ML

    DiAMoNDBack: Diffusion-denoising Autoregressive Model for Non-Deterministic Backmapping of Cα Protein Traces

    Authors: Michael S. Jones, Kirill Shmilovich, Andrew L. Ferguson

    Abstract: Coarse-grained molecular models of proteins permit access to length and time scales unattainable by all-atom models and the simulation of processes that occur on long-time scales such as aggregation and folding. The reduced resolution realizes computational accelerations but an atomistic representation can be vital for a complete understanding of mechanistic details. Backmapping is the process of… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  13. "Customization is Key": Reconfigurable Content Tokens for Accessible Data Visualizations

    Authors: Shuli Jones, Isabella Pedraza Pineros, Daniel Hajas, Jonathan Zong, Arvind Satyanarayan

    Abstract: Customization is crucial for making visualizations accessible to blind and low-vision (BLV) people with widely-varying needs. But what makes for usable or useful customization? We identify four design goals for how BLV people should be able to customize screen-reader-accessible visualizations: presence, or what content is included; verbosity, or how concisely content is presented; ordering, or how… ▽ More

    Submitted 29 February, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 14 pages. 6 figures. 2 tables. ACM CHI Conference 2024

  14. arXiv:2307.06458  [pdf, other

    cs.SI cs.CV cs.DL

    Discovering Image Usage Online: A Case Study With "Flatten the Curve''

    Authors: Shawn M. Jones, Diane Oyen

    Abstract: Understanding the spread of images across the web helps us understand the reuse of scientific visualizations and their relationship with the public. The "Flatten the Curve" graphic was heavily used during the COVID-19 pandemic to convey a complex concept in a simple form. It displays two curves comparing the impact on case loads for medical facilities if the populace either adopts or fails to adop… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 6 pages, 5 figures, Presented as poster at JCDL 2023

    ACM Class: I.4.9; H.3.3; H.4.3; H.3.7

  15. TacMMs: Tactile Mobile Manipulators for Warehouse Automation

    Authors: Zhuochao He, Xuyang Zhang, Simon Jones, Sabine Hauert, Dandan Zhang, Nathan F. Lepora

    Abstract: Multi-robot platforms are playing an increasingly important role in warehouse automation for efficient goods transport. This paper proposes a novel customization of a multi-robot system, called Tactile Mobile Manipulators (TacMMs). Each TacMM integrates a soft optical tactile sensor and a mobile robot with a load-lifting mechanism, enabling cooperative transportation in tasks requiring coordinated… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: 8 pages, accepted in IEEE Robotics and Automation Letters, 19 June 2023

  16. arXiv:2303.04352  [pdf, ps, other

    cs.AI

    Computational-level Analysis of Constraint Compliance for General Intelligence

    Authors: Robert E. Wray, Steven J. Jones, John E. Laird

    Abstract: Human behavior is conditioned by codes and norms that constrain action. Rules, ``manners,'' laws, and moral imperatives are examples of classes of constraints that govern human behavior. These systems of constraints are "messy:" individual constraints are often poorly defined, what constraints are relevant in a particular situation may be unknown or ambiguous, constraints interact and conflict wit… ▽ More

    Submitted 15 June, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: 10 pages, 2 figures. Accepted for presentation at AGI 2023. Corrected author list (segmented list) and abstract text artifacts

    ACM Class: I.2.0; I.2.8

  17. arXiv:2302.08775  [pdf, other

    cs.PL

    Triemaps that match

    Authors: Simon Peyton Jones, Sebastian Graf

    Abstract: The trie data structure is a good choice for finite maps whose keys are data structures (trees) rather than atomic values. But what if we want the keys to be patterns, each of which matches many lookup keys? Efficient matching of this kind is well studied in the theorem prover community, but much less so in the context of statically typed functional programming. Doing so yields an interesting new… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: Rejected from ICFP 2022; lack of novelty, too heavy on code that is "tiresome boilerplate"

  18. arXiv:2212.10307  [pdf, other

    cs.PL cs.LG cs.MS

    Efficient and Sound Differentiable Programming in a Functional Array-Processing Language

    Authors: Amir Shaikhha, Mathieu Huot, Shabnam Ghasemirad, Andrew Fitzgibbon, Simon Peyton Jones, Dimitrios Vytiniotis

    Abstract: Automatic differentiation (AD) is a technique for computing the derivative of a function represented by a program. This technique is considered as the de-facto standard for computing the differentiation in many machine learning and optimisation software tools. Despite the practicality of this technique, the performance of the differentiated programs, especially for functional languages and in the… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:1806.02136

  19. arXiv:2211.02115  [pdf, other

    cs.CV cs.IR

    Abstract Images Have Different Levels of Retrievability Per Reverse Image Search Engine

    Authors: Shawn M. Jones, Diane Oyen

    Abstract: Much computer vision research has focused on natural images, but technical documents typically consist of abstract images, such as charts, drawings, diagrams, and schematics. How well do general web search engines discover abstract images? Recent advancements in computer vision and machine learning have led to the rise of reverse image search engines. Where conventional search engines accept a tex… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: 20 pages; 7 figures; to be published in the proceedings of the Drawings and abstract Imagery: Representation and Analysis (DIRA) Workshop from ECCV 2022

    ACM Class: H.3.3; H.3.7; H.3.5; I.4.9

  20. arXiv:2209.08649  [pdf, other

    cs.DL

    Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists

    Authors: Himarsha R. Jayanetti, Shawn M. Jones, Martin Klein, Alex Osbourne, Paul Koerbin, Michael L. Nelson, Michele C. Weigle

    Abstract: As web archives' holdings grow, archivists subdivide them into collections so they are easier to understand and manage. In this work, we review the collection structures of eight web archive platforms: : Archive-It, Conifer, the Croatian Web Archive (HAW), the Internet Archive's user account web archives, Library of Congress (LC), PANDORA, Trove, and the UK Web Archive (UKWA). We note a plethora o… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 5 figures, 16 pages, accepted for publication at TPDL 2022

  21. arXiv:2209.04071  [pdf

    cs.AI cs.SD eess.AS

    Audio Analytics-based Human Trafficking Detection Framework for Autonomous Vehicles

    Authors: Sagar Dasgupta, Kazi Shakib, Mizanur Rahman, Silvana V Croope, Steven Jones

    Abstract: Human trafficking is a universal problem, persistent despite numerous efforts to combat it globally. Individuals of any age, race, ethnicity, sex, gender identity, sexual orientation, nationality, immigration status, cultural background, religion, socioeconomic class, and education can be a victim of human trafficking. With the advancements in technology and the introduction of autonomous vehicles… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

  22. arXiv:2205.00806  [pdf, other

    cs.IR

    Biographical: A Semi-Supervised Relation Extraction Dataset

    Authors: Alistair Plum, Tharindu Ranasinghe, Spencer Jones, Constantin Orasan, Ruslan Mitkov

    Abstract: Extracting biographical information from online documents is a popular research topic among the information extraction (IE) community. Various natural language processing (NLP) techniques such as text classification, text summarisation and relation extraction are commonly used to achieve this. Among these techniques, RE is the most common since it can be directly used to build biographical knowled… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: Accepted to ACM SIGIR 2022

  23. arXiv:2203.13809  [pdf, other

    cs.RO

    DOTS: An Open Testbed for Industrial Swarm Robotic Solutions

    Authors: Simon Jones, Emma Milner, Mahesh Sooriyabandara, Sabine Hauert

    Abstract: We present DOTS, a new open access testbed for industrial swarm robotics experimentation. It consists of 20 fast agile robots with high sensing and computational performance, and real-world payload capability. They are housed in an arena equipped with private 5G, motion capture, multiple cameras, and openly accessible via an online portal. We reduce barriers to entry by providing a complete platfo… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: 16 pages, 17 figures, for associated video, see https://drive.google.com/file/d/1EuA8PS1qpqK6LIfPwCNXtQ3hHNWPDvtN/view?usp=sharing

  24. arXiv:2202.02319  [pdf, other

    cs.CE cs.DC physics.data-an physics.flu-dyn

    An integrated heterogeneous computing framework for ensemble simulations of laser-induced ignition

    Authors: Kazuki Maeda, Thiago Teixeira, Jonathan M. Wang, Jeffrey M. Hokanson, Caetano Melone, Mario Di Renzo, Steve Jones, Javier Urzay, Gianluca Iaccarino

    Abstract: An integrated computational framework is introduced to study complex engineering systems through physics-based ensemble simulations on heterogeneous supercomputers. The framework is primarily designed for the quantitative assessment of laser-induced ignition in rocket engines. We develop and combine an implicit programming system, a compressible reacting flow solver, and a data generation/manageme… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: 28 pages, 12 figures

  25. arXiv:2111.02330  [pdf, other

    cs.SI physics.soc-ph stat.ML

    Geodesic statistics for random network families

    Authors: Sahil Loomba, Nick S. Jones

    Abstract: A key task in the study of networked systems is to derive local and global properties that impact connectivity, synchronizability, and robustness. Computing shortest paths or geodesics in the network yields measures of node centrality and network connectivity that can contribute to explain such phenomena. We derive an analytic distribution of shortest path lengths, on the giant component in the su… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 32 pages, 12 figures

  26. arXiv:2110.08223  [pdf, other

    cs.LG

    Simultaneous Missing Value Imputation and Structure Learning with Groups

    Authors: Pablo Morales-Alvarez, Wenbo Gong, Angus Lamb, Simon Woodhead, Simon Peyton Jones, Nick Pawlowski, Miltiadis Allamanis, Cheng Zhang

    Abstract: Learning structures between groups of variables from data with missing values is an important task in the real world, yet difficult to solve. One typical scenario is discovering the structure among topics in the education domain to identify learning pathways. Here, the observations are student performances for questions under each topic which contain missing values. However, most existing methods… ▽ More

    Submitted 24 February, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  27. arXiv:2110.04866  [pdf, other

    cs.LG

    CoRGi: Content-Rich Graph Neural Networks with Attention

    Authors: Jooyeon Kim, Angus Lamb, Simon Woodhead, Simon Peyton Jones, Cheng Zheng, Miltiadis Allamanis

    Abstract: Graph representations of a target domain often project it to a set of entities (nodes) and their relations (edges). However, such projections often miss important and rich information. For example, in graph representations used in missing value imputation, items - represented as nodes - may contain rich textual information. However, when processing graphs with graph neural networks (GNN), such inf… ▽ More

    Submitted 10 October, 2021; originally announced October 2021.

  28. arXiv:2106.01998  [pdf, other

    cs.HC cs.AI cs.CR

    Toward Explainable Users: Using NLP to Enable AI to Understand Users' Perceptions of Cyber Attacks

    Authors: Faranak Abri, Luis Felipe Gutierrez, Chaitra T. Kulkarni, Akbar Siami Namin, Keith S. Jones

    Abstract: To understand how end-users conceptualize consequences of cyber security attacks, we performed a card sorting study, a well-known technique in Cognitive Sciences, where participants were free to group the given consequences of chosen cyber attacks into as many categories as they wished using rationales they see fit. The results of the open card sorting study showed a large amount of inter-particip… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: 20 pages, 3 figures, COMPSAC'21

  29. arXiv:2105.02856  [pdf, other

    cs.PL cs.DS

    Hashing Modulo Alpha-Equivalence

    Authors: Krzysztof Maziarz, Tom Ellis, Alan Lawrence, Andrew Fitzgibbon, Simon Peyton Jones

    Abstract: In many applications one wants to identify identical subtrees of a program syntax tree. This identification should ideally be robust to alpha-renaming of the program, but no existing technique has been shown to achieve this with good efficiency (better than $\mathcal{O}(n^2)$ in expression size). We present a new, asymptotically efficient way to hash modulo alpha-equivalence. A key insight of our… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted for publication at the 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation (PLDI 2021)

  30. arXiv:2104.11612  [pdf, other

    cs.CL cs.SI

    Understanding who uses Reddit: Profiling individuals with a self-reported bipolar disorder diagnosis

    Authors: Glorianna Jagfeld, Fiona Lobban, Paul Rayson, Steven H. Jones

    Abstract: Recently, research on mental health conditions using public online data, including Reddit, has surged in NLP and health research but has not reported user characteristics, which are important to judge generalisability of findings. This paper shows how existing NLP methods can yield information on clinical, demographic, and identity characteristics of almost 20K Reddit users who self-report a bipol… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

    Comments: The Seventh Workshop on Computational Linguistics and Clinical Psychology: Improving Access @NAACL 2021; Visual abstract on p. 14

  31. It's All About The Cards: Sharing on Social Media Probably Encouraged HTML Metadata Growth

    Authors: Shawn M. Jones, Valentina Neblitt-Jones, Michele C. Weigle, Martin Klein, Michael L. Nelson

    Abstract: In a perfect world, all articles consistently contain sufficient metadata to describe the resource. We know this is not the reality, so we are motivated to investigate the evolution of the metadata that is present when authors and publishers supply their own. Because applying metadata takes time, we recognize that each news article author has a limited metadata budget with which to spend their tim… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: 10 pages, 10 figures, 3 tables

  32. arXiv:2104.04034  [pdf, other

    cs.CY cs.HC

    Results and Insights from Diagnostic Questions: The NeurIPS 2020 Education Challenge

    Authors: Zichao Wang, Angus Lamb, Evgeny Saveliev, Pashmina Cameron, Yordan Zaykov, Jose Miguel Hernandez-Lobato, Richard E. Turner, Richard G. Baraniuk, Craig Barton, Simon Peyton Jones, Simon Woodhead, Cheng Zhang

    Abstract: This competition concerns educational diagnostic questions, which are pedagogically effective, multiple-choice questions (MCQs) whose distractors embody misconceptions. With a large and ever-increasing number of such questions, it becomes overwhelming for teachers to know which questions are the best ones to use for their students. We thus seek to answer the following question: how can we use data… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2007.12061

  33. Automatically Selecting Striking Images for Social Cards

    Authors: Shawn M. Jones, Michele C. Weigle, Martin Klein, Michael L. Nelson

    Abstract: To allow previewing a web page, social media platforms have developed social cards: visualizations consisting of vital information about the underlying resource. At a minimum, social cards often include features such as the web resource's title, text summary, striking image, and domain name. News and scholarly articles on the web are frequently subject to social card creation when being shared on… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: 10 pages, 5 figures, 10 tables

  34. arXiv:2101.00503  [pdf, other

    stat.CO cs.LG cs.SI nlin.AO physics.soc-ph

    Modularity maximisation for graphons

    Authors: Florian Klimm, Nick S. Jones, Michael T. Schaub

    Abstract: Networks are a widely-used tool to investigate the large-scale connectivity structure in complex systems and graphons have been proposed as an infinite size limit of dense networks. The detection of communities or other meso-scale structures is a prominent topic in network science as it allows the identification of functional building blocks in complex systems. When such building blocks may be pre… ▽ More

    Submitted 2 January, 2021; originally announced January 2021.

  35. arXiv:2012.14488  [pdf, other

    cs.CR cs.LG

    Phishing Detection through Email Embeddings

    Authors: Luis Felipe Gutiérrez, Faranak Abri, Miriam Armstrong, Akbar Siami Namin, Keith S. Jones

    Abstract: The problem of detecting phishing emails through machine learning techniques has been discussed extensively in the literature. Conventional and state-of-the-art machine learning algorithms have demonstrated the possibility of building classifiers with high accuracy. The existing research studies treat phishing and genuine emails through general indicators and thus it is not exactly clear what phis… ▽ More

    Submitted 28 December, 2020; originally announced December 2020.

  36. arXiv:2012.02643  [pdf, other

    cs.SD cs.CV eess.AS

    Predicting Emotions Perceived from Sounds

    Authors: Faranak Abri, Luis Felipe Gutiérrez, Akbar Siami Namin, David R. W. Sears, Keith S. Jones

    Abstract: Sonification is the science of communication of data and events to users through sounds. Auditory icons, earcons, and speech are the common auditory display schemes utilized in sonification, or more specifically in the use of audio to convey information. Once the captured data are perceived, their meanings, and more importantly, intentions can be interpreted more easily and thus can be employed as… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: 10 pages

  37. arXiv:2012.00648  [pdf, other

    cs.CR cs.HC

    Cyber-Attack Consequence Prediction

    Authors: Prerit Datta, Natalie Lodinger, Akbar Siami Namin, Keith S. Jones

    Abstract: Cyber-physical systems posit a complex number of security challenges due to interconnection of heterogeneous devices having limited processing, communication, and power capabilities. Additionally, the conglomeration of both physical and cyber-space further makes it difficult to devise a single security plan spanning both these spaces. Cyber-security researchers are often overloaded with a variety… ▽ More

    Submitted 2 December, 2020; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: 9 pages. The pre-print of a paper to appear in the proceedings of the 3rd Workshop on Big Data Engineering and Analytics in Cyber-Physical Systems (BigEACPS'20), IEEE BigData Conference 2020

  38. arXiv:2011.05774  [pdf, other

    physics.soc-ph cond-mat.stat-mech cs.SI math.OC

    Influencing dynamics on social networks without knowledge of network microstructure

    Authors: Matthew Garrod, Nick S. Jones

    Abstract: Social network based information campaigns can be used for promoting beneficial health behaviours and mitigating polarisation (e.g. regarding climate change or vaccines). Network-based intervention strategies typically rely on full knowledge of network structure. It is largely not possible or desirable to obtain population-level social network data due to availability and privacy issues. It is eas… ▽ More

    Submitted 27 July, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

  39. arXiv:2010.04260  [pdf, other

    cs.CL cs.IR

    Fake Reviews Detection through Analysis of Linguistic Features

    Authors: Faranak Abri, Luis Felipe Gutierrez, Akbar Siami Namin, Keith S. Jones, David R. W. Sears

    Abstract: Online reviews play an integral part for success or failure of businesses. Prior to purchasing services or goods, customers first review the online comments submitted by previous customers. However, it is possible to superficially boost or hinder some businesses through posting counterfeit and fake reviews. This paper explores a natural language processing approach to identify fake reviews. We pre… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: The pre-print of a paper to appear in the proceedings of the IEEE International Conference on Machine Learning Applications (ICMLA 2020), 11 pages, 3 figures, 5 tables

  40. arXiv:2008.05337  [pdf, other

    cs.SI physics.soc-ph stat.ME

    Inference of a universal social scale and segregation measures using social connectivity kernels

    Authors: Till Hoffmann, Nick S. Jones

    Abstract: How people connect with one another is a fundamental question in the social sciences, and the resulting social networks can have a profound impact on our daily lives. Blau offered a powerful explanation: people connect with one another based on their positions in a social space. Yet a principled measure of social distance, allowing comparison within and between societies, remains elusive. We use t… ▽ More

    Submitted 28 October, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

    Comments: Article: 23 pages, 3 figures. Supplementary material: 8 pages, 1 figure

    Journal ref: J. R. Soc. Interface. 17: 20200638 (2020)

  41. arXiv:2008.00139  [pdf, other

    cs.DL cs.HC cs.IR

    SHARI -- An Integration of Tools to Visualize the Story of the Day

    Authors: Shawn M. Jones, Alexander C. Nwala, Martin Klein, Michele C. Weigle, Michael L. Nelson

    Abstract: Tools such as Google News and Flipboard exist to convey daily news, but what about the past? In this paper, we describe how to combine several existing tools with web archive holdings to perform news analysis and visualization of the "biggest story" for a given date. StoryGraph clusters news articles together to identify a common news story. Hypercane leverages ArchiveNow to store URLs produced by… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    Comments: 19 pages, 16 figures, 1 Table

    ACM Class: H.3.7; H.3.6; H.3.4

    Journal ref: Presented at the Web Archiving and Digital Libraries 2020 Workshop

  42. arXiv:2008.00137  [pdf, other

    cs.DL cs.HC cs.IR

    MementoEmbed and Raintale for Web Archive Storytelling

    Authors: Shawn M. Jones, Martin Klein, Michele C. Weigle, Michael L. Nelson

    Abstract: For traditional library collections, archivists can select a representative sample from a collection and display it in a featured physical or digital library space. Web archive collections may consist of thousands of archived pages, or mementos. How should an archivist display this sample to drive visitors to their collection? Search engines and social media platforms often represent web pages as… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    Comments: 54 pages, 5 tables, 46 figures

    ACM Class: H.3.7; H.3.6; H.3.4

    Journal ref: Presented at the Web Archiving and Digital Libraries 2020 Workshop

  43. arXiv:2007.12061  [pdf, other

    cs.CY cs.HC cs.LG

    Instructions and Guide for Diagnostic Questions: The NeurIPS 2020 Education Challenge

    Authors: Zichao Wang, Angus Lamb, Evgeny Saveliev, Pashmina Cameron, Yordan Zaykov, José Miguel Hernández-Lobato, Richard E. Turner, Richard G. Baraniuk, Craig Barton, Simon Peyton Jones, Simon Woodhead, Cheng Zhang

    Abstract: Digital technologies are becoming increasingly prevalent in education, enabling personalized, high quality education resources to be accessible by students across the world. Importantly, among these resources are diagnostic questions: the answers that the students give to these questions reveal key information about the specific nature of misconceptions that the students may hold. Analyzing the ma… ▽ More

    Submitted 12 April, 2021; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: 28 pages, 6 figures, NeurIPS 2020 Competition Track

  44. arXiv:2006.07914  [pdf, other

    cs.CR cs.NI

    Cloud as an Attack Platform

    Authors: Moitrayee Chatterjee, Prerit Datta, Faranak Abri, Akbar Siami Namin, Keith S. Jones

    Abstract: We present an exploratory study of responses from $75$ security professionals and ethical hackers in order to understand how they abuse cloud platforms for attack purposes. The participants were recruited at the Black Hat and DEF CON conferences. We presented the participants' with various attack scenarios and asked them to explain the steps they would have carried out for launching the attack in… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

  45. arXiv:2006.07912  [pdf, other

    cs.LG cs.SI

    Fake Reviews Detection through Ensemble Learning

    Authors: Luis Gutierrez-Espinoza, Faranak Abri, Akbar Siami Namin, Keith S. Jones, David R. W. Sears

    Abstract: Customers represent their satisfactions of consuming products by sharing their experiences through the utilization of online reviews. Several machine learning-based approaches can automatically detect deceptive and fake reviews. Recently, there have been studies reporting the performance of ensemble learning-based approaches in comparison to conventional machine learning techniques. Motivated by t… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

  46. arXiv:2006.07908  [pdf, other

    cs.CR cs.NI

    Launching Stealth Attacks using Cloud

    Authors: Moitrayee Chatterjee, Prerit Datta, Faranak Abri, Akbar Siami Namin, Keith S. Jones

    Abstract: Cloud computing offers users scalable platforms and low resource cost. At the same time, the off-site location of the resources of this service model makes it more vulnerable to certain types of adversarial actions. Cloud computing has not only gained major user base, but also, it has the features that attackers can leverage to remain anonymous and stealth. With convenient access to data and techn… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

  47. arXiv:2005.10957  [pdf, other

    eess.IV cs.CV

    Classification of Epithelial Ovarian Carcinoma Whole-Slide Pathology Images Using Deep Transfer Learning

    Authors: Yiping Wang, David Farnell, Hossein Farahani, Mitchell Nursey, Basile Tessier-Cloutier, Steven J. M. Jones, David G. Huntsman, C. Blake Gilks, Ali Bashashati

    Abstract: Ovarian cancer is the most lethal cancer of the female reproductive organs. There are $5$ major histological subtypes of epithelial ovarian cancer, each with distinct morphological, genetic, and clinical features. Currently, these histotypes are determined by a pathologist's microscopic examination of tumor whole-slide images (WSI). This process has been hampered by poor inter-observer agreement (… ▽ More

    Submitted 28 June, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Report number: MIDL/2020/ExtendedAbstract/VXdQD8B307

  48. arXiv:2003.07146  [pdf, other

    cs.SI physics.soc-ph

    Inference and Influence of Large-Scale Social Networks Using Snapshot Population Behaviour without Network Data

    Authors: Antonia Godoy-Lorite, Nick S. Jones

    Abstract: Population behaviours, such as voting and vaccination, depend on social networks. Social networks can differ depending on behaviour type and are typically hidden. However, we do often have large-scale behavioural data, albeit only snapshots taken at one timepoint. We present a method that jointly infers large-scale network structure and a networked model of human behaviour using only snapshot popu… ▽ More

    Submitted 23 March, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

  49. arXiv:2003.05980  [pdf, other

    cs.CY cs.LG stat.AP

    Educational Question Mining At Scale: Prediction, Analysis and Personalization

    Authors: Zichao Wang, Sebastian Tschiatschek, Simon Woodhead, Jose Miguel Hernandez-Lobato, Simon Peyton Jones, Richard G. Baraniuk, Cheng Zhang

    Abstract: Online education platforms enable teachers to share a large number of educational resources such as questions to form exercises and quizzes for students. With large volumes of available questions, it is important to have an automated way to quantify their properties and intelligently select them for students, enabling effective and personalized learning experiences. In this work, we propose a fram… ▽ More

    Submitted 28 February, 2021; v1 submitted 12 March, 2020; originally announced March 2020.

    Comments: Accepted at AAAI-EAAI 2021

  50. arXiv:1910.11717  [pdf, other

    cs.PL

    Selective Lambda Lifting

    Authors: Sebastian Graf, Simon Peyton Jones

    Abstract: Lambda lifting is a well-known transformation, traditionally employed for compiling functional programs to supercombinators. However, more recent abstract machines for functional languages like OCaml and Haskell tend to do closure conversion instead for direct access to the environment, so lambda lifting is no longer necessary to generate machine code. We propose to revisit selective lambda liftin… ▽ More

    Submitted 28 October, 2019; v1 submitted 25 October, 2019; originally announced October 2019.

    Comments: Rejected from ICFP 2019