Skip to main content

Showing 1–11 of 11 results for author: Cardenas, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.10643  [pdf, other

    cs.CL cs.AI

    `Keep it Together': Enforcing Cohesion in Extractive Summaries by Simulating Human Memory

    Authors: Ronald Cardenas, Matthias Galle, Shay B. Cohen

    Abstract: Extractive summaries are usually presented as lists of sentences with no expected cohesion between them. In this paper, we aim to enforce cohesion whilst controlling for informativeness and redundancy in summaries, in cases where the input exhibits high redundancy. The pipeline controls for redundancy in long inputs as it is consumed, and balances informativeness and cohesion during sentence selec… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  2. Mercury: A modeling, simulation, and optimization framework for data stream-oriented IoT applications

    Authors: Román Cárdenas, Patricia Arroba, Roberto Blanco, Pedro Malagón, José L. Risco-Martín, José M. Moya

    Abstract: The Internet of Things is transforming our society by monitoring users and infrastructures' behavior to enable new services that will improve life quality and resource management. These applications require a vast amount of localized information to be processed in real-time so, the deployment of new fog computing infrastructures that bring computing closer to the data sources is a major concern. I… ▽ More

    Submitted 2 November, 2023; originally announced December 2023.

    Journal ref: Simulation Modelling Practice and Theory, 101, 2019

  3. arXiv:2310.15077  [pdf, other

    cs.CL

    'Don't Get Too Technical with Me': A Discourse Structure-Based Framework for Science Journalism

    Authors: Ronald Cardenas, Bingsheng Yao, Dakuo Wang, Yufang Hou

    Abstract: Science journalism refers to the task of reporting technical findings of a scientific paper as a less technical news article to the general public audience. We aim to design an automated system to support this real-world task (i.e., automatic science journalism) by 1) introducing a newly-constructed and real-world dataset (SciTechNews), with tuples of a publicly-available scientific paper, its cor… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  4. arXiv:2309.16544  [pdf

    cs.SE cs.PF

    The DEVStone Metric: Performance Analysis of DEVS Simulation Engines

    Authors: Román Cárdenas, Kevin Henares, Patricia Arroba, José L. Risco-Martín, Gabriel A. Wainer

    Abstract: The DEVStone benchmark allows us to evaluate the performance of discrete-event simulators based on the DEVS formalism. It provides model sets with different characteristics, enabling the analysis of specific issues of simulation engines. However, this heterogeneity hinders the comparison of the results among studies, as the results obtained on each research work depend on the chosen subset of DEVS… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Journal ref: ACM Transactions on Modeling and Computer Simulation, 32(3), pp. 1-20, 2022

  5. arXiv:2306.07981  [pdf

    cs.CR cs.LG cs.SE

    Feature Engineering-Based Detection of Buffer Overflow Vulnerability in Source Code Using Neural Networks

    Authors: Mst Shapna Akter, Hossain Shahriar, Juan Rodriguez Cardenas, Sheikh Iqbal Ahamed, Alfredo Cuzzocrea

    Abstract: One of the most significant challenges in the field of software code auditing is the presence of vulnerabilities in software source code. Every year, more and more software flaws are discovered, either internally in proprietary code or publicly disclosed. These flaws are highly likely to be exploited and can lead to system compromise, data leakage, or denial of service. To create a large-scale mac… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  6. Bringing AI to the edge: A formal M&S specification to deploy effective IoT architectures

    Authors: Román Cárdenas, Patricia Arroba, José L. Risco-Martín

    Abstract: The Internet of Things is transforming our society, providing new services that improve the quality of life and resource management. These applications are based on ubiquitous networks of multiple distributed devices, with limited computing resources and power, capable of collecting and storing data from heterogeneous sources in real-time. To avoid network saturation and high delays, new architect… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Journal ref: Journal of Simulation, 16(5), pp. 494-511, 2019

  7. Sustainable Edge Computing: Challenges and Future Directions

    Authors: Patricia Arroba, Rajkumar Buyya, Román Cárdenas, José L. Risco-Martín, José M. Moya

    Abstract: An increasing amount of data is being injected into the network from IoT (Internet of Things) applications. Many of these applications, developed to improve society's quality of life, are latency-critical and inject large amounts of data into the network. These requirements of IoT applications trigger the emergence of Edge computing paradigm. Currently, data centers are responsible for a global en… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: 26 pages, 16 figures

  8. arXiv:2206.11249  [pdf, other

    cs.CL cs.AI cs.LG

    GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

    Authors: Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di Jin, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter , et al. (52 additional authors not shown)

    Abstract: Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, an… ▽ More

    Submitted 24 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  9. On the Trade-off between Redundancy and Local Coherence in Summarization

    Authors: Ronald Cardenas, Matthias Galle, Shay B. Cohen

    Abstract: Extractive summaries are usually presented as lists of sentences with no expected cohesion between them and with plenty of redundant information if not accounted for. In this paper, we investigate the trade-offs incurred when aiming to control for inter-sentential cohesion and redundancy in extracted summaries, and their impact on their informativeness. As case study, we focus on the summarization… ▽ More

    Submitted 6 June, 2024; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Accepted to JAIR

    Journal ref: Journal of Artificial Intelligence Research, 80, 273-326 (2024)

  10. arXiv:2104.08392  [pdf, other

    cs.CL

    Unsupervised Extractive Summarization by Human Memory Simulation

    Authors: Ronald Cardenas, Matthias Galle, Shay B. Cohen

    Abstract: Summarization systems face the core challenge of identifying and selecting important information. In this paper, we tackle the problem of content selection in unsupervised extractive summarization of long, structured documents. We introduce a wide range of heuristics that leverage cognitive representations of content units and how these are retained or forgotten in human memory. We find that prope… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  11. arXiv:1904.05426  [pdf, other

    cs.CL cs.AI cs.LG

    A Grounded Unsupervised Universal Part-of-Speech Tagger for Low-Resource Languages

    Authors: Ronald Cardenas, Ying Lin, Heng Ji, Jonathan May

    Abstract: Unsupervised part of speech (POS) tagging is often framed as a clustering problem, but practical taggers need to \textit{ground} their clusters as well. Grounding generally requires reference labeled data, a luxury a low-resource language might not have. In this work, we describe an approach for low-resource unsupervised POS tagging that yields fully grounded output and requires no labeled trainin… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: NAACL-HLT 2019, 12 pages, code available at https://github.com/isi-nlp/universal-cipher-pos-tagging