Skip to main content

Showing 1–12 of 12 results for author: Wigington, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12044  [pdf, other

    cs.CV

    ARTIST: Improving the Generation of Text-rich Images by Disentanglement

    Authors: Jianyi Zhang, Yufan Zhou, Jiuxiang Gu, Curtis Wigington, Tong Yu, Yiran Chen, Tong Sun, Ruiyi Zhang

    Abstract: Diffusion models have demonstrated exceptional capabilities in generating a broad spectrum of visual content, yet their proficiency in rendering text is still limited: they often generate inaccurate characters or words that fail to blend well with the underlying image. To address these shortcomings, we introduce a new framework named ARTIST. This framework incorporates a dedicated textual diffusio… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.08354  [pdf, other

    cs.CV cs.AI cs.LG

    DocSynthv2: A Practical Autoregressive Modeling for Document Generation

    Authors: Sanket Biswas, Rajiv Jain, Vlad I. Morariu, Jiuxiang Gu, Puneet Mathur, Curtis Wigington, Tong Sun, Josep Lladós

    Abstract: While the generation of document layouts has been extensively explored, comprehensive document generation encompassing both layout and content presents a more complex challenge. This paper delves into this advanced domain, proposing a novel approach called DocSynthv2 through the development of a simple yet effective autoregressive structured model. Our model, distinct in its integration of both la… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Spotlight (Oral) Acceptance to CVPR 2024 Workshop for Graphic Design Understanding and Generation (GDUG)

  3. Experts prefer text but videos help novices: an analysis of the utility of multi-media content

    Authors: Hayeong Song, Jennifer Healey, Alexa Siu, Curtis Wigington, John Stasko

    Abstract: Multi-media increases engagement and is increasingly prevalent in online content including news, web blogs, and social media, however, it may not always be beneficial to users. To determine what types of media users actually wanted, we conducted an exploratory study where users got to choose their own media augmentation. Our findings showed that users desired different amounts and types of media d… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: in CHI'23 Extended Abstracts on Human Factors in Computing Systems, 2023

  4. arXiv:2207.07972  [pdf, other

    cs.LG cs.CR

    Certified Neural Network Watermarks with Randomized Smoothing

    Authors: Arpit Bansal, Ping-yeh Chiang, Michael Curry, Rajiv Jain, Curtis Wigington, Varun Manjunatha, John P Dickerson, Tom Goldstein

    Abstract: Watermarking is a commonly used strategy to protect creators' rights to digital images, videos and audio. Recently, watermarking methods have been extended to deep learning models -- in principle, the watermark should be preserved when an adversary tries to copy the model. However, in practice, watermarks can often be removed by an intelligent adversary. Several papers have proposed watermarking m… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: ICML 2022

    Journal ref: ICML 2022

  5. arXiv:2203.16618  [pdf, other

    cs.CV

    End-to-end Document Recognition and Understanding with Dessurt

    Authors: Brian Davis, Bryan Morse, Bryan Price, Chris Tensmeyer, Curtis Wigington, Vlad Morariu

    Abstract: We introduce Dessurt, a relatively simple document understanding transformer capable of being fine-tuned on a greater variety of document tasks than prior methods. It receives a document image and task string as input and generates arbitrary text autoregressively as output. Because Dessurt is an end-to-end architecture that performs text recognition in addition to the document understanding, it do… ▽ More

    Submitted 15 June, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

  6. arXiv:2104.08689  [pdf, other

    cs.CV

    RPCL: A Framework for Improving Cross-Domain Detection with Auxiliary Tasks

    Authors: Kai Li, Curtis Wigington, Chris Tensmeyer, Vlad I. Morariu, Handong Zhao, Varun Manjunatha, Nikolaos Barmpalios, Yun Fu

    Abstract: Cross-Domain Detection (XDD) aims to train an object detector using labeled image from a source domain but have good performance in the target domain with only unlabeled images. Existing approaches achieve this either by aligning the feature maps or the region proposals from the two domains, or by transferring the style of source images to that of target image. Contrasted with prior work, this pap… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

    Comments: 10 pages, 5 figures

  7. arXiv:2104.06536  [pdf, other

    cs.HC

    Lets Make A Story Measuring MR Child Engagement

    Authors: Duotun Wang, Jennifer Healey, Jing Qian, Curtis Wigington, Tong Sun, Huaishu Peng

    Abstract: We present the result of a pilot study measuring child engagement with the Lets Make A Story system, a novel mixed reality, MR, collaborative storytelling system designed for grandparents and grandchildren. We compare our MR experience against an equivalent paper story experience. The goal of our pilot was to test the system with actual child users and assess the goodness of using metrics of time,… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: 3 pages, 3 figures, to be presented in the CHI Workshop on Evaluating User Experiences in Mixed Reality, see "https://sputze.github.io/evaluating-mr/"

    ACM Class: H.5.1; H.5.2

  8. arXiv:2009.00678  [pdf, other

    cs.CV

    Text and Style Conditioned GAN for Generation of Offline Handwriting Lines

    Authors: Brian Davis, Chris Tensmeyer, Brian Price, Curtis Wigington, Bryan Morse, Rajiv Jain

    Abstract: This paper presents a GAN for generating images of handwritten lines conditioned on arbitrary text and latent style vectors. Unlike prior work, which produce stroke points or single-word images, this model generates entire lines of offline handwriting. The model produces variable-sized images by using style vectors to determine character widths. A generator network is trained with GAN and autoenco… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

    Comments: Includes Supplementary Material. Accepted at BMVC 2020. 32 pages, 30 figures

  9. arXiv:2004.12016  [pdf, other

    cs.HC

    Using Behavioral Interactions from a Mobile Device to Classify the Reader's Prior Familiarity and Goal Conditions

    Authors: Sungjin Nam, Zoya Bylinskii, Christopher Tensmeyer, Curtis Wigington, Rajiv Jain, Tong Sun

    Abstract: A student reads a textbook to learn a new topic; an attorney leafs through familiar legal documents. Each reader may have a different goal for, and prior knowledge of, their reading. A mobile context, which captures interaction behavior, can provide insights about these reading conditions. In this paper, we focus on understanding the different reading conditions of mobile readers, as such an under… ▽ More

    Submitted 24 April, 2020; originally announced April 2020.

  10. arXiv:2003.13197  [pdf, other

    cs.CV

    Cross-Domain Document Object Detection: Benchmark Suite and Method

    Authors: Kai Li, Curtis Wigington, Chris Tensmeyer, Handong Zhao, Nikolaos Barmpalios, Vlad I. Morariu, Varun Manjunatha, Tong Sun, Yun Fu

    Abstract: Decomposing images of document pages into high-level semantic regions (e.g., figures, tables, paragraphs), document object detection (DOD) is fundamental for downstream tasks like intelligent document editing and understanding. DOD remains a challenging problem as document objects vary significantly in layout, size, aspect ratio, texture, etc. An additional challenge arises in practice because lar… ▽ More

    Submitted 29 March, 2020; originally announced March 2020.

    Comments: To appear in CVPR 2020

  11. arXiv:1808.01423  [pdf, other

    cs.CV

    Language Model Supervision for Handwriting Recognition Model Adaptation

    Authors: Chris Tensmeyer, Curtis Wigington, Brian Davis, Seth Stewart, Tony Martinez, William Barrett

    Abstract: Training state-of-the-art offline handwriting recognition (HWR) models requires large labeled datasets, but unfortunately such datasets are not available in all languages and domains due to the high cost of manual labeling.We address this problem by showing how high resource languages can be leveraged to help train models for low resource languages.We propose a transfer learning methodology where… ▽ More

    Submitted 4 August, 2018; originally announced August 2018.

  12. arXiv:1709.01618  [pdf, other

    cs.CV

    PageNet: Page Boundary Extraction in Historical Handwritten Documents

    Authors: Chris Tensmeyer, Brian Davis, Curtis Wigington, Iain Lee, Bill Barrett

    Abstract: When digitizing a document into an image, it is common to include a surrounding border region to visually indicate that the entire document is present in the image. However, this border should be removed prior to automated processing. In this work, we present a deep learning based system, PageNet, which identifies the main page region in an image in order to segment content from both textual and n… ▽ More

    Submitted 5 September, 2017; originally announced September 2017.

    Comments: HIP 2017 (in submission)