Jacek Siciarek - Science profile - Bridge of Knowledge

Search

Publication showcase

  • Extraction of information from born-digital PDF documents for reproducible research

    Born-digital PDF electronic documents might reasonably be expected to preserve useful data units of their source originals that suffice to produce executable papers for reproducible research. Unfortunately, developers of authoring tools may adopt arbitrary PDF generation strategies, producing a plethora of internal data representations. Such common information units as text paragraphs, tables, function graphs and flow diagrams,...

    Full text available to download

  • Semantic Driven Table Understanding in Born-Digital Documents

    - Year 2014

    This paper presents a new approach to table understanding, suitable for born-digital PDF documents. Advance beyond the current state of the art in table understanding is provided by the proposed reverse MVC method, which takes advantage of only partial logic structure loss (degradation) in born-digital PDF documents, as opposed to unrecoverable loss (deterioration) taking place in scan based PDF documents.

    Full text to download in external service

  • For Your Eyes Only – Biometric Protection of PDF Documents

    The paper introduces a concept of a digital document content encryption/decryption with facial biometric data coming from a legitimate user. Access to the document content is simple and straightforward, especially during collaborative work with mobile devices equipped with cameras. Various contexts of document exchange are presented with regard to the next generation pro-active digital documents proposed by authors. An important...

    Full text available to download

seen 653 times