Extract Custom Metadata from PDF Files

You can extract custom metadata from PDF files either by specifying individual metadata tag names, or by extracting all custom metadata at once.

Extract Custom Metadata by Tag

To extract custom metadata by metadata tag, add the custom metadata names to the pdfsr.ini file provided, and copy the modified file to the bin directory. You can then extract metadata as you normally would.

The pdfsr.ini is in the directory samples\pdfini, and has the following structure:

<META>
<TOTAL>total_item_number</TOTAL>,
/metadata_tag_name datatype,
</META>
Parameter Description
total item number The total number of metadata tags that are listed.
metadata_tag_name The metadata tag name used in the PDF files.
datatype The data type of the metadata field. Data types are defined in KVSumInfoType.

For example:

<META>
<TOTAL>4</TOTAL>
/part_number      INT4
/volume           INT4
/purchase_date    DATETIME
/customer         STRING
</META>

Extract All Custom Metadata

You can extract all metadata through the API.

To extract all metadata by using the API

  1. Call the fpInit() function.
  2. Call the fpFilterConfig() function with the following arguments:

    Argument Parameter
    nType KVFLT_EXPORTALLMETADATA
    nValue TRUE
    pData NULL

    For example:

    (*fpFilterConfig)(pKVFilter, KVFLT_EXPORTALLMETADATA, TRUE, NULL);
  3. Call the fpGetOLESummaryInfo() or fpGetOLESummaryInfoFile() function.

_HP_HTML5_bannerTitle.htm