Import delimited files

Delimited information contains data fields that are delimited by certain separators.

For example:

ISBN-0749401161 Winnie the Pooh, 5 years old, main character of The house at Pooh corner, Andrew A Milne.

This content could be imported to obtain the fields (in order):

DREREFERENCE, DRETITLE, AGEGROUP, DRECONTENT, AUTHOR

You can have multiple entries in one delimited file. The Import Module is also flexible enough to deduce that if the start delimiter is not specified, the next field must be taken from the last end delimiter. This saves typing in duplicate delimiters.


Note:

  • For any content to be imported, one of the fields to which you import delimited information must be DREREFERENCE.
  • If you are importing from files that contain binary characters (which are not permitted in ASCII text editors/readers and so on), you can use the following control characters to allow the Import Module to import binary characters correctly.
Character ASCII code   Character ASCII code   Character ASCII code
<NUL> 0   <VT> 11   <SYN> 22
<SOH> 1   <FF> 12   <ETB> 23
<STXH> 2   <CR> 13   <CAN> 24
<ETX> 3   <SO> 14   <EM> 25
<EOT> 4   <SI> 15   <SIB> 26
<ENQ> 5   <SLE> 16   <ESC> 27
<ACK> 6   <CS1> 17   <FS> 28
<BEL> 7   <DC2> 18   <GS> 29
<BS> 8   <DC3> 19   <RS> 30
<HT> 9   <DC4> 20   <US> 31
<LF> 10   <NAK> 21   <SP> 32

You can use these control characters in the following parameters:


ImportDelimitedDocEnd

ImportDelimitedDocStart

ImportDelimitedEnd<N>

ImportDelimitedStart<N>


For example:

String↵
text text text text
String↵
text text text text

In this example, the delimited document contains binary characters (carriage returns) that act as delimiters. To import the text content correctly, you need to set the ImportDelimitedDocStart parameter to String<CR>.


You can also use the following parameters to specify how the Import Module deals with delimited files:


ImportDelimitedEncoding

ImportDelimitedExtns

ImportDelimitedField<N>

ImportDelimitedFillInRefFromHTML

ImportDelimitedSkipChars<N>