Previous Next


                                                    954
       CHAPTER 10                                                           Document Interchange



KEY   TYPE             VALUE

ID    byte string      (Required) The digital identifier of the content set (see “Digital Identifiers” on page
                       950). If the content set has been located by means of the URLS name tree, this allows its
                       related entry in the IDS name tree to be found.

O     array            (Required) An array of indirect references to the objects belonging to the content set.
                       The order of objects in the array is undefined in general but may be restricted by spe-
                       cific content set subtypes.

SI    dictionary       (Req uired) A source information dictionary (see Section 10.9.4, “Source Information”)
      or array         or an array of such dictionaries, describing the sources from which the objects belong-
                       ing to the content set were created.

CT    ASCII string     (Optional) The content type, an ASCII string characterizing the source from which the
                       objects belonging to the content set were created. The string should conform to the
                       content type specification described in Internet RFC 2045, Multipurpose Internet Mail
                       Extensions (MIME) Part One: Format of Internet Message Bodies (see the Bibliography).
                       For example, for a page set consisting of a group of PDF pages created from an HTML
                       file, the content type would be text / html.

TS    date             (Optional) A time stamp giving the date and time at which the content set was created.


                    TABLE 10.39 Additional entries specific to a Web Capture page set
KEY   TYPE             VALUE

S     name             (Required) The subtype of content set that this dictionary describes; must be SPS (“Spi-
                       der page set”) for a page set.

T     text string      (Optional) The title of the page set, a text string representing it in human-readable
                       form.

TID   byte string      (Optional) A text identifier generated from the text of the page set, as described in
                       “Digital Identifiers” on page 950.


       Image Sets

       An image set is a content set containing a group of image XObjects generated
       from a common source, such as multiple frames of an animated GIF image. (Web
       Capture 4.0 always generates a single image XObject for a given image.) A single
       XObject may not belong to more than one image set. Table 10.40 shows the con-
       tent set dictionary entries specific to this type of content set.

Previous Next