vtkDocumentReader Class Reference

#include <vtkDocumentReader.h>


Detailed Description

Reads documents into memory for text analysis.

Reads zero-to-many documents into memory, producing a vtkTable suitable for use as an input to other VTK text analysis filters.

Parameters: "Files": a collection of filesystem paths to be loaded.

Outputs: Output port 0: A vtkTable containing "document", "uri", and "content" columns.

The output "document" column will contain a zero-based integer document index; "uri" will contain the filepath to the document formatted as a file:// URI; "contents" will contain the binary contents of the document.

Warning:
As a workaround, vtkDocumentReader stores the contents of each document in the "contents" column, which is a vtkStdString array. Note that the contents of a document may actually be binary data, so check the MIME-Type before treating the contents as a string.
Thanks:
Developed by Timothy M. Shead (tshead@sandia.gov) at Sandia National Laboratories.
BTX Implementation* const Internal; ETX

Events:
vtkCommand::ProgressEvent
Tests:
vtkDocumentReader (Tests)

Definition at line 26 of file vtkDocumentReader.h.


The documentation for this class was generated from the following file:

Generated on Mon Sep 27 18:22:24 2010 for VTK by  doxygen 1.5.6