Hitachi Vantara Pentaho Community Wiki
Skip to end of metadata
Go to start of metadata

Description

LDIF is short for LDAP Data Interchange Format.  See also: http://en.wikipedia.org/wiki/LDIF .
The LDIF Input step reads information from such files.

Files Tab

The files tab is where you define the location of the files from which you want to read. The table below contains options associated with the Files tab:

Option

Description

Step Name

Name of the step; the name has to be unique in a single transformation.

Files origin

  • Get filename from field? : the previous step is giving filenames in a certain field in the input stream.  These are read.
  • Filename field : specify the field to the filenames from.

File or directory

Specifies the location and/or name of the input text file.
Important: Click Add to add the file/directory/wildcard combination to the list of selected files (grid) below.

Regular expression

Specifies the regular expression you want to use to select the files in the directory specified in the previous option.

Selected Files

Contains a list of selected files (or wildcard selections) and a property specifying if file is required or not. If a file is required and it is not found, an error is generated;otherwise, the file name is skipped.

Show filenames(s)...

Displays a list of all files that will be loaded based on the current selected file definitions

Content Tab

Option

Description

Include filename in output?

Allows you to specify a field name to include the file name (String) in the output of this step.

Rownum in output

Allows you to specify a field name to include the row number (Integer) in the output of this step.

Include content type?

Allows you to specify a field name to include the content type (String) in the output of this step.

Limit

Limits the number of rows to this number (zero (0) means all rows).

Multi valued field separator

For multi-valued content fields, you can specify a separator here. (default is a comma)

Result filenames

  • Add files to result filename : Adds the XML filenames read to the result of this transformation.  A unique list is being kept in memory that can be used in the next job entry in a job, for example in another transformation.

Fields

Option

Description

Name

The name of the output field

Attribute

The attribute to read

Type

The data type to convert to

Format

The format or conversion mask to use in the data type conversion

Length

The length of the output data type

Precision

The precision of the output data type

Currency

The currency symbol to use during data type conversion

Decimal

The numeric decimal symbol to use during data type conversion

Group

The numeric grouping symbol to use during data type conversion

Trim type

The type of trimming to use during data type conversion

Repeat

Repeat the column value of the previous row if the column value is empty (null)


  • No labels