Hitachi Vantara Pentaho Community Wiki
Skip to end of metadata
Go to start of metadata

Description

 

Use the HTTP job entry to retrieve a file from a Web server using the HTTP protocol. This job entry could be used to access data on partner Web sites. For example, the daily data export or daily list of customers is located at a specified Web site. Also, SaaS providers may give you a URL to locate a report. You can call that URL to retrieve an Excel file or zip file that contains the data. Salesforce requires that you use SOAP APIs to retrieve data.

If HTTP traffic is too heavy in your corporate environment, you may choose to use a proxy server with HTTP authentication.

General

Option

Definition

Name of job entry

The name of the job entry. This name has to be unique in a single job. A job entry can be placed several times on the canvas, however it will be the same job entry.

URL

The HTTP URL of the file to retrieve, or the directory name to store an uploaded file to (for example: http://kettle.pentaho.org)

Run for every result row?

Check this if you want to run this job entry for every row that was generated by a previous transformation. Use the "Copy rows to result". If selected, an HTTP request will be made for each result. Otherwise, the file is only retrieved once

Input field which contains URL

If the "Run for every result row?" option is selected, the field specified here will determine the file URL for each row

Input field which contains upload file name

If the "Run for every result row?" option is selected, the field specified here will determine the local file that will be uploaded to the URL associated with the "Input field which contains URL" field.

Input field which contains destination file name

If the "Run for every result row?" option is selected, the field specified here will determine the local file where the result downloaded from the URL associated with the "Input field which contains URL" field.

Username

If the site requires authentication, use this username to log in

Password

If a username is defined, this is the password for it

Proxy server for upload

The URL of a proxy server that you want to connect to the HTTP URL through

Proxy port

If a proxy server is defined, this is the port number it listens on

Ignore proxy for hosts

A regular expression list of exceptions for proxy redirection. This may be useful when working on an intranet or VPN

Upload file

If you are uploading a file, this will be its name on the remote server

Target file

If you are downloading a file, this its name on your local filesystem

Append to specified target file?

If selected, and if the target file already exists, PDI will append all new data to the end of the file

Add date and time to file name?

If selected, the date and time of the HTTP request (in yyyMMdd_HHmmss format) will be added to the target filename

Target file extension

If the previous option is selected, this field specifies the extension (letters after the dot) of the target filename

Add filename to result filename

Any files that are copied will appear as a result from this step; shows a list of files that were copied in this step

Headers

Option

Definition

#

Order that the header should be processed

Name

The name of this HTTP header

Value

The actual header value to pass

  • No labels