Skip to main content

Microsoft Excel Input

Description

Microsoft Access Input is a step in the Input Plugin for Process Studio Workflows. Microsoft Access Input plugin step reads directly from Microsoft Access "MDB" files.

Configurations

No.Field NameDescription
1Step nameSpecify the name of the step. This name can me modified from the default name and has to be unique within the workflow.
2Spread sheet type (engine)Specify the spreadsheet type according to the file to be read.

- EXCEL 97-2003 XLS (JXL)

- EXCEL 2007 XLSX (Apache POI)

- EXCEL 2007 XLSX (Apache POI Streaming)

- Open Office ODS(ODFDOM)

- EXCEL Streamer

- EXCEL Binary XLSB

- Default value: EXCEL 97-2003 XLS (JXL)

Notes:

- Excel streamer like Apache POI Streaming is also for reading large XLSX files. It is memory efficient and works across machine configurations. Comparatively, Apache POI is faster in reading files but fails for some lower configuration machines with memory error.

- Excel Binary XLSB is provided to read Excel binary files.

- The File format is available from Excel 2007 onwards.

Advantages:

- The Excel workbook sheets can be read and written a bit faster, making them useful for the larger spreadsheets.

Notes:

- A little extra time is required for fetching headers of the file.

- By default, the type of all the columns are defined as String. The developer can change the type from the drop-down in Fields tab.

- Some options on Content tab (Stop on empty rows, No empty rows, Encoding) and Error handling tab (Ignore Errors) are disabled.

Limitations:

- File type as runtime parameter is not supported for Excel Binary files.

- Add filenames to result’ checkbox in Parameters Content tab to get Result Filenames is not supported for Excel Binary files.

3File or directorySpecify the location or the name of the input file or browse for file.
4Button: AddClick to add files to the selected files group.
5Button: FolderClick to browse and select a folder.
6Button: FileClick to browse and select a file.
7Regular ExpressionSpecify a regular expression for selecting files in the directory specified in the previous option.
8Exclude Regular ExpressionSpecify a location to exclude all files that meet criteria specified by the regular expression.
9File PasswordSpecify the password for the file.

Note: The field is available only if you select any one of the following options in the Spread sheet type (engine) field:

- EXCEL 2007 XLSX (Apache POI)

- EXCEL 2007 XLSX (Apache POI Streaming)

If the file you are using is already password protected, then ensure that you specify the same password in the field.

10Selected filesIt contains a tabular list of selected files (or wildcard selections) and a property specifying whether the file is required or not. The tabular list also includes columns for subfolders and file password (which appears encrypted).

Notes:

- If a file is required and it is not found, and an error is generated, otherwise, the file name is skipped.

- If the directory contains password-protected files, all files must have the same password.

- The step will fail if any file has a different password in the case of a directory path.

11DeleteClick Delete button to delete rows from Selected files data grid.
12EditClick Edit button to enable fields to be edited in data grid.
Accept filename from previous steps:
13Accept filenames from previous stepCheck the checkbox to read file names from the previous step in the workflow.
14Step to read filenames fromSpecify the step name to read filenames from.
15Field in the input to use as filenameSpecify the field in the input to use as a filename.
16Field in the input to use as passwordSelect the field that contains the file(s) password.

Note:

- The file name must be provided by the previous field.

- For file(s) that are not password-protected, leave the field value blank.

- The field is enabled only if the Spreed sheet type (engine) is Excel 2007 XLSX (Apache POI Streaming) or, Excel 2007 XLSX (Apache POI).

17Button: Show filenamesClick to view the file names of the sources connected successfully to the Microsoft Excel Input step.
Sheet Tab:
1SheetSpecify single or multiple sheet names or index in the excel sheet.

Note: You can also specify the sheet name as a variable or field.

2Select By

Select either Name or Index if you want to validate the value specified in the Sheet column, that is, either sheet name or sheet index.

You will see an error message if the value Select By column is unable to validate the value in the Sheet column. For example, if you provide the name of the sheet in the Sheet column and select Index in the Select By column.

3Start rowSpecify a start row in the corresponding sheet.

Note: You can also specify the start row as a variable or field.

4Start columnSpecify the start column in the corresponding sheet.

Note: You can also specify the start column as a variable or field.

5Button: Get Sheet name(s)Click to retrieves a list of sheet names in the input excel file.
Content Tab**
1HeaderIf checked, it skips a header row in the sheet. The default value is checked.
2No empty rowsIf checked, no empty rows are generated in the output sheet. Default value: checked. The default value is checked.
3Stop on empty rowEnable checkbox to stop reading the current sheet of a file when an empty line is encountered.
4LimitSpecify the Limit on the number of rows.
5EncodingSpecify text file encoding to use.
6Enable BatchingThis checkbox is editable only if the Spreadsheet type engine in the Files tab is Excel Binary (XLSB).

Enable this option for reading large XLSB files using batch processing.

7Batch SizeThis field is enabled only if the Enable Batching checkbox above is checked.

Specify the number of rows per batch, for processing. The data type is number.

Result filenames:
1Add filenames to resultEnable checkbox to add filenames to result. Default value: checked
Error Handling Tab
1Strict types?Enable checkbox to report data type errors in the input.
2Ignore errors?Enable checkbox to ignore errors during parsing.
3Skip error lines?Enable checkbox to it skips lines with errors.
4Warning files directorySpecify the location for storing warnings. It can be set by variables using Variable button or you may browse a directory location using the browse button.
5Error files directorySpecify the location for storing errors. It can be set by variables using Variable button or you may browse a directory location using the browse button.
6Failing line numbers files directorySpecify the location of files with errors on a line number. It can be set by variables using Variable button or you may browse a directory location using the browse button.
Fields Tab:
1Read Based on Column NameCheck if you want to get input based on name of the column. Table data is copied to the Get fields based to column name table when you click Get fields from header row.

You can specify the column name as a static value, field, parameter, or global or local PS variable.

Specify the column name as Sheetname.ColumnName. For example, if the name of the sheet is Country and the column name is Capital, then define a parameter as Param1 with value Country.Capital and in the Get Fields Based on ColName table specify the parameterized column name as ${Param1}.

If the value is set in a field, then select the field name in Get Fields Based on ColName table.

Note:

- If the checkbox is not selected, table data is copied from column-based table to index based table.

- If the checkbox is selected, then the Fail on column name not found and Case insensitive comparison fields are available.

- The option is also available for XLSB type of file(s).

Get fields based on column index
i.NameSpecify the name of the field.
ii.TypeSpecify the data type of the field.

- Number

- String

- Date

- Boolean

- Integer

- BigNumber

- Binary

- Timestamp

- Internet Address

iii.LengthSpecify the maximum length of the field.
iv.PrecisionSpecify Precision of the field.
v.Trim TypeTruncate spaces for the field by selecting one of the following.

- None

- Left

- Right

- Both

vi.RepeatSpecify a Boolean value for repeating field.
vii.FormatSpecify Format of the field.
viii.CurrencySpecify symbol for Currency.
ix.DecimalSpecify Decimal value.
x.GroupingSpecify grouping of numbers.
Get fields based on column name
2Fail on column name not foundSelect the checkbox, if required.

If the checkbox is selected, then the workflow will fail if a specified column is not found in the Excel file. Else, the workflow will continue and return null for any missing column.

The workflow will also fail, if Ignore Case for Column Name and Alias Names checkbox is clear, and the column names are specified with different case in the Excel file and the step. For example, Username and userName

3Ignore Case for Column Name and Alias NamesSelect the checkbox, if required.

If you select the checkbox then the names of the columns in the Excel file and those specified in the Column Name and Alias Names fields are compared, while the difference in case, if any, is ignored, that is, the workflow will execute even if the column name in the Excel file is, Username and in the step the column name is userName.

And, if the checkbox is clear then the workflow will fail if the case of the column names is different.

Get fields based on column name: Table
i.Column NameSelect the parameterized column name.
ii.Field NameSpecify the name of the field.
iii.Alias NameClick in the cell and in the List of Alias Names box, specify another name for the field name.
iv.TypeSpecify the data type of the field.

- Number

- String

- Date

- Boolean

- Integer

- BigNumber

- Binary

- Timestamp

- Internet Address

v.LengthSpecify the maximum length of the field.
vi.PrecisionSpecify Precision of the field.
vii.Trim typeTruncate spaces for the field by selecting one of the following.

- None

- Left

- Right

- Both

viii.RepeatSpecify a Boolean value for repeating field.
ix.FormatSpecify Format of the field.
x.CurrencySpecify symbol for Currency.
xi.DecimalSpecify the decimal value.
xii.GroupingSpecify grouping of numbers.
Get fields from header rowClick to populate the field names from the header row.
Additional Outputs fields Tab
1Full filename fieldSpecify Full filename along with the extension.
2Sheet name fieldSpecify worksheet name.
3Sheet row nr fieldSpecify current sheet row number.
4Row nr written fieldSpecify the number of rows written.
5Short filename fieldSpecify a filename without a path.
6Extension fieldSpecify extension of the file.
7Path fieldSpecify the path in operating system format.
8Size fieldSpecify the size of the file.
9Is hidden fieldSpecify whether the file is hidden or not.
10Last modification fieldSpecify field name to contain the latest modified field.
11Uri fieldSpecify a field name that contains URI.
12Root uri fieldSpecify a field name that contains only root part of URI.