Excel Parser Node

Basic Information

Description	This node is use to extract data from excel files.
Input	Single or multiple files that are stored in instance metadata
Output	One table for every files
Usage Tags	READER , SOURCES

Node Settings

FilePath ( string, required) : A path to the file that will be processed
Use Datatype ( bool, not required) : A boolean indicating if node tries to set data type to columns.
Included sheets ( List<string>, not required) : A list of sheet names that will be included in resulting table. If empty all sheets will be returned.
Included columns ( List<string>, not required) : A list of column names that will be included in resulting table. If empty all columns will be returned.
File filter ( string, not required) : A regex filter that will be applied on a file path. If match is successful then file will be processed.
Derived columns ( List<object>, not required) : A list of columns that will be extracted from file path. This is very useful when file path contains valuable information like the project name, experiment,…
- Column Name ( string, required) : A name of the column that will be created.
- Regex ( string, required) : A regex expression that must contain group. Regex will be applied on file path and resulting group will be extracted and stored in a generated column.

Excel parser node have a very general purpose. It can parse files based on multiple separators and can work in manual and automatic processes. It can handle files that are uploaded to process or automatically read from some directory.

Examples

Example 1: Parsing manually uploaded files

In this example the csv parser node will extract table from a csv file that was uploaded by user in one of previous steps.

Example json configuration

JSON

{
  "properties": {
    "filePath": "@filePaths[0]",
    "seperators": [
      ";"
    ],
    "useDataType": false,
    "includedColumns": [],
    "regexFileFilter": ".*xlsx",
    "fileNameColunms": []
  }
}

Example 2: Parsing files that were read from directory

In this example multiple files were already read from some directory. The node will parse all of them and create one table for every file. It is usually useful to later make a single table from all extracted tables using Union node.

Example json configuration

JSON

{
  "properties": {
    "filePath": "@filePaths",
    "seperators": [
      ";"
    ],
    "useDataType": false,
    "includedColumns": [],
    "regexFileFilter": ".*xlsx",
    "fileNameColunms": []
  }
}