Python Parser Multiple Node
Basic Information
Description | This node is used to parse files into tabular format |
Input | One file |
Output | Multiple tables. |
Usage Tags | PYTHON , READER |
Node Settings
FilePath ( string, required) : Connection to the source of the files. Can be either filePaths (read from file storage), filePaths[0] (manually uploaded file) or sharepointFiles (read from Sharepoint Connector).
File Regex Group (string, optional) : A regex expression with groups which creates a dictionary object with group name as keys and its match as value. This dictionary is then sent to Python script as input argument.
Filename columns ( List<FilenameColumn>, optional)
Column Name (string, required): Name of the column to create.
Regex (string, required): A regex expression with capturing group. Captured group is the value of the new column.
Script (string, required): A python script that processes given files. Follow Python Scripting documentation for syntax.
If multiple files are being processed in one instance, then the Python script will be executed one for every file. A special ParseInfo table will also be generated with the information about parsing status.
If an error occurs in Python script the node will successfully finish. To read the error either import ParseInfo table or check the details of execution in Table view.
Tips and Advanced usage
This node is generally used when existing parsers are not sufficient to extract data from special files. Therefore a custom python script can b developed and used that successfully parses given file type.