After you have an Ascend Read Connection set up, then you continue to set up the Read Connector in your dataflow.
In Figure 1 above:
- Name (required): The name to identify this connector with.
- Description (optional): Description of what data this connector will read.
You can either manually provide Object Pattern(required), which is the directory of the files you are looking for or click on Browse and Select Data: this button allows to explore resource and locate assets to ingest. This will give you access to the SFTP Server database in a modal dialog (Figure 3 and 4 below), where you can navigate to the desired table to be imported. Select the file or files you want to ingest and press Select.
Select the Parser(required) that match the data. If needed, some other advanced options are shown such as Field Delimiter and Date Format for CSV files, as shown in Figure 2
Once you click on the GENERATE SCHEMA button, the parser will create a schema and a data preview will be populated as in the Figure 5 below.
- Add schema column: Add a custom column to the generated schema
Update the status of the read connector by marking it either Running to mark it active or Paused to pause the connector from running.
The refresh schedule specifies how often Ascend checks the data location to see if there's new data. Ascend will automatically kick off the corresponding big data jobs once new or updated data is discovered.
When resources are constrained, Processing Priority will be used to determine which components to schedule first.
Higher priority numbers are scheduled before lower ones. Increasing the priority on a component also causes all its upstream components to be prioritized higher. Negative priorities can be used to postpone work until excess capacity becomes available.
Updated 11 months ago