Improve Parallel Execution of Clustering in CWL Workflow

The problem is that CWL takes very long to evaluate JS to read the components and files processed in a single batch. Two alternatives would fix this issue:

  • Store Componentbatch / ClusteringWorkload in Graph Database and read it as a first step of the clustering. Might require model changes
  • Only pass the file(s) produced by the GraphComponents step and read the necessary files and components before the clustering step