Hi Community,
We have created a flow and recipe in Dataprep.
We run our flows via GCP Composer and also directly in Dataprep.
Our flow Output Running environment parameter is set to Dataflow + BigQuery.
When running the flow directly from Dataprep (UI), the selected running env is BigQuery and the job succeeds.
Nevertheless, when running the same flow via Composer (Orchestrator) calling Dataprep via DAG, the running environment is DataFlow.
When Dataflow, the job fails with a schema error
2023-12-07 18:19:29.761 EST
Error message from worker: java.lang.IllegalStateException: The schema of the BigQuery table does not match the recipe. Expected [String, Integer, String, etc
Questions:
1) Why would this error occur for the same flow?
2) Why does it not pick BigQuery running environment? Is there a setup configuration?
3) Why does it fail when running via Dataflow vs BigQuery?
Thank you for your help.
A