Replies: 6 comments 5 replies
-
Note this line: --conf "spark.spline.lineageDispatcher=logging" With that you set a If you want to send lineage info over HTTP and log it at the same time, use a --conf "spark.spline.lineageDispatcher=composite"
--conf "spark.spline.lineageDispatcher.composite.dispatchers=http,logging" See the lineage dispatchers doc for more info. |
Beta Was this translation helpful? Give feedback.
-
To view the spline do we need a docker? |
Beta Was this translation helpful? Give feedback.
-
Also we referred both link mentioned above but how to run the own custom pyspark job is not mentioned , do we create a docker image out of the spark job and mention it in .yml file if so how can you kindly explain ? |
Beta Was this translation helpful? Give feedback.
-
Hi , We created a pyspark job in our ec2 instance and when we submitted the spark job using the following command : sudo ./bin/spark-submit --packages za.co.absa.spline.agent.spark:spark-3.1-spline-agent-bundle_2.12:0.6.1 --conf “spark.sql.queryExecutionListeners=za.co.absa.spline.harvester.listener.SplineQueryExecutionListener” --conf spark.spline.producer.url=http://ec2 ip :8080/producer" /home/ec2-user/spline-sandbox/mysparkjob.py We are able to view the output in console , but unable to view in spline UI Can we know what is the step we are missing ? We are not using docker here , just create a sample pyspark job and tried to run using the above command |
Beta Was this translation helpful? Give feedback.
-
Once again, if you said you see the lineage info in the logs, the agent is working properly. All you need is to run Spline REST gateway and provide a correct Producer Base URL to the agent. Make sure the Spline server is ready. |
Beta Was this translation helpful? Give feedback.
-
Hi, we are not able to see the lineage info in the logs , but we are able to see the output of the spark transformation / spark job in the logs , Even the spline server is not ready because we are not using docker compose up command |
Beta Was this translation helpful? Give feedback.
-
We created a sample pyspark job and gave the spark-submit commands as following in ec2 instance
we are able to view the output in the console but unable to view in spline UI what additional steps need to be done ?
Beta Was this translation helpful? Give feedback.
All reactions