Stormcrawler-docker and OpenSearch #1078
-
Hi all, |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 25 replies
-
Hi I have just realised that the stormcrawler-docker had no discussions section, I have created one. Your question would be better there but I'll start answering here for now. the purpose of the stormcrawler-docker setup is mostly to show to run the Storm services in Docker. Understanding how Storm runs is a difficulty for new users and the stormcrawler-docker resources try to help with that.
Of course when it comes to StormCrawler, you are not limited to a particular backend, you can use Elasticsearch, SOLR, OpenSearch or URLFrontier. You don't need more than one. In your case, you can remove the sections about URLFrontier, ElasticSearch, Kibana and just add service definitions for OpenSearch.
This is the implementation of the URLFrontier service running on Opensearch. This would be used instead of using the spouts and statusupdater bolts from the Opensearch module in SC. It won't help you with indexing the documents into OpenSearch or storing metrics. It is probably easier to start with the Opensearch module instead of URLFrontier. One you have the services running in Docker (the Storm ones + OpenSearch), have a look at the opensearch module to see how to use the Maven archetype. Of course you don't have to use Docker and could install OpenSearch and Apache Storm on your computer. I hope this helps |
Beta Was this translation helpful? Give feedback.
-
A simpler approach to get you unstuck in the short term would be to connect to the supervisor node and run the topology locally instead of doing so in distributed mode. Simply replace |
Beta Was this translation helpful? Give feedback.
Hi I have just realised that the stormcrawler-docker had no discussions section, I have created one. Your question would be better there but I'll start answering here for now.
the purpose of the stormcrawler-docker setup is mostly to show to run the Storm services in Docker. Understanding how Storm runs is a difficulty for new users and the stormcrawler-docker resources try to help with that.
Of course when it comes to StormCrawler, you are not limited to a particular backend, you can use Elasticsearch, SOLR, OpenSearch or URLFrontier. You don't need more than one. In your case, you can remov…