Java – how to run different jobs of Apache nutch in parallel
•
Java
I use hen 2.3 All jobs run one after another, that is, the first generator, extraction, parsing, indexing, etc I want to run some jobs at the same time I know that some jobs cannot run in parallel, but other jobs can parse jobs. Dbupdate and index jobs should be run with fetch
Is it possible? My basic goal is to run the grab all the time I think we can do it with different timestamps Can anyone guide me in the right way?
Solution
If you look at nutch web application server, you will find that it can perform multiple crawl jobs in parallel You should check the source code of nutch 2.3 for webapp [nutchuiserver] I hope this will help
The content of this article comes from the network collection of netizens. It is used as a learning reference. The copyright belongs to the original author.
THE END
二维码