Java – how to run different jobs of Apache nutch in parallel

I use hen 2.3 All jobs run one after another, that is, the first generator, extraction, parsing, indexing, etc I want to run some jobs at the same time I know that some jobs cannot run in parallel, but other jobs can parse jobs. Dbupdate and index jobs should be run with fetch

Is it possible? My basic goal is to run the grab all the time I think we can do it with different timestamps Can anyone guide me in the right way?

Solution

If you look at nutch web application server, you will find that it can perform multiple crawl jobs in parallel You should check the source code of nutch 2.3 for webapp [nutchuiserver] I hope this will help

The content of this article comes from the network collection of netizens. It is used as a learning reference. The copyright belongs to the original author.
THE END
分享
二维码
< <上一篇
下一篇>>