jeudi 9 avril 2015

run nutch 1.8 or 1.9 as a hadoop job


Vote count:

0




If I understand correctly, you can not run nutch 1.8 and 1.9 as a hadoop job, because these versions do not have a Crawl class which serves as a wrapper for all the crawl steps. This means there is no one class you can specify in the hadoop call to run a whole job. In nutch 1.7, this used to be the org.apache.nutch.crawl.Crawl class.


Am I missing something? Any one figure out a way around this?



asked 42 secs ago







run nutch 1.8 or 1.9 as a hadoop job

Aucun commentaire:

Enregistrer un commentaire