Package org.apache.nutch.tools
Class ResolveUrls
- java.lang.Object
- 
- org.apache.nutch.tools.ResolveUrls
 
- 
 public class ResolveUrls extends Object A simple tool that will spin up multiple threads to resolve urls to ip addresses. This can be used to verify that pages that are failing due to UnknownHostException during fetching are actually bad and are not failing due to a dns problem in fetching.
- 
- 
Constructor SummaryConstructors Constructor Description ResolveUrls(String urlsFile)Create a new ResolveUrls with a file from the local file system.ResolveUrls(String urlsFile, int numThreads)Create a new ResolveUrls with a urls file and a number of threads for the Thread pool.
 - 
Method SummaryAll Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description static voidmain(String[] args)Runs the resolve urls tool.voidresolveUrls()Creates a thread pool for resolving urls.
 
- 
- 
- 
Constructor Detail- 
ResolveUrlspublic ResolveUrls(String urlsFile) Create a new ResolveUrls with a file from the local file system.- Parameters:
- urlsFile- The local urls file, one url per line.
 
 - 
ResolveUrlspublic ResolveUrls(String urlsFile, int numThreads) Create a new ResolveUrls with a urls file and a number of threads for the Thread pool. Number of threads is 100 by default.- Parameters:
- urlsFile- The local urls file, one url per line.
- numThreads- The number of threads used to resolve urls in parallel.
 
 
- 
 - 
Method Detail- 
resolveUrlspublic void resolveUrls() Creates a thread pool for resolving urls. Reads in the url file on the local filesystem. For each url it attempts to resolve it keeping a total account of the number resolved, errored, and the amount of time.
 - 
mainpublic static void main(String[] args) Runs the resolve urls tool.- Parameters:
- args- the input arguments for this tool. Running with 'help' will print parameter options.
 
 
- 
 
-