Package org.apache.nutch.crawl
Class LinkDb
- java.lang.Object
- 
- org.apache.hadoop.conf.Configured
- 
- org.apache.nutch.util.NutchTool
- 
- org.apache.nutch.crawl.LinkDb
 
 
 
- 
- All Implemented Interfaces:
- Configurable,- Tool
 
 public class LinkDb extends NutchTool implements Tool Maintains an inverted link map, listing incoming links for each url.
- 
- 
Nested Class SummaryNested Classes Modifier and Type Class Description static classLinkDb.LinkDbMapper
 - 
Field SummaryFields Modifier and Type Field Description static StringCURRENT_NAMEstatic StringIGNORE_EXTERNAL_LINKSstatic StringIGNORE_INTERNAL_LINKSstatic StringLOCK_NAME- 
Fields inherited from class org.apache.nutch.util.NutchToolcurrentJob, currentJobNum, numJobs, results, status
 
- 
 - 
Constructor SummaryConstructors Constructor Description LinkDb()LinkDb(Configuration conf)
 - 
Method SummaryAll Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description static voidinstall(Job job, Path linkDb)voidinvert(Path linkDb, Path[] segments, boolean normalize, boolean filter, boolean force)voidinvert(Path linkDb, Path segmentsDir, boolean normalize, boolean filter, boolean force)static voidmain(String[] args)intrun(String[] args)Map<String,Object>run(Map<String,Object> args, String crawlId)Runs the tool, using a map of arguments.- 
Methods inherited from class org.apache.nutch.util.NutchToolgetProgress, getStatus, killJob, setConf, stopJob
 - 
Methods inherited from class org.apache.hadoop.conf.ConfiguredgetConf
 - 
Methods inherited from class java.lang.Objectclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 - 
Methods inherited from interface org.apache.hadoop.conf.ConfigurablegetConf, setConf
 
- 
 
- 
- 
- 
Field Detail- 
IGNORE_INTERNAL_LINKSpublic static final String IGNORE_INTERNAL_LINKS - See Also:
- Constant Field Values
 
 - 
IGNORE_EXTERNAL_LINKSpublic static final String IGNORE_EXTERNAL_LINKS - See Also:
- Constant Field Values
 
 - 
CURRENT_NAMEpublic static final String CURRENT_NAME - See Also:
- Constant Field Values
 
 - 
LOCK_NAMEpublic static final String LOCK_NAME - See Also:
- Constant Field Values
 
 
- 
 - 
Constructor Detail- 
LinkDbpublic LinkDb() 
 - 
LinkDbpublic LinkDb(Configuration conf) 
 
- 
 - 
Method Detail- 
invertpublic void invert(Path linkDb, Path segmentsDir, boolean normalize, boolean filter, boolean force) throws IOException, InterruptedException, ClassNotFoundException 
 - 
invertpublic void invert(Path linkDb, Path[] segments, boolean normalize, boolean filter, boolean force) throws IOException, InterruptedException, ClassNotFoundException 
 - 
installpublic static void install(Job job, Path linkDb) throws IOException - Throws:
- IOException
 
 - 
runpublic Map<String,Object> run(Map<String,Object> args, String crawlId) throws Exception Description copied from class:NutchToolRuns the tool, using a map of arguments. May return results, or null.- Specified by:
- runin class- NutchTool
- Parameters:
- args- a- Mapof arguments to be run with the tool
- crawlId- a crawl identifier to associate with the tool invocation
- Returns:
- Map results object if tool executes successfully otherwise null
- Throws:
- Exception- if there is an error during the tool execution
 
 
- 
 
-