Package org.apache.nutch.util
Class AbstractChecker
- java.lang.Object
- 
- org.apache.hadoop.conf.Configured
- 
- org.apache.nutch.util.AbstractChecker
 
 
- 
- All Implemented Interfaces:
- Configurable,- Tool
 - Direct Known Subclasses:
- CrawlDbReader,- IndexingFiltersChecker,- LinkDbReader,- ParserChecker,- URLFilterChecker,- URLNormalizerChecker
 
 public abstract class AbstractChecker extends Configured implements Tool Scaffolding class for the various Checker implementations. Can process cmdline input, stdin and TCP connections.- Author:
- Jurian Broertjes
 
- 
- 
Field SummaryFields Modifier and Type Field Description protected booleankeepClientCnxOpenprotected booleanstdinprotected inttcpPortprotected Stringusage
 - 
Constructor SummaryConstructors Constructor Description AbstractChecker()
 - 
Method SummaryAll Methods Instance Methods Abstract Methods Concrete Methods Modifier and Type Method Description protected ProtocolOutputgetProtocolOutput(String url, CrawlDatum datum, boolean checkRobotsTxt)protected intparseArgs(String[] args, int i)protected abstract intprocess(String line, StringBuilder output)protected intprocessSingle(String input)protected intprocessStdin()protected intprocessTCP(int tcpPort)protected intrun()- 
Methods inherited from class org.apache.hadoop.conf.ConfiguredgetConf, setConf
 - 
Methods inherited from class java.lang.Objectclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 - 
Methods inherited from interface org.apache.hadoop.conf.ConfigurablegetConf, setConf
 
- 
 
- 
- 
- 
Field Detail- 
keepClientCnxOpenprotected boolean keepClientCnxOpen 
 - 
tcpPortprotected int tcpPort 
 - 
stdinprotected boolean stdin 
 - 
usageprotected String usage 
 
- 
 - 
Method Detail- 
processprotected abstract int process(String line, StringBuilder output) throws Exception - Throws:
- Exception
 
 - 
parseArgsprotected int parseArgs(String[] args, int i) 
 - 
getProtocolOutputprotected ProtocolOutput getProtocolOutput(String url, CrawlDatum datum, boolean checkRobotsTxt) throws Exception - Throws:
- Exception
 
 
- 
 
-