Package org.apache.nutch.tools
Class CommonCrawlFormatJackson
- java.lang.Object
- 
- org.apache.nutch.tools.AbstractCommonCrawlFormat
- 
- org.apache.nutch.tools.CommonCrawlFormatJackson
 
 
- 
- All Implemented Interfaces:
- Closeable,- AutoCloseable,- CommonCrawlFormat
 
 public class CommonCrawlFormatJackson extends AbstractCommonCrawlFormat This class provides methods to map crawled data on JSON using Jackson Streaming APIs.
- 
- 
Field Summary- 
Fields inherited from class org.apache.nutch.tools.AbstractCommonCrawlFormatconf, content, inLinks, jsonArray, keyPrefix, LOG, metadata, reverseKey, reverseKeyValue, simpleDateFormat, url
 
- 
 - 
Constructor SummaryConstructors Constructor Description CommonCrawlFormatJackson(String url, Content content, Metadata metadata, Configuration nutchConf, CommonCrawlConfig config)CommonCrawlFormatJackson(Configuration nutchConf, CommonCrawlConfig config)
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description protected voidcloseArray(String key, boolean nested, boolean newline)protected voidcloseObject(String key)protected StringgenerateJson()protected voidstartArray(String key, boolean nested, boolean newline)protected voidstartObject(String key)protected voidwriteArrayValue(String value)protected voidwriteKeyNull(String key)protected voidwriteKeyValue(String key, String value)- 
Methods inherited from class org.apache.nutch.tools.AbstractCommonCrawlFormatclose, getImported, getInLinks, getJsonData, getJsonData, getJsonData, getKey, getMethod, getRequestAccept, getRequestAcceptEncoding, getRequestAcceptLanguage, getRequestContactEmail, getRequestContactName, getRequestHostAddress, getRequestHostName, getRequestRobots, getRequestSoftware, getRequestUserAgent, getResponseAddress, getResponseContent, getResponseContentEncoding, getResponseContentType, getResponseDate, getResponseHostName, getResponseServer, getResponseStatus, getTimestamp, getUrl, setInLinks
 
- 
 
- 
- 
- 
Constructor Detail- 
CommonCrawlFormatJacksonpublic CommonCrawlFormatJackson(Configuration nutchConf, CommonCrawlConfig config) throws IOException - Throws:
- IOException
 
 - 
CommonCrawlFormatJacksonpublic CommonCrawlFormatJackson(String url, Content content, Metadata metadata, Configuration nutchConf, CommonCrawlConfig config) throws IOException - Throws:
- IOException
 
 
- 
 - 
Method Detail- 
writeKeyValueprotected void writeKeyValue(String key, String value) throws IOException - Specified by:
- writeKeyValuein class- AbstractCommonCrawlFormat
- Throws:
- IOException
 
 - 
writeKeyNullprotected void writeKeyNull(String key) throws IOException - Specified by:
- writeKeyNullin class- AbstractCommonCrawlFormat
- Throws:
- IOException
 
 - 
startArrayprotected void startArray(String key, boolean nested, boolean newline) throws IOException - Specified by:
- startArrayin class- AbstractCommonCrawlFormat
- Throws:
- IOException
 
 - 
closeArrayprotected void closeArray(String key, boolean nested, boolean newline) throws IOException - Specified by:
- closeArrayin class- AbstractCommonCrawlFormat
- Throws:
- IOException
 
 - 
writeArrayValueprotected void writeArrayValue(String value) throws IOException - Specified by:
- writeArrayValuein class- AbstractCommonCrawlFormat
- Throws:
- IOException
 
 - 
startObjectprotected void startObject(String key) throws IOException - Specified by:
- startObjectin class- AbstractCommonCrawlFormat
- Throws:
- IOException
 
 - 
closeObjectprotected void closeObject(String key) throws IOException - Specified by:
- closeObjectin class- AbstractCommonCrawlFormat
- Throws:
- IOException
 
 - 
generateJsonprotected String generateJson() throws IOException - Specified by:
- generateJsonin class- AbstractCommonCrawlFormat
- Throws:
- IOException
 
 
- 
 
-