public static class ExtractingDocumentLoader.MostlyPassthroughHtmlMapper extends Object implements org.apache.tika.parser.html.HtmlMapper
| Modifier and Type | Field and Description |
|---|---|
static org.apache.tika.parser.html.HtmlMapper |
INSTANCE |
| Constructor and Description |
|---|
ExtractingDocumentLoader.MostlyPassthroughHtmlMapper() |
| Modifier and Type | Method and Description |
|---|---|
boolean |
isDiscardElement(String name)
Keep all elements and their content.
|
String |
mapSafeAttribute(String elementName,
String attributeName)
Lowercases the attribute name
|
String |
mapSafeElement(String name)
Lowercases the element name, but returns null for <BR>,
which suppresses the start-element event for lt;BR> tags.
|
public ExtractingDocumentLoader.MostlyPassthroughHtmlMapper()
public boolean isDiscardElement(String name)
isDiscardElement in interface org.apache.tika.parser.html.HtmlMapperpublic String mapSafeAttribute(String elementName, String attributeName)
mapSafeAttribute in interface org.apache.tika.parser.html.HtmlMapperCopyright © 2000-2015 Apache Software Foundation. All Rights Reserved.