| Interface | Description |
|---|---|
| PageBuilder |
Allows a TagRule to add information to a Page object.
|
| Class | Description |
|---|---|
| BodyTagRule | |
| ContentBlockExtractingRule | |
| FramesetRule | |
| HeadExtractingRule | |
| HtmlAttributesRule | |
| MetaTagRule | |
| MSOfficeDocumentPropertiesRule |
Extracts the extra properties saved in HTML from MS Office applications (Word and Excel),
such as Author, Company, Version, etc.
|
| ParameterExtractingRule | |
| RegexReplacementTextFilter |
TextFilter that substitutes content using a JDK 1.4 regular expression.
|
| TagReplaceRule |
Very simple rule for replacing all occurences of one tag with another.
|
| TitleExtractingRule |