public class Extract
extends java.lang.Object
Constructor and Description |
---|
Extract(java.lang.String from) |
Modifier and Type | Method and Description |
---|---|
java.lang.String |
authors(java.lang.String pattern)
Extracts the authors list and cleans it.
|
java.lang.String |
title(java.lang.String pattern)
Extracts the title and cleans it.
|
java.lang.String |
URL(java.lang.String pattern,
java.lang.String baseUrl,
java.lang.String modFormat)
Extracts one URL.
|
java.util.List<java.lang.String> |
URLs(java.lang.String pattern,
java.lang.String baseUrl,
java.lang.String modFormat)
Extracts all URLs.
|
int |
year(java.lang.String pattern)
Extracts the year.
|
public Extract(java.lang.String from)
from
- The input text, can be HTML code.public java.lang.String URL(java.lang.String pattern, java.lang.String baseUrl, java.lang.String modFormat)
pattern
- Pattern for the URL.baseUrl
- Base URL - used when matching URL is relative.modFormat
- Modification, for example an additional suffix.public java.util.List<java.lang.String> URLs(java.lang.String pattern, java.lang.String baseUrl, java.lang.String modFormat)
pattern
- Pattern for one URL.baseUrl
- Base URL - used when matching URL is relative.modFormat
- Modification, for example an additional suffix.public java.lang.String authors(java.lang.String pattern)
pattern
- Pattern for authors list.public java.lang.String title(java.lang.String pattern)
pattern
- Pattern for titlepublic int year(java.lang.String pattern)
pattern
- Pattern for year.