Class GetPDFText
java.lang.Object
org.deltava.dao.file.DAO
org.deltava.dao.file.GetPDFText
A Data Access Object to extract text from a PDF document.
- Since:
- 9.0
- Version:
- 10.2
- Author:
- Luke
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptiongetText()
Extracts the PDF document text.void
setEndPage
(int page) Updates the last page to extract.void
setSortByPosition
(boolean sbp) void
setStartPage
(int page) Updates the first page to extract.Methods inherited from class org.deltava.dao.file.DAO
getReader, getStream, setBufferSize
-
Constructor Details
-
GetPDFText
Initializes the Data Access Object.- Parameters:
is
- the InputStream to read
-
-
Method Details
-
setStartPage
public void setStartPage(int page) Updates the first page to extract.- Parameters:
page
- the page number
-
setEndPage
public void setEndPage(int page) Updates the last page to extract.- Parameters:
page
- the page number
-
setSortByPosition
public void setSortByPosition(boolean sbp) -
getText
Extracts the PDF document text.- Returns:
- the PDF text
- Throws:
DAOException
- if an I/O error occurs
-