Class GetPDFText

java.lang.Object
org.deltava.dao.file.DAO
org.deltava.dao.file.GetPDFText

public class GetPDFText extends DAO
A Data Access Object to extract text from a PDF document.
Since:
9.0
Version:
10.2
Author:
Luke
  • Constructor Details

    • GetPDFText

      public GetPDFText(InputStream is)
      Initializes the Data Access Object.
      Parameters:
      is - the InputStream to read
  • Method Details

    • setStartPage

      public void setStartPage(int page)
      Updates the first page to extract.
      Parameters:
      page - the page number
    • setEndPage

      public void setEndPage(int page)
      Updates the last page to extract.
      Parameters:
      page - the page number
    • setSortByPosition

      public void setSortByPosition(boolean sbp)
    • getText

      public String getText() throws DAOException
      Extracts the PDF document text.
      Returns:
      the PDF text
      Throws:
      DAOException - if an I/O error occurs