Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Invalid
-
2.0.18
-
None
-
None
-
Windows
Description
I try to read PDF file from a given URL, but I got following error message:
Exception in thread "main" java.io.IOException: Error: End-of-File, expected lineException in thread "main" java.io.IOException: Error: End-of-File, expected line at org.apache.pdfbox.pdfparser.BaseParser.readLine(BaseParser.java:1124) at org.apache.pdfbox.pdfparser.COSParser.parseHeader(COSParser.java:2595) at org.apache.pdfbox.pdfparser.COSParser.parsePDFHeader(COSParser.java:2574) at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:219) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1222) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1122) at ScreenshotFromPdf.Pdf2Image(ScreenshotFromPdf.java:19) at ScreenshotFromPdf.main(ScreenshotFromPdf.java:33)
Example pdf file: http://aplaidshirt.epizy.com/samplePDF.pdf
Code:
public class ScreenshotFromPdf { public static void Pdf2Image(String html) throws IOException, InterruptedException { Thread.sleep(5000); URL url=new URL(html); HttpURLConnection connection=(HttpURLConnection)url.openConnection(); InputStream is=connection.getInputStream(); PDDocument document = PDDocument.load(is); PDFRenderer pdfRenderer = new PDFRenderer(document); for (int page = 0; page < document.getNumberOfPages(); ++page) { BufferedImage bim = pdfRenderer.renderImageWithDPI(page, 300, ImageType.RGB); File outputFile = new File("C:\\_privat\\pdftest\\" + page + "image.jpg"); System.out.println(outputFile.toString()); ImageIO.write(bim, "jpg", outputFile); } document.close(); } public static void main(String[] args) throws IOException, InterruptedException { String url = "http://aplaidshirt.epizy.com/samplePDF.pdf"; ScreenshotFromPdf.Pdf2Image(url); } }