Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

tabula.convert_into only converts 1st page of pdf

See original GitHub issue

Summary of your issue

Using tabula.convert_into on a multipage pdf only converts the 1st page of the pdf

Environment

Write and check your environment. Please paste outputs of specific commands if required.

Paste the output of python --version command on your terminal: ?

Python 2.7.10

Paste the java version “1.8.0_71”

Java™ SE Runtime Environment (build 1.8.0_71-b15) Java HotSpot™ 64-Bit Server VM (build 25.71-b15, mixed mode) of java -version command on your terminal: ?

Does java -h command work well?; Ensure your java command is included in PATH

yes

Write your OS and it’s version: ? macOS Sierra 10.12.3
(Optional, but really helpful) Your PDF URL:

http://alabcboard.gov/sites/default/files/inline-files/Store Phone List.pdf#

Example code:

def main():
    download_file("http://alabcboard.gov/sites/default/files/inline-files/Store%20Phone%20List.pdf")
    tabula.convert_into("document.pdf", "output.csv", output_format="csv")

def download_file(download_url):
    response = urllib2.urlopen(download_url)
    file = open("document.pdf", 'w')
    file.write(response.read())
    file.close()

if __name__ == "__main__":
    main()
#

## Output:
```cat output.csv```.  Only the first page is included.

## What did you intend to be?
All pages of the original pdf should be included.