How to Extract Text Contents from PDF (part 3/3)

Loading...

Sign in or sign up now!
Alert icon
Upgrade to the latest Flash Player for improved playback performance. Upgrade now or more info.
1,067
Loading...
Alert icon
Sign in or sign up now!
Alert icon

Uploaded by on Mar 22, 2010

Demonstrates extracting text contents from PDF by hand, using basic UNIX tools only.

PDFMiner (PDF extraction tool in Python):
http://www.unixuser.org/~euske/python/pdfminer/

Category:

Education

Tags:

License:

Standard YouTube License

  • likes, 0 dislikes

Link to this comment:

Share to:
see all

All Comments (5)

Sign In or Sign Up now to post a comment!
  • Nice tutorial, thanks!

  • Thanks for the quick intro into this horrible pdf structure thing....and above: Thanks a lot for the pdfminer tools, which are definitely exact the solution for my problem.

  • Thanks for the tutorial!

  • Nice demo. Thanks!

  • hi there,

    i'm trying to make a shell script that i can pipe data into. so far, i haven't found a way to do it. your text extraction script is able to do it, but i don't really understand it. could you elaborate a bit on how '( /bin/echo -ne; cat "$@" )' works, especially the cat "$@" part?

    thank you,

    bamdad

Loading...

Alert icon
0 / 00Unsaved Playlist Return to active list
    1. Your queue is empty. Add videos to your queue using this button:
      or sign in to load a different list.
    Loading...Loading...Saving...
    • Clear all videos from this list
    • Learn more