Difference between revisions of "PDF to Text Converter"
Jump to navigation
Jump to search
(2 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
{{Project | {{Project | ||
+ | |Has project output=Tool | ||
+ | |Has sponsor=McNair Center | ||
|Has title=PDF to Text Converter | |Has title=PDF to Text Converter | ||
|Has owner=Christy Warden, | |Has owner=Christy Warden, |
Latest revision as of 13:47, 21 September 2020
PDF to Text Converter | |
---|---|
Project Information | |
Has title | PDF to Text Converter |
Has owner | Christy Warden |
Has start date | |
Has deadline date | |
Has keywords | PDF, txt, python |
Has project status | |
Dependent(s): | Patent Thicket |
Has sponsor | McNair Center |
Has project output | Tool |
Copyright © 2019 edegan.com. All Rights Reserved. |
Overview
This code is located at E:/McNair/Software/Google_Scholar_Crawler/pdf_to_txt_bulk_PTLR.py
This program converts a directory of PDFs to .txt files. All the new txt files will be placed in a new folder 'within' the provided directory of PDFs called 'Text Versions'
How to use
Open the python file in Komodo. At the bottom of the file, change the variable src_dir to the name of the directory of PDF files you want to convert to txt. Uncomment the line that says
#main(src_dir)
Click the play button the top center of the screen.
Notes
This program runs painfully slowly because PDFs are painful and slow.