Changes

USITC (view source)

Revision as of 14:47, 28 September 2017

191 bytes added , 14:47, 28 September 2017

no edit summary

Using the pdf scraper from previous project found here

E:\McNair\software\utilities\PDF_RIPPER

You can scrape the PDFs. This file was modified to scrape all of the pdfs in the pdfs folder. The modified code is in the McNair/Project/USITC/ directory and it is

called pdf_to_text_bulk.py

An example of PDF parsing that works parsing this PDF: https://www.usitc.gov/secretary/fed_reg_notices/337/337_959_notice02062017sgl.pdf

Hbrown512

Bureaucrats, Administrators (Semantic MediaWiki), Administrators

111

edits

Changes

USITC (view source)

Revision as of 14:47, 28 September 2017

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Sites

Sections

Organizations

Help

Tools