Changes

Jump to navigation Jump to search
886 bytes added ,  13:44, 21 September 2020
no edit summary
{{Project|Has project output=Data,Tool|Has sponsor=McNair ProjectsCenter
|Has title=USITC Data
|Has owner=Harrison Brown
=New Work=
==JSON InformationUSITC 337 Cases Tab Delimited Text==USITC patent information was gathered from the investigations.json file downloaded from the USITC website (https://pubapps2.usitc.gov/337external/, Click on Cases Instituted After 2008).This contains information on 337 cases and their respondents/complainants and the patents that were part of the case. The tab-delimited test files code and results for this program are here:
Projects/USITC/JSON_scraping_python
The program grabs the information, places it into lists of lists in Python, and then writes to the file names listed below. The files do not have headers and null values are set to be empty strings.To create the tab delimited text files, run code.py in the JSON_scraping_python directory. This has all of the file names hard coded. It will createthe following files
investigation_info.txt
Schema for this file is id, title, investigation number, investigation tpyetype, docket number, date of publication notice
complainant_info.txt
Schema for this file is investigation id, investigation number, Complaintant name, complainant outside party ID, comp_city, comp country
respondent_info.txt
Schema for this file is investigation id, investigation number, Respondent Outside Party ID , Respondent Name, Respondent City, Respondent Country
patent_info.txt
Schema for this file is Investigation Number, Patent ID, Patent Number, Active Date, Inactive Date,
==XML Information==

Navigation menu