Changes

Jump to navigation Jump to search
1,111 bytes added ,  13:44, 21 September 2020
no edit summary
{{Project
|Has project output=Data,Tool
|Has sponsor=McNair Center
|Has title=Trial Data Project
|Has owner=Jeemin Sim, Catherine Kirby,
|Has project status=Complete
}}
==Summary==
 
This project works out how to reprocess the Clinical Trial Data from ClinicalTrials.gov into structured and cleaned datasets. The data covers 239,638 studies from 2000 to present.
 
== Information Source ==
* https://clinicaltrials.gov/ct2/resources/download
* https://clinicaltrials.gov/ct2/html/images/info/public.xsd
 
== Steps Followed to Extract the Trial Data ==
output: Jeemin_FDATrial_as_key_data.ripping_output.txt
== Tables to generate ==+ Clinical ID -> All Table 1 to 1 trial info: row_headers1 = ['nct_id', 'brief title', 'oversight authority', 'brief summary', 'detailed description', 'overall status', 'start date', 'completion date', 'phase', 'study type', 'study design', 'condition', 'intervention type', 'intervention name', 'eligibility description','eligibility gender', 'eligibility min age', 'eligibility max age', 'verification date', 'lastchanged date', 'firstreceived date', 'has expanded access']  Table 2: + (Clinical ID) row_headers2 = ['nct_id', Agency Name -> Agency Class'sponsor agency', 'sponsor class', 'lead (Y/N)or collaborator']  Table 3:+ (Clinical ID) -> row_headers3 = ['nct_id', 'facility name', 'city', 'state', 'zip', 'country']  Table 4: row_headers4 = ['nct_id', 'MeSH Termterm']  Table 5:+ (Clinical ID) -> Investigator row_headers5 = ['nct_id', 'keyword']

Navigation menu