Changes

Jump to navigation Jump to search
790 bytes added ,  19:32, 22 March 2019
no edit summary
The sql script get_data.sql from last year is copied to the current Crunchbase3 directory. However, two databases are very different now and adjustments are necessary. To keep track of the data type from each csv file used to copy to sql tables, a file get_type.py is included in E:\projects\crunchbase3. This python script will print the first 5 rows of every data frame in the current directory.
All the crunchbase3 data from drive E are now also in drive Z:/crunchbase3 A version of crunchbase3 database is live on the postgresql in Z:/crunchbase3. However, a few csv files have not been copied to the SQL database because of data type errors, which is a small problem but need to spend some time to fix. Hiep will continue fixing fix it next week (March 28th). Right now, a modification of load_crunchbase.sql is in both Z:/crunchbase3 and E:/projects/crunchbase3. Changes in dataset, datatype, and data columns have been made a lot compared to the previous version. The columns that are not yet added to the postgres db are noted inside two lines of ################'s. Since the get_datadata has changed a lot compared to last year, using \i load_crunchbase.sql was not very useful, one may need to copy one table at a time by pasting the sql script on 03/22/2019into the terminal.
82

edits

Navigation menu