Changes

Jump to navigation Jump to search
*Initiate the idea of data preprocessing: create proper input dataset for the CNN model
'''5/2/2019'''
*Work on data preprocessing
 
'''5/6/2019'''
*Keep working on data preprocessing
*Generate screenshot
 
'''5/7/2019'''
*some issues occurred during screenshot generating (Will work on this more tomorrow)
*try to set up CNN model
**https://www.datacamp.com/community/tutorials/cnn-tensorflow-python
 
'''5/8/2019'''
*fix the screenshot tool by switching to Firefox
*Data preprocessing
 
'''5/12/2019'''
*Finish image data preprocessing
'''5/113/2019'''*Set up initial CNN model using Keras**issue: Keras freezes on last batch of first epoch, make sure the following: steps_per_epoch = number of train samples//batch_size validation_steps = number of validation samples//batch_size
*Research on how to feed Mixed data: categorical + images to our model**https:'''5//www.pyimagesearch.com14/2019/02/04/keras-multiple-inputs-and-mixed-data/'''*Object detection using Implement the CNNmodel *Work on some changes in the data preprocessing part (image data)*https://gluon.mxnet.io/chapter08_computer-vision/object-detection.html*place class label in image filename
'''5/15/2019'''
*Correct some out-of-date data in <code>The File to Rule Them ALL.csv</code>, new file saved as <code>The File to Rule Them ALL_NEW.csv</code>
*implement generate_dataset.py and sitmap tool
**regenerate dataset using updated data and tool
'''5/216/2019'''*Work implementation on CNN*Some problems to consider:**some websites have more than 1 cohort page: a list of cohorts for each year**class label is highly imbalanced: https://towardsdatascience.com/deep-learning-unbalanced-training-data preprocessing-solve-it-like-this-6c528e9efea6  '''5/17/2019'''*have to go back with the old plan of separating image data :(*documentation on wiki*test run on the GPU server
227

edits

Navigation menu