==How to Run the Code==
My code has been intentionally broken into two parts: data preprocessing and model training/prediction
The first part of the code is all about data preprocessing which I will discuss later. But basically this is where you transform your single "XXX.txt" input file into a pickle file that the later part of the code can use for training and prediction. To run this part: