Changes

Jump to navigation Jump to search
278 bytes added ,  16:41, 19 October 2017
no edit summary
==NLTK==
NLTK is the Natural Language Toolkit
*NLTK Information
**Need to convert text to ascii. Had issues with my PDF texts and had to convert
**Can use sent_tokenize() function to split document into sentences, easier that regular expressions
**Use pos_tag() to tag the sentences. This can be used to extract proper noun

Navigation menu