Changes

Ecosystem Organization Classifier (view source)

Revision as of 14:56, 30 March 2019

46 bytes added , 14:56, 30 March 2019

===Text Processing===

There are two ~~possible~~ obvious classification methods for the processing the ~~text of target HTML pages~~textual descriptions. The first is a "Bag of Words" approach, which uses Term Frequency – Inverse Document Frequency (TF-IDF) to do basic natural language processing and select words or phrases which have discriminant capabilities. The second is a Word2Vec approach which uses a shallow 2 layer neural ~~networks~~ network to reduce descriptions to a vector with high discriminant potential. (See "Memo for Evan" in E:\mcnair\Projects\Incubators for further detail.) We are going to be trying both approaches.

==Related Projects==

Ed

Bureaucrats, Interface administrators, Administrators (Semantic MediaWiki), Administrators

7,612

edits

Changes

Ecosystem Organization Classifier (view source)

Revision as of 14:56, 30 March 2019

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Sites

Sections

Organizations

Help

Tools