[20161015]CS297 Weekly Report

Topic: Use Word2Vec to select feature

  1. Analysis of feature
    1. [0.9,1) 80 pairs, [0.8, 0.9) 1570 pairs,[0.7, 0.8) 8660 pairs, [0.6,0.7)55896 pairs

screenshot.png screenshot.png screenshot.png screenshot.png

add up all features that have high similarity (>0.9)

screenshot.png  But, similar features are intertwined

–> Use graph search to identify all connected components:


–> add up weight for each group and become new features





