[20160919] CS297 Weekly Report

DataSet

1. Text categorization Benchmark

1. Reuters-21578 –> 21,578 docs, 135 different topics

http://www.daviddlewis.com/resources/testcollections/reuters21578/

screenshot.png

2. 20 Newsgroups –>20,000 docs, 20 different topics

http://qwone.com/~jason/20Newsgroups/

screenshot.png

2. Practical text categorization 

Literature 

screenshot.png

 

 

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s