1. Histogram of most frequently-used words.
2. Proportion of unique words per page (i.e. [unique words]/[total words]).
3. Average word length
4. Average sentence length.
5. Average # sentences per paragraph.
2. Proportion of unique words per page (i.e. [unique words]/[total words]).
3. Average word length
4. Average sentence length.
5. Average # sentences per paragraph.
Natural language processing
18/11/2012 03:47:47 PM
- 703 Views
A few ideas
19/11/2012 07:11:43 PM
- 314 Views