Archive for 'Linguistics'
Clustering texts with an obvious grouping.
It was pointed out to me by Kenny Easwaran that I ought to try clustering texts that already have a natural grouping. So I ran the clustering program on 15 texts written by three authors, and here is the result: The largest eigenvalue is 25 times bigger than the next largest eigenvalue, and picks out the author […]
Posted: January 27th, 2008 under Personal, Linguistics.
Comments: 1
Clustering Shakespeare.
I ran my clustering program (which I just ran on the New Testament) on Shakespeare’s plays—which were conveniently packaged into a text file by Open Source Shakespeare. The result was the following graph: I know little about Shakespeare, so I can’t say too much about the above image. I’d love to know what you think: does […]
Posted: January 22nd, 2008 under Personal, Linguistics.
Comments: 1