1. The Gini index criterion is discussed in this chapter (for cluster validity). Show how you can pair this criterion with the k-means algorithm to perform unsupervised feature selection. Which other cluster validity criterion (or criteria) can you use for unsupervised feature selection in this manner?
2. Implement the feature selection criterion for term strength.
3. Suppose your text documents have a representation in which you only know about the presence or absence of words in half the lexicon and you know the exact frequencies of words in the remaining half. Show how you can combine the Bernoulli and multivariate models to perform text clustering.