Consider the term “elections” which is present in only 50 documents in a corpus of 1000 documents. Furthermore, assume that the corpus contains 100 documents belonging to the Politics category, and 900 documents belonging to the Not-Politics category. The term “election” is contained in 25 documents belonging to the Politics category. (a) Compute the unnormalized Gini index and the normalized Gini index Gn(·) of the term “elections.” (b) Compute the entropy of the class distribution with respect to the entire data set. (c) Compute the conditional entropy of the class distribution with respect to the term “elections.” (d) Compute the mutual information of the term “elections” according to Eq. 5.6. How are your answers to (b), (c), and (d) related? (e) Compute the information gain of the term “elections” according to Eq. 5.7. How are your answers to (d) and (e) related?

Found something interesting ?

• On-time delivery guarantee
• PhD-level professional writers
• Free Plagiarism Report

• 100% money-back guarantee
• Absolute Privacy & Confidentiality
• High Quality custom-written papers

Related Model Questions

Feel free to peruse our college and university model questions. If any our our assignment tasks interests you, click to place your order. Every paper is written by our professional essay writers from scratch to avoid plagiarism. We guarantee highest quality of work besides delivering your paper on time.

Grab your Discount!

25% Coupon Code: SAVE25
get 25% !!