An automated answer-rating site marks each post in a community forum website as “good” or “bad” based on the quality of the post. The CSV file, which you can download from OA 9.14, contains the various types of quality as measured by the tool. Following are the type of qualities that the dataset contains:

i. num_words: number of words in the post

ii. num_characters: number of characters in the post

iii. num_misspelled: number of misspelled words

iv. bin_end_qmark: if the post ends with a question mark

v. num_interrogative: number of interrogative words in the post

vi. bin_start_small: if the answer starts with a lowercase letter (“1” means yes, otherwise no)

vii. num_sentences: number of sentences per post

viii. num_punctuations: number of punctuation symbols in the post ix. label: the label of the post (“G” for good and “B” for bad) as determined by the tool. Create a logistics regression model to predict the class label from the first eight attributes of the question set. Evaluate the accuracy of your model.

Found something interesting ?

• On-time delivery guarantee
• PhD-level professional writers
• Free Plagiarism Report

• 100% money-back guarantee
• Absolute Privacy & Confidentiality
• High Quality custom-written papers

Related Model Questions

Feel free to peruse our college and university model questions. If any our our assignment tasks interests you, click to place your order. Every paper is written by our professional essay writers from scratch to avoid plagiarism. We guarantee highest quality of work besides delivering your paper on time.

Grab your Discount!

25% Coupon Code: SAVE25
get 25% !!