Solution Manual Big Data Analytics 17CS82 VTU CBCS

Outlook	Temp	Humidity	Windy	Play
Sunny	Hot	High	False	No
Sunny	Hot	High	True	No
Overcast	Hot	High	False	Yes
Rainy	Mild	High	False	Yes
Rainy	Cool	Normal	False	Yes
Rainy	Cool	Normal	True	No
Overcast	Cool	Normal	True	Yes
Sunny	Mild	High	False	No
Sunny	Cool	Normal	False	Yes
Rainy	Mild	Normal	False	Yes
Sunny	Mild	Normal	True	Yes
Overcast	Mild	High	True	Yes
Overcast	Hot	Normal	False	Yes
Rainy	Mild	High	True	No

Outlook	Temp	Humidity	Windy	Play
Sunny	Hot	Normal	True	?

4. Create a decision tree for the following dataset and predict whether the loan is approved or not (8 M July 2019)

Age	Job	House	Credit	Loan Approved
Young	False	No	Fair	No
Young	False	No	Good	No
Young	True	No	Good	Yes
Young	True	Yes	Fair	Yes
Young	False	No	Fair	No
Middle	False	No	Fair	No
Middle	False	No	Good	No
Middle	True	Yes	Good	Yes
Middle	False	Yes	Excellent	Yes
Middle	False	Yes	Excellent	Yes
Old	False	Yes	Excellent	Yes
Old	False	Yes	Good	Yes
Old	True	No	Good	Yes
Old	True	No	Excellent	Yes
Old	False	No	Fair	No

Age	Job	House	Credit	Loan Approved
Young	False	No	Good	?

5. Explain the design principles of an artificial neural network. (8 M July 2019)

6. Explain the design principles of an artificial neural network constructing a model representation for a single and multilayer perceptron. Describe the steps to build ANN (Artificial neural networks) (10 M Nov 2020)

7. How does the Apriori Algorithm work? Apply the same for the following example. Assume the support count is 2. (8 M July 2019)– V

TID	List of Items IDs
T100	I1, I2, I5
T200	I2, I4
T300	I2, I3
T400	I1, I2, I4
T500	I1, I3
T600	I2, I3
T700	I1, I3
T800	I1, I2, I3, I5
T900	I1, I2, I3

8. Describe the advantages and disadvantages of a regression model. (8 M Jan 2020)

9. Write the different steps involved in developing an artificial neural network. (5 M Jan 2020)

10. Describe the advantages of using ANN. (3 M Jan 2020)

11. For the following example describes the different steps of forming association rules using the Apriori algorithm with support of 33% and confidence of 50%. (8 M Jan 2020, Nov 2020) – V

1	Milk	Egg	Bread	Butter
2	Milk	Butter	Egg	Ketchup
3	Bread	Butter	Ketchup
4	Milk	Bread	Butter
5	Bread	Butter	Cookies
6	Milk	Bread	Butter	Cookies
7	Milk	Cookies
8	Milk	Bread	Butter
9	Bread	Butter	Egg	Cookies
10	Milk	Butter	Bread
11	Milk	Bread	Butter
12	Milk	Bread	Cookies	Ketchup

12. For the given City Size, Avg. Income, Local Investors, LOHAS Awareness Data set apply the Decision Tree algorithm and find the optimal decision tree. Also, predict the class label for a new example.

City Size	Avg. Income	Local Investors	LOHAS Awareness	Decision
Big	High	Yes	High	Yes
Medium	Med	No	Med	No
Small	Low	Yes	Low	No
Big	High	No	High	Yes
Small	Med	Yes	High	No
Med	High	Yes	Med	Yes
Med	Med	Yes	Med	No
Big	Med	No	Med	No
Med	High	Yes	Low	No
Small	High	No	High	Yes
Small	Med	No	High	No
Med	Heigh	No	Med	No

City Size	Avg. Income	Local Investors	LOHAS Awareness	Decision
Med	Med	No	Med	?

Big Data Analytics Module 5 Question Bank with Answers

1. Compare text mining with data mining. (8 M Nov 2020)

2. What is Naïve Bayes technique? Explain its model. (5 M July 2019)

3. Explain steps in the text mining process and architecture (8 M Nov 2020)

4. What is a support vector machine? Explain its model. (8 M July 2019)

5. Mention the 3-step process of Text Mining. (3 M July 2019)

6. Explain briefly the three different types of web mining. (6 M July 2019)

7. Compute the rank values for the nodes of the following network shown in below fig. Which is the highest-ranked node? Solve the same with eight iterations. (8 M July 2019, Nov 2020)

8. Describe the difference between text mining and data mining. (6 M)

9. Explain Naïve Bayes model. What are the advantages and disadvantages of the Naïve Bayes model?

10. Briefly describe the Support vector machine (SVM) technique. (4 M)

11. What are the advantages and disadvantages of Support vector machine – SVM?

12. Explain the Naïve Bayes model to classify the text data into the right class using the following dataset. (6 M)

Document ID	Keywords in the document	Class h
1	Love Happy Joy Joy Happy	Yes
2	Happy Love Kick Joy Happy	Yes
3	Love Move Joy Good	Yes
4	Love Happy Joy Love Pain	Yes
5	Joy Love Pain Kick Pain	No
6	Pain Pain Love kick	No
7	Love Pain Joy Love Kick	?

13. What is web mining? Explain the different types of web mining. (8 M)

14. Explain three types of web mining. Use an appropriate flow diagram to represent the same. (8 M Nov 2020)

15. Write a short note on Social Network Analysis (SNA). Numerical examples on Naïve Bayes Model, SVM, and SNA (Rank Calculation).

16. Suppose we have the height, weight, and T-shirt size of some customers and we need to predict the T-shirt size of a new customer given only the height and weight information we have. Data including height, weight, and T-shirt size information is shown below

Height (in cms)	158	158	160	163	163	160	163	165	165	165	170	170	170
Weight (in kgs)	58	59	60	60	61	64	64	61	62	65	63	64	68
T-Shirt Size	M	M	M	M	L	L	L	L	L	L	L	L	L

Determine the T-Shirt size of a new customer with a weight of 61 kg and height of 161 cms using KNN with K=5.

Follow the link for Solution

Solution Manual Big Data Analytics 17CS82 VTU CBCS

Download Final Year Projects

Solution Manual to Big Data Analytics 17CS82 VTU CBCS Question Bank

8th Semester Big Data Analytics notes Computer Science and Engineering

Solution Manual Big Data Analytics 17CS82 VTU CBCS Module 1 Question Bank

Big Data Analytics Module 2 Question Bank with Answers

Big Data Analytics Module 3 Question Bank with Answers

Big Data Analytics Module 4 Question Bank with Answers

Big Data Analytics Module 5 Question Bank with Answers

8th Semester Big Data Analytics notes Computer Science and Engineering

8th Semester Computer Science and Engineering Sun Star Exam Scanner

2018 Scheme Computer Science and Engineering VTU CBCS Notes

Related Posts

Leave a Comment Cancel Reply

VTU Notes

VTU Question Papers

Projects

Tutorials