Data Mining questions
Dean131. What's noise? How can noise be reduced in a dataset?
2. Define outlier. Describe 2 different approaches to detect outliers in a dataset.
3. Give 2 examples in which aggregation is useful.
4. What's stratified sampling? Why is it preferred?
5. Provide a brief description of what Principal Components Analysis (PCA) does. [Hint: See Appendix A and your lecture notes.] State what's the input and what the output of PCA is.
6. What's the difference between dimensionality reduction and feature selection? 7. What's the difference between feature selection and feature extraction?
8. Give two examples of data in which feature extraction would be useful.
9. What's data discretization and when is it needed?
10. How are the Correlation and Covariance, used in data pre-processing?
Textbook: Tan, P., Steinbach, M. & Kumar, V. (2019). Introduction to data mining. 2nd Edition. Boston: Pearson Addison Wesley. ISBN 0-13-312890-3
Chapter 2.
Any references used for the study.
- 4 years ago
- 4
Purchase the answer to view it
- DataMiningQuestions.docx
Purchase the answer to view it
- Question_answers.doc
- Psychology
- Did missiles threaten the actual security of the United States? Did the missiles present a threat to the Kennedy administration?
- Movie essay
- paper, essay due in two days
- Cl-doc
- ITM information technology management
- Welcome to Module 3: The Market for Foreign Exchange. The module contains a number of learning activities accessible from the left-hand side navigation bar, in the course. Learning activities include reading assignments, module assignments, and discussion
- Environment week 3 pt 1
- Discussion:7
- presentation for manufacturing