Data Analysis and Data Mining
space
Course code
MLM6310.DT
old course code
Course title in Estonian
Andmeanalüüs ja andmekaeve
Course title in English
Data Analysis and Data Mining
ECTS credits
6.0
Assessment form
assessment
lecturer of 2023/2024 Spring semester
Not opened for teaching. Click the study programme link below to see the nominal division schedule.
lecturer of 2024/2025 Autumn semester
Not opened for teaching. Click the study programme link below to see the nominal division schedule.
Course aims
Develop knowledge and skills necessary for conducting Statistical Data Analysis using the R programming language. To learn how to collect and clean data; make
exploratory techniques for summarizing data; to make prediction functions; to cover the basics of machine learning; to learn how to use R for effective data analysis.
Brief description of the course
Drawing conclusions about populations, statistical significance tests. Parametric and nonparametric tests, Regression analysis, ANOVA and ANCOVA models.
The course covers the basics needed for collecting and cleaning data, the essential exploratory techniques for summarizing data, the basic components of building and applying prediction functions with an emphasis on practical applications. The course will provide basic grounding in concepts such as training and tests sets, and error rates. The course covers program R possibilities for processing data and data analysis. We will cover the plotting systems in R as well as
some of the basic principles of constructing data graphics.
Learning outcomes in the course
Upon completing the course the student:
- knows main notions of data analysis and data mining;
- is familiar with the main methods and their applications;
- is able to solve the standard exercises using R.
Teacher
Maria Zeltser
space