Data Processing and Analysis

Go to class
Write Review

Data Processing and Analysis provided by edX is a comprehensive online course, which lasts for 15 weeks long, 2-4 hours a week. Data Processing and Analysis is taught by Natalia Grafeeva, Elena Mikhailova, Olga Egorova, Anton Boitsev, Aleksei Romanov and Dmitry Volchek. Upon completion of the course, you can receive an e-certificate from edX. The course is taught in Englishand is $447.00. Visit the course page at edX for detailed price information.

Overview
  • The demand for skilled data analysts both in science and industry is constantly growing. Data processing and analysis Professional Certificate Program gives you the necessary knowledge base and useful skills to face data analysis challenges in your professional field.

    The first course of the Program covers such concepts of data analytics as data preprocessing and visualization, large datasets management and storage by means of SQL and NoSQL database management systems, data series analysis.

    The second course of the program discusses what machine learning is and mainly focuses on the regression problem (linear regression, polynomial and multivariable regression), classification methods (logistic regression, Naïve Bayes and K-nearest neighbors) and clustering methods (hierarchical and k-means clustering).

    The last course covers advanced methods of machine learning. You will learn how to analyze large datasets, find regularities in your data, and apply more complicated clusterization and classification techniques. More precisely, you will face with the concept of the factor analysis under the Principal Component Analysis (PCA), learn about support vector machines (SVM) and decision trees for classification, get familiar with some popular resampling methods and apply them to the so-called Ensemble Learning. Finally, you will deal with the problem of reinforcement learning and learn some useful algorithms.

    In all courses, practical tasks of each week will refine your understanding of main concepts and enhance your abilities in data engineering.

    The program helps you to develop skills that include Excel data analysis, MS Azure Machine Learning Studio and Python Notebooks, Oracle Apex and Mongo DB. MS Excel and database management systems are used in the first course. Two learning tracks are provided in machine learning courses, one for those who have coding experience in Python, while the tasks in the other track are realized in MS Azure for students with no coding experience.

    Founded in 1900, ITMO University is the top higher education institution in computer science in Russia, it is a trailblazer shaping national education and research policy in Russia. Higher School of Digital Culture is delighted to share with you its experience in the field of data science as well as in interdisciplinary research.

Syllabus
  • Courses under this program:
    Course 1: Data Storage and Processing

    Master the culture of data representation, interpretation and outcomes evaluation. Learn the fundamentals of relational and NoSQL database management systems.



    Course 2: Introduction to Machine Learning

    Learn the essentials of machine learning and algorithms of statistical data analysis.



    Course 3: Advanced Machine Learning

    An advanced course on machine learning. You will learn specific techniques and methods to analyze big amounts of data.