Data Science Foundations: Python Scientific Stack

Go to class
Write Review

Free Online Course: Data Science Foundations: Python Scientific Stack provided by LinkedIn Learning is a comprehensive online course, which lasts for 3-4 hours worth of material. The course is taught in English and is free of charge. Upon completion of the course, you can receive an e-certificate from LinkedIn Learning. Data Science Foundations: Python Scientific Stack is taught by Miki Tebeka.

Overview
  • Learn how to use the Python scientific stack to complete data science tasks. Find out how to work with pandas for data crunching, NumPy for numeric computation, and more.

    Data science provides organizations with striking—and highly valuable—insights into human behavior. While data mining can seem a bit daunting, you don't need to be a highly-skilled programmer to process your own data. In this hands-on course, learn how to use the Python scientific stack to complete common data science tasks. Miki Tebeka covers the tools and concepts you need to effectively process data with the Python scientific stack, including Pandas for data crunching, matplotlib for data visualization, NumPy for numeric computation, and more.

Syllabus
  • Introduction

    • Welcome
    • What you should know
    • Mac setup
    • Windows setup
    • Linux setup
    • How to use the exercise files
    1. Scientific Python Overview
    • Ramp up with Scientific Python
    2. The Jupyter Notebook
    • Start the notebook server
    • Use code cells
    • Extensions to Python language
    • Understand markdown cells
    • Edit notebooks
    3. NumPy Basics
    • Overview: NumPy
    • NumPy arrays
    • Slicing
    • Learn Boolean indexing
    • Understand broadcasting
    • Understand array operations
    • Understand ufuncs
    4. Pandas
    • Pandas overview
    • Load CSV files
    • Parse time
    • Access rows and columns
    • Use pure Python packages
    • Calculate speed
    • Display a speed box plot
    5. Conda
    • Introduction to Python packages
    • Manage environments
    6. Folium and Geo
    • Create an initial map
    • Draw a track on the map
    • Use geo data with Shapely
    • Generate a report
    7. NY Taxi Data
    • Examine data
    • Load data from CSV files
    • Work with categorical data
    • Work with data: Hourly trip rides
    • Work with data: Rides per hour
    • Work with data: Weather data
    8. scikit-learn
    • Introduction: scikit-learn
    • Learn regression on Boston dataset
    • Understand train/test splits
    • Preprocess data
    • Compose pipelines
    • Save and load models
    9. Plotting
    • Overview: matplotlib
    • Use styles
    • Customize Pandas output
    • Use matplotlib
    • Tips and tricks
    • Understand bokeh
    10. Other Packages
    • Other packages overview
    • Go faster with Numba and Cython
    • Understand deep learning
    • Work with image processing
    • Understand NLP: NLTK
    • Understand NLP: SpaCy
    • Bigger data with HDF5 and dask
    11. Development Process
    • Overview
    • Understand source control
    • Learn code review
    • Testing overview
    • Testing example
    Conclusion
    • Next steps